I am looking at the downloaded data files and trying to match the filename with the train labels.
For example, in Train_label.csv the first line is
2018-02-01T08:00:00Z,3S31A,11.4
Then I am looking at the training data files by the date 20180201 (Feb. First 2018), I don’t see a matching file. Someone please help me with understanding what I am missing.
I think you might be looking at the wrong file; if you download the training labels CSV files, the first line (after the header) should be:
2019-01-01T08:00:00Z,3A3IE,8.695
If you then open up the satellite metadata file, you can look for a corresponding satellite data granule for this date. For example, the first file listed there is:
20190101T213357_tropomi_la_0.nc
Which would be a sub-set of the TROPOMI data collected on the same day, covering the “LA” area.
I think you may have the incorrect files, since I don’t think that February 2018 is covered by any of the datasets; I would suggest you re-download the training labels file and take another look.
@nayeemmz It may also be the case that satellite data is missing for some samples. This is just a reality of the temporal sparsity of the satellite data. See the thread here for more discussion on the topic: Satellite data not available for some train/test samples
Yes, that is the correct label file for the Particulate Matter track (but not the Trace Gas Track). The first few lines of the corresponding metadata file should look like: