Submitted:
18 February 2024
Posted:
21 February 2024
You are already at the latest version
Abstract
Keywords:
Specifications Table
| Subject | Remote Sensing; Earth-Surface Processes; Big Data Analytics. |
| Specific subject area | Pixel-level global overview of available of cloud-, snow-, and shade-free Landsat and Sentinel-2 observations for terrestrial vegetation analyses |
| Data format | Analysed |
| Type of data | Tabulated data distributed as .csv |
| Data collection | We based our dataset on satellite data available freely and openly in the public domain. See section on Data source location. The used satellite acquisitions were spatially subsampled using a regular 0.18° x 0.18° grid defined in the EPSG:4326 projection and tabulated. |
| Data source location | Landsat data (Collection 2, doi: 10.5066/P918ROHC [2], doi: 10.5066/P9TU80IG [3], doi: 10.5066/P975CC9B [4]) are freely and openly available in the public domain. We accessed Landsat reflectance Level 2, Tier 1 scenes acquired from 1982 through 2023 through Google Earth Engine in December 2022 – January 2023 and in January-February 2024. Sentinel-2 data (pre-Collection-1 doi: 10.5270/S2_-d8we2fl [5], and Collection-1 doi: 10.5270/S2_-742ikth [6]) are freely and openly available in the public domain. We accessed Sentinel-2 top-of-atmosphere (TOA) reflectance Level-1C scenes acquired between 26 June 2015 and 31 December 2023 through Google Earth Engine in July – November 2023 and in January-February 2024. MODIS land cover dynamics product at 500-m resolution (MCD12Q2; Collection 6 doi: 10.5067/MODIS/MCD12Q2.061) is freely and openly available in the public domain. We access the 2001-2019 time series of data through Google Earth Engine in July 2023. |
| Data accessibility | Tabular data on 1982-2023 global availability of usable Landsat and Sentinel-2 observations, accompanied by growing season information are publicly available for download in a data repository: Repository name: Dryad Data identification number: doi.org/10.5061/dryad.gb5mkkwxm Direct URL to data: https://doi.org/10.5061/dryad.gb5mkkwxm (will be made publicly available upon acceptance of the paper) Rasterized version of the tabular data on 1982-2023 global data availability based on Landsat and Sentinel-2 archives can be interactively viewed via Google Earth Engine App: https://katarzynaelewinska.users.earthengine.app/view/worlddataaval |
Value of the Data
- Understanding data availability is crucial for the appropriate selection and parametrization of algorithms used for terrestrial vegetation analyses. Yet, a-priori data exploration is rarely performed due to its high resource and time requirements. The lack of appropriate understanding of data availability can lead to ill-advised selection of algorithms and poorly framed research hypotheses, and thus inferior quality of the final results. Our dataset provides a ready-to-use, pixel-level global overview of the spatio-temporal availability of cloud-, snow-, and shade-free Landsat and Sentinel-2 observations from 1982 through 2023, allowing for informed decision-making for analyses relying on datasets based on these two data archives.
- The dataset comprises information on the availability of cloud-, snow-, and shade-free Landsat and Sentinel-2 pixels sampled daily for 1984-2023 in a regular 0.18°-point grid at the global scale. Consequently, a user can easily query data availability for their specific area of interest and time window. As such, the dataset facilitates parametrization of time series processing algorithms, selection of optimal length of compositing windows, evaluation of data availability for spectral-temporal metrics, land cover classification, trend analyses, and other analysis specific to terrestrial vegetation.
- The dataset provides separately availability information for the Landsat (1982-2023) and Sentinel-2 (2015-2023) data archives. The corresponding structure of the two tabulated files comprising the data allows for seamless integration, while catering to users utilizing only one of the data archives. Furthermore, this separation allows for a straightforward assessment of the added value of joint use of Landsat and Sentinel-2 archives after 2015, as compared to Landsat or Sentinel-2 time series alone.
- The pre-calculated overview of usable data provides insight into the quality of formerly derived datasets and results based on Landsat and/or Sentinel-2 time series that lack explicit data-availability quality assessment.
- The accompanying Google Earth Engine App (https://katarzynaelewinska.users.earthengine.app/view/worlddataaval) offers on-the-fly querying of the datasets. Provided functionality allows exploration of the data availability for a selected sensor constellation, using a user-defined length of aggregation period, and allowing to choose an entire calendar year, a vegetation-specific growing season, or other user-defined time periods. As such, the App provides an interface with a basic data query functionality for exploring Landsat and Sentinel-2 data availability that is designed to be used by a wide range of user groups.
Background

Experimental Design, Materials and Methods
- -
- exclusion of all pixels flagged as clouds and cirrus in the inherent ‘QA60’ cloud mask band;
- -
- exclusion of all pixels with cloud probability >50% as defined in the corresponding Cloud Probability product available for each scene;
- -
- exclusion of cirrus clouds (B10 reflectance >0.01);
- -
- exclusion of clouds based on Cloud Displacement Analysis (CDI<-0.5) [15];
- -
- exclusion of dark pixels (B8 reflectance <0.16) within cloud shadows modelled for each scene with scene-specific sun parameters for the clouds identified in the previous steps. Here we assumed a cloud height of 2,000 m.
- -
- exclusion of pixels within a 40-m buffer (two pixels at 20-m resolution) around each identified cloud and cloud shadow object.
- -
- exclusion of snow pixels identified with a snow mask branch of the Sen2Cor processor [16].
Limitations
Author Contributions
Informed Consent Statement
Acknowledgments
Conflicts of Interest
References
- K.E. Lewińska, D. Frantz, U. Leser, P. Hostert, Usable Observations over Europe: Evaluation of Compositing Windows for Landsat and Sentinel-2 Time Series, Environmental and Earth Sciences, 2023. https://doi.org/10.20944/preprints202308.2174.v2. [CrossRef]
- Earth Resources Observation And Science (EROS) Center, Collection-2 Landsat 4-5 Thematic Mapper (TM) Level-1 Data Products, (1982). https://doi.org/10.5066/P918ROHC. [CrossRef]
- Earth Resources Observation And Science (EROS) Center, Collection-2 Landsat 7 Enhanced Thematic Mapper Plus (ETM+) Level-1 Data Products, (1999). https://doi.org/10.5066/P9TU80IG. [CrossRef]
- Earth Resources Observation And Science (EROS) Center, Collection-2 Landsat 8-9 OLI (Operational Land Imager) and TIRS (Thermal Infrared Sensor) Level-1 Data Products, (2013). https://doi.org/10.5066/P975CC9B. [CrossRef]
- European Space Agency, Sentinel-2 MSI Level-1C TOA Reflectance, Collection 0, (2021). https://doi.org/10.5270/S2_-d8we2fl. [CrossRef]
- European Space Agency, Sentinel-2 MSI Level-1C TOA Reflectance, (2022). https://doi.org/10.5270/S2_-742ikth. [CrossRef]
- K.E. Lewińska, S. Ernst, D. Frantz, U. Leser, P. Hostert, Global overview of cloud-, snow-, and shade-free Landsat (1982-2023) and Sentinel-2 (2015-2023) data, (2024). https://doi.org/10.5061/dryad.gb5mkkwxm. [CrossRef]
- N. Gorelick, M. Hancher, M. Dixon, S. Ilyushchenko, D. Thau, R. Moore, Google Earth Engine: Planetary-scale geospatial analysis for everyone, Remote Sensing of Environment 202 (2017) 18–27. https://doi.org/10.1016/j.rse.2017.06.031. [CrossRef]
- S. Foga, P.L. Scaramuzza, S. Guo, Z. Zhu, R.D. Dilley, T. Beckmann, G.L. Schmidt, J.L. Dwyer, M. Joseph Hughes, B. Laue, Cloud detection algorithm comparison and validation for operational Landsat data products, Remote Sensing of Environment 194 (2017) 379–390. https://doi.org/10.1016/j.rse.2017.03.026. [CrossRef]
- Z. Zhu, C.E. Woodcock, Object-based cloud and cloud shadow detection in Landsat imagery, Remote Sensing of Environment 118 (2012) 83–94. https://doi.org/10.1016/j.rse.2011.10.028. [CrossRef]
- Y. Zhang, C.E. Woodcock, P. Arévalo, P. Olofsson, X. Tang, R. Stanimirova, E. Bullock, K.R. Tarrio, Z. Zhu, M.A. Friedl, A Global Analysis of the Spatial and Temporal Variability of Usable Landsat Observations at the Pixel Scale, Front. Remote Sens. 3 (2022) 894618. https://doi.org/10.3389/frsen.2022.894618. [CrossRef]
- S. Qiu, Z. Zhu, R. Shang, C.J. Crawford, Can Landsat 7 preserve its science capability with a drifting orbit?, Science of Remote Sensing 4 (2021) 100026. https://doi.org/10.1016/j.srs.2021.100026. [CrossRef]
- L. Baetens, C. Desjardins, O. Hagolle, Validation of Copernicus Sentinel-2 Cloud Masks Obtained from MAJA, Sen2Cor, and FMask Processors Using Reference Cloud Masks Generated with a Supervised Active Learning Procedure, Remote Sensing 11 (2019) 433. https://doi.org/10.3390/rs11040433. [CrossRef]
- R. Coluzzi, V. Imbrenda, M. Lanfredi, T. Simoniello, A first assessment of the Sentinel-2 Level 1-C cloud mask product to support informed surface analyses, Remote Sensing of Environment 217 (2018) 426–443. https://doi.org/10.1016/j.rse.2018.08.009. [CrossRef]
- D. Frantz, E. Haß, A. Uhl, J. Stoffels, J. Hill, Improvement of the Fmask algorithm for Sentinel-2 images: Separating clouds from bright surfaces based on parallax effects, Remote Sensing of Environment 215 (2018) 471–481. https://doi.org/10.1016/j.rse.2018.04.046. [CrossRef]
- M. Main-Knorn, B. Pflug, J. Louis, V. Debaecker, U. Müller-Wilm, F. Gascon, Sen2Cor for Sentinel-2, in: L. Bruzzone, F. Bovolo, J.A. Benediktsson (Eds.), Image and Signal Processing for Remote Sensing XXIII, SPIE, Warsaw, Poland, 2017: p. 3. https://doi.org/10.1117/12.2278218. [CrossRef]
- M. Friedl, G. Josh, D. Sulla-Menashe, MCD12Q2 MODIS/Terra+Aqua Land Cover Dynamics Yearly L3 Global 500m SIN Grid V006, (2019). https://doi.org/10.5067/MODIS/MCD12Q2.006. [CrossRef]
- S. Skakun, J. Wevers, C. Brockmann, G. Doxani, M. Aleksandrov, M. Batič, D. Frantz, F. Gascon, L. Gómez-Chova, O. Hagolle, D. López-Puigdollers, J. Louis, M. Lubej, G. Mateo-García, J. Osman, D. Peressutti, B. Pflug, J. Puc, R. Richter, J.-C. Roger, P. Scaramuzza, E. Vermote, N. Vesel, A. Zupanc, L. Žust, Cloud Mask Intercomparison eXercise (CMIX): An evaluation of cloud masking algorithms for Landsat 8 and Sentinel-2, Remote Sensing of Environment 274 (2022) 112990. https://doi.org/10.1016/j.rse.2022.112990. [CrossRef]
| File name | Explanation |
|---|---|
| GLOBAL_LND_1982-2023_CSO.csv | Daily data availability derived from Landsat 1982-2023 archives |
| GLOBAL_S2_2015-2022_CSO.csv | Daily data availability derived from Sentinel-2 2015-2023 archives |
| GLOBAL_GrowingSeason.csv | Growing season information for normal and leap years |
| README.md | Text file containing basic information on the distributed datasets |
| Variable | Explanation |
|---|---|
| All datasets | |
| id | Unique identifier |
| Lat | Latitude [in degrees] (EPSG:4326) |
| Lon | Longitude [in degrees] (EPSG:4326) |
| GLOBAL_LND_1982-2023_CSO.csv | |
| L_YYYY_MM_dd | Data availability (binary information: 1 – valid observation; 0 – no data) for a single day where YYYY indicates year, MM indicates month, and dd indicates day. |
| GLOBAL_S2_2015-2023_CSO.csv | |
| L_YYYY_MM_dd | Data availability (binary information: 1 – valid observation; 0 – no data) for a single day where YYYY indicates year, MM indicates month, and dd indicates day. |
| GLOBAL_GrowingSeason.csv | |
| Regular_MM_dd | Information on growing season (1 – within the growing season, 0 – outside the growing season) provided daily for a regular year comprising 365 days, where MM indicates month and dd day of a day of interest. |
| Leap_MM_dd | Information on growing season (1 – within the growing season, 0 – outside the growing season) provided daily for a leap year comprising 366 days, where MM indicates month and dd day of a day of interest. |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2024 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).
