Global Daily Column Average CO2 at 0.1° × 0.1° Spatial Resolution Integrating OCO-3, GOSAT, CAMS with EOF and Deep Learning

Antezana Lopez, Franz Pablo; Zhou, Guanhua; Jing, Guifei; Zhang, Kai; Chen, Liangfu; Chen, Lin; Tan, Yumin

doi:10.1038/s41597-024-04135-w

Download PDF

Data Descriptor
Open access
Published: 14 February 2025

Global Daily Column Average CO₂ at 0.1° × 0.1° Spatial Resolution Integrating OCO-3, GOSAT, CAMS with EOF and Deep Learning

Franz Pablo Antezana Lopez¹,
Guanhua Zhou ORCID: orcid.org/0000-0003-2789-6989¹,
Guifei Jing²,
Kai Zhang³,
Liangfu Chen⁴,
Lin Chen⁵ &
…
Yumin Tan⁶

Scientific Data volume 12, Article number: 268 (2025) Cite this article

5361 Accesses
2 Citations
9 Altmetric
Metrics details

Subjects

Abstract

Accurate global carbon dioxide (CO₂) distribution with high spatial and temporal resolution is essential for understanding its dynamics and impacts on climate change. This study tackles the challenge of data gaps in satellite observations of greenhouse gases, caused by orbital and observational limitations. We reconstructed a comprehensive dataset of Column-averaged CO2 (XCO₂) concentrations by integrating re-analyzed data from the Copernicus Atmosphere Monitoring Service (CAMS) with observations from GOSAT and OCO-3 satellites. Using two advanced data reconstruction methods—Data Interpolating Empirical Orthogonal Functions (DINEOF) and Convolutional Auto-Encoder (DINCAE)—we imputed missing data, preserving spatial and temporal consistency. The combined approach achieved high accuracy, with Pearson correlation values between 0.94 and 0.95 against TCCON measurements, and we also reported root mean square error (RMSE) to assess model performance further. Our results indicate that these techniques generate a daily, high-resolution, gap-free XCO₂ dataset, enabling improved CO₂ monitoring, climate modeling, and policy development.

Full-coverage estimation of CO₂ concentrations in China via multisource satellite data and Deep Forest model

Article Open access 14 November 2024

Improved estimation of carbon dioxide and methane using machine learning with satellite observations over the Arabian Peninsula

Article Open access 04 January 2025

Downscaling top-down CO₂ emissions and sinks in China empowered by hybrid training

Article Open access 23 May 2025

Background & Summary

Global temperatures have been on a steady upward trajectory since the late 19^th century^1,2,3,4, largely driven by human activities that have significantly increased the concentration of greenhouse gases in the atmosphere, with carbon dioxide (CO₂) being the primary contributor to global warming. It has never been more important to reduce CO₂ emissions, especially now that the global XCO₂ level has reached 400 parts per million (ppm)^5,6 Around 2015 and 2016. Nowadays as of July 2024, the value reached 421.1 ppm⁷. Numerous strategies have been adopted to lessen the negative consequences of climate change in response to mounting concerns. The C40 city network, which has established local-level measures to reduce CO₂ emissions in cities, is one famous example^8,9. However, precise data is strongly required to evaluate these initiatives’ success. Although some towns have installed mobile and fixed ground-based measurement networks¹⁰. these networks often face limitations in terms of spatial coverage and temporal consistency. Ground-based stations are generally sparse and only provide data at fixed locations, making it difficult to capture the full variability of CO₂ across diverse regions. In contrast, high temporal and spatial resolution satellite datasets offer comprehensive global coverage, enabling continuous monitoring of urban and remote areas. This allows for a more detailed understanding of CO₂ dynamics, particularly in areas where ground-based measurements are unavailable. Moreover, satellite data provides consistent observations over time, which is critical for tracking short-term fluctuations and long-term trends in carbon emissions and atmospheric concentrations.

The development of remote sensing atmospheric sounding technology has increased the collection of regional and global CO₂ data via satellite measurements^11,12. Radiative transfer theory is employed to estimate the column-averaged carbon dioxide dry air mole fraction (XCO₂) using atmospheric spectral data from spaceborne sensors¹³. However, significant challenges arise due to calibration issues and mismatches between retrieved and observed radiances, leading to biases in XCO₂ estimates¹⁴. Various space agencies and satellites contribute to CO₂ observation programs, yet they face spatial coverage limitations. For instance, NASA’s Orbiting Carbon Observatory-2 (OCO-2), launched in 2014, provides precise CO₂ measurements using near-infrared observations but has a limited observational swath, posing challenges for the comprehensive spatial distribution^15,16,17. Similarly, Japan’s GOSAT and ESA’s efforts have advanced CO₂ monitoring, but all face challenges regarding spatial data coverage.

While the Copernicus Atmosphere Monitoring Service (CAMS) provides a more comprehensive dataset without gaps, encapsulating global XCO₂ information^18,19,20, its specific advantages and limitations compared to other datasets are often underemphasized. CAMS enhances data resolution and precision, especially in reconstructing daily XCO₂ datasets^21,22. But key challenges remain, including gaps in satellite observations caused by cloud cover or technical issues, on the other hand CAMS dataset. To mitigate these gaps, researchers have integrated multisource datasets, such as OCO-2 and OCO-3, with machine learning techniques like neural networks and convolutional neural networks to improve spatial and temporal resolution²³. Data reconstruction algorithms such as the Data Interpolating Convolutional Auto-Encoder (DINCAE)^24,25 and Data Interpolating Empirical Orthogonal Functions (DINEOF)^26,27,28, originally developed for sea surface temperature, are also being adapted to enhance the spatiotemporal coverage of missing CO₂ data.

Earlier research explored various methods aimed at expanding the spatiotemporal coverage of satellite-based CO₂ measurements. These studies mainly addressed the challenge of reconstructing missing data in carbon satellite datasets. Techniques such as the Baxter-King Filter were employed following the application of algorithms like DINEOF and DINCAE to fill data gaps^3,29,30,31, While these methods have been useful, gaps in their application remain. Specifically, DINEOF’s reliance on Empirical Orthogonal Functions (EOFs) to estimate missing data through dominant spatial patterns may oversimplify complex temporal and spatial variability in CO₂ measurements. The comparison of datasets, such as OCO-2 and GOSAT in the near-infrared (NIR) and shortwave infrared (SWIR) regions, helped generate the initial version of the XCO₂ dataset^32,33,34. However, merging datasets from different instruments still presents challenges, particularly regarding consistency and resolution.

While the DINCAE algorithm, using deep learning and a convolutional auto-encoder, has shown promise by efficiently reconstructing missing data from OCO-3 measurements, its application across multiple datasets has yet to be fully explored³⁵. The integration of multisource data, including TCCON, GOSAT, OCO-2, and OCO-3, provides a step forward in reconstructing monthly XCO₂ values, but the fusion of these datasets still faces limitations in accuracy and validation.

DINCAE, as a neural network comprising encoders and decoders, reduces the resolution of satellite data through convolutional layers and then reconstructs the information using interpolation layers^24,36. This approach shows potential for filling gaps in satellite datasets but requires further validation, particularly when applied to complex geophysical processes. Similarly, while DINEOF and DINCAE algorithms have improved the ability to reconstruct data from OCO-3 measurements, the accuracy and reliability of the reconstructed data must be compared against benchmark datasets like TCCON and GOSAT. TCCON, which began with the installation of equipment in Park Falls, Wisconsin, in 2004 and has since expanded to 23 globally operational devices, provides a crucial standard for validating satellite-derived CO₂ data³⁷ However, further comparison and validation efforts are essential to enhance the reliability of reconstructed datasets across different instruments and regions.

This study delves into the reconstruction of a global CO₂ dataset with comprehensive coverage and high spatiotemporal resolution by integrating multisource from OCO-3, GOSAT, CAMS, contributing to the analysis of CO₂ dynamics and their climatic impacts. The novel reconstruction techniques, DINEOF and DINCAE, will be employed to address data gaps due to sensor limitations and cloud cover. To enhance the data reliability and underscore the effectiveness of these methods, the reconstructed dataset will be compared with TCCON and other datasets. Furthermore, we demonstrate the critical role of post-reconstruction filtering in ensuring data integrity, significantly advancing our understanding of the carbon cycle and informing climate policy. This study also reflects on the broader implications of NASA’s OCO-3 mission for climate change analysis, advocating for enhanced satellite coverage to tackle global environmental challenges.

Methods

This study intends to reconstruct global daily gap-free XCO₂ datasets with high resolution to enhance our understanding of CO₂ dynamics, setting the stage for crafting robust climate models and actionable mitigation strategies, with validation and spatiotemporal pattern analysis. A key feature of this research is the employment of a high temporal resolution and consistent sampling method, which reinforces the reliability of the comparative analysis. Figure 1 graphically delineates the systematic methodology undertaken, beginning with the collation of TCCON, OCO-3, and GOSAT datasets, followed by data pre-processing which includes quality filtering and standardization to a 0.1-degree resolution. Data reconstruction is executed using DINEOF and DINCAE algorithms, where deep learning and empirical orthogonal functions merge disparate data into a coherent, gap-free dataset. This process culminates in post-processing steps, applying spatial median filtering and time series noise reduction, refining the data for enhanced accuracy, as demonstrated by the spatial and temporal trend comparisons with TCCON benchmarks.

Input datasets overview

Although GOSAT and CAMS datasets both use data from the GOSAT satellite, their different processing methods ensure they provide unique insights. GOSAT offers direct satellite measurements, while CAMS includes reanalyzed data from multiple sources, enhancing the completeness and resolution of CO2 coverage. Combining these datasets compensates for missing information in sparse datasets like OCO-3, allowing for more accurate interpolation. To create daily column average values of CO₂ maps, we analyzed multiple datasets to validate the information and study each parameter. The list of used datasets is presented in Table 1 and a detailed description.

Table 1 An overview of the data from various sources used in this investigation.

Full size table

GOSAT satellite dataset

The Greenhouse Gas Observing Satellite, also known as “IBUKI” or GOSAT, is essential for monitoring two key greenhouse gases: CO₂ and CH₄^38,39. Key instruments on board include the Fourier Transform Spectrometer (FTS) and Cloud-Aerosol Imager (CAI) of the Thermal and Near-infrared Sensor (TANSO)^40,41. The FTS collects reflected short-wavelength infrared (SWIR) radiation in three bands: 0.76, 1.6, and 2.0 μm, allowing for near-surface CO₂ retrievals³⁹. It also detects thermal infrared radiation (TIR) in the range of 5.5–14.3 μm, using absorption bands near 14 μm to gather CO2 concentration data up to 2 km^39,42. Dense clouds and aerosol interference are captured by CAI, supporting high-quality data filtering.

For this study, we used a bias-corrected monthly GOSAT Level-3 global XCO₂ product, with 2.5° × 2.5° resolution, covering June 2018 to December 2023 (Data is accessible at https://data2.gosat.nies.go.jp for GOSAT). This dataset is derived from FTS SWIR Level-2 data, which is smoothed, extrapolated, and interpolated via the Kriging approach^43,44. fill gaps and improve spatiotemporal XCO₂ coverage⁴³.

OCO-3 satellite dataset

NASA’s OCO-3 satellite, launched in 2019, tracks atmospheric CO₂ with a high spatial resolution of 2.5 km × 0.7 km^45,46,47. Its FTS captures spectral data for accurate CO₂ and CH₄ column measurements, while TCCON data provides detailed validation of the total carbon column¹³. This study uses OCO-3’s monthly SWIR observations from August 2019 to November 2023. OCO-3’s orbital constraints can challenge data collection, but the L2FP algorithm mitigates this by filtering variables, including CO₂ and H₂O ratios, albedo, BRDF coefficients, RMS errors, and atmospheric factors⁴⁸. We verified OCO-3 data against TCCON and atmospheric stations, focusing on the satellite’s spatial coverage from latitudes −52° to 52° and longitudes −180° to 180°.

TCCON in-situ dataset

It consists of high-resolution ground-based Fourier Transform Spectrometers that measure column-averaged CO₂ and other gases such as CH₄ and CO. TCCON has been crucial for validating satellite data from OCO-2, GOSAT, and other missions^{49,50,51,52,53,54}.

We use the extensive GGG2020 dataset⁵⁵ from several TCCON stations, from Burgos⁵⁶, Caltech⁵⁷, Darwin⁵⁸, East Trout Lake⁵⁹, Garmisch⁶⁰, Harwell⁶¹, Hefei⁶², Izana⁶³, Jet propulsion laboratory^64,65, Karlsruhe⁶⁶, Lamont⁶⁷, Lauder^68,69,70, Nicosia⁷¹, Orleans⁷², Paris, Park Falls⁷³, Reunion Island⁷⁴, Rikubetsu⁷⁵, Saga⁷⁶, Tsukuba⁷⁷, Wollongong and Xianghe⁷⁸, all of them distributed around the world, the data were extracted from 2018 to 2023.

CAMS dataset

The CAMS XCO₂ dataset, generated by the ECMWF’s Integrated Forecasting System (IFS) and 4DVar data assimilation system, offers high-resolution global atmospheric data, including greenhouse gases, with 0.75° spatial and three-hour temporal resolution⁷⁹. This study focuses on CO₂ column-mean molar fractions, and differences in spatial and temporal coverage between OCO-3 are accounted for. CAMS XCO₂ data is integrated for consistency and using this data for extrapolation⁸⁰, improving our understanding of atmospheric CO₂ dynamics. Prior studies using CAMS and OCO-2 data for XCO₂ reconstruction via deep learning have shown high accuracy⁸¹.

Initial input datasets, preliminary insights

Terrestrial areas tend to exhibit larger errors compared to marine areas, with GOSAT and Carbon tracker datasets showing particularly high inaccuracies⁸². On the other hand, OCO-2 and OCO-3 data have spatial and temporal resolution problems because it has several missed information, statistical characteristics, uncertainties, and constraints related to temporal fluctuation and various seasons of the year are included in the data comparison between TCCON and OCO-2, including glint and nadir modes¹¹. Regarding the data in terms of enhancing the spatiotemporal coverage of OCO-3 data in this research, the preliminary findings of applying the DINCAE and DINEOF algorithms show encouraging results. The reconstructed data will offer insightful knowledge of the dynamics of CO₂ and its impact on climate change. It was crucial to improve the accuracy and dependability of our reconstructed dataset. Using diligent data filtering techniques to prepare the dataset for in-depth analysis and modeling was crucial. Meaningful advancements in carbon cycle studies, climate modeling, and policy formulation depend on our ability to comprehend the dynamics of CO₂ and their broad consequences for climate change. The improvements made possible by the skillful application of DINCAE and DINEOF algorithms and cutting-edge data filtering methods significantly increased the spatiotemporal coverage and dependability of OCO-3 data. This advancement is crucial because it enhances our understanding of CO₂ dynamics and their significant influence on climate change.

The OCO-3 dataset has data gaps ranging from 99.75% to 100% per day, this data was also filtered by analyzing the quality_flag equal to 0 which generally indicates reliable data. On Earth’s surface, each grid cell corresponds to a 0.1 × 0.1-degree area. This gap hinders our full comprehension of CO₂ levels, potentially compromising the accuracy and completeness of global CO₂ assessments and modeling efforts. Figure 2 presents a temporal analysis illustrating the variation in missing information within OCO-3 satellite data.

Similarly, the GOSAT L3 product, with a coarser spatial resolution of 2.5 × 2.5 degrees, shows empty data in 38–65% of the grid cells each month. On Earth’s surface, this corresponds to an area of approximately 625,000 km². The lack of data in this range makes it difficult to fully understand CO₂ dynamics, especially at local and regional scales. Figure 3 illustrates a temporal analysis of empty information variation of GOSAT L3 V03.05 satellite data products.

Thus, the magnitude and significance of the missing data gaps in both OCO-3 and GOSAT L3 varied to different degrees for CO₂ monitoring. Hence, the ability to collect comprehensive global CO₂ distributions and perform accurate regional analyses is limited. The GOSAT data covers more XCO₂ information spatially, but the temporal and spatial resolution is lower than that of the OCO-3 satellites. To improve our understanding of CO₂ dynamics and contribute to the success of climate change mitigation initiatives, it is crucial to enhance satellite coverage and data collection continuously.

Figure 4 illustrates the geographical distribution of the data utilized in this study. Derived from GOSAT Level 3 and OCO-3 datasets, acquired in December 2022 and December 14, 2022, respectively. The data visualization highlights significant gaps in the global coverage of atmospheric CO₂, showing that the OCO-3 dataset mostly spans the region between 52°N and 52°S. This emphasizes how critical it is to have a complete dataset and how crucial it is to rebuild satellite pictures to close this gap. The CAMS dataset, on the other hand, offers global coverage based on simulation results, highlighting a notable discrepancy in data comprehensiveness and geographic scope and emphasizing the integration of multiple sources for accurate CO₂ monitoring.

The OCO-3 satellite and its state-of-the-art equipment offer a plethora of information essential for tracking atmospheric greenhouse gases. By analyzing its SWIR data, we intend to gain insight into the global carbon cycle and assess the environmental effects of human activities⁸³. Rigorous comparisons with TCCON, fusing the information of CAMS, GOSAT, and OCO-3, will verify the accuracy and reliability of OCO-3 for scientific research and environmental management.

Figure 5 presents an extensive plot against a world map showing the atmospheric XCO₂ concentrations between 2018 and 2023. The scatter plot distinguishes data points according to the source satellite and associated geographic area. Notably, the alignment of patterns between CAMS and OCO-3 indicates a high degree of consistency in XCO₂ measurements, reinforcing the reliability and calibration uniformity of the OCO-3 sensor and CAMS model.

The integrated map highlights the coverage of the OCO-3 satellite in yellow, indicating the spatial emphasis of the dataset and obliquely implying a geographically limited investigation. While the overall trends of CAMS and OCO-3 align closely, GOSAT measurements deviate from this trend, especially in south ocean areas. The datasets exhibit consistency in inland areas, but significant discrepancies arise over ocean regions. These deviations, particularly highlighted in the scatter plot, point to potential regional anomalies, calibration inconsistencies, or differences in sensor sensitivities. In the larger scheme of atmospheric monitoring, these discrepancies are crucial since they could indicate the requirement for further research into regional atmospheric dynamics and cross-calibration. To address this issue, we excluded the GOSAT grid cells with large deviations from CAMS and OCO-3 to improve the reliability of the reconstructed dataset.

The scatterplot matrix displayed in Fig. 6 presents a comparison of XCO₂ distribution with histograms for CAMS, GOSAT, and OCO-3 in the panels along the diagonal. This information reveals the frequency of different XCO2 ppm values captured by each system. The overlapping period for the three datasets (CAMS, OCO-3, and GOSAT) is from 2019-10 to 2020-12, so the assessment and comparison of XCO₂ data to evaluate the input datasets are within this period. The scatterplots in the off-diagonal panels compare the readings for every pair of systems, and the linear regression lines show the correlation between the systems for every location. Best correlation parameters are visible in the comparison of GOSAT and CAMS data, on the other hand, the results among OCO-3 and other datasets are not consistent and, in some cases, have a negative slope in linear correlation, particularly in southern areas. These correlation factors are presented in Table 2, which offers quantitative insights into the degree of agreement between measurements from several satellite systems in a variety of places.

Table 2 Correlation Parameter Comparisons Between GOSAT, OCO-3, and CAMS Datasets.

Full size table

Table 2 presents the correlation parameters by region and dataset. In Europe, CAMS and GOSAT exhibit the highest R² value (0.64) and Pearson correlation (0.80), demonstrating a strong linear relationship and high prediction accuracy in this region. Oceania, however, shows a negative correlation (−0.09) in this particular comparison, indicating a weak and unfavorable association. Europe also shows a steep positive slope (1.39) and a significant negative intercept (−161.49), reinforcing the strong linear relationship. The correlation levels between CAMS and OCO-3 and GOSAT and OCO-3 vary by region. For example, Europe maintains a strong correlation in the CAMS-OCO-3 comparison (Pearson coefficient of 0.68), while Africa shows negative correlations in both GOSAT-OCO-3 (−0.19) and CAMS-OCO-3 (−0.18), pointing to differences in satellite data or localized fluctuations in the atmosphere. These region-specific variations highlight the complexity and variety in interpreting satellite data and underscore the need for localized atmospheric studies.

Dataset reconstruction method

According to previous studies^{84,85,86,87,88,89,90,91}, the most often used technical approaches for modeling relationships are statistical models, machine learning, and deep learning techniques. Despite the widespread use of statistical models, including spatial autoregression models, geographically weighted regression (GWR), and linear regression models, to investigate spatiotemporal relationships, there are still certain shortcomings in estimating XCO₂^92,93,94. Those works focus on the use of Deep Learning and EOF techniques to generate enhanced maps of XCO₂ maps, which are both known in geo-statistics as space-time modal decomposition, with a focus on the crucial part that space-time modal decomposition plays in the data analysis stage of the DINEOF technique⁹⁵. Our research strategy involved thorough data collection, the use of reconstruction techniques, and a strict evaluation process to fully examine the spatiotemporal coverage of OCO-3 data combined with CAMS and GOSAT. Our research had a rich context thanks to the varied climatic circumstances and variations in atmospheric CO₂ levels in our study area. The OCO-3 data were gathered using cutting-edge sensors and equipment deployed on satellite platforms, providing precise measurements of the atmospheric CO₂ level. Forming the backbone of this study.

To assess the spatiotemporal coverage of OCO-3 data, a comprehensive methodology involving data collection, application of DINCAE and DINEOF algorithms, and rigorous evaluation was employed. This approach facilitated a comparison of the algorithm performances through statistical analysis and visual inspection, enhancing our understanding of data reconstruction techniques and their implications for spatiotemporal coverage. The study also acknowledges limitations, particularly the potential biased or ambiguous inherent in OCO-3 data and the influence of specific characteristics of the utilized datasets and the parameter settings in the DINCAE and DINEOF algorithms on their performance. Recognizing these limitations and maintaining methodological rigor is crucial for accurate interpretation of the findings.

DINEOF reconstruction method

DINEOF (Data Interpolating Empirical Orthogonal Function), originally introduced by Pearson⁹⁶, is a sophisticated analytical technique utilized to interpolate missing data and identify spatiotemporal modes. Empirical Orthogonal Function (EOF) analysis forms its foundation. The data matrix A, represented as A_mxn, is the central component of DINEOF. It is a m x n matrix where m is bigger than n. In this case, n is the number of layers, usually expressed in terms of time, such as months, and m denotes all the grid cells that correspond to one layer of the research area, each grid cell arranged in a row⁹⁷.

In DINEOF, the breakdown of the matrix A_mxn into its parts of space and time is essential. A_mxn = V_mxm Z_mxn is a concise representation of this decomposition, which separates A_mxn into a space function V_mxm and a time function Z_mxn. Whereas the time function is made up of related time coefficients, the space function is made up of orthogonal space characteristic fields. These space characteristic fields and their coefficients are concatenated linearly to represent the space field at a given position a_k as represented in the below equation.

$${a}_{k}=\mathop{\sum }\limits_{i=1}^{N}{v}_{i}\cdot {c}_{i}$$

(1)

where ${v}_{i}$ are the vectors of the space characteristic fields and ${c}_{i}$ are the coefficients. When it comes to recreating missing values in the dataset, the EOF decomposition is essential. The first step in this approach is to use EOF to decompose the spatiotemporal variable field A_mxn and produce spatial typical fields. The methodology then involves selecting the first N of these spatial fields and reconstructing the complete space-time variable field with these fields. Thus, the reconstructed matrix, represented as A_mxn, is the product of the time coefficients corresponding to the first N spatial typical fields. This method is a reliable way to handle missing data in complicated datasets since it works especially well with big data volumes and is resilient to local changes. In geo-statistics, it is sometimes referred to as space-time modal decomposition and is a crucial component of the DINEOF method’s data analysis⁹⁵.

DINCAE reconstruction method

The “DINCAE 2.0” methodology employ a convolutional neural network (CNN) created especially designed for reconstruction of missing data in satellite observations. The core of this methodology is a U-Net type network, which is particularly effective because of its deep structure and capacity to capture both local and global properties in the input data. A distinguishing characteristic of the U-Net design is the utilization of skip connections, which aid in maintaining the fine-scale details in the data and are essential for precise reconstruction of complicated geophysical fields^36,98. Assuming a Gaussian distribution of errors, the network is trained to minimize the negative log-likelihood, which can be expressed mathematically as:

$$L(\theta )=-\log \,L(X|\theta )$$

(2)

Where L(θ) is the likelihood function of the parameters θ given the data X. This algorithm is further enhanced by an additional refinement phase, which processes the inputs and outputs through an auto-encoder, in addition to its architecture. This phase deepens the network and enhances the reconstruction quality, especially for complex patterns and interactions^99,100. Moreover, the methodology expands to multivariate reconstruction, enabling it to handle diverse data types, such as wind fields, chlorophyll concentration, and sea surface temperature. For a thorough knowledge and analysis of environmental events, a multivariate approach is essential^26,101,102. Additionally, a noteworthy development for this model is its ability to process non-gridded data, which is common in satellite datasets that vary in formats and resolutions. This flexibility is achieved by modifying the network’s cost function and the input layer, allowing it to efficiently analyze unstructured data inputs and produce organized gridded field outputs.

Figure 7 illustrates a global map collection that displays the monthly mean reconstructed atmospheric CO₂ concentrations over six years, from January 2018 to July 2023. The XCO₂ concentrations fluctuate throughout the year, exhibiting a seasonal pattern, with levels declining from May to September. This decrease can be attributed to increased photosynthetic activities in the Northern Hemisphere during the summer, which removes CO₂ from the atmosphere. In contrast, XCO₂ levels rise between October to April, driven by reduced photosynthetic activity in the winter in the Northern Hemisphere in conjunction with an increase in CO₂ emissions from human activities like industrial processes and heating.

Global Spatiotemporal XCO₂ evolution

Figure 8 provides a thorough analysis of variations in atmospheric CO₂ in different parts of the world, highlighting the important contribution of oceans to Earth’s carbon cycle through their seasonal CO₂ absorption and emission. It emphasizes the unique patterns of XCO₂ exchange with the atmosphere in the Indian Ocean, the seasonal biological productivity and decay of temperate zones in the North Atlantic and North Pacific, and the continents of North America, Asia, and Europe. The waters in the Southern China Sea also show a naturally occurring cyclical constant increase with fluctuations of XCO₂, almost similar trend and seasonal variation like China.

On the other hand, South America’s XCO₂ trajectory, significantly influenced by the Amazon’s vast photosynthetic capacity, is marred by a concerning surge in emissions due to fossil fuel combustion and rampant deforestation. This trend is mirrored paralleled in the Southern Hemisphere regions such as the South Atlantic, South Pacific, and Oceania, where XCO₂ levels, though lower, mirror the distressing upward trend observed in the northern counterparts. The depiction across these diverse landscapes contributes to a comprehensive narrative of the Earth’s carbon flux, illustrating a climate system that is both dynamic and acutely sensitive to changes, set against an ever-increasing concentration of atmospheric greenhouse gases. Notably, the graph identifies the Australian region as having the lowest column-average CO₂ concentrations, marking a distinct region of interest in the global trend patterns.

The comprehensive analysis of the XCO₂ linear growth rate spanning from 2018 to 2023 offers a striking visualization of the escalating CO₂ concentrations worldwide, this is visible in Fig. 9. The intensity of the red hues mapped across various global regions not only underscores the areas with significant increases in XCO₂ levels but also invites a deeper investigation into the underlying causes. These vivid markers serve as indicators of heightened industrial activities, urban sprawl, deforestation, and other carbon-emitting practices, highlighting the urgent need for scrutinizing local and regional environmental policies.

The Amazon Basin in South America, along with regions such as Argentina, Bolivia, Brazil, exhibits areas of elevated XCO₂ levels, as indicated by the visual data. Southern region of Africa similarly shows an increase in XCO₂ concentrations. These trends in the data reflect changes in atmospheric carbon dioxide but do not provide conclusive evidence about the specific causes of these increases, which could be influenced by a range of factors including natural variability caused by the landuse change, short-term atmospheric mixing, or regional land-use changes. In Asia, areas like South and west Asia, including western China, show noticeable XCO₂ increases, possibly linked to industrial activity. Western region of Australia shows a subtler rise in XCO₂ levels, which may be due to both natural processes and localized environmental conditions. However, caution is advised in interpreting these spatial trends as direct indicators of specific sources or processes without accounting for the potential influence of atmospheric dynamics over short periods.

Data Records

The final output of our study is a daily raster dataset, systematically organized in a folder structure and named following the pattern “Reconstructed_YYYYMMDD.tif”, representing gap-free daily XCO₂ concentrations. Each data record is saved as a “.tif” file, ensuring broad compatibility and ease of use for further scientific analysis. The dataset covers an extensive period from January 2018 to November 2023 and boasts a fine spatial resolution of 0.1 degrees, facilitating precise, grid-based analyses of XCO₂ distribution patterns over time. This dataset is publicly available for free download at the Zenodo¹⁰³. Our data has been made publicly available for further academic inquiry and collaboration at the following digital repository: https://zenodo.org/records/13895409. We encourage scholars and researchers to access and utilize this dataset to advance the field.

Reconstructed products

A comparison between model and satellite data for XCO₂ across the Western Hemisphere is shown in Fig. 10. A smoother gradient of XCO₂ concentrations is seen across the final results of the integrated reconstruction of OCO-3, GOSAT, and CAMS, which incorporates all three datasets. This is likely due to the synergistic effect of combining all available data for a more accurate and higher-resolution reconstruction of the atmospheric CO₂ distribution. Figure 10a OCO-3, (b) GOSAT L3, (c) CAMS, and (d) the reconstructed image using only OCO-3 data cannot obtain a complete image and still contain several missing gaps and even noise, so using only one dataset to reconstruct the data is not enough. On the other hand, Fig. 10e shows the integration of GOSAT information on data reconstruction, the resolution of GOSAT does not allow a satisfactory data reconstruction, so finally, we integrate the CAMS data, and the results are presented in Fig. 10f which are the final result of the reconstructed data. Hence, the images show how the model output (CAMS) and the original satellite observations (OCO-3 and GOSAT) differ in terms of data quality and resolution. The reconstructed and merged data maps show improvements in spatial resolution and data continuity, leading to image (f), which, as a result of the integration of all datasets, offers the most accurate and thorough perspective of XCO₂ levels. This demonstrates the importance of merging data from several sources when conducting environmental monitoring and analysis.

Technical Validation

The key difference between GOSAT and CAMS lies in their spatial resolution and the data assimilation techniques used by CAMS, which integrates multiple data sources. These differences, far from being problematic, enhance the robustness of the reconstructed dataset by providing a broader spatial coverage and more frequent updates. The complementary nature of these datasets is critical in filling the gaps present in OCO-3 data, resulting in a more accurate and continuous global XCO₂ dataset. This research aims to reconstruct the dataset using multisource geodata containing XCO₂ information, which has been interpolated through the DINEOF and DINCAE methods. The validation process involves a comprehensive analysis of the differences between the interpolated OCO-3 data and reference datasets obtained from GOSAT and TCCON.

In evaluating the reliability of satellite-derived CO₂ data, a crucial aspect involves comparing datasets from various instruments. Notably, studies^104,105 have conducted comprehensive assessments of the consistency between different satellite missions, including BESD-SCIAMACHY, ACOS-GOSAT, and OCO-2. These assessments involve meticulous analyses of biases, correlation coefficients, and spatiotemporal variations, addressing challenges associated with differences in a priori profiles and averaging kernels across diverse satellite data products, as highlighted by Rodgers and Connor¹⁰⁶. The results confirm a degree of agreement between various satellite datasets, facilitating the integration of multi-satellite observations for a more comprehensive understanding of atmospheric CO₂ concentrations²³.

In Fig. 11, the boxplot shows a succinct analytical comparison of atmospheric XCO₂ across a suite of global monitoring TCCON stations, showcasing the spread and central tendencies within the TCCON, DINCAE, and DINEOF reconstructed datasets. The central clustering around 412 ppm highlights the coherence in global CO₂ levels, with notable consistency at Burgos station, exemplifying minimal cross-method variability. Peaks are closely matched at Caltech and Armstrong Center, with the TCCON readings reaching a maximum of 427.565 ppm, indicative of regional emission spikes. A notable lower bound is observed at East Trout Lake with DINEOF, suggesting lower regional CO₂ concentration levels. Lauder’s OCO-3 data exhibit good agreement with TCCON, reflecting reliable measurements. In contrast, sites like Orleans, Nicosia, and Paris display a tightly packed range, signifying data robustness. Reunion Island’s DINEOF results represent the lower end of measurements, with Xianghe maintaining a consistent range with the global data set, illustrating the extensive span of CO₂ levels captured.

Figure 12a shows a scatterplot comparison between XCO2 values from TCCON and those from the DINEOF technique. A high correlation coefficient (r = 0.94) between the two sets of data indicates a strong linear link, which is revealed by the scatter of dots. The difference between values seen and values projected by an estimator or model is measured by the root mean square error, or RMSE. Quantitative estimates of the average magnitude of the error are provided by the unbiased RMSE (ubRMSE) of 1.581 ppm and the RMSE of 1.610 ppm. With a bias of 0.301 ppm, which is another measure of precision, the DLR measurements are, on average, just marginally higher than the TCCON values.

Figure 12b presents a nuanced comparison, suggesting potential calibration or adjustment enhancements to the DINCAE method. This figure demonstrates a slightly improved correlation over Fig. 12a, with a correlation coefficient of 0.95. Furthermore, it features reduced RMSE and ubRMSE values of 1.441 ppm and 1.400 ppm, respectively, indicating more precise accuracy. While the bias slightly increases to 0.343 ppm, it remains within an acceptable margin, underscoring a closer alignment with the TCCON reference data. The lower RMSE and ubRMSE, alongside the higher correlation coefficient, suggest that the observations in Fig. 12b are more accurate and consistent compared to those aligned with the TCCON benchmark. Although the bias is marginally higher, it’s not significantly detrimental. The density of points closely following the 1:1 line visually confirms a tighter congruence with TCCON readings, hinting at successful calibration or adjustments in addressing systematic discrepancies evident in the DINEOF approach depicted in Fig. 12a.

The data validation process, incorporating scatterplot analysis and spatiotemporal patterns, is a robust methodology to assess the quality and reliability of reconstructed data. Comparing the measurements with TCCON datasets contributes to the refinement of monitoring CO₂ concentrations.

Code availability

The data assembly, processing, and quality-control steps were performed using R and Julia, utilizing established libraries for DINEOF and DINCAE. These implementations relied on the respective open-source libraries available. The dineof was used, that function is available in sinkr library in R, and the DINCAE implementation is available on GitHub https://github.com/gher-uliege/DINCAE. No custom code was generated to automate these procedures, as the available libraries fully supported the required functionalities.

References

Intergovernmental Panel on Climate Change (IPCC). Climate Change 2022 – Impacts, Adaptation and Vulnerability. https://doi.org/10.1017/9781009325844 (Cambridge University Press, 2023).
Hegerl, G. C. et al. Causes of climate change over the historical record. Environmental Research Letters 14, 123006, https://doi.org/10.1088/1748-9326/ab4557 (2019).
Article ADS MATH Google Scholar
Jiang, Y., Gao, Z., He, J., Wu, J. & Christakos, G. Application and Analysis of XCO2 Data from OCO Satellite Using a Synthetic DINEOF–BME Spatiotemporal Interpolation Framework. Remote Sens (Basel) 14, 4422, https://doi.org/10.3390/rs14174422 (2022).
Article ADS Google Scholar
Adak, S., Mandal, N., Mukhopadhyay, A., Maity, P. P. & Sen, S. Current State and Prediction of Future Global Climate Change and Variability in Terms of CO2 Levels and Temperature. in Enhancing Resilience of Dryland Agriculture Under Changing Climate 15–43 https://doi.org/10.1007/978-981-19-9159-2_2 (Springer Nature Singapore, 2023).
Keeling, Ralph F; Keeling, Charles D. Atmospheric Monthly In Situ CO2 Data - Mauna Loa Observatory, Hawaii (Archive 2024-11-13). In Scripps CO2 Program Data. UC San Diego Library Digital Collections. https://doi.org/10.6075/J08W3BHW (2017).
Kozyra, J., Łukasik, Z., Kuśmińska-Fijałkowska, A. & Janota, A. Reduction of CO2 emission as a result of the use of 48-volt electrical installations in passenger cars. Archives of Transport 66, 7–20, https://doi.org/10.5604/01.3001.0016.3124 (2023).
Article ADS Google Scholar
Recent global CO2. GOSAT [online]. National Institute for Environmental Studies (NIES). https://www.gosat.nies.go.jp/en/recent-global-co2.html.
Lei, L. et al. A comparison of atmospheric CO2 concentration GOSAT-based observations and model simulations. Sci China Earth Sci 57, 1393–1402, https://doi.org/10.1007/s11430-013-4807-y (2014).
Article ADS MATH Google Scholar
Leal, J. M. & Paterson, M. Transnational city networks, global political economy, and climate governance: C40 in Mexico and Lima. Rev Int Polit Econ 31, 26–46, https://doi.org/10.1080/09692290.2023.2167849 (2024).
Article Google Scholar
Potier, E. et al. Complementing XCO2 imagery with ground-based CO2 and 14CO2 measurements to monitor CO2 emissions from fossil fuels on a regional to local scale. Atmos Meas Tech 15, 5261–5288, https://doi.org/10.5194/amt-15-5261-2022 (2022).
Article MATH Google Scholar
Wunch, D. et al. Comparisons of the Orbiting Carbon Observatory-2 (OCO-2) CO2 measurements with TCCON. Atmos Meas Tech 10, 2209–2238, https://doi.org/10.5194/amt-10-2209-2017 (2017).
Article MATH Google Scholar
Buchwitz, M. et al. The Greenhouse Gas Climate Change Initiative (GHG-CCI): Comparison and quality assessment of near-surface-sensitive satellite-derived CO2 and CH4 global data sets. Remote Sens Environ 162, 344–362, https://doi.org/10.1016/j.rse.2013.04.024 (2015).
Article ADS Google Scholar
Taylor, T. E. et al. Evaluating the consistency between OCO-2 and OCO-3 XCO ₂ estimates derived from the NASA ACOS version 10 retrieval algorithm. Atmos Meas Tech 16, 3173–3209, https://doi.org/10.5194/amt-16-3173-2023 (2023).
Article MATH Google Scholar
Mauceri, S., Massie, S. & Schmidt, S. Correcting 3D cloud effects in X CO2 retrievals from the Orbiting Carbon Observatory-2 (OCO-2). Atmos Meas Tech 16, 1461–1476, https://doi.org/10.5194/amt-16-1461-2023 (2023).
Article Google Scholar
Crisp, D. et al. The ACOS CO2 retrieval algorithm – Part II: Global XCO2 data characterization. Atmos Meas Tech 5, 687–707, https://doi.org/10.5194/amt-5-687-2012 (2012).
Article MATH Google Scholar
Liu, Y. et al. Effects of spectral sampling rate and range of CO2 absorption bands on XCO2 retrieval from TanSat hyperspectral spectrometer. Chinese Science Bulletin 59, 1485–1491, https://doi.org/10.1007/s11434-014-0215-8 (2014).
Article ADS MATH Google Scholar
Connor, B. et al. Quantification of uncertainties in OCO-2 measurements of XCO2 simulations and linear error analysis. Atmos Meas Tech 9, 5227–5238, https://doi.org/10.5194/amt-9-5227-2016 (2016).
Article MATH Google Scholar
Wang, Y., Yuan, Q., Zhou, S. & Zhang, L. Global spatiotemporal completion of daily high-resolution TCCO from TROPOMI over land using a swath-based local ensemble learning method. ISPRS Journal of Photogrammetry and Remote Sensing 194, 167–180, https://doi.org/10.1016/j.isprsjprs.2022.10.012 (2022).
Article ADS Google Scholar
Park, M., Randel, W. J., Emmons, L. K. & Livesey, N. J. Transport pathways of carbon monoxide in the Asian summer monsoon diagnosed from Model of Ozone and Related Tracers (MOZART). Journal of Geophysical Research: Atmospheres 114 https://doi.org/10.1029/2008JD010621 (2009).
Tang, W. et al. Source Contributions to Carbon Monoxide Concentrations During KORUS‐AQ Based on CAM‐chem Model Applications. Journal of Geophysical Research: Atmospheres 124, 2796–2822, https://doi.org/10.1029/2018JD029151 (2019).
Article ADS MATH Google Scholar
Buontempo, C. et al. The Copernicus Climate Change Service: Climate Science in Action. Bull Am Meteorol Soc 103, E2669–E2687, https://doi.org/10.1175/BAMS-D-21-0315.1 (2022).
Article MATH Google Scholar
Peuch, V.-H. et al. The Copernicus Atmosphere Monitoring Service: From Research to Operations. Bull Am Meteorol Soc 103, E2650–E2668, https://doi.org/10.1175/BAMS-D-21-0314.1 (2022).
Article Google Scholar
Zhang, M. & Liu, G. Mapping contiguous XCO2 by machine learning and analyzing the spatio-temporal variation in China from 2003 to 2019. Science of The Total Environment 858, 159588, https://doi.org/10.1016/j.scitotenv.2022.159588 (2023).
Article PubMed Google Scholar
Barth, A., Alvera-Azcárate, A., Troupin, C. & Beckers, J.-M. DINCAE 2.0: multivariate convolutional neural network with error estimates to reconstruct sea surface temperature satellite and altimetry observations. Geosci Model Dev 15, 2183–2196, https://doi.org/10.5194/gmd-15-2183-2022 (2022).
Article ADS Google Scholar
Barth, A., Alvera-Azcárate, A., Licer, M. & Beckers, J.-M. DINCAE 1.0: a convolutional neural network with error estimates to reconstruct sea surface temperature satellite observations. Geosci Model Dev 13, 1609–1622, https://doi.org/10.5194/gmd-13-1609-2020 (2020).
Article ADS Google Scholar
Alvera-Azcárate, A., Barth, A., Beckers, J.-M. & Weisberg, R. H. Multivariate reconstruction of missing data in sea surface temperature, chlorophyll, and wind satellite fields. J Geophys Res 112, C03008, https://doi.org/10.1029/2006JC003660 (2007).
Article ADS Google Scholar
Alvera-Azcárate, A., Barth, A., Sirjacobs, D., Lenartz, F. & Beckers, J. M. Data Interpolating Empirical Orthogonal Functions (DINEOF): a tool for geophysical data analyses. Mediterr Mar Sci 12, 5, https://doi.org/10.12681/mms.64 (2011).
Article Google Scholar
Alvera-Azcárate, A., Barth, A., Rixen, M. & Beckers, J. M. Reconstruction of incomplete oceanographic data sets using empirical orthogonal functions: application to the Adriatic Sea surface temperature. Ocean Model (Oxf) 9, 325–346, https://doi.org/10.1016/j.ocemod.2004.08.001 (2005).
Article ADS Google Scholar
Lopez, F. P. A., Zhou, G., Jing, G., Zhang, K. & Tan, Y. XCO2 and XCH4 Reconstruction Using GOSAT Satellite Data Based on EOF-Algorithm. Remote Sens (Basel) 14, 2622, https://doi.org/10.3390/rs14112622 (2022).
Article ADS Google Scholar
Yan, X. et al. Application of Synthetic DINCAE–BME Spatiotemporal Interpolation Framework to Reconstruct Chlorophyll–a from Satellite Observations in the Arabian Sea. J Mar Sci Eng 11, 743, https://doi.org/10.3390/jmse11040743 (2023).
Article MATH Google Scholar
Murray, C. J. Cyclical Properties of Baxter-King Filtered Time Series. Review of Economics and Statistics 85, 472–476, https://doi.org/10.1162/003465303765299945 (2003).
Article MATH Google Scholar
Kuze, A. et al. Update on GOSAT TANSO-FTS performance, operations, and data products after more than 6 years in space. Atmos Meas Tech 9, 2445–2461, https://doi.org/10.5194/amt-9-2445-2016 (2016).
Article MATH Google Scholar
Noël, S. et al. XCO2 retrieval for GOSAT and GOSAT-2 based on the FOCAL algorithm. Atmos Meas Tech 14, 3837–3869, https://doi.org/10.5194/amt-14-3837-2021 (2021).
Article MATH Google Scholar
Suto, H. et al. Thermal and near-infrared sensor for carbon observation Fourier transform spectrometer-2 (TANSO-FTS-2) on the Greenhouse gases Observing SATellite-2 (GOSAT-2) during its first year in orbit. Atmos Meas Tech 14, 2013–2039, https://doi.org/10.5194/amt-14-2013-2021 (2021).
Article MATH Google Scholar
Nguyen, P., Shivadekar, S., Laya Chukkapalli, S. S. & Halem, M. Satellite Data Fusion of Multiple Observed XCO2 using Compressive Sensing and Deep Learning. in IGARSS 2020 - 2020 IEEE International Geoscience and Remote Sensing Symposium 2073–2076 https://doi.org/10.1109/IGARSS39084.2020.9323861 (IEEE, 2020).
Ronneberger, O., Fischer, P. & Brox, T. U-Net: Convolutional Networks for Biomedical Image Segmentation. in 234–241. https://doi.org/10.1007/978-3-319-24574-4_28 (2015).
Laughner, J. L. et al. The Total Carbon Column Observing Network's GGG2020 data version. Earth Syst. Sci. Data 16, 2197–2260. https://doi.org/10.5194/essd-16-2197-2024 (2024).
Hamazaki, T., Kaneko, Y., Kuze, A. & Kondo, K. Fourier transform spectrometer for Greenhouse Gases Observing Satellite (GOSAT). in (eds. Komar, G. J., Wang, J. & Kimura, T.) 73 https://doi.org/10.1117/12.581198 (2005).
Yokota, T. et al. Global Concentrations of CO2 and CH4 Retrieved from GOSAT: First Preliminary Results. SOLA 5, 160–163, https://doi.org/10.2151/sola.2009-041 (2009).
Article ADS MATH Google Scholar
Suto, H. et al. Updated spectral radiance calibration on TIR bands for TANSO-FTS-2 onboard GOSAT-2. Atmos Meas Tech 15, 5399–5413, https://doi.org/10.5194/amt-15-5399-2022 (2022).
Article MATH Google Scholar
Matsunaga, T. & Tanimoto, H. Greenhouse gas observation by TANSO-3 onboard GOSAT-GW. in Sensors, Systems, and Next-Generation Satellites XXVI (eds. Kimura, T., Babu, S. R. & Hélière, A.) 16 https://doi.org/10.1117/12.2639221 (SPIE, 2022).
Yoshida, Y. et al. Retrieval algorithm for CO2 and CH4 column abundances from short-wavelength infrared spectral observations by the Greenhouse gases observing satellite. Atmos Meas Tech 4, 717–734, https://doi.org/10.5194/amt-4-717-2011 (2011).
Article MATH Google Scholar
Watanabe, H. et al. Global mapping of greenhouse gases retrieved from GOSAT Level 2 products by using a kriging method. Int J Remote Sens 36, 1509–1528, https://doi.org/10.1080/01431161.2015.1011792 (2015).
Article MATH Google Scholar
Xu, Y. et al. Satellite‐derived estimations of spatial and seasonal variation in tropospheric carbon dioxide mass over China. Ecol Evol 3, 4310–4325, https://doi.org/10.1002/ece3.823 (2013).
Article PubMed PubMed Central MATH Google Scholar
Eldering, A., Taylor, T. E., O’Dell, C. W. & Pavlick, R. The OCO-3 mission: measurement objectives and expected performance based on 1 year of simulated data. Atmos Meas Tech 12, 2341–2370, https://doi.org/10.5194/amt-12-2341-2019 (2019).
Article MATH Google Scholar
Yanhui, P., Xiaoying, L., Xian, Y. & Lei, Z. Analysis of Influence to the Solar Array Output by Satellite Attitude Maneuver in Sun Synchronous Orbit. in 3372–3379. https://doi.org/10.1007/978-981-16-9492-9_331 (2022).
Srivastava, P. et al. Establishing Launch Readiness of NASA ISS Instrument OCO-3. in IGARSS 2020 - 2020 IEEE International Geoscience and Remote Sensing Symposium 6101–6104 https://doi.org/10.1109/IGARSS39084.2020.9323631 (IEEE, 2020).
Taylor, T. E. et al. OCO-3 early mission operations and initial (vEarly) XCO2 and SIF retrievals. Remote Sens Environ 251, 112032, https://doi.org/10.1016/j.rse.2020.112032 (2020).
Article Google Scholar
Kiel, M. et al. Urban-focused satellite CO2 observations from the Orbiting Carbon Observatory-3: A first look at the Los Angeles megacity. Remote Sens Environ 258, 112314, https://doi.org/10.1016/j.rse.2021.112314 (2021).
Article MATH Google Scholar
Hedelius, J. K. et al. Evaluation of MOPITT Version 7 joint TIR–NIR X<sub>CO</sub> retrievals with TCCON. Atmos Meas Tech 12, 5547–5572, https://doi.org/10.5194/amt-12-5547-2019 (2019).
Article MATH Google Scholar
Kiel, M. et al. How bias correction goes wrong: measurement of XCO2; affected by erroneous surface pressure estimates. Atmos Meas Tech 12, 2241–2259, https://doi.org/10.5194/amt-12-2241-2019 (2019).
Article MATH Google Scholar
Borsdorff, T. et al. Mapping carbon monoxide pollution from space down to city scales with daily global coverage. Atmos Meas Tech 11, 5507–5518, https://doi.org/10.5194/amt-11-5507-2018 (2018).
Article MATH Google Scholar
O’Dell, C. W. et al. Improved retrievals of carbon dioxide from Orbiting Carbon Observatory-2 with the version 8 ACOS algorithm. Atmos Meas Tech 11, 6539–6576, https://doi.org/10.5194/amt-11-6539-2018 (2018).
Article MATH Google Scholar
Wunch, D. et al. A method for evaluating bias in global measurements of CO<sub>2</sub> total columns from space. Atmos Chem Phys 11, 12317–12337, https://doi.org/10.5194/acp-11-12317-2011 (2011).
Article ADS MATH Google Scholar
Total Carbon Column Observing Network (TCCON) Team. (2022). 2020 TCCON Data Release (2020).
Morino, I., Velazco, V. A., Hori, A., Uchino, O., & Griffith, D. W. T. TCCON data from Burgos, Ilocos Norte (PH), Release GGG2020.R0 (Versión R0) [Data set]. CaltechDATA. https://doi.org/10.14291/tccon.ggg2020.burgos01.R0 (2022).
Wennberg, P. O. et al. TCCON data from Caltech (US), Release GGG2020.R0 (Versión R0) [Data set]. CaltechDATA. https://doi.org/10.14291/tccon.ggg2020.pasadena01.R0 (2022).
Deutscher, N. M. et al. TCCON data from Darwin (AU), Release GGG2020.R0 (Versión R0) [Data set]. CaltechDATA. https://doi.org/10.14291/tccon.ggg2020.darwin01.R0 (2023).
Wunch, D., Mendonca, J., Colebatch, O., Allen, N. T., Blavier, J.-F., Kunz, K., Roche, S., Hedelius, J., Neufeld, G., Springett, S., Worthy, D., Kessler, R., & Strong, K. TCCON data from East Trout Lake, SK (CA), Release GGG2020.R0 (Versión R0) [Data set]. CaltechDATA. https://doi.org/10.14291/tccon.ggg2020.easttroutlake01.R0 (2022).
Sussmann, R., & Rettinger, M. TCCON data from Garmisch (DE), Release GGG2020.R1 (Versión R1) [Data set]. CaltechDATA. https://doi.org/10.14291/tccon.ggg2020.garmisch01.R0 (2025).
Weidmann, D., Brownsword, R., & Doniki, S. TCCON data from Harwell, Oxfordshire (UK), Release GGG2020.R0 (Versión R0) [Data set]. CaltechDATA. https://doi.org/10.14291/tccon.ggg2020.harwell01.R0 (2023).
Liu, C., Wang, W., Sun, Y., & Shan, C. TCCON data from Hefei (PRC), Release GGG2020.R1 (Versión R1) [Data set]. CaltechDATA. https://doi.org/10.14291/tccon.ggg2020.hefei01.R1 (2023).
García, O. E., Schneider, M., Herkommer, B., Gross, J., Hase, F., Blumenstock, T., & Sepúlveda, E. TCCON data from Izana (ES), Release GGG2020.R1 (Versión R1) [Data set]. CaltechDATA. https://doi.org/10.14291/tccon.ggg2020.izana01.R1 (2022).
Wennberg, P. O., Roehl, C. M., Blavier, J.-F., Wunch, D., & Allen, N. T. TCCON data from Jet Propulsion Laboratory (US), 2011, Release GGG2020.R0 (Versión R0) [Data set]. CaltechDATA. https://doi.org/10.14291/tccon.ggg2020.jpl01.R0 (2022).
Wennberg, P. O. et al. TCCON data from Jet Propulsion Laboratory (US), 2007, Release GGG2020.R0 (Versión R0) [Data set]. CaltechDATA. https://doi.org/10.14291/tccon.ggg2020.jpl02.R0 (2022).
Hase, F. et al. TCCON data from Karlsruhe (DE), Release GGG2020.R2 (Versión R2) [Data set]. CaltechDATA. https://doi.org/10.14291/tccon.ggg2020.karlsruhe01.R2 (2024).
Wennberg, P. O., Wunch, D., Roehl, C. M., Blavier, J.-F., Toon, G. C., & Allen, N. T. TCCON data from Lamont (US), Release GGG2020.R0 (Versión R0) [Data set]. CaltechDATA. https://doi.org/10.14291/tccon.ggg2020.lamont01.R0 (2022).
Sherlock, V. et al. TCCON data from Lauder (NZ), 120HR, Release GGG2020.R0 (Versión R0) [Data set]. CaltechDATA. https://doi.org/10.14291/tccon.ggg2020.lauder01.R0 (2022).
Sherlock, V. et al. TCCON data from Lauder (NZ), 125HR, Release GGG2020.R0 (Versión R0) [Data set]. CaltechDATA. https://doi.org/10.14291/tccon.ggg2020.lauder02.R0 (2022).
Pollard, D. F., Robinson, J., & Shiona, H. TCCON data from Lauder (NZ), Release GGG2020.R0 (Versión R0) [Data set]. CaltechDATA. https://doi.org/10.14291/tccon.ggg2020.lauder03.R0 (2022).
Petri, C. et al. TCCON data from Nicosia (CY), Release GGG2020.R1 (Versión R1) [Data set]. CaltechDATA. https://doi.org/10.14291/tccon.ggg2020.nicosia01.R1 (2024).
Warneke, T., Petri, C., Notholt, J., & Buschmann, M. TCCON data from Orléans (FR), Release GGG2020.R1 (Versión R1) [Data set]. CaltechDATA. https://doi.org/10.14291/tccon.ggg2020.orleans01.R0 (2024).
Wennberg, P. O. et al. TCCON data from Park Falls (US), Release GGG2020.R1 (Versión R1) [Data set]. CaltechDATA. https://doi.org/10.14291/tccon.ggg2020.parkfalls01.R1 (2022).
De Mazière, M. et al. TCCON data from Réunion Island (RE), Release GGG2020.R0 (Versión R0) https://doi.org/10.14291/tccon.ggg2020.reunion01.R0 (2022).
Morino, I., Ohyama, H., Hori, A., & Ikegami, H. TCCON data from Rikubetsu (JP), Release GGG2020.R0 (Versión R0) [Data set]. CaltechDATA. https://doi.org/10.14291/tccon.ggg2020.rikubetsu01.R0 (2022).
Shiomi, K. et al. TCCON data from Saga (JP), Release GGG2020.R0 (Versión R0) [Data set]. https://doi.org/10.14291/tccon.ggg2020.saga01.R0 (2022).
Morino, I., Ohyama, H., Hori, A., & Ikegami, H. TCCON data from Tsukuba (JP), 125HR, Release GGG2020.R0 (Versión R0) [Data set]. CaltechDATA. https://doi.org/10.14291/tccon.ggg2020.tsukuba02.R0 (2022).
Zhou, M., Wang, P., Kumps, N., Hermans, C., & Nan, W. TCCON data from Xianghe, China, Release GGG2020.R0 (Versión R0) [Data set]. CaltechDATA. https://doi.org/10.14291/tccon.ggg2020.xianghe01.R0 (2022).
Inness, A. et al. The CAMS reanalysis of atmospheric composition. Atmos Chem Phys 19, 3515–3556, https://doi.org/10.5194/acp-19-3515-2019 (2019).
Article ADS MATH Google Scholar
Agustí-Panareda, A. et al. Technical note: The CAMS greenhouse gas reanalysis from 2003 to 2020. Atmos Chem Phys 23, 3829–3859, https://doi.org/10.5194/acp-23-3829-2023 (2023).
Article ADS MATH Google Scholar
Li, T., Wu, J. & Wang, T. Generating daily high-resolution and full-coverage XCO2 across China from 2015 to 2020 based on OCO-2 and CAMS data. Science of The Total Environment 893, 164921, https://doi.org/10.1016/j.scitotenv.2023.164921 (2023).
Article PubMed Google Scholar
Chen, Y. et al. Global-Scale Evaluation of XCO2 Products from GOSAT, OCO-2 and CarbonTracker Using Direct Comparison and Triple Collocation Method. Remote Sens (Basel) 14, 5635, https://doi.org/10.3390/rs14225635 (2022).
Article ADS MATH Google Scholar
Basilio, R. R., Livermore, T. R., Shen, Y. J. & Pollock, H. R. The quest for an OCO (Orbiting Carbon Observatory) re-flight. in (eds. Picard, R. H., Schäfer, K., Comeron, A. & van Weele, M.) 78270A https://doi.org/10.1117/12.867042 (2010).
Sunder, S., Ramsankaran, R. & Ramakrishnan, B. Machine learning techniques for regional scale estimation of high-resolution cloud-free daily sea surface temperatures from MODIS data. ISPRS Journal of Photogrammetry and Remote Sensing 166, 228–240, https://doi.org/10.1016/j.isprsjprs.2020.06.008 (2020).
Article ADS Google Scholar
Li, T., Shen, H., Yuan, Q. & Zhang, L. Geographically and temporally weighted neural networks for satellite-based mapping of ground-level PM2.5. ISPRS Journal of Photogrammetry and Remote Sensing 167, 178–188, https://doi.org/10.1016/j.isprsjprs.2020.06.019 (2020).
Article ADS MATH Google Scholar
Kamir, E., Waldner, F. & Hochman, Z. Estimating wheat yields in Australia using climate records, satellite image time series and machine learning methods. ISPRS Journal of Photogrammetry and Remote Sensing 160, 124–135, https://doi.org/10.1016/j.isprsjprs.2019.11.008 (2020).
Article ADS Google Scholar
Wu, C., Ju, Y., Yang, S., Zhang, Z. & Chen, Y. Reconstructing annual XCO2 at a 1 km×1 km spatial resolution across China from 2012 to 2019 based on a spatial CatBoost method. Environ Res 236, 116866, https://doi.org/10.1016/j.envres.2023.116866 (2023).
Article PubMed Google Scholar
Zhang, L., Li, T., Wu, J. & Yang, H. Global estimates of gap-free and fine-scale CO2 concentrations during 2014–2020 from satellite and reanalysis data. Environ Int 178, 108057, https://doi.org/10.1016/j.envint.2023.108057 (2023).
Article PubMed MATH Google Scholar
Wang, Y. et al. Seamless mapping of long-term (2010–2020) daily global XCO2 and XCH4 from the Greenhouse Gases Observing Satellite (GOSAT), Orbiting Carbon Observatory 2 (OCO-2), and CAMS global greenhouse gas reanalysis (CAMS-EGG4) with a spatiotemporally self-supervised fusion method. Earth Syst Sci Data 15, 3597–3622, https://doi.org/10.5194/essd-15-3597-2023 (2023).
Article ADS MATH Google Scholar
Chen, J. et al. Multi-sensor integrated mapping of global XCO2 from 2015 to 2021 with a local random forest model. ISPRS Journal of Photogrammetry and Remote Sensing 208, 107–120, https://doi.org/10.1016/j.isprsjprs.2024.01.009 (2024).
Article MATH Google Scholar
He, Q. et al. Full-coverage mapping high-resolution atmospheric CO2 concentrations in China from 2015 to 2020: Spatiotemporal variations and coupled trends with particulate pollution. J Clean Prod 428, 139290, https://doi.org/10.1016/j.jclepro.2023.139290 (2023).
Article Google Scholar
Yao, F., Wu, J., Li, W. & Peng, J. A spatially structured adaptive two-stage model for retrieving ground-level PM2.5 concentrations from VIIRS AOD in China. ISPRS Journal of Photogrammetry and Remote Sensing 151, 263–276, https://doi.org/10.1016/j.isprsjprs.2019.03.011 (2019).
Article ADS MATH Google Scholar
Wang, S., Shi, C., Fang, C. & Feng, K. Examining the spatial variations of determinants of energy-related CO2 emissions in China at the city level using Geographically Weighted Regression Model. Appl Energy 235, 95–105, https://doi.org/10.1016/j.apenergy.2018.10.083 (2019).
Article ADS MATH Google Scholar
Liu, Q., Wu, S., Lei, Y., Li, S. & Li, L. Exploring spatial characteristics of city-level CO2 emissions in China and their influencing factors from global and local perspectives. Science of The Total Environment 754, 142206, https://doi.org/10.1016/j.scitotenv.2020.142206 (2021).
Article PubMed MATH Google Scholar
Nguyen, H., Cressie, N. & Braverman, A. Multivariate Spatial Data Fusion for Very Large Remote Sensing Datasets. Remote Sens (Basel) 9, 142, https://doi.org/10.3390/rs9020142 (2017).
Article ADS MATH Google Scholar
Pearson, K. On lines and planes of closest fit to systems of points in space. The London, Edinburgh, and Dublin Philosophical Magazine and Journal of Science 2, 559–572, https://doi.org/10.1080/14786440109462720 (1901).
Article MATH Google Scholar
Gao, Z., Jiang, Y., He, J. & Wu, J. Spatiotemporal variation analysis of global XCO2 concentration during 2010–2020 based on DINEOF-BME framework and wavelet function. Science of The Total Environment 892, 164750, https://doi.org/10.1016/j.scitotenv.2023.164750 (2023).
Article PubMed Google Scholar
He, K., Zhang, X., Ren, S. & Sun, J. Deep Residual Learning for Image Recognition. in 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) 770–778 https://doi.org/10.1109/CVPR.2016.90 (IEEE, 2016).
Liu, H., Jiang, B., Xiao, Y. & Yang, C. Coherent Semantic Attention for Image Inpainting. in 2019 IEEE/CVF International Conference on Computer Vision (ICCV) 4169–4178 https://doi.org/10.1109/ICCV.2019.00427 (IEEE, 2019).
Zhang, Z., Stanev, E. V. & Grayek, S. Reconstruction of the Basin‐Wide Sea‐Level Variability in the North Sea Using Coastal Data and Generative Adversarial Networks. J Geophys Res Oceans 125 https://doi.org/10.1029/2020JC016402 (2020).
Olmedo, E., Taupier-Letage, I., Turiel, A. & Alvera-Azcárate, A. Improving SMOS Sea Surface Salinity in the Western Mediterranean Sea through Multivariate and Multifractal Analysis. Remote Sens (Basel) 10, 485, https://doi.org/10.3390/rs10030485 (2018).
Article ADS Google Scholar
Olmedo, E. et al. Nine years of SMOS sea surface salinity global maps at the Barcelona Expert Center. Earth Syst Sci Data 13, 857–888, https://doi.org/10.5194/essd-13-857-2021 (2021).
Article ADS MATH Google Scholar
Lopez, F. P. A. Daily Column average CO2 mapping combining OCO 3, GOSAT, and CAMS Data applying EOF and Deep learning from 2018 to 2023. Zenodo https://doi.org/10.5281/zenodo.13895409 (2024).
Jing, Y., Shi, J., Zhang, P., Wang, T. & Chen, L. Comparison of atmospheric carbon dioxide concentration based on GOSAT and OCO-2 observations. in 2016 IEEE International Geoscience and Remote Sensing Symposium (IGARSS) 4071–4073 https://doi.org/10.1109/IGARSS.2016.7730059 (IEEE, 2016).
Kulawik, S. et al. Consistent evaluation of ACOS-GOSAT, BESD-SCIAMACHY, CarbonTracker, and MACC through comparisons to TCCON. Atmos Meas Tech 9, 683–709, https://doi.org/10.5194/amt-9-683-2016 (2016).
Article MATH Google Scholar
Rodgers, C. D. & Connor, B. J. Intercomparison of remote sounding instruments. Journal of Geophysical Research: Atmospheres 108 https://doi.org/10.1029/2002JD002299 (2003).
(TCCON), T. C. C. O. N. Team 2020 TCCON Data Release (Version GGG2020) [Data set], https://doi.org/10.14291/TCCON.GGG2020 (2022).

Download references

Acknowledgements

This project was supported by the National Natural Science Foundation of China (Grant No. 42471425), and the National Key Research and Development Program of China (Grant No. 2021YFB3901000, 2021YFB3901005). We extend our heartfelt gratitude to the data providers of the Orbiting Carbon Observatory-3 (OCO-3), the Greenhouse Gases Observing Satellite (GOSAT), and the Total Carbon Column Observing Network (TCCON). Their invaluable datasets provided the essential foundation for this study, enabling us to achieve a deeper and more comprehensive analysis of atmospheric CO2. We greatly appreciate their commitment to open data sharing, which significantly enriched our research and contributed to advancing the broader scientific understanding of carbon monitoring.

Author information

Authors and Affiliations

School of Instrumentation Science and Optoelectronic Engineering, Beihang University, Beijing, 100191, China
Franz Pablo Antezana Lopez & Guanhua Zhou
Hangzhou International Innovation Institute, Beihang University, Hangzhou, 311115, China
Guifei Jing
State Key Laboratory of Environmental Criteria and Risk Assessment, Chinese Research Academy of Environmental Sciences, Beijing, 100012, China
Kai Zhang
Aerospace Information Research Institute, Chinese Academy of Sciences, Beijing, 100094, China
Liangfu Chen
National Satellite Meteorological Centre, Beijing, China, 100081, China
Lin Chen
School of Transportation Science and Engineering, Beihang University, Beijing, 100191, China
Yumin Tan

Authors

Franz Pablo Antezana Lopez
View author publications
Search author on:PubMed Google Scholar
Guanhua Zhou
View author publications
Search author on:PubMed Google Scholar
Guifei Jing
View author publications
Search author on:PubMed Google Scholar
Kai Zhang
View author publications
Search author on:PubMed Google Scholar
Liangfu Chen
View author publications
Search author on:PubMed Google Scholar
Lin Chen
View author publications
Search author on:PubMed Google Scholar
Yumin Tan
View author publications
Search author on:PubMed Google Scholar

Contributions

Franz Pablo Antezana Lopez: Algorithm development, data analysis, XCO₂ interpretation; Guanhua Zhou: Research oversight, guidance, theoretical framework, accuracy assurance; Guifei Jing: Communication, manuscript drafting, team collaboration, seamless workflow; Kai Zhang: Data collection, preprocessing, algorithm implementation, technical expertise, reconstruction; Lin Chen: Comparative analysis, data validation, consistency assessment, dataset reliability; Liangfu Chen: Theoretical framework, innovation proposals, remote sensing expertise, Tan Yumin: Revision.

Corresponding author

Correspondence to Guanhua Zhou.

Ethics declarations

Competing interests

We confirm the absence of any competing interests that could potentially influence or bias the research outcomes, methodology, or presentation. Our work was driven by a commitment to scientific integrity, and we assert that no financial, commercial, personal, or intellectual conflicts exist that could cast doubt on the objectivity of our findings. Our research approach is characterized by transparency and a dedication to advancing knowledge in the field of remote sensing and greenhouse gas monitoring, free from any external influences.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License, which permits any non-commercial use, sharing, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if you modified the licensed material. You do not have permission under this licence to share adapted material derived from this article or parts of it. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by-nc-nd/4.0/.

Reprints and permissions

About this article

Cite this article

Antezana Lopez, F.P., Zhou, G., Jing, G. et al. Global Daily Column Average CO₂ at 0.1° × 0.1° Spatial Resolution Integrating OCO-3, GOSAT, CAMS with EOF and Deep Learning. Sci Data 12, 268 (2025). https://doi.org/10.1038/s41597-024-04135-w

Download citation

Received: 09 April 2024
Accepted: 15 November 2024
Published: 14 February 2025
DOI: https://doi.org/10.1038/s41597-024-04135-w