Abstract
Accurate global carbon dioxide (CO2) distribution with high spatial and temporal resolution is essential for understanding its dynamics and impacts on climate change. This study tackles the challenge of data gaps in satellite observations of greenhouse gases, caused by orbital and observational limitations. We reconstructed a comprehensive dataset of Column-averaged CO2 (XCO2) concentrations by integrating re-analyzed data from the Copernicus Atmosphere Monitoring Service (CAMS) with observations from GOSAT and OCO-3 satellites. Using two advanced data reconstruction methods—Data Interpolating Empirical Orthogonal Functions (DINEOF) and Convolutional Auto-Encoder (DINCAE)—we imputed missing data, preserving spatial and temporal consistency. The combined approach achieved high accuracy, with Pearson correlation values between 0.94 and 0.95 against TCCON measurements, and we also reported root mean square error (RMSE) to assess model performance further. Our results indicate that these techniques generate a daily, high-resolution, gap-free XCO2 dataset, enabling improved CO2 monitoring, climate modeling, and policy development.
Similar content being viewed by others
Background & Summary
Global temperatures have been on a steady upward trajectory since the late 19th century1,2,3,4, largely driven by human activities that have significantly increased the concentration of greenhouse gases in the atmosphere, with carbon dioxide (CO2) being the primary contributor to global warming. It has never been more important to reduce CO2 emissions, especially now that the global XCO2 level has reached 400 parts per million (ppm)5,6 Around 2015 and 2016. Nowadays as of July 2024, the value reached 421.1 ppm7. Numerous strategies have been adopted to lessen the negative consequences of climate change in response to mounting concerns. The C40 city network, which has established local-level measures to reduce CO2 emissions in cities, is one famous example8,9. However, precise data is strongly required to evaluate these initiatives’ success. Although some towns have installed mobile and fixed ground-based measurement networks10. these networks often face limitations in terms of spatial coverage and temporal consistency. Ground-based stations are generally sparse and only provide data at fixed locations, making it difficult to capture the full variability of CO2 across diverse regions. In contrast, high temporal and spatial resolution satellite datasets offer comprehensive global coverage, enabling continuous monitoring of urban and remote areas. This allows for a more detailed understanding of CO2 dynamics, particularly in areas where ground-based measurements are unavailable. Moreover, satellite data provides consistent observations over time, which is critical for tracking short-term fluctuations and long-term trends in carbon emissions and atmospheric concentrations.
The development of remote sensing atmospheric sounding technology has increased the collection of regional and global CO2 data via satellite measurements11,12. Radiative transfer theory is employed to estimate the column-averaged carbon dioxide dry air mole fraction (XCO2) using atmospheric spectral data from spaceborne sensors13. However, significant challenges arise due to calibration issues and mismatches between retrieved and observed radiances, leading to biases in XCO2 estimates14. Various space agencies and satellites contribute to CO2 observation programs, yet they face spatial coverage limitations. For instance, NASA’s Orbiting Carbon Observatory-2 (OCO-2), launched in 2014, provides precise CO2 measurements using near-infrared observations but has a limited observational swath, posing challenges for the comprehensive spatial distribution15,16,17. Similarly, Japan’s GOSAT and ESA’s efforts have advanced CO2 monitoring, but all face challenges regarding spatial data coverage.
While the Copernicus Atmosphere Monitoring Service (CAMS) provides a more comprehensive dataset without gaps, encapsulating global XCO2 information18,19,20, its specific advantages and limitations compared to other datasets are often underemphasized. CAMS enhances data resolution and precision, especially in reconstructing daily XCO2 datasets21,22. But key challenges remain, including gaps in satellite observations caused by cloud cover or technical issues, on the other hand CAMS dataset. To mitigate these gaps, researchers have integrated multisource datasets, such as OCO-2 and OCO-3, with machine learning techniques like neural networks and convolutional neural networks to improve spatial and temporal resolution23. Data reconstruction algorithms such as the Data Interpolating Convolutional Auto-Encoder (DINCAE)24,25 and Data Interpolating Empirical Orthogonal Functions (DINEOF)26,27,28, originally developed for sea surface temperature, are also being adapted to enhance the spatiotemporal coverage of missing CO2 data.
Earlier research explored various methods aimed at expanding the spatiotemporal coverage of satellite-based CO2 measurements. These studies mainly addressed the challenge of reconstructing missing data in carbon satellite datasets. Techniques such as the Baxter-King Filter were employed following the application of algorithms like DINEOF and DINCAE to fill data gaps3,29,30,31, While these methods have been useful, gaps in their application remain. Specifically, DINEOF’s reliance on Empirical Orthogonal Functions (EOFs) to estimate missing data through dominant spatial patterns may oversimplify complex temporal and spatial variability in CO2 measurements. The comparison of datasets, such as OCO-2 and GOSAT in the near-infrared (NIR) and shortwave infrared (SWIR) regions, helped generate the initial version of the XCO2 dataset32,33,34. However, merging datasets from different instruments still presents challenges, particularly regarding consistency and resolution.
While the DINCAE algorithm, using deep learning and a convolutional auto-encoder, has shown promise by efficiently reconstructing missing data from OCO-3 measurements, its application across multiple datasets has yet to be fully explored35. The integration of multisource data, including TCCON, GOSAT, OCO-2, and OCO-3, provides a step forward in reconstructing monthly XCO2 values, but the fusion of these datasets still faces limitations in accuracy and validation.
DINCAE, as a neural network comprising encoders and decoders, reduces the resolution of satellite data through convolutional layers and then reconstructs the information using interpolation layers24,36. This approach shows potential for filling gaps in satellite datasets but requires further validation, particularly when applied to complex geophysical processes. Similarly, while DINEOF and DINCAE algorithms have improved the ability to reconstruct data from OCO-3 measurements, the accuracy and reliability of the reconstructed data must be compared against benchmark datasets like TCCON and GOSAT. TCCON, which began with the installation of equipment in Park Falls, Wisconsin, in 2004 and has since expanded to 23 globally operational devices, provides a crucial standard for validating satellite-derived CO2 data37 However, further comparison and validation efforts are essential to enhance the reliability of reconstructed datasets across different instruments and regions.
This study delves into the reconstruction of a global CO2 dataset with comprehensive coverage and high spatiotemporal resolution by integrating multisource from OCO-3, GOSAT, CAMS, contributing to the analysis of CO2 dynamics and their climatic impacts. The novel reconstruction techniques, DINEOF and DINCAE, will be employed to address data gaps due to sensor limitations and cloud cover. To enhance the data reliability and underscore the effectiveness of these methods, the reconstructed dataset will be compared with TCCON and other datasets. Furthermore, we demonstrate the critical role of post-reconstruction filtering in ensuring data integrity, significantly advancing our understanding of the carbon cycle and informing climate policy. This study also reflects on the broader implications of NASA’s OCO-3 mission for climate change analysis, advocating for enhanced satellite coverage to tackle global environmental challenges.
Methods
This study intends to reconstruct global daily gap-free XCO2 datasets with high resolution to enhance our understanding of CO2 dynamics, setting the stage for crafting robust climate models and actionable mitigation strategies, with validation and spatiotemporal pattern analysis. A key feature of this research is the employment of a high temporal resolution and consistent sampling method, which reinforces the reliability of the comparative analysis. Figure 1 graphically delineates the systematic methodology undertaken, beginning with the collation of TCCON, OCO-3, and GOSAT datasets, followed by data pre-processing which includes quality filtering and standardization to a 0.1-degree resolution. Data reconstruction is executed using DINEOF and DINCAE algorithms, where deep learning and empirical orthogonal functions merge disparate data into a coherent, gap-free dataset. This process culminates in post-processing steps, applying spatial median filtering and time series noise reduction, refining the data for enhanced accuracy, as demonstrated by the spatial and temporal trend comparisons with TCCON benchmarks.
Input datasets overview
Although GOSAT and CAMS datasets both use data from the GOSAT satellite, their different processing methods ensure they provide unique insights. GOSAT offers direct satellite measurements, while CAMS includes reanalyzed data from multiple sources, enhancing the completeness and resolution of CO2 coverage. Combining these datasets compensates for missing information in sparse datasets like OCO-3, allowing for more accurate interpolation. To create daily column average values of CO2 maps, we analyzed multiple datasets to validate the information and study each parameter. The list of used datasets is presented in Table 1 and a detailed description.
GOSAT satellite dataset
The Greenhouse Gas Observing Satellite, also known as “IBUKI” or GOSAT, is essential for monitoring two key greenhouse gases: CO2 and CH438,39. Key instruments on board include the Fourier Transform Spectrometer (FTS) and Cloud-Aerosol Imager (CAI) of the Thermal and Near-infrared Sensor (TANSO)40,41. The FTS collects reflected short-wavelength infrared (SWIR) radiation in three bands: 0.76, 1.6, and 2.0 μm, allowing for near-surface CO2 retrievals39. It also detects thermal infrared radiation (TIR) in the range of 5.5–14.3 μm, using absorption bands near 14 μm to gather CO2 concentration data up to 2 km39,42. Dense clouds and aerosol interference are captured by CAI, supporting high-quality data filtering.
For this study, we used a bias-corrected monthly GOSAT Level-3 global XCO2 product, with 2.5° × 2.5° resolution, covering June 2018 to December 2023 (Data is accessible at https://data2.gosat.nies.go.jp for GOSAT). This dataset is derived from FTS SWIR Level-2 data, which is smoothed, extrapolated, and interpolated via the Kriging approach43,44. fill gaps and improve spatiotemporal XCO2 coverage43.
OCO-3 satellite dataset
NASA’s OCO-3 satellite, launched in 2019, tracks atmospheric CO2 with a high spatial resolution of 2.5 km × 0.7 km45,46,47. Its FTS captures spectral data for accurate CO2 and CH4 column measurements, while TCCON data provides detailed validation of the total carbon column13. This study uses OCO-3’s monthly SWIR observations from August 2019 to November 2023. OCO-3’s orbital constraints can challenge data collection, but the L2FP algorithm mitigates this by filtering variables, including CO2 and H2O ratios, albedo, BRDF coefficients, RMS errors, and atmospheric factors48. We verified OCO-3 data against TCCON and atmospheric stations, focusing on the satellite’s spatial coverage from latitudes −52° to 52° and longitudes −180° to 180°.
TCCON in-situ dataset
It consists of high-resolution ground-based Fourier Transform Spectrometers that measure column-averaged CO2 and other gases such as CH4 and CO. TCCON has been crucial for validating satellite data from OCO-2, GOSAT, and other missions49,50,51,52,53,54.
We use the extensive GGG2020 dataset55 from several TCCON stations, from Burgos56, Caltech57, Darwin58, East Trout Lake59, Garmisch60, Harwell61, Hefei62, Izana63, Jet propulsion laboratory64,65, Karlsruhe66, Lamont67, Lauder68,69,70, Nicosia71, Orleans72, Paris, Park Falls73, Reunion Island74, Rikubetsu75, Saga76, Tsukuba77, Wollongong and Xianghe78, all of them distributed around the world, the data were extracted from 2018 to 2023.
CAMS dataset
The CAMS XCO2 dataset, generated by the ECMWF’s Integrated Forecasting System (IFS) and 4DVar data assimilation system, offers high-resolution global atmospheric data, including greenhouse gases, with 0.75° spatial and three-hour temporal resolution79. This study focuses on CO2 column-mean molar fractions, and differences in spatial and temporal coverage between OCO-3 are accounted for. CAMS XCO2 data is integrated for consistency and using this data for extrapolation80, improving our understanding of atmospheric CO2 dynamics. Prior studies using CAMS and OCO-2 data for XCO2 reconstruction via deep learning have shown high accuracy81.
Initial input datasets, preliminary insights
Terrestrial areas tend to exhibit larger errors compared to marine areas, with GOSAT and Carbon tracker datasets showing particularly high inaccuracies82. On the other hand, OCO-2 and OCO-3 data have spatial and temporal resolution problems because it has several missed information, statistical characteristics, uncertainties, and constraints related to temporal fluctuation and various seasons of the year are included in the data comparison between TCCON and OCO-2, including glint and nadir modes11. Regarding the data in terms of enhancing the spatiotemporal coverage of OCO-3 data in this research, the preliminary findings of applying the DINCAE and DINEOF algorithms show encouraging results. The reconstructed data will offer insightful knowledge of the dynamics of CO2 and its impact on climate change. It was crucial to improve the accuracy and dependability of our reconstructed dataset. Using diligent data filtering techniques to prepare the dataset for in-depth analysis and modeling was crucial. Meaningful advancements in carbon cycle studies, climate modeling, and policy formulation depend on our ability to comprehend the dynamics of CO2 and their broad consequences for climate change. The improvements made possible by the skillful application of DINCAE and DINEOF algorithms and cutting-edge data filtering methods significantly increased the spatiotemporal coverage and dependability of OCO-3 data. This advancement is crucial because it enhances our understanding of CO2 dynamics and their significant influence on climate change.
The OCO-3 dataset has data gaps ranging from 99.75% to 100% per day, this data was also filtered by analyzing the quality_flag equal to 0 which generally indicates reliable data. On Earth’s surface, each grid cell corresponds to a 0.1 × 0.1-degree area. This gap hinders our full comprehension of CO2 levels, potentially compromising the accuracy and completeness of global CO2 assessments and modeling efforts. Figure 2 presents a temporal analysis illustrating the variation in missing information within OCO-3 satellite data.
Similarly, the GOSAT L3 product, with a coarser spatial resolution of 2.5 × 2.5 degrees, shows empty data in 38–65% of the grid cells each month. On Earth’s surface, this corresponds to an area of approximately 625,000 km2. The lack of data in this range makes it difficult to fully understand CO2 dynamics, especially at local and regional scales. Figure 3 illustrates a temporal analysis of empty information variation of GOSAT L3 V03.05 satellite data products.
Thus, the magnitude and significance of the missing data gaps in both OCO-3 and GOSAT L3 varied to different degrees for CO2 monitoring. Hence, the ability to collect comprehensive global CO2 distributions and perform accurate regional analyses is limited. The GOSAT data covers more XCO2 information spatially, but the temporal and spatial resolution is lower than that of the OCO-3 satellites. To improve our understanding of CO2 dynamics and contribute to the success of climate change mitigation initiatives, it is crucial to enhance satellite coverage and data collection continuously.
Figure 4 illustrates the geographical distribution of the data utilized in this study. Derived from GOSAT Level 3 and OCO-3 datasets, acquired in December 2022 and December 14, 2022, respectively. The data visualization highlights significant gaps in the global coverage of atmospheric CO2, showing that the OCO-3 dataset mostly spans the region between 52°N and 52°S. This emphasizes how critical it is to have a complete dataset and how crucial it is to rebuild satellite pictures to close this gap. The CAMS dataset, on the other hand, offers global coverage based on simulation results, highlighting a notable discrepancy in data comprehensiveness and geographic scope and emphasizing the integration of multiple sources for accurate CO2 monitoring.
The OCO-3 satellite and its state-of-the-art equipment offer a plethora of information essential for tracking atmospheric greenhouse gases. By analyzing its SWIR data, we intend to gain insight into the global carbon cycle and assess the environmental effects of human activities83. Rigorous comparisons with TCCON, fusing the information of CAMS, GOSAT, and OCO-3, will verify the accuracy and reliability of OCO-3 for scientific research and environmental management.
Figure 5 presents an extensive plot against a world map showing the atmospheric XCO2 concentrations between 2018 and 2023. The scatter plot distinguishes data points according to the source satellite and associated geographic area. Notably, the alignment of patterns between CAMS and OCO-3 indicates a high degree of consistency in XCO2 measurements, reinforcing the reliability and calibration uniformity of the OCO-3 sensor and CAMS model.
The integrated map highlights the coverage of the OCO-3 satellite in yellow, indicating the spatial emphasis of the dataset and obliquely implying a geographically limited investigation. While the overall trends of CAMS and OCO-3 align closely, GOSAT measurements deviate from this trend, especially in south ocean areas. The datasets exhibit consistency in inland areas, but significant discrepancies arise over ocean regions. These deviations, particularly highlighted in the scatter plot, point to potential regional anomalies, calibration inconsistencies, or differences in sensor sensitivities. In the larger scheme of atmospheric monitoring, these discrepancies are crucial since they could indicate the requirement for further research into regional atmospheric dynamics and cross-calibration. To address this issue, we excluded the GOSAT grid cells with large deviations from CAMS and OCO-3 to improve the reliability of the reconstructed dataset.
The scatterplot matrix displayed in Fig. 6 presents a comparison of XCO2 distribution with histograms for CAMS, GOSAT, and OCO-3 in the panels along the diagonal. This information reveals the frequency of different XCO2 ppm values captured by each system. The overlapping period for the three datasets (CAMS, OCO-3, and GOSAT) is from 2019-10 to 2020-12, so the assessment and comparison of XCO2 data to evaluate the input datasets are within this period. The scatterplots in the off-diagonal panels compare the readings for every pair of systems, and the linear regression lines show the correlation between the systems for every location. Best correlation parameters are visible in the comparison of GOSAT and CAMS data, on the other hand, the results among OCO-3 and other datasets are not consistent and, in some cases, have a negative slope in linear correlation, particularly in southern areas. These correlation factors are presented in Table 2, which offers quantitative insights into the degree of agreement between measurements from several satellite systems in a variety of places.
Table 2 presents the correlation parameters by region and dataset. In Europe, CAMS and GOSAT exhibit the highest R2 value (0.64) and Pearson correlation (0.80), demonstrating a strong linear relationship and high prediction accuracy in this region. Oceania, however, shows a negative correlation (−0.09) in this particular comparison, indicating a weak and unfavorable association. Europe also shows a steep positive slope (1.39) and a significant negative intercept (−161.49), reinforcing the strong linear relationship. The correlation levels between CAMS and OCO-3 and GOSAT and OCO-3 vary by region. For example, Europe maintains a strong correlation in the CAMS-OCO-3 comparison (Pearson coefficient of 0.68), while Africa shows negative correlations in both GOSAT-OCO-3 (−0.19) and CAMS-OCO-3 (−0.18), pointing to differences in satellite data or localized fluctuations in the atmosphere. These region-specific variations highlight the complexity and variety in interpreting satellite data and underscore the need for localized atmospheric studies.
Dataset reconstruction method
According to previous studies84,85,86,87,88,89,90,91, the most often used technical approaches for modeling relationships are statistical models, machine learning, and deep learning techniques. Despite the widespread use of statistical models, including spatial autoregression models, geographically weighted regression (GWR), and linear regression models, to investigate spatiotemporal relationships, there are still certain shortcomings in estimating XCO292,93,94. Those works focus on the use of Deep Learning and EOF techniques to generate enhanced maps of XCO2 maps, which are both known in geo-statistics as space-time modal decomposition, with a focus on the crucial part that space-time modal decomposition plays in the data analysis stage of the DINEOF technique95. Our research strategy involved thorough data collection, the use of reconstruction techniques, and a strict evaluation process to fully examine the spatiotemporal coverage of OCO-3 data combined with CAMS and GOSAT. Our research had a rich context thanks to the varied climatic circumstances and variations in atmospheric CO2 levels in our study area. The OCO-3 data were gathered using cutting-edge sensors and equipment deployed on satellite platforms, providing precise measurements of the atmospheric CO2 level. Forming the backbone of this study.
To assess the spatiotemporal coverage of OCO-3 data, a comprehensive methodology involving data collection, application of DINCAE and DINEOF algorithms, and rigorous evaluation was employed. This approach facilitated a comparison of the algorithm performances through statistical analysis and visual inspection, enhancing our understanding of data reconstruction techniques and their implications for spatiotemporal coverage. The study also acknowledges limitations, particularly the potential biased or ambiguous inherent in OCO-3 data and the influence of specific characteristics of the utilized datasets and the parameter settings in the DINCAE and DINEOF algorithms on their performance. Recognizing these limitations and maintaining methodological rigor is crucial for accurate interpretation of the findings.
DINEOF reconstruction method
DINEOF (Data Interpolating Empirical Orthogonal Function), originally introduced by Pearson96, is a sophisticated analytical technique utilized to interpolate missing data and identify spatiotemporal modes. Empirical Orthogonal Function (EOF) analysis forms its foundation. The data matrix A, represented as Amxn, is the central component of DINEOF. It is a m x n matrix where m is bigger than n. In this case, n is the number of layers, usually expressed in terms of time, such as months, and m denotes all the grid cells that correspond to one layer of the research area, each grid cell arranged in a row97.
In DINEOF, the breakdown of the matrix Amxn into its parts of space and time is essential. Amxn = Vmxm Zmxn is a concise representation of this decomposition, which separates Amxn into a space function Vmxm and a time function Zmxn. Whereas the time function is made up of related time coefficients, the space function is made up of orthogonal space characteristic fields. These space characteristic fields and their coefficients are concatenated linearly to represent the space field at a given position ak as represented in the below equation.
where \({v}_{i}\) are the vectors of the space characteristic fields and \({c}_{i}\) are the coefficients. When it comes to recreating missing values in the dataset, the EOF decomposition is essential. The first step in this approach is to use EOF to decompose the spatiotemporal variable field Amxn and produce spatial typical fields. The methodology then involves selecting the first N of these spatial fields and reconstructing the complete space-time variable field with these fields. Thus, the reconstructed matrix, represented as Amxn, is the product of the time coefficients corresponding to the first N spatial typical fields. This method is a reliable way to handle missing data in complicated datasets since it works especially well with big data volumes and is resilient to local changes. In geo-statistics, it is sometimes referred to as space-time modal decomposition and is a crucial component of the DINEOF method’s data analysis95.
DINCAE reconstruction method
The “DINCAE 2.0” methodology employ a convolutional neural network (CNN) created especially designed for reconstruction of missing data in satellite observations. The core of this methodology is a U-Net type network, which is particularly effective because of its deep structure and capacity to capture both local and global properties in the input data. A distinguishing characteristic of the U-Net design is the utilization of skip connections, which aid in maintaining the fine-scale details in the data and are essential for precise reconstruction of complicated geophysical fields36,98. Assuming a Gaussian distribution of errors, the network is trained to minimize the negative log-likelihood, which can be expressed mathematically as:
Where L(θ) is the likelihood function of the parameters θ given the data X. This algorithm is further enhanced by an additional refinement phase, which processes the inputs and outputs through an auto-encoder, in addition to its architecture. This phase deepens the network and enhances the reconstruction quality, especially for complex patterns and interactions99,100. Moreover, the methodology expands to multivariate reconstruction, enabling it to handle diverse data types, such as wind fields, chlorophyll concentration, and sea surface temperature. For a thorough knowledge and analysis of environmental events, a multivariate approach is essential26,101,102. Additionally, a noteworthy development for this model is its ability to process non-gridded data, which is common in satellite datasets that vary in formats and resolutions. This flexibility is achieved by modifying the network’s cost function and the input layer, allowing it to efficiently analyze unstructured data inputs and produce organized gridded field outputs.
Figure 7 illustrates a global map collection that displays the monthly mean reconstructed atmospheric CO2 concentrations over six years, from January 2018 to July 2023. The XCO2 concentrations fluctuate throughout the year, exhibiting a seasonal pattern, with levels declining from May to September. This decrease can be attributed to increased photosynthetic activities in the Northern Hemisphere during the summer, which removes CO2 from the atmosphere. In contrast, XCO2 levels rise between October to April, driven by reduced photosynthetic activity in the winter in the Northern Hemisphere in conjunction with an increase in CO2 emissions from human activities like industrial processes and heating.
Global Spatiotemporal XCO2 evolution
Figure 8 provides a thorough analysis of variations in atmospheric CO2 in different parts of the world, highlighting the important contribution of oceans to Earth’s carbon cycle through their seasonal CO2 absorption and emission. It emphasizes the unique patterns of XCO2 exchange with the atmosphere in the Indian Ocean, the seasonal biological productivity and decay of temperate zones in the North Atlantic and North Pacific, and the continents of North America, Asia, and Europe. The waters in the Southern China Sea also show a naturally occurring cyclical constant increase with fluctuations of XCO2, almost similar trend and seasonal variation like China.
On the other hand, South America’s XCO2 trajectory, significantly influenced by the Amazon’s vast photosynthetic capacity, is marred by a concerning surge in emissions due to fossil fuel combustion and rampant deforestation. This trend is mirrored paralleled in the Southern Hemisphere regions such as the South Atlantic, South Pacific, and Oceania, where XCO2 levels, though lower, mirror the distressing upward trend observed in the northern counterparts. The depiction across these diverse landscapes contributes to a comprehensive narrative of the Earth’s carbon flux, illustrating a climate system that is both dynamic and acutely sensitive to changes, set against an ever-increasing concentration of atmospheric greenhouse gases. Notably, the graph identifies the Australian region as having the lowest column-average CO2 concentrations, marking a distinct region of interest in the global trend patterns.
The comprehensive analysis of the XCO2 linear growth rate spanning from 2018 to 2023 offers a striking visualization of the escalating CO2 concentrations worldwide, this is visible in Fig. 9. The intensity of the red hues mapped across various global regions not only underscores the areas with significant increases in XCO2 levels but also invites a deeper investigation into the underlying causes. These vivid markers serve as indicators of heightened industrial activities, urban sprawl, deforestation, and other carbon-emitting practices, highlighting the urgent need for scrutinizing local and regional environmental policies.
The Amazon Basin in South America, along with regions such as Argentina, Bolivia, Brazil, exhibits areas of elevated XCO2 levels, as indicated by the visual data. Southern region of Africa similarly shows an increase in XCO2 concentrations. These trends in the data reflect changes in atmospheric carbon dioxide but do not provide conclusive evidence about the specific causes of these increases, which could be influenced by a range of factors including natural variability caused by the landuse change, short-term atmospheric mixing, or regional land-use changes. In Asia, areas like South and west Asia, including western China, show noticeable XCO2 increases, possibly linked to industrial activity. Western region of Australia shows a subtler rise in XCO2 levels, which may be due to both natural processes and localized environmental conditions. However, caution is advised in interpreting these spatial trends as direct indicators of specific sources or processes without accounting for the potential influence of atmospheric dynamics over short periods.
Data Records
The final output of our study is a daily raster dataset, systematically organized in a folder structure and named following the pattern “Reconstructed_YYYYMMDD.tif”, representing gap-free daily XCO2 concentrations. Each data record is saved as a “.tif” file, ensuring broad compatibility and ease of use for further scientific analysis. The dataset covers an extensive period from January 2018 to November 2023 and boasts a fine spatial resolution of 0.1 degrees, facilitating precise, grid-based analyses of XCO2 distribution patterns over time. This dataset is publicly available for free download at the Zenodo103. Our data has been made publicly available for further academic inquiry and collaboration at the following digital repository: https://zenodo.org/records/13895409. We encourage scholars and researchers to access and utilize this dataset to advance the field.
Reconstructed products
A comparison between model and satellite data for XCO2 across the Western Hemisphere is shown in Fig. 10. A smoother gradient of XCO2 concentrations is seen across the final results of the integrated reconstruction of OCO-3, GOSAT, and CAMS, which incorporates all three datasets. This is likely due to the synergistic effect of combining all available data for a more accurate and higher-resolution reconstruction of the atmospheric CO2 distribution. Figure 10a OCO-3, (b) GOSAT L3, (c) CAMS, and (d) the reconstructed image using only OCO-3 data cannot obtain a complete image and still contain several missing gaps and even noise, so using only one dataset to reconstruct the data is not enough. On the other hand, Fig. 10e shows the integration of GOSAT information on data reconstruction, the resolution of GOSAT does not allow a satisfactory data reconstruction, so finally, we integrate the CAMS data, and the results are presented in Fig. 10f which are the final result of the reconstructed data. Hence, the images show how the model output (CAMS) and the original satellite observations (OCO-3 and GOSAT) differ in terms of data quality and resolution. The reconstructed and merged data maps show improvements in spatial resolution and data continuity, leading to image (f), which, as a result of the integration of all datasets, offers the most accurate and thorough perspective of XCO2 levels. This demonstrates the importance of merging data from several sources when conducting environmental monitoring and analysis.
Technical Validation
The key difference between GOSAT and CAMS lies in their spatial resolution and the data assimilation techniques used by CAMS, which integrates multiple data sources. These differences, far from being problematic, enhance the robustness of the reconstructed dataset by providing a broader spatial coverage and more frequent updates. The complementary nature of these datasets is critical in filling the gaps present in OCO-3 data, resulting in a more accurate and continuous global XCO2 dataset. This research aims to reconstruct the dataset using multisource geodata containing XCO2 information, which has been interpolated through the DINEOF and DINCAE methods. The validation process involves a comprehensive analysis of the differences between the interpolated OCO-3 data and reference datasets obtained from GOSAT and TCCON.
In evaluating the reliability of satellite-derived CO2 data, a crucial aspect involves comparing datasets from various instruments. Notably, studies104,105 have conducted comprehensive assessments of the consistency between different satellite missions, including BESD-SCIAMACHY, ACOS-GOSAT, and OCO-2. These assessments involve meticulous analyses of biases, correlation coefficients, and spatiotemporal variations, addressing challenges associated with differences in a priori profiles and averaging kernels across diverse satellite data products, as highlighted by Rodgers and Connor106. The results confirm a degree of agreement between various satellite datasets, facilitating the integration of multi-satellite observations for a more comprehensive understanding of atmospheric CO2 concentrations23.
In Fig. 11, the boxplot shows a succinct analytical comparison of atmospheric XCO2 across a suite of global monitoring TCCON stations, showcasing the spread and central tendencies within the TCCON, DINCAE, and DINEOF reconstructed datasets. The central clustering around 412 ppm highlights the coherence in global CO2 levels, with notable consistency at Burgos station, exemplifying minimal cross-method variability. Peaks are closely matched at Caltech and Armstrong Center, with the TCCON readings reaching a maximum of 427.565 ppm, indicative of regional emission spikes. A notable lower bound is observed at East Trout Lake with DINEOF, suggesting lower regional CO2 concentration levels. Lauder’s OCO-3 data exhibit good agreement with TCCON, reflecting reliable measurements. In contrast, sites like Orleans, Nicosia, and Paris display a tightly packed range, signifying data robustness. Reunion Island’s DINEOF results represent the lower end of measurements, with Xianghe maintaining a consistent range with the global data set, illustrating the extensive span of CO2 levels captured.
Figure 12a shows a scatterplot comparison between XCO2 values from TCCON and those from the DINEOF technique. A high correlation coefficient (r = 0.94) between the two sets of data indicates a strong linear link, which is revealed by the scatter of dots. The difference between values seen and values projected by an estimator or model is measured by the root mean square error, or RMSE. Quantitative estimates of the average magnitude of the error are provided by the unbiased RMSE (ubRMSE) of 1.581 ppm and the RMSE of 1.610 ppm. With a bias of 0.301 ppm, which is another measure of precision, the DLR measurements are, on average, just marginally higher than the TCCON values.
Figure 12b presents a nuanced comparison, suggesting potential calibration or adjustment enhancements to the DINCAE method. This figure demonstrates a slightly improved correlation over Fig. 12a, with a correlation coefficient of 0.95. Furthermore, it features reduced RMSE and ubRMSE values of 1.441 ppm and 1.400 ppm, respectively, indicating more precise accuracy. While the bias slightly increases to 0.343 ppm, it remains within an acceptable margin, underscoring a closer alignment with the TCCON reference data. The lower RMSE and ubRMSE, alongside the higher correlation coefficient, suggest that the observations in Fig. 12b are more accurate and consistent compared to those aligned with the TCCON benchmark. Although the bias is marginally higher, it’s not significantly detrimental. The density of points closely following the 1:1 line visually confirms a tighter congruence with TCCON readings, hinting at successful calibration or adjustments in addressing systematic discrepancies evident in the DINEOF approach depicted in Fig. 12a.
The data validation process, incorporating scatterplot analysis and spatiotemporal patterns, is a robust methodology to assess the quality and reliability of reconstructed data. Comparing the measurements with TCCON datasets contributes to the refinement of monitoring CO2 concentrations.
Code availability
The data assembly, processing, and quality-control steps were performed using R and Julia, utilizing established libraries for DINEOF and DINCAE. These implementations relied on the respective open-source libraries available. The dineof was used, that function is available in sinkr library in R, and the DINCAE implementation is available on GitHub https://github.com/gher-uliege/DINCAE. No custom code was generated to automate these procedures, as the available libraries fully supported the required functionalities.
References
Intergovernmental Panel on Climate Change (IPCC). Climate Change 2022 – Impacts, Adaptation and Vulnerability. https://doi.org/10.1017/9781009325844 (Cambridge University Press, 2023).
Hegerl, G. C. et al. Causes of climate change over the historical record. Environmental Research Letters 14, 123006, https://doi.org/10.1088/1748-9326/ab4557 (2019).
Jiang, Y., Gao, Z., He, J., Wu, J. & Christakos, G. Application and Analysis of XCO2 Data from OCO Satellite Using a Synthetic DINEOF–BME Spatiotemporal Interpolation Framework. Remote Sens (Basel) 14, 4422, https://doi.org/10.3390/rs14174422 (2022).
Adak, S., Mandal, N., Mukhopadhyay, A., Maity, P. P. & Sen, S. Current State and Prediction of Future Global Climate Change and Variability in Terms of CO2 Levels and Temperature. in Enhancing Resilience of Dryland Agriculture Under Changing Climate 15–43 https://doi.org/10.1007/978-981-19-9159-2_2 (Springer Nature Singapore, 2023).
Keeling, Ralph F; Keeling, Charles D. Atmospheric Monthly In Situ CO2 Data - Mauna Loa Observatory, Hawaii (Archive 2024-11-13). In Scripps CO2 Program Data. UC San Diego Library Digital Collections. https://doi.org/10.6075/J08W3BHW (2017).
Kozyra, J., Łukasik, Z., Kuśmińska-Fijałkowska, A. & Janota, A. Reduction of CO2 emission as a result of the use of 48-volt electrical installations in passenger cars. Archives of Transport 66, 7–20, https://doi.org/10.5604/01.3001.0016.3124 (2023).
Recent global CO2. GOSAT [online]. National Institute for Environmental Studies (NIES). https://www.gosat.nies.go.jp/en/recent-global-co2.html.
Lei, L. et al. A comparison of atmospheric CO2 concentration GOSAT-based observations and model simulations. Sci China Earth Sci 57, 1393–1402, https://doi.org/10.1007/s11430-013-4807-y (2014).
Leal, J. M. & Paterson, M. Transnational city networks, global political economy, and climate governance: C40 in Mexico and Lima. Rev Int Polit Econ 31, 26–46, https://doi.org/10.1080/09692290.2023.2167849 (2024).
Potier, E. et al. Complementing XCO2 imagery with ground-based CO2 and 14CO2 measurements to monitor CO2 emissions from fossil fuels on a regional to local scale. Atmos Meas Tech 15, 5261–5288, https://doi.org/10.5194/amt-15-5261-2022 (2022).
Wunch, D. et al. Comparisons of the Orbiting Carbon Observatory-2 (OCO-2) CO2 measurements with TCCON. Atmos Meas Tech 10, 2209–2238, https://doi.org/10.5194/amt-10-2209-2017 (2017).
Buchwitz, M. et al. The Greenhouse Gas Climate Change Initiative (GHG-CCI): Comparison and quality assessment of near-surface-sensitive satellite-derived CO2 and CH4 global data sets. Remote Sens Environ 162, 344–362, https://doi.org/10.1016/j.rse.2013.04.024 (2015).
Taylor, T. E. et al. Evaluating the consistency between OCO-2 and OCO-3 XCO 2 estimates derived from the NASA ACOS version 10 retrieval algorithm. Atmos Meas Tech 16, 3173–3209, https://doi.org/10.5194/amt-16-3173-2023 (2023).
Mauceri, S., Massie, S. & Schmidt, S. Correcting 3D cloud effects in X CO2 retrievals from the Orbiting Carbon Observatory-2 (OCO-2). Atmos Meas Tech 16, 1461–1476, https://doi.org/10.5194/amt-16-1461-2023 (2023).
Crisp, D. et al. The ACOS CO2 retrieval algorithm – Part II: Global XCO2 data characterization. Atmos Meas Tech 5, 687–707, https://doi.org/10.5194/amt-5-687-2012 (2012).
Liu, Y. et al. Effects of spectral sampling rate and range of CO2 absorption bands on XCO2 retrieval from TanSat hyperspectral spectrometer. Chinese Science Bulletin 59, 1485–1491, https://doi.org/10.1007/s11434-014-0215-8 (2014).
Connor, B. et al. Quantification of uncertainties in OCO-2 measurements of XCO2 simulations and linear error analysis. Atmos Meas Tech 9, 5227–5238, https://doi.org/10.5194/amt-9-5227-2016 (2016).
Wang, Y., Yuan, Q., Zhou, S. & Zhang, L. Global spatiotemporal completion of daily high-resolution TCCO from TROPOMI over land using a swath-based local ensemble learning method. ISPRS Journal of Photogrammetry and Remote Sensing 194, 167–180, https://doi.org/10.1016/j.isprsjprs.2022.10.012 (2022).
Park, M., Randel, W. J., Emmons, L. K. & Livesey, N. J. Transport pathways of carbon monoxide in the Asian summer monsoon diagnosed from Model of Ozone and Related Tracers (MOZART). Journal of Geophysical Research: Atmospheres 114 https://doi.org/10.1029/2008JD010621 (2009).
Tang, W. et al. Source Contributions to Carbon Monoxide Concentrations During KORUS‐AQ Based on CAM‐chem Model Applications. Journal of Geophysical Research: Atmospheres 124, 2796–2822, https://doi.org/10.1029/2018JD029151 (2019).
Buontempo, C. et al. The Copernicus Climate Change Service: Climate Science in Action. Bull Am Meteorol Soc 103, E2669–E2687, https://doi.org/10.1175/BAMS-D-21-0315.1 (2022).
Peuch, V.-H. et al. The Copernicus Atmosphere Monitoring Service: From Research to Operations. Bull Am Meteorol Soc 103, E2650–E2668, https://doi.org/10.1175/BAMS-D-21-0314.1 (2022).
Zhang, M. & Liu, G. Mapping contiguous XCO2 by machine learning and analyzing the spatio-temporal variation in China from 2003 to 2019. Science of The Total Environment 858, 159588, https://doi.org/10.1016/j.scitotenv.2022.159588 (2023).
Barth, A., Alvera-Azcárate, A., Troupin, C. & Beckers, J.-M. DINCAE 2.0: multivariate convolutional neural network with error estimates to reconstruct sea surface temperature satellite and altimetry observations. Geosci Model Dev 15, 2183–2196, https://doi.org/10.5194/gmd-15-2183-2022 (2022).
Barth, A., Alvera-Azcárate, A., Licer, M. & Beckers, J.-M. DINCAE 1.0: a convolutional neural network with error estimates to reconstruct sea surface temperature satellite observations. Geosci Model Dev 13, 1609–1622, https://doi.org/10.5194/gmd-13-1609-2020 (2020).
Alvera-Azcárate, A., Barth, A., Beckers, J.-M. & Weisberg, R. H. Multivariate reconstruction of missing data in sea surface temperature, chlorophyll, and wind satellite fields. J Geophys Res 112, C03008, https://doi.org/10.1029/2006JC003660 (2007).
Alvera-Azcárate, A., Barth, A., Sirjacobs, D., Lenartz, F. & Beckers, J. M. Data Interpolating Empirical Orthogonal Functions (DINEOF): a tool for geophysical data analyses. Mediterr Mar Sci 12, 5, https://doi.org/10.12681/mms.64 (2011).
Alvera-Azcárate, A., Barth, A., Rixen, M. & Beckers, J. M. Reconstruction of incomplete oceanographic data sets using empirical orthogonal functions: application to the Adriatic Sea surface temperature. Ocean Model (Oxf) 9, 325–346, https://doi.org/10.1016/j.ocemod.2004.08.001 (2005).
Lopez, F. P. A., Zhou, G., Jing, G., Zhang, K. & Tan, Y. XCO2 and XCH4 Reconstruction Using GOSAT Satellite Data Based on EOF-Algorithm. Remote Sens (Basel) 14, 2622, https://doi.org/10.3390/rs14112622 (2022).
Yan, X. et al. Application of Synthetic DINCAE–BME Spatiotemporal Interpolation Framework to Reconstruct Chlorophyll–a from Satellite Observations in the Arabian Sea. J Mar Sci Eng 11, 743, https://doi.org/10.3390/jmse11040743 (2023).
Murray, C. J. Cyclical Properties of Baxter-King Filtered Time Series. Review of Economics and Statistics 85, 472–476, https://doi.org/10.1162/003465303765299945 (2003).
Kuze, A. et al. Update on GOSAT TANSO-FTS performance, operations, and data products after more than 6 years in space. Atmos Meas Tech 9, 2445–2461, https://doi.org/10.5194/amt-9-2445-2016 (2016).
Noël, S. et al. XCO2 retrieval for GOSAT and GOSAT-2 based on the FOCAL algorithm. Atmos Meas Tech 14, 3837–3869, https://doi.org/10.5194/amt-14-3837-2021 (2021).
Suto, H. et al. Thermal and near-infrared sensor for carbon observation Fourier transform spectrometer-2 (TANSO-FTS-2) on the Greenhouse gases Observing SATellite-2 (GOSAT-2) during its first year in orbit. Atmos Meas Tech 14, 2013–2039, https://doi.org/10.5194/amt-14-2013-2021 (2021).
Nguyen, P., Shivadekar, S., Laya Chukkapalli, S. S. & Halem, M. Satellite Data Fusion of Multiple Observed XCO2 using Compressive Sensing and Deep Learning. in IGARSS 2020 - 2020 IEEE International Geoscience and Remote Sensing Symposium 2073–2076 https://doi.org/10.1109/IGARSS39084.2020.9323861 (IEEE, 2020).
Ronneberger, O., Fischer, P. & Brox, T. U-Net: Convolutional Networks for Biomedical Image Segmentation. in 234–241. https://doi.org/10.1007/978-3-319-24574-4_28 (2015).
Laughner, J. L. et al. The Total Carbon Column Observing Network's GGG2020 data version. Earth Syst. Sci. Data 16, 2197–2260. https://doi.org/10.5194/essd-16-2197-2024 (2024).
Hamazaki, T., Kaneko, Y., Kuze, A. & Kondo, K. Fourier transform spectrometer for Greenhouse Gases Observing Satellite (GOSAT). in (eds. Komar, G. J., Wang, J. & Kimura, T.) 73 https://doi.org/10.1117/12.581198 (2005).
Yokota, T. et al. Global Concentrations of CO2 and CH4 Retrieved from GOSAT: First Preliminary Results. SOLA 5, 160–163, https://doi.org/10.2151/sola.2009-041 (2009).
Suto, H. et al. Updated spectral radiance calibration on TIR bands for TANSO-FTS-2 onboard GOSAT-2. Atmos Meas Tech 15, 5399–5413, https://doi.org/10.5194/amt-15-5399-2022 (2022).
Matsunaga, T. & Tanimoto, H. Greenhouse gas observation by TANSO-3 onboard GOSAT-GW. in Sensors, Systems, and Next-Generation Satellites XXVI (eds. Kimura, T., Babu, S. R. & Hélière, A.) 16 https://doi.org/10.1117/12.2639221 (SPIE, 2022).
Yoshida, Y. et al. Retrieval algorithm for CO2 and CH4 column abundances from short-wavelength infrared spectral observations by the Greenhouse gases observing satellite. Atmos Meas Tech 4, 717–734, https://doi.org/10.5194/amt-4-717-2011 (2011).
Watanabe, H. et al. Global mapping of greenhouse gases retrieved from GOSAT Level 2 products by using a kriging method. Int J Remote Sens 36, 1509–1528, https://doi.org/10.1080/01431161.2015.1011792 (2015).
Xu, Y. et al. Satellite‐derived estimations of spatial and seasonal variation in tropospheric carbon dioxide mass over China. Ecol Evol 3, 4310–4325, https://doi.org/10.1002/ece3.823 (2013).
Eldering, A., Taylor, T. E., O’Dell, C. W. & Pavlick, R. The OCO-3 mission: measurement objectives and expected performance based on 1 year of simulated data. Atmos Meas Tech 12, 2341–2370, https://doi.org/10.5194/amt-12-2341-2019 (2019).
Yanhui, P., Xiaoying, L., Xian, Y. & Lei, Z. Analysis of Influence to the Solar Array Output by Satellite Attitude Maneuver in Sun Synchronous Orbit. in 3372–3379. https://doi.org/10.1007/978-981-16-9492-9_331 (2022).
Srivastava, P. et al. Establishing Launch Readiness of NASA ISS Instrument OCO-3. in IGARSS 2020 - 2020 IEEE International Geoscience and Remote Sensing Symposium 6101–6104 https://doi.org/10.1109/IGARSS39084.2020.9323631 (IEEE, 2020).
Taylor, T. E. et al. OCO-3 early mission operations and initial (vEarly) XCO2 and SIF retrievals. Remote Sens Environ 251, 112032, https://doi.org/10.1016/j.rse.2020.112032 (2020).
Kiel, M. et al. Urban-focused satellite CO2 observations from the Orbiting Carbon Observatory-3: A first look at the Los Angeles megacity. Remote Sens Environ 258, 112314, https://doi.org/10.1016/j.rse.2021.112314 (2021).
Hedelius, J. K. et al. Evaluation of MOPITT Version 7 joint TIR–NIR X<sub>CO</sub> retrievals with TCCON. Atmos Meas Tech 12, 5547–5572, https://doi.org/10.5194/amt-12-5547-2019 (2019).
Kiel, M. et al. How bias correction goes wrong: measurement of XCO2; affected by erroneous surface pressure estimates. Atmos Meas Tech 12, 2241–2259, https://doi.org/10.5194/amt-12-2241-2019 (2019).
Borsdorff, T. et al. Mapping carbon monoxide pollution from space down to city scales with daily global coverage. Atmos Meas Tech 11, 5507–5518, https://doi.org/10.5194/amt-11-5507-2018 (2018).
O’Dell, C. W. et al. Improved retrievals of carbon dioxide from Orbiting Carbon Observatory-2 with the version 8 ACOS algorithm. Atmos Meas Tech 11, 6539–6576, https://doi.org/10.5194/amt-11-6539-2018 (2018).
Wunch, D. et al. A method for evaluating bias in global measurements of CO<sub>2</sub> total columns from space. Atmos Chem Phys 11, 12317–12337, https://doi.org/10.5194/acp-11-12317-2011 (2011).
Total Carbon Column Observing Network (TCCON) Team. (2022). 2020 TCCON Data Release (2020).
Morino, I., Velazco, V. A., Hori, A., Uchino, O., & Griffith, D. W. T. TCCON data from Burgos, Ilocos Norte (PH), Release GGG2020.R0 (Versión R0) [Data set]. CaltechDATA. https://doi.org/10.14291/tccon.ggg2020.burgos01.R0 (2022).
Wennberg, P. O. et al. TCCON data from Caltech (US), Release GGG2020.R0 (Versión R0) [Data set]. CaltechDATA. https://doi.org/10.14291/tccon.ggg2020.pasadena01.R0 (2022).
Deutscher, N. M. et al. TCCON data from Darwin (AU), Release GGG2020.R0 (Versión R0) [Data set]. CaltechDATA. https://doi.org/10.14291/tccon.ggg2020.darwin01.R0 (2023).
Wunch, D., Mendonca, J., Colebatch, O., Allen, N. T., Blavier, J.-F., Kunz, K., Roche, S., Hedelius, J., Neufeld, G., Springett, S., Worthy, D., Kessler, R., & Strong, K. TCCON data from East Trout Lake, SK (CA), Release GGG2020.R0 (Versión R0) [Data set]. CaltechDATA. https://doi.org/10.14291/tccon.ggg2020.easttroutlake01.R0 (2022).
Sussmann, R., & Rettinger, M. TCCON data from Garmisch (DE), Release GGG2020.R1 (Versión R1) [Data set]. CaltechDATA. https://doi.org/10.14291/tccon.ggg2020.garmisch01.R0 (2025).
Weidmann, D., Brownsword, R., & Doniki, S. TCCON data from Harwell, Oxfordshire (UK), Release GGG2020.R0 (Versión R0) [Data set]. CaltechDATA. https://doi.org/10.14291/tccon.ggg2020.harwell01.R0 (2023).
Liu, C., Wang, W., Sun, Y., & Shan, C. TCCON data from Hefei (PRC), Release GGG2020.R1 (Versión R1) [Data set]. CaltechDATA. https://doi.org/10.14291/tccon.ggg2020.hefei01.R1 (2023).
García, O. E., Schneider, M., Herkommer, B., Gross, J., Hase, F., Blumenstock, T., & Sepúlveda, E. TCCON data from Izana (ES), Release GGG2020.R1 (Versión R1) [Data set]. CaltechDATA. https://doi.org/10.14291/tccon.ggg2020.izana01.R1 (2022).
Wennberg, P. O., Roehl, C. M., Blavier, J.-F., Wunch, D., & Allen, N. T. TCCON data from Jet Propulsion Laboratory (US), 2011, Release GGG2020.R0 (Versión R0) [Data set]. CaltechDATA. https://doi.org/10.14291/tccon.ggg2020.jpl01.R0 (2022).
Wennberg, P. O. et al. TCCON data from Jet Propulsion Laboratory (US), 2007, Release GGG2020.R0 (Versión R0) [Data set]. CaltechDATA. https://doi.org/10.14291/tccon.ggg2020.jpl02.R0 (2022).
Hase, F. et al. TCCON data from Karlsruhe (DE), Release GGG2020.R2 (Versión R2) [Data set]. CaltechDATA. https://doi.org/10.14291/tccon.ggg2020.karlsruhe01.R2 (2024).
Wennberg, P. O., Wunch, D., Roehl, C. M., Blavier, J.-F., Toon, G. C., & Allen, N. T. TCCON data from Lamont (US), Release GGG2020.R0 (Versión R0) [Data set]. CaltechDATA. https://doi.org/10.14291/tccon.ggg2020.lamont01.R0 (2022).
Sherlock, V. et al. TCCON data from Lauder (NZ), 120HR, Release GGG2020.R0 (Versión R0) [Data set]. CaltechDATA. https://doi.org/10.14291/tccon.ggg2020.lauder01.R0 (2022).
Sherlock, V. et al. TCCON data from Lauder (NZ), 125HR, Release GGG2020.R0 (Versión R0) [Data set]. CaltechDATA. https://doi.org/10.14291/tccon.ggg2020.lauder02.R0 (2022).
Pollard, D. F., Robinson, J., & Shiona, H. TCCON data from Lauder (NZ), Release GGG2020.R0 (Versión R0) [Data set]. CaltechDATA. https://doi.org/10.14291/tccon.ggg2020.lauder03.R0 (2022).
Petri, C. et al. TCCON data from Nicosia (CY), Release GGG2020.R1 (Versión R1) [Data set]. CaltechDATA. https://doi.org/10.14291/tccon.ggg2020.nicosia01.R1 (2024).
Warneke, T., Petri, C., Notholt, J., & Buschmann, M. TCCON data from Orléans (FR), Release GGG2020.R1 (Versión R1) [Data set]. CaltechDATA. https://doi.org/10.14291/tccon.ggg2020.orleans01.R0 (2024).
Wennberg, P. O. et al. TCCON data from Park Falls (US), Release GGG2020.R1 (Versión R1) [Data set]. CaltechDATA. https://doi.org/10.14291/tccon.ggg2020.parkfalls01.R1 (2022).
De Mazière, M. et al. TCCON data from Réunion Island (RE), Release GGG2020.R0 (Versión R0) https://doi.org/10.14291/tccon.ggg2020.reunion01.R0 (2022).
Morino, I., Ohyama, H., Hori, A., & Ikegami, H. TCCON data from Rikubetsu (JP), Release GGG2020.R0 (Versión R0) [Data set]. CaltechDATA. https://doi.org/10.14291/tccon.ggg2020.rikubetsu01.R0 (2022).
Shiomi, K. et al. TCCON data from Saga (JP), Release GGG2020.R0 (Versión R0) [Data set]. https://doi.org/10.14291/tccon.ggg2020.saga01.R0 (2022).
Morino, I., Ohyama, H., Hori, A., & Ikegami, H. TCCON data from Tsukuba (JP), 125HR, Release GGG2020.R0 (Versión R0) [Data set]. CaltechDATA. https://doi.org/10.14291/tccon.ggg2020.tsukuba02.R0 (2022).
Zhou, M., Wang, P., Kumps, N., Hermans, C., & Nan, W. TCCON data from Xianghe, China, Release GGG2020.R0 (Versión R0) [Data set]. CaltechDATA. https://doi.org/10.14291/tccon.ggg2020.xianghe01.R0 (2022).
Inness, A. et al. The CAMS reanalysis of atmospheric composition. Atmos Chem Phys 19, 3515–3556, https://doi.org/10.5194/acp-19-3515-2019 (2019).
Agustí-Panareda, A. et al. Technical note: The CAMS greenhouse gas reanalysis from 2003 to 2020. Atmos Chem Phys 23, 3829–3859, https://doi.org/10.5194/acp-23-3829-2023 (2023).
Li, T., Wu, J. & Wang, T. Generating daily high-resolution and full-coverage XCO2 across China from 2015 to 2020 based on OCO-2 and CAMS data. Science of The Total Environment 893, 164921, https://doi.org/10.1016/j.scitotenv.2023.164921 (2023).
Chen, Y. et al. Global-Scale Evaluation of XCO2 Products from GOSAT, OCO-2 and CarbonTracker Using Direct Comparison and Triple Collocation Method. Remote Sens (Basel) 14, 5635, https://doi.org/10.3390/rs14225635 (2022).
Basilio, R. R., Livermore, T. R., Shen, Y. J. & Pollock, H. R. The quest for an OCO (Orbiting Carbon Observatory) re-flight. in (eds. Picard, R. H., Schäfer, K., Comeron, A. & van Weele, M.) 78270A https://doi.org/10.1117/12.867042 (2010).
Sunder, S., Ramsankaran, R. & Ramakrishnan, B. Machine learning techniques for regional scale estimation of high-resolution cloud-free daily sea surface temperatures from MODIS data. ISPRS Journal of Photogrammetry and Remote Sensing 166, 228–240, https://doi.org/10.1016/j.isprsjprs.2020.06.008 (2020).
Li, T., Shen, H., Yuan, Q. & Zhang, L. Geographically and temporally weighted neural networks for satellite-based mapping of ground-level PM2.5. ISPRS Journal of Photogrammetry and Remote Sensing 167, 178–188, https://doi.org/10.1016/j.isprsjprs.2020.06.019 (2020).
Kamir, E., Waldner, F. & Hochman, Z. Estimating wheat yields in Australia using climate records, satellite image time series and machine learning methods. ISPRS Journal of Photogrammetry and Remote Sensing 160, 124–135, https://doi.org/10.1016/j.isprsjprs.2019.11.008 (2020).
Wu, C., Ju, Y., Yang, S., Zhang, Z. & Chen, Y. Reconstructing annual XCO2 at a 1 km×1 km spatial resolution across China from 2012 to 2019 based on a spatial CatBoost method. Environ Res 236, 116866, https://doi.org/10.1016/j.envres.2023.116866 (2023).
Zhang, L., Li, T., Wu, J. & Yang, H. Global estimates of gap-free and fine-scale CO2 concentrations during 2014–2020 from satellite and reanalysis data. Environ Int 178, 108057, https://doi.org/10.1016/j.envint.2023.108057 (2023).
Wang, Y. et al. Seamless mapping of long-term (2010–2020) daily global XCO2 and XCH4 from the Greenhouse Gases Observing Satellite (GOSAT), Orbiting Carbon Observatory 2 (OCO-2), and CAMS global greenhouse gas reanalysis (CAMS-EGG4) with a spatiotemporally self-supervised fusion method. Earth Syst Sci Data 15, 3597–3622, https://doi.org/10.5194/essd-15-3597-2023 (2023).
Chen, J. et al. Multi-sensor integrated mapping of global XCO2 from 2015 to 2021 with a local random forest model. ISPRS Journal of Photogrammetry and Remote Sensing 208, 107–120, https://doi.org/10.1016/j.isprsjprs.2024.01.009 (2024).
He, Q. et al. Full-coverage mapping high-resolution atmospheric CO2 concentrations in China from 2015 to 2020: Spatiotemporal variations and coupled trends with particulate pollution. J Clean Prod 428, 139290, https://doi.org/10.1016/j.jclepro.2023.139290 (2023).
Yao, F., Wu, J., Li, W. & Peng, J. A spatially structured adaptive two-stage model for retrieving ground-level PM2.5 concentrations from VIIRS AOD in China. ISPRS Journal of Photogrammetry and Remote Sensing 151, 263–276, https://doi.org/10.1016/j.isprsjprs.2019.03.011 (2019).
Wang, S., Shi, C., Fang, C. & Feng, K. Examining the spatial variations of determinants of energy-related CO2 emissions in China at the city level using Geographically Weighted Regression Model. Appl Energy 235, 95–105, https://doi.org/10.1016/j.apenergy.2018.10.083 (2019).
Liu, Q., Wu, S., Lei, Y., Li, S. & Li, L. Exploring spatial characteristics of city-level CO2 emissions in China and their influencing factors from global and local perspectives. Science of The Total Environment 754, 142206, https://doi.org/10.1016/j.scitotenv.2020.142206 (2021).
Nguyen, H., Cressie, N. & Braverman, A. Multivariate Spatial Data Fusion for Very Large Remote Sensing Datasets. Remote Sens (Basel) 9, 142, https://doi.org/10.3390/rs9020142 (2017).
Pearson, K. On lines and planes of closest fit to systems of points in space. The London, Edinburgh, and Dublin Philosophical Magazine and Journal of Science 2, 559–572, https://doi.org/10.1080/14786440109462720 (1901).
Gao, Z., Jiang, Y., He, J. & Wu, J. Spatiotemporal variation analysis of global XCO2 concentration during 2010–2020 based on DINEOF-BME framework and wavelet function. Science of The Total Environment 892, 164750, https://doi.org/10.1016/j.scitotenv.2023.164750 (2023).
He, K., Zhang, X., Ren, S. & Sun, J. Deep Residual Learning for Image Recognition. in 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) 770–778 https://doi.org/10.1109/CVPR.2016.90 (IEEE, 2016).
Liu, H., Jiang, B., Xiao, Y. & Yang, C. Coherent Semantic Attention for Image Inpainting. in 2019 IEEE/CVF International Conference on Computer Vision (ICCV) 4169–4178 https://doi.org/10.1109/ICCV.2019.00427 (IEEE, 2019).
Zhang, Z., Stanev, E. V. & Grayek, S. Reconstruction of the Basin‐Wide Sea‐Level Variability in the North Sea Using Coastal Data and Generative Adversarial Networks. J Geophys Res Oceans 125 https://doi.org/10.1029/2020JC016402 (2020).
Olmedo, E., Taupier-Letage, I., Turiel, A. & Alvera-Azcárate, A. Improving SMOS Sea Surface Salinity in the Western Mediterranean Sea through Multivariate and Multifractal Analysis. Remote Sens (Basel) 10, 485, https://doi.org/10.3390/rs10030485 (2018).
Olmedo, E. et al. Nine years of SMOS sea surface salinity global maps at the Barcelona Expert Center. Earth Syst Sci Data 13, 857–888, https://doi.org/10.5194/essd-13-857-2021 (2021).
Lopez, F. P. A. Daily Column average CO2 mapping combining OCO 3, GOSAT, and CAMS Data applying EOF and Deep learning from 2018 to 2023. Zenodo https://doi.org/10.5281/zenodo.13895409 (2024).
Jing, Y., Shi, J., Zhang, P., Wang, T. & Chen, L. Comparison of atmospheric carbon dioxide concentration based on GOSAT and OCO-2 observations. in 2016 IEEE International Geoscience and Remote Sensing Symposium (IGARSS) 4071–4073 https://doi.org/10.1109/IGARSS.2016.7730059 (IEEE, 2016).
Kulawik, S. et al. Consistent evaluation of ACOS-GOSAT, BESD-SCIAMACHY, CarbonTracker, and MACC through comparisons to TCCON. Atmos Meas Tech 9, 683–709, https://doi.org/10.5194/amt-9-683-2016 (2016).
Rodgers, C. D. & Connor, B. J. Intercomparison of remote sounding instruments. Journal of Geophysical Research: Atmospheres 108 https://doi.org/10.1029/2002JD002299 (2003).
(TCCON), T. C. C. O. N. Team 2020 TCCON Data Release (Version GGG2020) [Data set], https://doi.org/10.14291/TCCON.GGG2020 (2022).
Acknowledgements
This project was supported by the National Natural Science Foundation of China (Grant No. 42471425), and the National Key Research and Development Program of China (Grant No. 2021YFB3901000, 2021YFB3901005). We extend our heartfelt gratitude to the data providers of the Orbiting Carbon Observatory-3 (OCO-3), the Greenhouse Gases Observing Satellite (GOSAT), and the Total Carbon Column Observing Network (TCCON). Their invaluable datasets provided the essential foundation for this study, enabling us to achieve a deeper and more comprehensive analysis of atmospheric CO2. We greatly appreciate their commitment to open data sharing, which significantly enriched our research and contributed to advancing the broader scientific understanding of carbon monitoring.
Author information
Authors and Affiliations
Contributions
Franz Pablo Antezana Lopez: Algorithm development, data analysis, XCO2 interpretation; Guanhua Zhou: Research oversight, guidance, theoretical framework, accuracy assurance; Guifei Jing: Communication, manuscript drafting, team collaboration, seamless workflow; Kai Zhang: Data collection, preprocessing, algorithm implementation, technical expertise, reconstruction; Lin Chen: Comparative analysis, data validation, consistency assessment, dataset reliability; Liangfu Chen: Theoretical framework, innovation proposals, remote sensing expertise, Tan Yumin: Revision.
Corresponding author
Ethics declarations
Competing interests
We confirm the absence of any competing interests that could potentially influence or bias the research outcomes, methodology, or presentation. Our work was driven by a commitment to scientific integrity, and we assert that no financial, commercial, personal, or intellectual conflicts exist that could cast doubt on the objectivity of our findings. Our research approach is characterized by transparency and a dedication to advancing knowledge in the field of remote sensing and greenhouse gas monitoring, free from any external influences.
Additional information
Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License, which permits any non-commercial use, sharing, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if you modified the licensed material. You do not have permission under this licence to share adapted material derived from this article or parts of it. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by-nc-nd/4.0/.
About this article
Cite this article
Antezana Lopez, F.P., Zhou, G., Jing, G. et al. Global Daily Column Average CO2 at 0.1° × 0.1° Spatial Resolution Integrating OCO-3, GOSAT, CAMS with EOF and Deep Learning. Sci Data 12, 268 (2025). https://doi.org/10.1038/s41597-024-04135-w
Received:
Accepted:
Published:
DOI: https://doi.org/10.1038/s41597-024-04135-w