Data Processing

[pdf version]

Reference paper. More detailed descriptions of the FLUXNET2015 dataset and the ONEFlux processing pipeline are available in the dataset reference paper, co-authored by data teams and members of all site teams contributing to FLUXNET2015 Tier One sites:

Pastorello, G., Trotta, C., Canfora, E. et al. The FLUXNET2015 dataset and the ONEFlux processing pipeline for eddy covariance data. Sci Data 7, 225 (2020). https://doi.org/10.1038/s41597-020-0534-3
[.bib, .ris]

The data processing pipeline for the FLUXNET2015 Dataset was developed in a collaboration between personnel from the European Ecosystem Fluxes Database, ICOS Ecosystem Thematic Centre (ICOS-ETC) and the AmeriFlux Management Project (AMP). It adapts code developed by the community, integrating with code developed by the teams into a consistent and uniform data processing pipeline. The starting point for the data processing is half-hourly data collected and processed at FLUXNET sites. The pipeline generates uniform and high quality derived data products suitable for studies requiring intercomparability of data from multiple sites. The harmonization and data quality control activities are particularly important for the FLUXNET2015 Dataset. Regional network coordination offices, in particular for OzFLUX-TERN, ChinaFlux, AsiaFlux, and FLUXNET-Canada are participating in the harmonization and data quality screening for the sites in their networks.

Figure 1. Processing Pipeline for FLUXNET2015 Release.

Temporal aggregations are generated at last step using each variable. These are all concatenated into the data distribution formats at the Product Merging step, which generates the final file representation for data products at all temporal aggregation resolutions.

1. QA/QC Procedures

Before product generation starts, data for each site goes through quality assurance / quality control (QA/QC) steps tailored to the generation of these derived data products (e.g., gap-filling or uncertainty estimation). A few of these QA/QC steps are described in PASTORELLO et al. 2014 (eScience). Quality checks are done over single variables (e.g., overall trends at multiple temporal resolutions), multiple/combined variables (e.g., variables that should vary comparably), or can be more specialized tests (e.g., comparing measured radiation to the maximum, top of the atmosphere radiation expected for a given location).

2. Micrometeorological Processing

Meteorological data includes at the moment a selection of variables that will be expanded in the next incremental releases. The variables are gapfilled with MDS but, for a selection of them where it was possible, also downscaled at site level from ERA-interim reanalysis data following the method described by Vuichard and Papale 2015 (ESSD). A proposed optimal combination of the two products is also produced (identified with _F).

Half hourly or hourly:

The file contain a set of meteorological variables gapfilled and/or downscaled from the ERA-interim dataset. The downscaling both in space (from cell to tower) and time (6 hourly to half hourly) is applied to seven variables using regressions with the site measurements when available according to the method described in Vuichard and Papale 2015.

The gapfilling of the site level measurements have been done using the MDS method as described in Reichstein et al. 2005.