HEMCO data directories

From Geos-chem
Jump to navigation Jump to search

On this page we describe the directory tree from which the HEMCO emissions component can read emissions inventories and other atmospheric data sets.

Overview

The HEMCO emissions component can read several types of emission inventories, as well as other types atmospheric data sets, such as production and loss rates, or concentration data. We have collated all of this data into a comprehensive directory tree structure. Each folder of the HEMCO data directory tree represents a particular emissions inventory or other data set.

At present, the HEMCO data directory tree resides on the disk servers at Harvard University (and soon at Dalhousie University). We have created a package that will let you download this directory tree to your local disk storage space. For more information, please see the Downloading the HEMCO data directories section below.

Please see our list of recommended default emission inventories, which has been compiled by the Emissions and Deposition Working Group.

--Bob Yantosca (talk) 22:27, 7 September 2017 (UTC)

Downloading the HEMCO data directories

We recommend the following methods for downloading the HEMCO data directories, depending on which GEOS-Chem version you are using:

GEOS-Chem versions Recommended download method Alternate download method
12.7.0 and later The GEOS-Chem dry-run option Manual download from either
12.6.3 and prior The hemco_data_download package Manual download from either

--Bob Yantosca (talk) 17:07, 3 December 2020 (UTC)

HEMCO data directory structure

The sections below describe each of the data sets contained in the HEMCO directory tree, grouped by type.

In each of the tables below:

The Inventory column contains a link that describes each data set in more detail. Most of these links point to existing pages on the GEOS-Chem wiki.
The Data file info column points to the README files that are stored with each data set. Each README provides a list of the files contained within a given folder, information about the data contained in each file, and (often) a description of how the files were created.
The Path column shows the location of each data set, with respect to $ROOT, the top-level HEMCO directory. For example, on the Harvard disk server, $ROOT points to the directory /mnt/gcgrid/data/ExtData/HEMCO/.
The Status column describes the current status of each data set:
DEFAULT SETTING Denotes that the data set is part of the standard emissions configuration for GEOS-Chem, that is:
OPTIONAL SETTING Denotes that the data set is not part of the the standard GEOS-Chem emissions configuration.
  • But you can still use the data set in your research applications if you wish.
If use GEOS-Chem, then we recommend that you download the data sets that are listed as DEFAULT SETTING, as these form the standard GEOS-Chem emissions configuration. You are free, however, to use any of the data sets listed below. Your choice of data sets will depend on your particular research needs.

NOTE: The inventories are grouped and listed in the same order as in the HEMCO_Config.rc version that ships with GEOS-Chem 13.0.0.

Regional inventories

Inventory More info Path Data size Status
APEI
  • Canadian emissions, 1989-2014
README $ROOT/APEI/v2016-11/ 68 MB OPTIONAL SETTING
NEI2011_HOURLY
  • Hourly CONUS emissions from NEI2011 inventory
README $ROOT/NEI2011/v2015-03 248 GB OPTIONAL SETTING
NEI2011_MONTHLY_MEAN
  • Monthly mean CONUS emissions from NEI2011 inventory
README $ROOT/NEI2011/v2017-02-MM 248 GB OPTIONAL SETTING
MIX
  • Asian anthropogenic emissions
README $ROOT/MIX/v2015-03 623 MB OPTIONAL SETTING
DICE_Africa
  • African emissions inventory
README $ROOT/DICE_Africa/v2016-10 202 MB OPTIONAL

Global inventories

The following subdirectories of the HEMCO directory tree contain inventories of anthropogenic and biofuel emissions.

Please also see list of species included in each of these global inventories and regional inventories.

Inventory More info Path Data size Status
CEDS
  • Community Emissions Data System
README $ROOT/CEDS/v2020-08 73 GB DEFAULT SETTING
  • Introduced in 13.0.0
  • Reference: McDuffie et al 2020, ESSD
CEDS_byFuelType
  • Sectorial CEDS emissions
README $ROOT/CEDS/v2020-08 73 GB OPTIONAL SETTING
  • Incurs additional I/O overhead
  • Introduced in 13.0.0
  • Reference: McDuffie et al 2020, ESSD
EDGARv43
  • EDGAR global emissions v4.3
README $ROOT/EDGARv43/v2016-11 6.2 GB OPTIONAL SETTING
  • Superseded by CEDS in 13.0.0
  • Used for sectors not included in DICE-Africa inventory
  • Turning on this option will also include trash burning emissions of NOx, SOx, CO, NH3, BC and OC from Wiedinmyer et al. (2014) and NMVOC emissions from CEDS
HTAP
  • HTAP global anthro+biofuel emissions
README $ROOT/HTAP/v2015-03 4.5 GB OPTIONAL SETTING
  • NOTE: HTAP includes some regional emissions from other inventories. If you select HTAP you might not need to select the other regional inventories in HEMCO. See this document for more information.
GEIA_NH3
  • GEIA natural source NH3 emissions
README $ROOT/NH3/v2014-07 8.1 MB DEFAULT SETTING
SEABIRD_NH3
  • Croft et al NH3 emissions from arctic sea birds
README HEMCO/NH3/v2018-04/ 104 MB DEFAULT SETTING
POET
  • Anthropogenic EOH inventory
README $ROOT/POET/v2017-03/ 38 MB OPTIONAL SETTING
TZOMPASOSA_C2H6
  • Tzompa-Soza et al [2017] C2H6 emissions
README $ROOT/C2H6_2010/v2019-06 536 KB DEFAULT SETTING
  • Overwrites the C2H6 emissions from the CEDS to address underestimates
XIAO_C3H8
  • Y. Xiao et al [2008] C3H8 emissions
README $ROOT/XIAO/v2014-09 308 KB DEFAULT SETTING
  • Overwrites the C3H8 emissions from CEDS to address underestimates
LIANG_BROMOCARB
  • (Liang et al VSL emissions))
README $ROOT/BROMINE/v2015-02 156 KB DEFAULT SETTING
ORDONEZ_IONOCARB
  • Ordonez et al Iodine emissions
README $ROOT/IODINE/v2017-09 4.4 MB DEFAULT SETTING
AEIC
  • MIT aircraft emissions inventory
README $ROOT/AEIC/v2015-01 2.0 GB DEFAULT SETTING
DECAYING_PLANTS
  • ALD2 and EOH emissions from decaying plants
README $ROOT/ALD2/v2017-03 8.1 MB DEFAULT SETTING
AFCID
  • Anthropogenic PM2.5 dust source
README $ROOT/AFCID/2018-04 4.4 MB DEFAULT SETTING
  • In GEOS-Chem 12.5.0 AFCID was separated from the DEAD dust emissions extension to allow for use with offline dust emissions as well.

Ship emissions

The following subdirectories of the HEMCO directory tree contain inventories of anthropogenic aircraft and ship emissions.

Inventory More info Path Data size Status
CEDS_SHIP
  • CEDS ship emissions
README $ROOT/CEDS/v2020-08 73 GB DEFAULT SETTING
  • Emissions are summed over all fuel sectors.
CEDS_SHIP_byFuelType
  • Sectorial CEDS ship emissions
README $ROOT/CEDS/v2020-08 73 GB OPTIONAL SETTING
  • Emissions are for individual fuel sectors. Incurs additional I/O overhead.
HTAP_SHIP
  • CO, NO, SO2 ship emissions from the HTAP inventory
README $ROOT/HTAP/v2015-03 4.5 GB OPTIONAL SETTING
ICOADS_SHIP
  • CO ship emissions from the ICOADS inventory
README $ROOT/ICOADS_SHIP/v2014-07 4.2 MB OPTIONAL SETTING
ARCTAS_SHIP
  • SO2 ship emissions from the ARCTAS inventory
README $ROOT/ARCTAS_SHIP/v2014-07 508 KB OPTIONAL SETTING
CORBETT_SHIP
  • SO2 ship emissions from Corbett et al inventory
README $ROOT/CORBETT_SHIP/v2014-07 1.8 MB OPTIONAL SETTING
NEI2011_SHIP
  • Hourly ship emissions from the NEI_2011 inventory
README $ROOT/NEI2011/v2015-03 248 GB OPTIONAL SETTING
  • NOTE: Only extends a few km from land, so this is more of a coastal emissions inventory.
NEI2011_SHIP_MONMEAN
  • Monthly mean ship emissions from the NEI2011 inventory
README $ROOT/NEI2011/v2017-02-MM 248 GB OPTIONAL SETTING
  • NOTE: Only extends a few km from land, so this is more of a coastal emissions inventory.

NOTES about ship emissions:

  1. CEDS ship emissions are the default ship inventory in GEOS-Chem 12.1.0 and later versions.
    • Data files for CEDS ship emissions are stored with the other CEDS emission sector files in the HEMCO data path: $ROOT/CEDS/v2018-04/.
  2. If CEDS emissions are turned off above then:
    • ARCTAS should be used over ICOADS, CORBETT, and HTAP for SO2
    • ICOADS should be used for CO and NO.

Future emissions

Inventory More info Path Data size Status
RCP_3PD
  • Emissions for scenario RCP_3PD
README $ROOT/RCP/v2020-07/RCP_3PD 91 MB OPTIONAL SETTING
RCP_45
  • Emissions for scenario RCP_45
README $ROOT/RCP/v2020-07/RCP_45 93 MB OPTIONAL SETTING
RCP_60
  • Emissions for scenario RCP_60
README $ROOT/RCP/v2020-07/RCP_60 92 MB OPTIONAL SETTING
RCP_85
  • Emissions for scenario RCP_85
README $ROOT/RCP/v2020-07/RCP_85 93 MB OPTIONAL SETTING

Biomass burning emissions (not implemented as extensions)

NOTE: Some biomass burning inventories are implemented as HEMCO extensions. See this section below for more information.

The following subdirectories of the HEMCO directory tree contain inventories of biomass burning emissions.

Inventory More info Path Data size Status
GFAS
  • CAMS Global Fire Assimilation System estimated emissions
  • Injection height info is taken from
    • ECMWF fire observations
    • ECMWF operational weather forecasts.
README $ROOT/GFAS/v2018-09 49 GB OPTIONAL SETTING
  • Monthly data are available from 2003-present and are updated monthly
QFED
  • Quick Fire Emissions Database (v2.5r1)
README $ROOT/QFED/v2018-07 247 GB OPTIONAL SETTING
  • QFED v2.5r1 data extends from 2000-2020

Offline grid-independent emissions

Offline grid-independent emissions were generated by running high-resolution (0.25°x0.3125° for GEOS-FP, 0.5°x0.625° for MERRA-2) simulations using the HEMCO standalone and online emissions from the HEMCO extensions.

Inventory More info Path Data size Status
OFFLINE_DUST
  • Dust emissions archived from HEMCO standalone simulations
README $ROOT/OFFLINE_DUST/v2019-01 18 GB DEFAULT SETTING
  • Introduced in 12.5.0
  • You must also turn off the DustDead and DustGinoux extensions
OFFLINE_BIOGENICVOC
  • Biogenic VOC emissions generated from HEMCO standalone simulations
README $ROOT/OFFLINE_BIOVOC/v2019-01 1400 GB DEFAULT SETTING
  • Introduced in 12.5.0
  • NOTE: The MEGAN extension should remain on so that it can calculate biogenic emissions for some species not in this inventory.
OFFLINE_SEASALT
  • Sea salt emissions generated from HEMCO standalone simulations
README $ROOT/OFFLINE_SEASALT/v2019-01 876 GB DEFAULT SETTING
  • Introduced in 12.5.0
  • To use this, turn off the SeaSalt extension
OFFLINE_SOILNOX
  • Soil NOx emissions generated from HEMCO standalone simulations
README $ROOT/OFFLINE_SOILNOX/v2019-01 48 GB DEFAULT SETTING
  • Introduced in 12.5.0
  • You must also turn off the SoilNOx extension

Aerosol emissions

The following subdirectories of the HEMCO directory tree contain aerosol emission inventories.

NOTE: AeroCom volcanic SO2 emissions are listed under the Emissions implemented as HEMCO extensions.

Inventory Data file info Path Directory size Status
Secondary organic aerosols README $ROOT/SOA/2014-07 3.0 MB DEFAULT SETTING
  • Contain various inputs for SOA simulations.
Spatially varying OM/OC ratios for SOA species README $ROOT/OMOC/v2018-01 8.6 MB OPTIONAL
  • Controlled by a switch in input.geos
    (default is OFF)

Emissions implemented as HEMCO extensions

The following subdirectories of the HEMCO directory tree contain input data used by various HEMCO extensions. These HEMCO extensions compute emissions for quantities that depend on meteorological variables (e.g. emissions from lighting, biogenic processes, etc.).

NOTE: GFED3, GFED4 and FINNv1 are implemented as HEMCO extensions, but we have listed them with the other biomass burning inventories, for clarity.

Inventory Data file info Path Directory size Status
Acetone ocean exchange README $ROOT/ACET/v2014-07 52 KB DEFAULT SETTING
ALD2 ocean exchange README $ROOT/ALD2/v2017-03 52 KB DEFAULT SETTING
DMS ocean exchange README $ROOT/DMS/v2015-07 3.0 MB DEFAULT SETTING
  • This is the default in v11-01 and higher versions
  • Uses Lana 2011 climatology
NO from lightning README $ROOT/OFFLINE_LIGHTNING/v2019-01/ 47 GB DEFAULT SETTING
PARANOX ship plume model README $ROOT/PARANOX/v2015-02 162 MB DEFAULT SETTING
  • Computes the aging of emissions in ship exhaust plumes.
AeroCom volcanic emissions README $ROOT/VOLCANO/v2019-08 660 MB DEFAULT SETTING
Optional inventories (turned OFF by default in the standard emissions configuration)
DEAD dust model README $ROOT/DUST_DEAD/2014-07 712 KB OPTIONAL
  • May be used instead of offline dust emissions
MEGAN biogenic emissions README $ROOT/MEGAN/v2018-05 12 MB OPTIONAL
  • May be used instead of offline biogenic VOC emissions
NO from soils/fertilizers README $ROOT/SOILNOX/v2014-07 79 MB OPTIONAL
  • May be used instead of offline soil NOx emissions
GINOUX dust model README $ROOT/DUST_GINOUX/2014-07 36 KB OPTIONAL
  • You can use this in place of the DEAD dust model.

GEOS-Chem specialty simulation data

The following subdirectories of the HEMCO directory tree input data (emissions, oxidants, etc.) for use with the GEOS-Chem specialty simulations. If you do not regularly use these simulations, you may choose not to download these data directories.

Inventory Data file info Path Directory size Status
Active inventories (turned ON by default in the standard emissions configuration)
Aerosol-only simulation README $ROOT/OFFLINE_AEROSOL/v2014-09 165 MB DEFAULT SETTING
CH4 simulation README1
README2
$ROOT/CH4/v2017-10
$ROOT/CH4/v2014-09
274 MB DEFAULT SETTING
CO2 simulation README1
README2
README3
README4
README5
README6
$ROOT/CO2/v2015-04
$ROOT/CO2/v2015-04/BIO/
$ROOT/CO2/v2015-04/BIOFUEL/
$ROOT/CO2/v2015-04/CHEM/
$ROOT/CO2/v2015-04/FOSSIL/
$ROOT/CO2/v2015-04/OCEAN/
1033 MB DEFAULT SETTING
Mercury simulation README1
README2
README3
README4
README5
README6
README7
README8
README9
README10
$ROOT/MERCURY/v2014-09
$ROOT/MERCURY/v2014-09/ARTISANAL
$ROOT/MERCURY/v2014-09/BrOx
$ROOT/MERCURY/v2014-09/Hg2_PARTITION
$ROOT/MERCURY/v2014-09/JVALUES
$ROOT/MERCURY/v2014-09/NATURAL
$ROOT/MERCURY/v2014-09/NEI2005
$ROOT/MERCURY/v2014-09/OCEAN
$ROOT/MERCURY/v2014-09/SOIL
$ROOT/MERCURY/v2014-09/STREETS
342 MB DEFAULT SETTING
POPs simulation README $$ROOT/POPs/v2015-08 809 MB DEFAULT SETTING
RRTMG radiative transfer model README $ROOT/RRTMG/v2018-11 19 MB DEFAULT SETTING
Tagged CO simulation README $ROOT/TAGGED_CO/v2017-04 260 KB DEFAULT SETTING
Tagged O3 simulation README $ROOT/TAGGED_O3/v2014-09 372 MB DEFAULT SETTING
O3 for offline simulations README $ROOT/O3/v2014-09 130 MB DEFAULT SETTING
OH for offline simulations
README1
README2
$ROOT/OH/v2014-09
$ROOT/OH/v2014-09/v5-07-08
$ROOT/OH/v2014-09/v7-02-03.GMI
148 MB DEFAULT SETTING
H2O2 for offline simulations README $ROOT/OXIDANTS/v2014-07 32 MB DEFAULT SETTING
Oceanic Chlorophyll-A for Hg simulations README $ROOT/CHLA/v2014-07 2.0 MB DEFAULT SETTING
Obsolete inventories (superseded by newer developments)
CH3I simulation README $ROOT/CH3I/v2014-07 280 KB OBSOLETE
  • This simulation is no longer used in GEOS-Chem (awaiting revival)
POPs simulation README $ROOT/POPs/v2014-09 622 MB OBSOLETE
  • Superseded by v2015-08
Tagged CO simulation README $ROOT/TAGGED_CO/v2014-08q 260 KB OBSOLETE
  • Superseded by v2017-04

--Bob Yantosca (talk) 19:01, 2 July 2018 (UTC)

Other inputs for HEMCO

The following subdirectories of the HEMCO directory tree input data for various HEMCO functions. These include regional masks, emission scale factors, and grid information.

Inventory Data file info Path Directory size Status
Active inventories (turned ON by default in the standard emissions configuration)
Annual scale factors README $ROOT/AnnualScalar/v2014-07 3.0 MB DEFAULT SETTING
  • Used to scale NEI2005 and other emissions to years other than the inventory base year.
NEI2005 scale factors README $ROOT/NEI2005/v2014-09 8.1 MB DEFAULT SETTING
ID codes for each country README $ROOT/COUNTRY_ID/v2015-04 15 MB DEFAULT SETTING
  • ISO ALPHA-3 standard country codes are contained on an 0.1° x 0.1° grid. HEMCO can use these ID's to apply country-level scale factors to any emissions field.
Mask files for regional emissions README $ROOT/MASKS/v2014-07 51 MB DEFAULT SETTING
  • Masks define the geographical regions where the various regional emissions (e.g. BRAVO, CAC, NEI) are to be applied.
MAP_A2A regridding data README $ROOT/MAP_A2A/v2014-07 152 KB DEFAULT SETTING
Obsolete inventories (superseded by newer developments)
Weekly scale factors README $ROOT/WEEKSCALE/v2014-07 4.0 KB OBSOLETE
  • Not currently in use

--Bob Yantosca (talk) 19:05, 2 July 2018 (UTC)

The hemco_data_download package

Obtaining the hemco_data_download package

Obsolete.jpg

The hemco_data_download package will be retired in GEOS-Chem 12.7.0 in favor of the GEOS-Chem dry run option.

To obtain the hemco_data_download package, use Git to clone this repository

git clone https://github.com/GCST/hemco_data_download.git

This will create a directory named hemco_data_download, in which you should see the following files:

File Description
README File with an overall description of the directory contents
hemcoDataDownload.pl Perl script to download HEMCO data directories
hemcoDataDownload.rc Configuration file for the hemcoDataDownload.pl script. In this file you can specify which HEMCO data directories you would like to download and which you would like to omit.
forTesting.rc A configuration file that you can use for testing or debugging. This will tell hemcoDataDownload.pl only to download a couple of emissions inventories with files that do not take up much disk space.

--Bob Yantosca (talk) 17:02, 5 December 2016 (UTC)

Setting up the configuration file

Obsolete.jpg

The hemco_data_download package will be retired in GEOS-Chem 12.7.0 in favor of the GEOS-Chem dry run option.

The configuration files (i.e. hemcoDataDownload.rc and forTesting.rc) are pretty much self-explanatory.

At the top of the configuration file you will see this section:

###############################################################################
#                                                                             #
#  Specify the remote and local HEMCO data paths, plus other options.         #
#                                                                             #
###############################################################################

Remote HEMCO data path | ftp://ftp.as.harvard.edu/gcgrid/data/ExtData/HEMCO
Your HEMCO data path   | /as/scratch/bmy/HEMCO
Verbose output         | yes
Dryrun only?           | no
Path Description
Remote HEMCO data path Location on the FTP server from which you are going to download the data. This can be from either Harvard or from Dalhousie. (For now we will use the Harvard server). You can edit this accordingly.
Your HEMCO data path The root-level directory for HEMCO data on your own disk space. If you are not sure where to place this, then ask your sysadmin.
Verbose output Lets you specify if you want to print out extra output during the download process. This can be set to either "yes" or "no".
Dryrun only Allows you to print out the data download commands without actually downloading the data. This is useful for debugging. This can be set to either "yes" or "no".

In the next section you specify all of the HEMCO inventories that you want to download. You will see this header:

###############################################################################
#                                                                             #
#  THE FOLLOWING DATA DIRECTORIES WILL BE DOWNLOADED.                         #
#                                                                             #
#  These data directories comprise the recommended emissions configuration    #
#  for typical GEOS-Chem full-chemistry and specialty simulations.            #
#                                                                             #
#  NOTE: In most cases, you only have to specify the top-level folder.        #
#  All subfolders will be downloaded automatically.                           #
#                                                                             #
###############################################################################

#=============================+================================================
# AEROSOLS                    | Directory paths
#=============================+================================================
AEROCOM volcano emissions     | $ROOT/VOLCANO/v2014-10
Bond et al BC/OC              | $ROOT/BCOC_BOND/v2014-07
Cooke et al BC/OC             | $ROOT/BCOC_COOKE/v2014-07
Secondary organic aerosols    | $ROOT/SOA/v2014-07
... etc ...

Each line specifies the name of a HEMCO emissions inventory and the data path where it can be found on disk, relative to the root data path. NOTE: The script will replace the $ROOT token with the value you gave to the "HEMCO remote data path" above. (Lines starting with the comment character # will be ignored.)

Any inventory found in this section will be downloaded. To prevent an inventory from being downloaded you can either comment it out (i.e. place a # in the first column) or move the inventory to the next section.

The final section specifies HEMCO emission inventories that you do not wish to download. The section looks like this:

###############################################################################
#                                                                             #
#  THE FOLLOWING DATA DIRECTORIES WILL NOT BE DOWNLOADED.                     #
#                                                                             #
#  These data directories contain are optional emissions inventories that     #
#  are not used in typical GEOS-Chem simulations.  If you wish to download    #
#  any of these inventories, simply move the corresponding entry for each     #
#  inventory to the previous section.                                         #
#                                                                             #
###############################################################################

CH3I simulation (obsolete)    | $ROOT/CH3I/v2014-07
Chlorophyll A                 | $ROOT/CHLA/v2014-07
... etc ...

--Bob Y. 13:55, 12 February 2015 (EST)

Downloading the data

Obsolete.jpg

The hemco_data_download package will be retired in GEOS-Chem 12.7.0 in favor of the GEOS-Chem dry run option.

Once you have set up your configuration file, you can run the hemcoDataDownload.pl script to start downloading the HEMCO data to your local server. To run the script you can type:

hemcoDataDownload.pl

If you do not specify a configuration file name, then the hemcoDataDownload.pl scriptwill read the default configuration file hemcoDataDownload.rc configuration file. If you wish to specify a different configuration file name, simply pass that as an argument to the script, e.g.

hemcoDataDownload.pl myNewConfigFile.rc

Before you start downloading GB's of data, we recommend that you run a short test to make sure that the data directories are being copied to the proper locations on your disk server. For this purpose, we have provided a configuration file named forTesting.rc. Typing

hemcoDataDownload.pl forTesting.pl

will only download a couple of data inventories that do not take up much disk space. This allows you to ensure that the data transfer is sucessful without making you wait a long time.

--Bob Y. 13:57, 12 February 2015 (EST)

Submitting new data for use with HEMCO

If you have a new emissions inventory or other atmospheric data set that you would like to add to the HEMCO data directories, then please contact the GEOS-Chem Support Team. Data files must be in COARDS-compliant or CF-compliant netCDF format. Please see the following pages for more information:

  1. Preparing data files for use with HEMCO
  2. Submitting updates for inclusion in GEOS-Chem

--Bob Yantosca (talk) 22:24, 14 January 2020 (UTC)