HEMCO data directories
On this page we describe the directory tree from which the HEMCO emissions component can read emissions inventories and other atmospheric data sets.
Overview
The HEMCO emissions component can read several types of emission inventories, as well as other types atmospheric data sets, such as production and loss rates, or concentration data. We have collated all of this data into a comprehensive directory tree structure. Each folder of the HEMCO data directory tree represents a particular emissions inventory or other data set.
At present, the HEMCO data directory tree resides on the disk servers at Harvard University (and soon at Dalhousie University). We have created a package that will let you download this directory tree to your local disk storage space. For more information, please see the Downloading the HEMCO data directories section below.
Please see our list of recommended default emission inventories, which has been compiled by the Emissions and Deposition Working Group.
--Bob Yantosca (talk) 22:27, 7 September 2017 (UTC)
Downloading the HEMCO data directories
We recommend the following methods for downloading the HEMCO data directories, depending on which GEOS-Chem version you are using:
GEOS-Chem versions | Recommended download method | Alternate download method |
---|---|---|
12.7.0 and later | The GEOS-Chem dry-run option | Manual download from either |
12.6.3 and prior | The hemco_data_download package | Manual download from either |
--Bob Yantosca (talk) 17:07, 3 December 2020 (UTC)
HEMCO data directory structure
The sections below describe each of the data sets contained in the HEMCO directory tree, grouped by type.
In each of the tables below:
- The Inventory column contains a link that describes each data set in more detail. Most of these links point to existing pages on the GEOS-Chem wiki.
- The Data file info column points to the README files that are stored with each data set. Each README provides a list of the files contained within a given folder, information about the data contained in each file, and (often) a description of how the files were created.
- The Path column shows the location of each data set, with respect to $ROOT, the top-level HEMCO directory. For example, on the Harvard disk server, $ROOT points to the directory /mnt/gcgrid/data/ExtData/HEMCO/.
- The Status column describes the current status of each data set:
DEFAULT SETTING Denotes that the data set is part of the standard emissions configuration for GEOS-Chem, that is: - It is used in the GEOS-Chem 1-month and 1-year full-chemistry benchmark simulations, or
- It is used in one or more of the GEOS-Chem specialty simulations.
OPTIONAL SETTING Denotes that the data set is not part of the the standard GEOS-Chem emissions configuration. - But you can still use the data set in your research applications if you wish.
- If use GEOS-Chem, then we recommend that you download the data sets that are listed as DEFAULT SETTING, as these form the standard GEOS-Chem emissions configuration. You are free, however, to use any of the data sets listed below. Your choice of data sets will depend on your particular research needs.
NOTE: The inventories are grouped and listed in the same order as in the HEMCO_Config.rc version that ships with GEOS-Chem 13.0.0.
Regional inventories
Inventory | More info | Path | Data size | Status |
---|---|---|---|---|
APEI
|
README | $ROOT/APEI/v2016-11/ | 68 MB | OPTIONAL SETTING |
NEI2011_HOURLY
|
README | $ROOT/NEI2011/v2015-03 | 248 GB | OPTIONAL SETTING |
NEI2011_MONTHLY_MEAN
|
README | $ROOT/NEI2011/v2017-02-MM | 248 GB | OPTIONAL SETTING |
MIX
|
README | $ROOT/MIX/v2015-03 | 623 MB | OPTIONAL SETTING |
DICE_Africa
|
README | $ROOT/DICE_Africa/v2016-10 | 202 MB | OPTIONAL |
Global inventories
The following subdirectories of the HEMCO directory tree contain inventories of anthropogenic and biofuel emissions.
Please also see list of species included in each of these global inventories and regional inventories.
Inventory | More info | Path | Data size | Status |
---|---|---|---|---|
CEDS
|
README | $ROOT/CEDS/v2020-08 | 73 GB | DEFAULT SETTING
|
CEDS_byFuelType
|
README | $ROOT/CEDS/v2020-08 | 73 GB | OPTIONAL SETTING
|
EDGARv43
|
README | $ROOT/EDGARv43/v2016-11 | 6.2 GB | OPTIONAL SETTING
|
HTAP
|
README | $ROOT/HTAP/v2015-03 | 4.5 GB | OPTIONAL SETTING
|
GEIA_NH3
|
README | $ROOT/NH3/v2014-07 | 8.1 MB | DEFAULT SETTING |
SEABIRD_NH3
|
README | HEMCO/NH3/v2018-04/ | 104 MB | DEFAULT SETTING |
POET
|
README | $ROOT/POET/v2017-03/ | 38 MB | OPTIONAL SETTING |
TZOMPASOSA_C2H6
|
README | $ROOT/C2H6_2010/v2019-06 | 536 KB | DEFAULT SETTING
|
XIAO_C3H8
|
README | $ROOT/XIAO/v2014-09 | 308 KB | DEFAULT SETTING
|
LIANG_BROMOCARB
|
README | $ROOT/BROMINE/v2015-02 | 156 KB | DEFAULT SETTING |
ORDONEZ_IONOCARB
|
README | $ROOT/IODINE/v2017-09 | 4.4 MB | DEFAULT SETTING |
AEIC
|
README | $ROOT/AEIC/v2015-01 | 2.0 GB | DEFAULT SETTING |
DECAYING_PLANTS
|
README | $ROOT/ALD2/v2017-03 | 8.1 MB | DEFAULT SETTING |
AFCID
|
README | $ROOT/AFCID/2018-04 | 4.4 MB | DEFAULT SETTING
|
Ship emissions
The following subdirectories of the HEMCO directory tree contain inventories of anthropogenic aircraft and ship emissions.
Inventory | More info | Path | Data size | Status |
---|---|---|---|---|
CEDS_SHIP
|
README | $ROOT/CEDS/v2020-08 | 73 GB | DEFAULT SETTING
|
CEDS_SHIP_byFuelType
|
README | $ROOT/CEDS/v2020-08 | 73 GB | OPTIONAL SETTING
|
HTAP_SHIP
|
README | $ROOT/HTAP/v2015-03 | 4.5 GB | OPTIONAL SETTING
|
ICOADS_SHIP
|
README | $ROOT/ICOADS_SHIP/v2014-07 | 4.2 MB | OPTIONAL SETTING
|
ARCTAS_SHIP
|
README | $ROOT/ARCTAS_SHIP/v2014-07 | 508 KB | OPTIONAL SETTING
|
CORBETT_SHIP
|
README | $ROOT/CORBETT_SHIP/v2014-07 | 1.8 MB | OPTIONAL SETTING
|
NEI2011_SHIP
|
README | $ROOT/NEI2011/v2015-03 | 248 GB | OPTIONAL SETTING
|
NEI2011_SHIP_MONMEAN
|
README | $ROOT/NEI2011/v2017-02-MM | 248 GB | OPTIONAL SETTING
|
NOTES about ship emissions:
- CEDS ship emissions are the default ship inventory in GEOS-Chem 12.1.0 and later versions.
- Data files for CEDS ship emissions are stored with the other CEDS emission sector files in the HEMCO data path: $ROOT/CEDS/v2018-04/.
- If CEDS emissions are turned off above then:
- ARCTAS should be used over ICOADS, CORBETT, and HTAP for SO2
- ICOADS should be used for CO and NO.
Future emissions
Inventory | More info | Path | Data size | Status |
---|---|---|---|---|
RCP_3PD
|
README | $ROOT/RCP/v2020-07/RCP_3PD | 91 MB | OPTIONAL SETTING |
RCP_45
|
README | $ROOT/RCP/v2020-07/RCP_45 | 93 MB | OPTIONAL SETTING |
RCP_60
|
README | $ROOT/RCP/v2020-07/RCP_60 | 92 MB | OPTIONAL SETTING |
RCP_85
|
README | $ROOT/RCP/v2020-07/RCP_85 | 93 MB | OPTIONAL SETTING |
Biomass burning emissions (not implemented as extensions)
NOTE: Some biomass burning inventories are implemented as HEMCO extensions. See this section below for more information.
The following subdirectories of the HEMCO directory tree contain inventories of biomass burning emissions.
Inventory | More info | Path | Data size | Status |
---|---|---|---|---|
GFAS
|
README | $ROOT/GFAS/v2018-09 | 49 GB | OPTIONAL SETTING
|
QFED
|
README | $ROOT/QFED/v2018-07 | 247 GB | OPTIONAL SETTING
|
Offline grid-independent emissions
Offline grid-independent emissions were generated by running high-resolution (0.25°x0.3125° for GEOS-FP, 0.5°x0.625° for MERRA-2) simulations using the HEMCO standalone and online emissions from the HEMCO extensions.
Inventory | More info | Path | Data size | Status |
---|---|---|---|---|
OFFLINE_DUST
|
README | $ROOT/OFFLINE_DUST/v2019-01 | 18 GB | DEFAULT SETTING
|
OFFLINE_BIOGENICVOC
|
README | $ROOT/OFFLINE_BIOVOC/v2019-01 | 1400 GB | DEFAULT SETTING
|
OFFLINE_SEASALT
|
README | $ROOT/OFFLINE_SEASALT/v2019-01 | 876 GB | DEFAULT SETTING
|
OFFLINE_SOILNOX
|
README | $ROOT/OFFLINE_SOILNOX/v2019-01 | 48 GB | DEFAULT SETTING
|
Aerosol emissions
The following subdirectories of the HEMCO directory tree contain aerosol emission inventories.
NOTE: AeroCom volcanic SO2 emissions are listed under the Emissions implemented as HEMCO extensions.
Inventory | Data file info | Path | Directory size | Status |
---|---|---|---|---|
Secondary organic aerosols | README | $ROOT/SOA/2014-07 | 3.0 MB | DEFAULT SETTING
|
Spatially varying OM/OC ratios for SOA species | README | $ROOT/OMOC/v2018-01 | 8.6 MB | OPTIONAL
|
Emissions implemented as HEMCO extensions
The following subdirectories of the HEMCO directory tree contain input data used by various HEMCO extensions. These HEMCO extensions compute emissions for quantities that depend on meteorological variables (e.g. emissions from lighting, biogenic processes, etc.).
NOTE: GFED3, GFED4 and FINNv1 are implemented as HEMCO extensions, but we have listed them with the other biomass burning inventories, for clarity.
Inventory | Data file info | Path | Directory size | Status |
---|---|---|---|---|
Acetone ocean exchange | README | $ROOT/ACET/v2014-07 | 52 KB | DEFAULT SETTING |
ALD2 ocean exchange | README | $ROOT/ALD2/v2017-03 | 52 KB | DEFAULT SETTING |
DMS ocean exchange | README | $ROOT/DMS/v2015-07 | 3.0 MB | DEFAULT SETTING
|
NO from lightning | README | $ROOT/OFFLINE_LIGHTNING/v2019-01/ | 47 GB | DEFAULT SETTING
|
PARANOX ship plume model | README | $ROOT/PARANOX/v2015-02 | 162 MB | DEFAULT SETTING
|
AeroCom volcanic emissions | README | $ROOT/VOLCANO/v2019-08 | 660 MB | DEFAULT SETTING
|
Optional inventories (turned OFF by default in the standard emissions configuration) | ||||
DEAD dust model | README | $ROOT/DUST_DEAD/2014-07 | 712 KB | OPTIONAL
|
MEGAN biogenic emissions | README | $ROOT/MEGAN/v2018-05 | 12 MB | OPTIONAL
|
NO from soils/fertilizers | README | $ROOT/SOILNOX/v2014-07 | 79 MB | OPTIONAL
|
GINOUX dust model | README | $ROOT/DUST_GINOUX/2014-07 | 36 KB | OPTIONAL
|
GEOS-Chem specialty simulation data
The following subdirectories of the HEMCO directory tree input data (emissions, oxidants, etc.) for use with the GEOS-Chem specialty simulations. If you do not regularly use these simulations, you may choose not to download these data directories.
Inventory | Data file info | Path | Directory size | Status |
---|---|---|---|---|
Active inventories (turned ON by default in the standard emissions configuration) | ||||
Aerosol-only simulation | README | $ROOT/OFFLINE_AEROSOL/v2014-09 | 165 MB | DEFAULT SETTING |
CH4 simulation | README1 README2 |
$ROOT/CH4/v2017-10 $ROOT/CH4/v2014-09 |
274 MB | DEFAULT SETTING |
CO2 simulation | README1 README2 README3 README4 README5 README6 |
$ROOT/CO2/v2015-04 $ROOT/CO2/v2015-04/BIO/ $ROOT/CO2/v2015-04/BIOFUEL/ $ROOT/CO2/v2015-04/CHEM/ $ROOT/CO2/v2015-04/FOSSIL/ $ROOT/CO2/v2015-04/OCEAN/ |
1033 MB | DEFAULT SETTING |
Mercury simulation | README1 README2 README3 README4 README5 README6 README7 README8 README9 README10 |
$ROOT/MERCURY/v2014-09 $ROOT/MERCURY/v2014-09/ARTISANAL $ROOT/MERCURY/v2014-09/BrOx $ROOT/MERCURY/v2014-09/Hg2_PARTITION $ROOT/MERCURY/v2014-09/JVALUES $ROOT/MERCURY/v2014-09/NATURAL $ROOT/MERCURY/v2014-09/NEI2005 $ROOT/MERCURY/v2014-09/OCEAN $ROOT/MERCURY/v2014-09/SOIL $ROOT/MERCURY/v2014-09/STREETS |
342 MB | DEFAULT SETTING |
POPs simulation | README | $$ROOT/POPs/v2015-08 | 809 MB | DEFAULT SETTING
|
RRTMG radiative transfer model | README | $ROOT/RRTMG/v2018-11 | 19 MB | DEFAULT SETTING
|
Tagged CO simulation | README | $ROOT/TAGGED_CO/v2017-04 | 260 KB | DEFAULT SETTING
|
Tagged O3 simulation | README | $ROOT/TAGGED_O3/v2014-09 | 372 MB | DEFAULT SETTING |
O3 for offline simulations | README | $ROOT/O3/v2014-09 | 130 MB | DEFAULT SETTING |
OH for offline simulations | README1 README2 |
$ROOT/OH/v2014-09 $ROOT/OH/v2014-09/v5-07-08 $ROOT/OH/v2014-09/v7-02-03.GMI |
148 MB | DEFAULT SETTING |
H2O2 for offline simulations | README | $ROOT/OXIDANTS/v2014-07 | 32 MB | DEFAULT SETTING |
Oceanic Chlorophyll-A for Hg simulations | README | $ROOT/CHLA/v2014-07 | 2.0 MB | DEFAULT SETTING |
Obsolete inventories (superseded by newer developments) | ||||
CH3I simulation | README | $ROOT/CH3I/v2014-07 | 280 KB | OBSOLETE
|
POPs simulation | README | $ROOT/POPs/v2014-09 | 622 MB | OBSOLETE
|
Tagged CO simulation | README | $ROOT/TAGGED_CO/v2014-08q | 260 KB | OBSOLETE
|
--Bob Yantosca (talk) 19:01, 2 July 2018 (UTC)
Non-emissions data
The following subdirectories of the HEMCO directory tree input data contains non-emissions data sets. Most of these are used as inputs to GEOS-Chem's chemistry modules (i.e. photolysis, stratospheric chemistry, etc.).
Inventory | Data file info | Path | Directory size | Status |
---|---|---|---|---|
Active inventories (turned ON by default in the standard emissions configuration) | ||||
Timezone offsets from UTC | README | $ROOT/TIMEZONES/v2015-02 | 264 KB | DEFAULT SETTING
|
UV surface albedoes | README | $ROOT/UVALBEDO/v2019-06 | 476 KB | DEFAULT SETTING
|
TOMS/SBUV O3 columns | README | $ROOT/TOMS_SBUV/v2019-06 | 21 MB | DEFAULT SETTING |
Stratospheric Bry from CCM | README | $ROOT/STRAT/v2015-01/Bry | 385 MB | DEFAULT SETTING
|
GMI strat chem mechanism | README | $ROOT/GMI/v2015-02 | 16 GB | DEFAULT SETTING
|
UCX chemistry mechanism | README | $ROOT/UCX/v2018-02 | 12 GB | DEFAULT SETTING
|
MODIS XLAI (leaf area index per land type) |
README | $ROOT/MODIS_XLAI/v2017-07 | 203 MB | TDEFAULT SETTING
|
Olson land map (masks for each land type) |
README | $ROOT/OLSON_MAP/v2019-02 | 1.9 MB | TDEFAULT SETTING
|
Yuan-processed MODIS XLAI (leaf area index per land type) |
README | $ROOT/Yuan_XLAI/v2019-03 | 203 MB | TDEFAULT SETTING
|
Obsolete inventories (superseded by newer developments) | ||||
TOMS/SBUV O3 columns | README | $ROOT/TOMS_SBUV/v2015-03 | 17 MB | OBSOLETE
|
Other inputs for HEMCO
The following subdirectories of the HEMCO directory tree input data for various HEMCO functions. These include regional masks, emission scale factors, and grid information.
Inventory | Data file info | Path | Directory size | Status |
---|---|---|---|---|
Active inventories (turned ON by default in the standard emissions configuration) | ||||
Annual scale factors | README | $ROOT/AnnualScalar/v2014-07 | 3.0 MB | DEFAULT SETTING
|
NEI2005 scale factors | README | $ROOT/NEI2005/v2014-09 | 8.1 MB | DEFAULT SETTING |
ID codes for each country | README | $ROOT/COUNTRY_ID/v2015-04 | 15 MB | DEFAULT SETTING
|
Mask files for regional emissions | README | $ROOT/MASKS/v2014-07 | 51 MB | DEFAULT SETTING
|
MAP_A2A regridding data | README | $ROOT/MAP_A2A/v2014-07 | 152 KB | DEFAULT SETTING |
Obsolete inventories (superseded by newer developments) | ||||
Weekly scale factors | README | $ROOT/WEEKSCALE/v2014-07 | 4.0 KB | OBSOLETE
|
--Bob Yantosca (talk) 19:05, 2 July 2018 (UTC)
The hemco_data_download package
Obtaining the hemco_data_download package
The hemco_data_download package will be retired in GEOS-Chem 12.7.0 in favor of the GEOS-Chem dry run option.
To obtain the hemco_data_download package, use Git to clone this repository
git clone https://github.com/GCST/hemco_data_download.git
This will create a directory named hemco_data_download, in which you should see the following files:
File | Description |
---|---|
README | File with an overall description of the directory contents |
hemcoDataDownload.pl | Perl script to download HEMCO data directories |
hemcoDataDownload.rc | Configuration file for the hemcoDataDownload.pl script. In this file you can specify which HEMCO data directories you would like to download and which you would like to omit. |
forTesting.rc | A configuration file that you can use for testing or debugging. This will tell hemcoDataDownload.pl only to download a couple of emissions inventories with files that do not take up much disk space. |
--Bob Yantosca (talk) 17:02, 5 December 2016 (UTC)
Setting up the configuration file
The hemco_data_download package will be retired in GEOS-Chem 12.7.0 in favor of the GEOS-Chem dry run option.
The configuration files (i.e. hemcoDataDownload.rc and forTesting.rc) are pretty much self-explanatory.
At the top of the configuration file you will see this section:
############################################################################### # # # Specify the remote and local HEMCO data paths, plus other options. # # # ############################################################################### Remote HEMCO data path | ftp://ftp.as.harvard.edu/gcgrid/data/ExtData/HEMCO Your HEMCO data path | /as/scratch/bmy/HEMCO Verbose output | yes Dryrun only? | no
Path | Description |
---|---|
Remote HEMCO data path | Location on the FTP server from which you are going to download the data. This can be from either Harvard or from Dalhousie. (For now we will use the Harvard server). You can edit this accordingly. |
Your HEMCO data path | The root-level directory for HEMCO data on your own disk space. If you are not sure where to place this, then ask your sysadmin. |
Verbose output | Lets you specify if you want to print out extra output during the download process. This can be set to either "yes" or "no". |
Dryrun only | Allows you to print out the data download commands without actually downloading the data. This is useful for debugging. This can be set to either "yes" or "no". |
In the next section you specify all of the HEMCO inventories that you want to download. You will see this header:
############################################################################### # # # THE FOLLOWING DATA DIRECTORIES WILL BE DOWNLOADED. # # # # These data directories comprise the recommended emissions configuration # # for typical GEOS-Chem full-chemistry and specialty simulations. # # # # NOTE: In most cases, you only have to specify the top-level folder. # # All subfolders will be downloaded automatically. # # # ############################################################################### #=============================+================================================ # AEROSOLS | Directory paths #=============================+================================================ AEROCOM volcano emissions | $ROOT/VOLCANO/v2014-10 Bond et al BC/OC | $ROOT/BCOC_BOND/v2014-07 Cooke et al BC/OC | $ROOT/BCOC_COOKE/v2014-07 Secondary organic aerosols | $ROOT/SOA/v2014-07 ... etc ...
Each line specifies the name of a HEMCO emissions inventory and the data path where it can be found on disk, relative to the root data path. NOTE: The script will replace the $ROOT token with the value you gave to the "HEMCO remote data path" above. (Lines starting with the comment character # will be ignored.)
Any inventory found in this section will be downloaded. To prevent an inventory from being downloaded you can either comment it out (i.e. place a # in the first column) or move the inventory to the next section.
The final section specifies HEMCO emission inventories that you do not wish to download. The section looks like this:
############################################################################### # # # THE FOLLOWING DATA DIRECTORIES WILL NOT BE DOWNLOADED. # # # # These data directories contain are optional emissions inventories that # # are not used in typical GEOS-Chem simulations. If you wish to download # # any of these inventories, simply move the corresponding entry for each # # inventory to the previous section. # # # ############################################################################### CH3I simulation (obsolete) | $ROOT/CH3I/v2014-07 Chlorophyll A | $ROOT/CHLA/v2014-07 ... etc ...
--Bob Y. 13:55, 12 February 2015 (EST)
Downloading the data
The hemco_data_download package will be retired in GEOS-Chem 12.7.0 in favor of the GEOS-Chem dry run option.
Once you have set up your configuration file, you can run the hemcoDataDownload.pl script to start downloading the HEMCO data to your local server. To run the script you can type:
hemcoDataDownload.pl
If you do not specify a configuration file name, then the hemcoDataDownload.pl scriptwill read the default configuration file hemcoDataDownload.rc configuration file. If you wish to specify a different configuration file name, simply pass that as an argument to the script, e.g.
hemcoDataDownload.pl myNewConfigFile.rc
Before you start downloading GB's of data, we recommend that you run a short test to make sure that the data directories are being copied to the proper locations on your disk server. For this purpose, we have provided a configuration file named forTesting.rc. Typing
hemcoDataDownload.pl forTesting.pl
will only download a couple of data inventories that do not take up much disk space. This allows you to ensure that the data transfer is sucessful without making you wait a long time.
--Bob Y. 13:57, 12 February 2015 (EST)
Submitting new data for use with HEMCO
If you have a new emissions inventory or other atmospheric data set that you would like to add to the HEMCO data directories, then please contact the GEOS-Chem Support Team. Data files must be in COARDS-compliant or CF-compliant netCDF format. Please see the following pages for more information:
--Bob Yantosca (talk) 22:24, 14 January 2020 (UTC)