HEMCO data directories
On this page we describe the directory tree from which the HEMCO emissions component can read emissions inventories and other atmospheric data sets.
Contents
- 1 Overview
- 2 HEMCO data directory structure
- 2.1 Aerosols
- 2.2 Anthropogenic and biofuel emissions
- 2.3 Anthropogenic aircraft and ship emissions
- 2.4 Biomass burning emissions
- 2.5 Emissions implemented as HEMCO extensions
- 2.6 Future and historical emissions
- 2.7 GEOS-Chem specialty simulation data
- 2.8 Seawater concentrations
- 2.9 Stratospheric data
- 2.10 Other inputs for HEMCO
- 3 Downloading the HEMCO data directories
Overview
The HEMCO emissions component can read several types of emission inventories, as well as other types atmospheric data sets, such as production and loss rates, or concentration data. We have collated all of this data into a comprehensive directory tree structure. Each folder of the HEMCO data directory tree represents a particular emissions inventory or other data set.
At present, the HEMCO data directory tree resides on the disk servers at Harvard University (and soon at Dalhousie University). We have created a package that will let you download this directory tree to your local disk storage space. For more information, please see the Downloading the HEMCO data directories section below.
--Bob Y. 10:22, 13 February 2015 (EST)
HEMCO data directory structure
The sections below describe each of the data sets contained in the HEMCO directory tree, grouped by type. Each data set is contained in a subdirectory of the HEMCO root directory, which is specified by the $ROOT token. (For example, on the Harvard disk server, $ROOT points to the directory /mnt/gcgrid/data/ExtData/HEMCO/.)
--Bob Y. 10:49, 13 February 2015 (EST)
Aerosols
The following subdirectories of the HEMCO directory tree contain aerosol emission inventories.
Inventory | Data file info | Path | Status |
---|---|---|---|
AEROCOM volcanic emissions | README | $ROOT/VOLCANO/v2014-10 | CURRENTLY USED |
Tami Bond et al (2007) EC and OC emissions | README | $ROOT/BCOC_BOND/v2014-07 | CURRENTLY USED |
Cooke et al BC and OC emissions | README | $ROOT/BCOC_COOKE/v2014-07 | OPTIONAL
|
Secondary organic aerosols | README | $ROOT/HEMCO/SOA/2014-07 | CURRENTLY USED |
--Bob Y. 15:35, 12 February 2015 (EST)
Anthropogenic and biofuel emissions
The following subdirectories of the HEMCO directory tree contain inventories of anthropogenic and biofuel emissions.
Inventory | Data file info | Path | Status |
---|---|---|---|
GEIA global anthropogenic | README | $ROOT/GEIA/v2014-07 | Slated for replacement |
GEIA NH3 (anthro, biofuel, natural source) | README | $ROOT/NH3/v2014-07 | CURRENTLY USED
|
EDGAR v3 global anthropogenic | README | $ROOT/EDGAR/v2014-07 | CURRENTLY USED
|
EDGAR v4.2 global anthropogenic | README | $ROOT/EDGARv42/v2015-02 | PENDING BENCHMARK APPROVAL
|
RETRO VOC emissions | README | $ROOT/RETRO/v2014-07/orig_kgC | CURRENTLY USED |
Yevich & Logan biofuels | README | $ROOT/BIOFUEL/v2014-07 | CURRENTLY USED |
BRAVO regional anthropogenic | README | $ROOT/BRAVO/v2014-07 | CURRENTLY USED |
CAC regional anthropogenic | README | $ROOT/CAC/v2014-07 | CURRENTLY USED |
EMEP regional anthropogenic | README | $ROOT/EMEP/v2015-01 | CURRENTLY USED |
NEI2005 regional anthro/biofuel | README | $ROOT/NEI2005/v2014-09 | CURRENTLY USED |
NEI/VISTAS scale factors | README | $ROOT/VISTAS/v2014-07 | CURRENTLY USED |
Streets regional anthro | README | $ROOT/STREETS/v2014-07 | CURRENTLY USED |
MASAGE agricultural NH3 | README | $ROOT/MASAGE_NH3/v2015-02 | CURRENTLY USED |
Yaping Xiao et al C2H6 and C3H8 anthropogenic | README | $ROOT/XIAO/v2014-09 | CURRENTLY USED |
--Bob Y. 12:17, 13 February 2015 (EST)
Anthropogenic aircraft and ship emissions
The following subdirectories of the HEMCO directory tree contain inventories of anthropogenic aircraft and ship emissions.
Inventory | Data file info | Path | Status |
---|---|---|---|
AEIC aircraft | README | $ROOT/AEIC/v2014-10 | CURRENTLY USED |
ARCTAS ship emissions (SO2) | README | $ROOT/ARCTAS_SHIP/v2014-07 | CURRENTLY USED
|
Corbett et al ship emissions (SO2) | README | $ROOT/VOLCANO/v2014-10 | OPTIONAL
|
ICOADS ship (CO) | README | $ROOT/ICOADS_SHIP/v2014-07 | CURRENTLY USED
|
EDGAR v3 ship (CO) | README | $ROOT/EDGAR/v2014-07 | OPTIONAL
|
EMEP ship (CO, SO2) | README | $ROOT/EMEP/v2015-01 | CURRENTLY USED
|
--Bob Y. 11:03, 13 February 2015 (EST)
Biomass burning emissions
The following subdirectories of the HEMCO directory tree contain inventories of biomass burning emissions.
Inventory | Data file info | Path | Status |
---|---|---|---|
GFED3 biomass | README | $ROOT/GFED3/v2014-10 | CURRENTLY USED
|
FINN biomass | README | $ROOT/FINN/v2015-02 | OPTIONAL
|
QFED biomass | README | $ROOT/QFED/v2014-09 | OPTIONAL
|
GFED2 biomass | README | $ROOT/GFED2/v2014-07 | OBSOLETE
|
Duncan et al biomass | README | $ROOT/BIOBURN/v2014-07 | OBSOLETE
|
--Bob Y. 15:48, 13 February 2015 (EST)
Emissions implemented as HEMCO extensions
The following subdirectories of the HEMCO directory tree contain input data used by various HEMCO extensions. These HEMCO extensions compute emissions for quantities that depend on meteorological variables (e.g. emissions from lighting, biogenic processes, etc.).
Inventory | Data file info! | Path | Status |
---|---|---|---|
DEAD dust model | README | $ROOT/DUST_DEAD/2014-07 | CURRENTLY USED
|
GINOUX dust model | README | $ROOT/DUST_GINOUX/2014-07 | OPTIONAL
|
MEGAN biogenic emissions | README | $ROOT/MEGAN/v2014-07 | CURRENTLY USED |
NO from lightning | README | $ROOT/LIGHTNOX/v2014-07 | CURRENTLY USED |
NO from soils/fertilizers | README | $ROOT/SOILNOX/v2014-07 | CURRENTLY USED |
PARANOX ship plume model | README | $ROOT/PARANOX/v2014-07 | CURRENTLY USED |
--Bob Y. 10:49, 13 February 2015 (EST)
Future and historical emissions
The following subdirectories of the HEMCO directory tree contain historical and future emissions inventories.
Inventory | Data file info | Path | Status |
---|---|---|---|
RCP future scenarios | README | $ROOT/RCP/RCP26 | STILL BEING IMPLEMENTED |
--Bob Y. 11:07, 13 February 2015 (EST)
GEOS-Chem specialty simulation data
The following subdirectories of the HEMCO directory tree input data (emissions, oxidants, etc.) for use with the GEOS-Chem specialty simulations. If you do not regularly use these simulations, you may choose not to download these data directories.
Inventory | Data file info | Path | Status |
---|---|---|---|
Aerosol-only simulation | README | $ROOT/OFFLINE_AEROSOL/v2014-09 | CURRENTLY USED |
CH4 simulation | CH4 simulation | $ROOT/CH4/v2014-09 | CURRENTLY USED |
CO2 simulation | README | $ROOT/CO2/v2014-09 | CURRENTLY USED |
Mercury simulation | README | $ROOT/MERCURY/v2014-09/ | CURRENTLY USED |
POPs simulation | README | $ROOT/POPs/v2014-09 | CURRENTLY USED |
Tagged CO simulation | README | $ROOT/TAGGED_CO/v2014-08 | CURRENTLY USED |
Tagged O3 simulation | README | $ROOT/TAGGED_O3/v2014-09 | CURRENTLY USED |
O3 for offline simulations | README | $ROOT/O3/v2014-09/ | CURRENTLY USED |
OH for offline simulations | README | $ROOT/OH/v2014-09 | CURRENTLY USED |
H2O2 for offline simulations | README | $ROOT/OXIDANTS/v2014-07 | CURRENTLY USED |
CH3I simulation | README | $ROOT/CH3I/v2014-07 | OBSOLETE
|
--Bob Y. 11:17, 13 February 2015 (EST)
Seawater concentrations
The following subdirectories of the HEMCO directory tree input data contains seawater concentrations used by the acetone ocean loss and sulfate chemistry modules.
Inventory | Data file info | Path | Status |
---|---|---|---|
Acetone seawater | README | $ROOT/ACET/v2014-07 | CURRENTLY USED |
DMS seawater | README | $ROOT/DMS/v2014-07 | CURRENTLY USED |
--Bob Y. 11:22, 13 February 2015 (EST)
Stratospheric data
The following subdirectories of the HEMCO directory tree input data contains data for use with the GEOS-Chem's stratospheric chemistry mechanism.
Inventory | Data file info | Path | Status |
---|---|---|---|
Stratospheric Bry data | README | $ROOT/STRAT/v2015-01/Bry | CURRENTLY USED |
--Bob Y. 11:23, 13 February 2015 (EST)
Other inputs for HEMCO
The following subdirectories of the HEMCO directory tree input data for various HEMCO functions. These include regional masks, timezones, and emission scale factors.
Inventory | Data file info | Path | Status |
---|---|---|---|
Annual scale factors | README | $ROOT/AnnualScalar/v2014-07 | CURRENTLY USED |
Mask files | README | $ROOT/MASKS/v2014-07 | CURRENTLY USED |
MAP_A2A regridding data | README | $ROOT/MAP_A2A/v2014-07 | CURRENTLY USED |
Timezone offsets from UTC | README | $ROOT/TIMEZONES/v2015-02 | CURRENTLY USED |
Weekly scale factors | README | $ROOT/WEEKSCALE/v2014-07 | CURRENTLY USED |
--Bob Y. 11:23, 13 February 2015 (EST)
Downloading the HEMCO data directories
The GEOS-Chem Support Team has created a package called hemco_data_download. With this package, you can download the various emissions inventories and related data files for HEMCO to your own disk server. Furthermore, you can specify which data directories that you would like to download (as well as those you would like to ignore) via a configuration file.
Obtaining the hemco_data_download package
To obtain the hemco_data_download package, use Git to clone this repository
git clone https://github.com/GCST/hemco_data_download.git
This will create a directory named hemco_data_download, in which you should see the following files:
- README
- File with an overall description of the directory contents
- hemcoDataDownload.pl
- Perl script to download HEMCO data directories
- hemcoDataDownload.rc
- Configuration file for the hemcoDataDownload.pl script. In this file you can specify which HEMCO data directories you would like to download and which you would like to omit.
- forTesting.rc
- A configuration file that you can use for testing or debugging. This will tell hemcoDataDownload.pl only to download a couple of emissions inventories with files that do not take up much disk space.
--Bob Y. 13:58, 12 February 2015 (EST)
Setting up the configuration file
The configuration files (i.e. hemcoDataDownload.rc and forTesting.rc) are pretty much self-explanatory.
At the top of the configuration file you will see this section:
############################################################################### # # # Specify the remote and local HEMCO data paths, plus other options. # # # ############################################################################### Remote HEMCO data path | ftp://ftp.as.harvard.edu/gcgrid/data/ExtData/HEMCO Your HEMCO data path | /as/scratch/bmy/HEMCO Verbose output | yes Dryrun only? | no
- Remote HEMCO data path is the location on the FTP server from which you are going to download the data. This can be from either Harvard or from Dalhousie. (For now we will use the Harvard server). You can edit this accordingly.
- Your HEMCO data path specifies the root-level directory for HEMCO data on your own disk space. If you are not sure where to place this, then ask your sysadmin.
- Verbose output lets you specify if you want to print out extra output during the download process. This can be set to either "yes" or "no".
- Dryrun only flag allows you to print out the data download commands without actually downloading the data. This is useful for debugging.
In the next section you specify all of the HEMCO inventories that you want to download. You will see this header:
############################################################################### # # # THE FOLLOWING DATA DIRECTORIES WILL BE DOWNLOADED. # # # # These data directories comprise the recommended emissions configuration # # for typical GEOS-Chem full-chemistry and specialty simulations. # # # ############################################################################### #=============================+================================================ # AEROSOLS | Directory paths #=============================+================================================ AEROCOM volcano emissions | $ROOT/VOLCANO/v2014-10 Bond et al BC/OC | $ROOT/BCOC_BOND/v2014-07 Cooke et al BC/OC | $ROOT/BCOC_COOKE/v2014-07 Secondary organic aerosols | $ROOT/SOA/v2014-07 ... etc ...
Each line specifies the name of a HEMCO emissions inventory and the data path where it can be found on disk, relative to the root data path. NOTE: The script will replace the $ROOT token with the value you gave to the "HEMCO remote data path" above. (Lines starting with the comment character # will be ignored.)
Any inventory found in this section will be downloaded. To prevent an inventory from being downloaded you can either comment it out (i.e. place a # in the first column) or move the inventory to the next section.
The final section specifies HEMCO emission inventories that you do not wish to download. The section looks like this:
############################################################################### # # # THE FOLLOWING DATA DIRECTORIES WILL NOT BE DOWNLOADED. # # # # These data directories contain are optional emissions inventories that # # are not used in typical GEOS-Chem simulations. If you wish to download # # any of these inventories, simply move the corresponding entry for each # # inventory to the previous section. # # # ############################################################################### CH3I simulation (obsolete) | $ROOT/CH3I/v2014-07 Chlorophyll A | $ROOT/CHLA/v2014-07 ... etc ...
--Bob Y. 13:55, 12 February 2015 (EST)
Downloading the data
Once you have set up your configuration file, you can run the hemcoDataDownload.pl script to start downloading the HEMCO data to your local server. To run the script you can type:
hemcoDataDownload.pl
If you do not specify a configuration file name, then the hemcoDataDownload.pl scriptwill read the default configuration file hemcoDataDownload.rc configuration file. If you wish to specify a different configuration file name, simply pass that as an argument to the script, e.g.
hemcoDataDownload.pl myNewConfigFile.rc
Before you start downloading GB's of data, we recommend that you run a short test to make sure that the data directories are being copied to the proper locations on your disk server. For this purpose, we have provided a configuration file named forTesting.rc. Typing
hemcoDataDownload.pl forTesting.pl
will only download a couple of data inventories that do not take up much disk space. This allows you to ensure that the data transfer is sucessful without making you wait a long time.
--Bob Y. 13:57, 12 February 2015 (EST)