HEMCO data directories

From Geos-chem
Jump to navigation Jump to search

On this page we describe the directory tree from which the HEMCO emissions component can read emissions inventories and other atmospheric data sets.

Overview

The HEMCO emissions component can read several types of emission inventories, as well as other types atmospheric data sets, such as production and loss rates, or concentration data. We have collated all of this data into a comprehensive directory tree structure. Each folder of the HEMCO data directory tree represents a particular emissions inventory or other data set.

At present, the HEMCO data directory tree resides on the disk servers at Harvard University (and soon at Dalhousie University). We have created a package that will let you download this directory tree to your local disk storage space. For more information, please see the Downloading the HEMCO data directories section below.

--Bob Y. 10:22, 13 February 2015 (EST)

HEMCO data directory structure

The sections below describe each of the data sets contained in the HEMCO directory tree, grouped by type. Each data set is contained in a subdirectory of the HEMCO root directory, which is specified by the $ROOT token. (For example, on the Harvard disk server, $ROOT points to the directory /mnt/gcgrid/data/ExtData/HEMCO/.)

The Status column in the tables below denote the following:

CURRENTLY USED Denotes that the data set is part of the standard emissions configuration for GEOS-Chem, that is:
OPTIONAL Denotes means that the data set is not used in the standard GEOS-Chem emissions configuration, but may be used in place of another data set.
TO BE ADDED SOON Denotes that the data set is in the process of being added to the HEMCO data directory tree.
OBSOLETE Denotes that the data set is outdated and has been removed from the standard GEOS-Chem emissions configuration.
NOT USED Denotes that the data set may be up-to-date, but is not being used in the standard GEOS-Chem emissions configuration for a particular reason.

If use GEOS-Chem, then we recommend that you download the data sets that are listed as CURRENTLY USED, as these form the standard GEOS-Chem emissions configuration. You are free, however, to use any of the data sets listed below. Your choice of data sets will depend on your particular research needs.

--Bob Y. 15:12, 17 March 2015 (EDT)

Aerosol emissions

The following subdirectories of the HEMCO directory tree contain aerosol emission inventories.

Inventory Data file info Path Status
AEROCOM volcanic emissions README $ROOT/VOLCANO/v2015-02 CURRENTLY USED
  • Default global inventory for volcanic SO2.
Tami Bond et al (2007) BC and OC emissions README $ROOT/BCOC_BOND/v2014-07 CURRENTLY USED
  • Default global inventory for black carbon (BC) and organic carbon (OC).
Secondary organic aerosols README $ROOT/HEMCO/SOA/2014-07 CURRENTLY USED
  • Contain various inputs for SOA simulations.
Cooke et al BC and OC emissions README $ROOT/BCOC_COOKE/v2014-07 OPTIONAL
  • Can be used in place of Bond et al (2007).

--Bob Y. 10:15, 18 March 2015 (EDT)

Anthropogenic and biofuel emissions

The following subdirectories of the HEMCO directory tree contain inventories of anthropogenic and biofuel emissions.

Please also see list of species included in each of these global inventories and regional inventories.

Inventory Data file info Path Status
EDGAR v4.2 global anthropogenic README $ROOT/EDGARv42/v2015-02/CO
$ROOT/EDGARv42/v2015-02/NH3
$ROOT/EDGARv42/v2015-02/NO
$ROOT/EDGARv42/v2015-02/SO2
$ROOT/EDGARv42/v2015-02/VOCv2
CURRENTLY USED
  • Default global inventory for NOx, CO, SO2, and NH3 (replaces EDGAR v3).
  • The following sectors are not used:
    • EDGAR v4.2 ship emissions
    • EDGAR v4.2 soil NO emissions
    • EDGAR v4.2 biomass emissions
  • NAP is still taken from EDGAR v2 data.
GEIA NH3 (anthro, biofuel, natural source) README $ROOT/NH3/v2014-07 CURRENTLY USED
  • Default global inventory for biofuel & natural NH3.
  • Anthro NH3 is overwritten by EDGAR v4.2
  • Slated for replacement after GEOS-Chem v10-01 ships.
RETRO VOC emissions README $ROOT/RETRO/v2014-07/orig_kgC CURRENTLY USED
Yevich & Logan biofuels README $ROOT/BIOFUEL/v2014-07 CURRENTLY USED
  • Default global biofuel emissions inventory.
BRAVO regional anthropogenic README $ROOT/BRAVO/v2014-07 CURRENTLY USED
  • Overwrites CO, NO, and SO2 over Mexico.
CAC regional anthropogenic README $ROOT/CAC/v2014-07 CURRENTLY USED
  • Overwrites CO, NO, NH3, and SOx over Canada.
EMEP regional anthropogenic README $ROOT/EMEP/v2015-01 CURRENTLY USED
  • Overwrites CO, NH3, NO, SO2, PRPE, ALK4, C2H6, ALD2, and MEK over Europe.
MASAGE agricultural NH3 README $ROOT/MASAGE_NH3/v2015-02 CURRENTLY USED
  • Adds NH3 from agricultural processes to the anthropogenic NH3 category.
MIX regional anthro README $ROOT/MIX/v2015-03 CURRENTLY USED
  • Replaces David Streets et al emissions.
  • The base years for MIX is 2008 and 2010, as opposed to 2006 for Streets.
NEI2005 regional anthro/biofuel README $ROOT/NEI2005/v2014-09 CURRENTLY USED
  • Overwrites ACET, ALK4, ALD2, BC, C2H6, C3H8, CO, CH2O, MEK, NH3, NIT, NO, OC, PRPE, SO2, and SO4 over the USA.
  • Use NEI2011 for simulations covering the period 2011-2013; otherwise use NEI2005.
  • NOTE: NEI2005 is eventually going to be retired.
NEI/VISTAS scale factors README1
README2
$ROOT/VISTAS/v2014-07/VISTAS_NOX
$ROOT/VISTAS/v2014-07/VISTAS_WEWD
CURRENTLY USED
  • Used to scale NO in the NEI2005 inventory.
NEI2011 regional anthro/biofuel README $ROOT/NEI2011/v2015-03 CURRENTLY USED
  • Overwrites CO, NO, NO2, HNO2, NH3, CH2O, RCHO, MACR, ACET, ALK4, PRPE, EOH, MOH, XYLE, TOLU, BENZ, SO2, SO4, C2H4, C2H6, C3H8, ALD2, BC, and OC over the USA.
  • Use NEI2011 for simulations covering the period 2011-2013; otherwise use NEI2005.
NEI2011 agricultural emissions only README $ROOT/NEI2011_ag_only/v2015-02 CURRENTLY USED
Yaping Xiao et al C2H6 and C3H8 anthropogenic README $ROOT/XIAO/v2014-09 CURRENTLY USED
HTAP global anthro+biofuel README $ROOT/HTAP/v2015-03 OPTIONAL
  • Can be used instead of EDGAR v4.2 emissions.
  • NOTE: HTAP includes some regional emissions from other inventories. If you select HTAP you might not need to select the other regional inventories in HEMCO. See this document for more information.
Streets regional anthro README1
README2
$ROOT/STREETS/v2014-07 OPTIONAL
  • Streets is being retired in favor of MIX.
  • We will keep Streets as a research option. You may use it if you wish.
NEI2008 regional anthro/biofuel README $ROOT/NEI2008/v2015-02 TO BE ADDED SOON
GEIA global anthropogenic README $ROOT/GEIA/v2014-07 OBSOLETE
  • Slated for replacement by newer inventories
EDGAR v3 global anthropogenic README $ROOT/EDGAR/v2014-07 OBSOLETE

--Bob Y. 16:22, 17 March 2015 (EDT)

Anthropogenic aircraft and ship emissions

The following subdirectories of the HEMCO directory tree contain inventories of anthropogenic aircraft and ship emissions.

Inventory Data file info Path Status
AEIC aircraft README $ROOT/AEIC/v2014-10 CURRENTLY USED
  • Default global aircraft emissions inventory.
  • Contains fuel burned, NO, CO, and hydrocarbons.
ARCTAS ship emissions (SO2) README $ROOT/ARCTAS_SHIP/v2014-07 CURRENTLY USED
  • Default inventory for ship SO2.
  • Overwritten by EMEP over Europe.
ICOADS ship (CO) README $ROOT/ICOADS_SHIP/v2014-07 CURRENTLY USED
  • Default inventory for ship CO.
  • Overwritten by EMEP over Europe.
EMEP ship (CO, NO, SO2) README $ROOT/EMEP/v2015-01 CURRENTLY USED
  • Overwrites ship CO, NO, SO2 over Europe.
  • Stored together w/ other EMEP data files.
Corbett et al ship emissions (SO2) README $ROOT/VOLCANO/v2014-10 OPTIONAL
  • Can be used to overwrite ARCTAS ship SO2.
HTAP ship (CO, NO, SO2) README $ROOT/HTAP/v2015-03 OPTIONAL
NEI2011 ship (several species) README $ROOT/NEI2011/v2015-03 OPTIONAL
  • Can be used to overwrite the following ship emissions over North America: NO, NO2, HONO, CO, NH3, CH2O, RCHO, MACR, ACET, C3H8, MEK, ALK4, PRPE, EOH, MOH, XYLE, TOLU, BENZ, SO2, SO4, C2H4, C2H6, ALD2, BCPI, BCPO, OCPI, OCPO.
EDGAR v3 ship (CO) README $ROOT/EDGAR/v2014-07 OBSOLETE
EDGAR v4.2 ship README - NOT USED
  • EDGAR v4.2 ship emissions are not used because they are lumped with other non-road transportation sectors (aircraft, rail, etc), and thus cannot be easily separated.

--Bob Y. 10:07, 18 March 2015 (EDT)

Biomass burning emissions

The following subdirectories of the HEMCO directory tree contain inventories of biomass burning emissions.

Inventory Data file info Path Status
GFED4 biomass README $ROOT/GFED4/v2015-03 CURRENTLY USED
  • Default global biomass burning inventory.
  • Monthly-mean data is available from 1996-2013
  • Use daily scale factors from GFED3.
  • Use 3-hourly scale factors from GFED3.
GFED3 biomass README $ROOT/GFED3/v2014-10 CURRENTLY USED
  • GEOS-Chem benchmarks now use GFED4 monthly data, but
    • The GFED_daily option reads daily scale factors from the GFED3 data directory.
    • The GFED_3hourly option reads 3-hourly scale factors from the GFED3 data directory.
FINN biomass README $ROOT/FINN/v2015-02 OPTIONAL
  • You may choose to replace GFED with FINN for research purposes.
  • If you do not need FINN, you may choose not to download it, in order to save disk space.
QFED biomass README $ROOT/QFED/v2014-09 OPTIONAL
  • You may choose to replace GFED3 with QFED for research purposes.
  • If you do not need QFED, you may choose not to download it, in order to save disk space.
GFED2 biomass README $ROOT/GFED2/v2014-07 OBSOLETE
  • Superseded by GFED4.
Duncan et al biomass README $ROOT/BIOBURN/v2014-07 OBSOLETE
  • Superseded by GFED4.

--Bob Y. 15:03, 17 March 2015 (EDT)

Bromine emissions

The following subdirectories of the HEMCO directory tree contain inventories of bromine emissions.

NOTE: See our Non-emissions data section for information about stratospheric bromine concentration data.

Inventory Data file info Path Status
Liang et al VSL emissions README $ROOT/STRAT/v2015-01/Bry CURRENTLY USED

--Bob Y. 15:03, 17 March 2015 (EDT)

Emissions implemented as HEMCO extensions

The following subdirectories of the HEMCO directory tree contain input data used by various HEMCO extensions. These HEMCO extensions compute emissions for quantities that depend on meteorological variables (e.g. emissions from lighting, biogenic processes, etc.).

NOTE: GFED3, GFED4 and FINN are implemented as HEMCO extensions, but we have listed them with the other biomass burning inventories, for clarity.

Inventory Data file info! Path Status
Acetone ocean exchange README $ROOT/ACET/v2014-07 CURRENTLY USED
DEAD dust model README $ROOT/DUST_DEAD/2014-07 CURRENTLY USED
  • Default dust mobilization scheme
DMS ocean exchange README $ROOT/DMS/v2014-07 CURRENTLY USED
MEGAN biogenic emissions README $ROOT/MEGAN/v2015-02 CURRENTLY USED
  • Default global biogenic emissions inventory.
NO from lightning README $ROOT/LIGHTNOX/v2014-07 CURRENTLY USED
NO from soils/fertilizers README $ROOT/SOILNOX/v2014-07 CURRENTLY USED
PARANOX ship plume model README $ROOT/PARANOX/v2015-02 CURRENTLY USED
  • Computes the aging of emissions in ship exhaust plumes.
GINOUX dust model README $ROOT/DUST_GINOUX/2014-07 OPTIONAL
  • You can use this in place of the DEAD dust model.

--Bob Y. 10:09, 18 March 2015 (EDT)

Future and historical emissions

The following subdirectories of the HEMCO directory tree contain historical and future emissions inventories.

Inventory Data file info Path Status
RCP future scenarios README $ROOT/RCP/v2015-02 OPTIONAL
  • RCP emissions are not part of the standard GEOS-Chem emissions configuration but may be used as a research option.

--Bob Y. 15:04, 17 March 2015 (EDT)

GEOS-Chem specialty simulation data

The following subdirectories of the HEMCO directory tree input data (emissions, oxidants, etc.) for use with the GEOS-Chem specialty simulations. If you do not regularly use these simulations, you may choose not to download these data directories.

Inventory Data file info Path Status
Aerosol-only simulation README $ROOT/OFFLINE_AEROSOL/v2014-09 CURRENTLY USED
CH4 simulation README $ROOT/CH4/v2014-09 CURRENTLY USED
CO2 simulation README1
README2
README3
README4
README5
$ROOT/CO2/v2014-09
$ROOT/CO2/v2014-09/BBIO/
$ROOT/CO2/v2014-09/CHEM/
$ROOT/CO2/v2014-09/FOSSIL/
$ROOT/CO2/v2014-09/OCEAN/
CURRENTLY USED
Mercury simulation README1
README2
README3
README4
README5
README6
README7
README8
README9
$ROOT/MERCURY/v2014-09/ARTISANAL
$ROOT/MERCURY/v2014-09/BrOx
$ROOT/MERCURY/v2014-09/Hg2_PARTITION
$ROOT/MERCURY/v2014-09/JVALUES
$ROOT/MERCURY/v2014-09/NATURAL
$ROOT/MERCURY/v2014-09/NEI2005
$ROOT/MERCURY/v2014-09/OCEAN
$ROOT/MERCURY/v2014-09/SOIL
$ROOT/MERCURY/v2014-09/STREETS
CURRENTLY USED
POPs simulation README $ROOT/POPs/v2014-09 CURRENTLY USED
Tagged CO simulation README $ROOT/TAGGED_CO/v2014-08 CURRENTLY USED
Tagged O3 simulation README $ROOT/TAGGED_O3/v2014-09 CURRENTLY USED
O3 for offline simulations README $ROOT/O3/v2014-09 CURRENTLY USED
OH for offline simulations README1
README2
$ROOT/OH/v2014-09/v5-07-08
$ROOT/OH/v2014-09/v7-02-03.GMI
CURRENTLY USED
H2O2 for offline simulations README $ROOT/OXIDANTS/v2014-07 CURRENTLY USED
Oceanic Chlorophyll-A for Hg simulations README $ROOT/CHLA/v2014-07 CURRENTLY USED
CH3I simulation README $ROOT/CH3I/v2014-07 OBSOLETE
  • This simulation is no longer used in GEOS-Chem (awaiting revival)

--Bob Y. 15:06, 17 March 2015 (EDT)

Non-emissions data

The following subdirectories of the HEMCO directory tree input data contains non-emissions data sets. Most of these are used as inputs to GEOS-Chem's chemistry modules (i.e. photolysis, stratospheric chemistry, etc.).

Inventory Data file info Path Status
GMI strat chem mechanism README $ROOT/GMI/v2015-02 CURRENTLY USED
  • Used to compute P & L of species in the stratosphere.
Stratospheric Bry from CCM README $ROOT/STRAT/v2015-01/Bry CURRENTLY USED
  • Required for the for the full-chemistry simulations.
Timezone offsets from UTC README $ROOT/TIMEZONES/v2015-02 CURRENTLY USED
  • Used by HEMCO to compute emissions that depend on local time rather than on UTC time.
TOMS/SBUV O3 columns README $ROOT/TOMS_SBUV/v2015-03 CURRENTLY USED
UV surface albedoes README $ROOT/UVALBEDO/v2015-03 CURRENTLY USED

--Bob Y. 15:07, 17 March 2015 (EDT)

Other inputs for HEMCO

The following subdirectories of the HEMCO directory tree input data for various HEMCO functions. These include regional masks, emission scale factors, and grid information.

Inventory Data file info Path Status
Annual scale factors README $ROOT/AnnualScalar/v2014-07 CURRENTLY USED
  • Used to scale NEI2005 and other emissions to years other than the inventory base year.
Mask files for regional emissions README $ROOT/MASKS/v2014-07 CURRENTLY USED
  • Masks define the geographical regions where the various regional emissions (e.g. BRAVO, CAC, NEI) are to be applied.
MAP_A2A regridding data README $ROOT/MAP_A2A/v2014-07 CURRENTLY USED
Weekly scale factors README $ROOT/WEEKSCALE/v2014-07 OBSOLETE
  • Not currently in use

--Bob Y. 15:08, 17 March 2015 (EDT)

Downloading the HEMCO data directories

The GEOS-Chem Support Team has created a package called hemco_data_download. With this package, you can download the various emissions inventories and related data files for HEMCO to your own disk server. Furthermore, you can specify which data directories that you would like to download (as well as those you would like to ignore) via a configuration file.

Obtaining the hemco_data_download package

To obtain the hemco_data_download package, use Git to clone this repository

git clone https://github.com/GCST/hemco_data_download.git

This will create a directory named hemco_data_download, in which you should see the following files:

README
File with an overall description of the directory contents
hemcoDataDownload.pl
Perl script to download HEMCO data directories
hemcoDataDownload.rc
Configuration file for the hemcoDataDownload.pl script. In this file you can specify which HEMCO data directories you would like to download and which you would like to omit.
forTesting.rc
A configuration file that you can use for testing or debugging. This will tell hemcoDataDownload.pl only to download a couple of emissions inventories with files that do not take up much disk space.

--Bob Y. 13:58, 12 February 2015 (EST)

Setting up the configuration file

The configuration files (i.e. hemcoDataDownload.rc and forTesting.rc) are pretty much self-explanatory.

At the top of the configuration file you will see this section:

###############################################################################
#                                                                             #
#  Specify the remote and local HEMCO data paths, plus other options.         #
#                                                                             #
###############################################################################

Remote HEMCO data path | ftp://ftp.as.harvard.edu/gcgrid/data/ExtData/HEMCO
Your HEMCO data path   | /as/scratch/bmy/HEMCO
Verbose output         | yes
Dryrun only?           | no
  1. Remote HEMCO data path is the location on the FTP server from which you are going to download the data. This can be from either Harvard or from Dalhousie. (For now we will use the Harvard server). You can edit this accordingly.
  2. Your HEMCO data path specifies the root-level directory for HEMCO data on your own disk space. If you are not sure where to place this, then ask your sysadmin.
  3. Verbose output lets you specify if you want to print out extra output during the download process. This can be set to either "yes" or "no".
  4. Dryrun only flag allows you to print out the data download commands without actually downloading the data. This is useful for debugging.

In the next section you specify all of the HEMCO inventories that you want to download. You will see this header:

###############################################################################
#                                                                             #
#  THE FOLLOWING DATA DIRECTORIES WILL BE DOWNLOADED.                         #
#                                                                             #
#  These data directories comprise the recommended emissions configuration    #
#  for typical GEOS-Chem full-chemistry and specialty simulations.            #
#                                                                             #
#  NOTE: In most cases, you only have to specify the top-level folder.        #
#  All subfolders will be downloaded automatically.                           #
#                                                                             #
###############################################################################

#=============================+================================================
# AEROSOLS                    | Directory paths
#=============================+================================================
AEROCOM volcano emissions     | $ROOT/VOLCANO/v2014-10
Bond et al BC/OC              | $ROOT/BCOC_BOND/v2014-07
Cooke et al BC/OC             | $ROOT/BCOC_COOKE/v2014-07
Secondary organic aerosols    | $ROOT/SOA/v2014-07
... etc ...

Each line specifies the name of a HEMCO emissions inventory and the data path where it can be found on disk, relative to the root data path. NOTE: The script will replace the $ROOT token with the value you gave to the "HEMCO remote data path" above. (Lines starting with the comment character # will be ignored.)

Any inventory found in this section will be downloaded. To prevent an inventory from being downloaded you can either comment it out (i.e. place a # in the first column) or move the inventory to the next section.

The final section specifies HEMCO emission inventories that you do not wish to download. The section looks like this:

###############################################################################
#                                                                             #
#  THE FOLLOWING DATA DIRECTORIES WILL NOT BE DOWNLOADED.                     #
#                                                                             #
#  These data directories contain are optional emissions inventories that     #
#  are not used in typical GEOS-Chem simulations.  If you wish to download    #
#  any of these inventories, simply move the corresponding entry for each     #
#  inventory to the previous section.                                         #
#                                                                             #
###############################################################################

CH3I simulation (obsolete)    | $ROOT/CH3I/v2014-07
Chlorophyll A                 | $ROOT/CHLA/v2014-07
... etc ...

--Bob Y. 13:55, 12 February 2015 (EST)

Downloading the data

Once you have set up your configuration file, you can run the hemcoDataDownload.pl script to start downloading the HEMCO data to your local server. To run the script you can type:

hemcoDataDownload.pl

If you do not specify a configuration file name, then the hemcoDataDownload.pl scriptwill read the default configuration file hemcoDataDownload.rc configuration file. If you wish to specify a different configuration file name, simply pass that as an argument to the script, e.g.

hemcoDataDownload.pl myNewConfigFile.rc

Before you start downloading GB's of data, we recommend that you run a short test to make sure that the data directories are being copied to the proper locations on your disk server. For this purpose, we have provided a configuration file named forTesting.rc. Typing

hemcoDataDownload.pl forTesting.pl

will only download a couple of data inventories that do not take up much disk space. This allows you to ensure that the data transfer is sucessful without making you wait a long time.

--Bob Y. 13:57, 12 February 2015 (EST)

Submitting new data for use with HEMCO

If you have a new emissions inventory or other atmospheric data set that you would like to add to the HEMCO data directories, then please contact the GEOS-Chem Support Team. Data files must be in COARDS-compliant or CF-compliant netCDF format. Please see our Preparing data files for use with HEMCO wiki page for more information.

--Bob Y. 11:49, 5 March 2015 (EST)