HEMCO data directories: Difference between revisions

From Geos-chem
Jump to navigation Jump to search
Line 489: Line 489:
--[[User:Bmy|Bob Y.]] 13:41, 4 March 2015 (EST)
--[[User:Bmy|Bob Y.]] 13:41, 4 March 2015 (EST)


=== Other inputs for HEMCO ===
=== Scale factors and masks ===


The following subdirectories of the HEMCO directory tree input data for various HEMCO functions.  These include regional masks, timezones, and emission scale factors.
The following subdirectories of the HEMCO directory tree input data for various HEMCO functions.  These include regional masks, timezones, and emission scale factors.

Revision as of 18:45, 4 March 2015

On this page we describe the directory tree from which the HEMCO emissions component can read emissions inventories and other atmospheric data sets.

Overview

The HEMCO emissions component can read several types of emission inventories, as well as other types atmospheric data sets, such as production and loss rates, or concentration data. We have collated all of this data into a comprehensive directory tree structure. Each folder of the HEMCO data directory tree represents a particular emissions inventory or other data set.

At present, the HEMCO data directory tree resides on the disk servers at Harvard University (and soon at Dalhousie University). We have created a package that will let you download this directory tree to your local disk storage space. For more information, please see the Downloading the HEMCO data directories section below.

--Bob Y. 10:22, 13 February 2015 (EST)

HEMCO data directory structure

The sections below describe each of the data sets contained in the HEMCO directory tree, grouped by type. Each data set is contained in a subdirectory of the HEMCO root directory, which is specified by the $ROOT token. (For example, on the Harvard disk server, $ROOT points to the directory /mnt/gcgrid/data/ExtData/HEMCO/.)

--Bob Y. 10:49, 13 February 2015 (EST)

Aerosol emissions

The following subdirectories of the HEMCO directory tree contain aerosol emission inventories.

Inventory Data file info Path Status
AEROCOM volcanic emissions README $ROOT/VOLCANO/v2014-10 CURRENTLY USED
Tami Bond et al (2007) EC and OC emissions README $ROOT/BCOC_BOND/v2014-07 CURRENTLY USED
Cooke et al BC and OC emissions README $ROOT/BCOC_COOKE/v2014-07 OPTIONAL
  • Can be used to replace Bond et al (2007).
Secondary organic aerosols README $ROOT/HEMCO/SOA/2014-07 CURRENTLY USED

--Bob Y. 15:35, 12 February 2015 (EST)

Anthropogenic and biofuel emissions

The following subdirectories of the HEMCO directory tree contain inventories of anthropogenic and biofuel emissions.

Inventory Data file info Path Status
GEIA global anthropogenic README $ROOT/GEIA/v2014-07 OBSOLETE
  • Slated for replacement by newer inventories
GEIA NH3 (anthro, biofuel, natural source) README $ROOT/NH3/v2014-07 CURRENTLY USED
EDGAR v3 global anthropogenic README $ROOT/EDGAR/v2014-07 CURRENTLY USED
EDGAR v4.2 global anthropogenic README $ROOT/EDGARv42/v2015-02/CO
$ROOT/EDGARv42/v2015-02/NH3
$ROOT/EDGARv42/v2015-02/NO
$ROOT/EDGARv42/v2015-02/SO2
$ROOT/EDGARv42/v2015-02/VOCv2
PENDING BENCHMARK APPROVAL
HTAP global anthro+biofuel TBD TBD TO BE ADDED SOON
RETRO VOC emissions README $ROOT/RETRO/v2014-07/orig_kgC CURRENTLY USED
Yevich & Logan biofuels README $ROOT/BIOFUEL/v2014-07 CURRENTLY USED
BRAVO regional anthropogenic README $ROOT/BRAVO/v2014-07 CURRENTLY USED
CAC regional anthropogenic README $ROOT/CAC/v2014-07 CURRENTLY USED
EMEP regional anthropogenic README $ROOT/EMEP/v2015-01 CURRENTLY USED
NEI2005 regional anthro/biofuel README $ROOT/NEI2005/v2014-09 CURRENTLY USED
  • NEI2005 may soon be superseded by NEI2008 and/or NEI2011.
NEI2008 regional anthro/biofuel README $ROOT/NEI2008/v2015-02 TO BE ADDED SOON
NEI2011 regional anthro/biofuel README $ROOT/NEI2011/v2015-02 TO BE ADDED SOON
NEI2011 agricultural emissions only README $ROOT/NEI2011_ag_only/v2015-02 TO BE ADDED SOON
NEI/VISTAS scale factors README1
README2
$ROOT/VISTAS/v2014-07/VISTAS_NOX
$ROOT/VISTAS/v2014-07/VISTAS_WEWD
CURRENTLY USED
Streets regional anthro README1
README2
$ROOT/STREETS/v2014-07 CURRENTLY USED
MIX regional anthro README TBD TO BE ADDED SOON
  • Slated as a replacement for Streets et al emissions.
MASAGE agricultural NH3 README $ROOT/MASAGE_NH3/v2015-02 CURRENTLY USED
Yaping Xiao et al C2H6 and C3H8 anthropogenic README $ROOT/XIAO/v2014-09 CURRENTLY USED

--Bob Y. 16:23, 13 February 2015 (EST)

Anthropogenic aircraft and ship emissions

The following subdirectories of the HEMCO directory tree contain inventories of anthropogenic aircraft and ship emissions.

Inventory Data file info Path Status
AEIC aircraft README $ROOT/AEIC/v2014-10 CURRENTLY USED
ARCTAS ship emissions (SO2) README $ROOT/ARCTAS_SHIP/v2014-07 CURRENTLY USED
  • Default inventory for ship SO2
  • Overwritten by EMEP over Europe
Corbett et al ship emissions (SO2) README $ROOT/VOLCANO/v2014-10 OPTIONAL
  • Can be used to overwrite ARCTAS SO2
ICOADS ship (CO) README $ROOT/ICOADS_SHIP/v2014-07 CURRENTLY USED
  • Default inventory for ship CO
  • Overwritten by EMEP over Europe
EDGAR v3 ship (CO) README $ROOT/EDGAR/v2014-07 OPTIONAL
  • Can be used instead of ICOADS
  • Stored together w/ other EDGAR data files
  • Soon to be replaced by EDGAR v4.2
EMEP ship (CO, SO2) README $ROOT/EMEP/v2015-01 CURRENTLY USED
  • Overwrites CO, SO2 over Europe
  • Stored together w/ other EMEP data files

--Bob Y. 11:03, 13 February 2015 (EST)

Biomass burning emissions

The following subdirectories of the HEMCO directory tree contain inventories of biomass burning emissions.

Inventory Data file info Path Status
GFED4 biomass TBD TBD TO BE ADDED SOON
  • You will be able to select either GFED3 or GFED4.
GFED3 biomass README $ROOT/GFED3/v2014-10 CURRENTLY USED
  • Default biomass burning inventory.
FINN biomass README $ROOT/FINN/v2015-02 OPTIONAL
  • You may choose to replace GFED3 with FINN for research purposes.
  • If you do not need FINN, you may choose not to download it, in order to save disk space.
QFED biomass README $ROOT/QFED/v2014-09 OPTIONAL
  • You may choose to replace GFED3 with QFED for research purposes.
  • If you do not need QFED, you may choose not to download it, in order to save disk space.
GFED2 biomass README $ROOT/GFED2/v2014-07 OBSOLETE
  • Superseded by GFED3.
Duncan et al biomass README $ROOT/BIOBURN/v2014-07 OBSOLETE
  • Superseded by GFED3.

--Bob Y. 13:40, 4 March 2015 (EST)

Bromine emissions

The following subdirectories of the HEMCO directory tree contain inventories of bromine emissions.

NOTE: Stratospheric bromine concentrations are included in our Non-emissions data section.

Inventory Data file info Path Status
Liang et al VSL emissions README $ROOT/STRAT/v2015-01/Bry CURRENTLY USED

--Bob Y. 13:42, 4 March 2015 (EST)

Emissions implemented as HEMCO extensions

The following subdirectories of the HEMCO directory tree contain input data used by various HEMCO extensions. These HEMCO extensions compute emissions for quantities that depend on meteorological variables (e.g. emissions from lighting, biogenic processes, etc.).

NOTE: GFED3 and FINN are implemented as HEMCO extensions, but we have listed them with the other biomass burning inventories, for clarity.

Inventory Data file info! Path Status
DEAD dust model README $ROOT/DUST_DEAD/2014-07 CURRENTLY USED
  • Default dust mobilization scheme
GINOUX dust model README $ROOT/DUST_GINOUX/2014-07 OPTIONAL
  • You can use this in place of the DEAD model.
MEGAN biogenic emissions README $ROOT/MEGAN/v2015-02 CURRENTLY USED
NO from lightning README $ROOT/LIGHTNOX/v2014-07 CURRENTLY USED
NO from soils/fertilizers README $ROOT/SOILNOX/v2014-07 CURRENTLY USED
PARANOX ship plume model README $ROOT/PARANOX/v2015-02 CURRENTLY USED

--Bob Y. 13:09, 18 February 2015 (EST)

Future and historical emissions

The following subdirectories of the HEMCO directory tree contain historical and future emissions inventories.

Inventory Data file info Path Status
RCP future scenarios README $ROOT/RCP/v2015-02 OPTIONAL
  • RCP emissions are not part of the standard GEOS-Chem emissions configuration but may be used as a research option.

--Bob Y. 12:51, 27 February 2015 (EST)

GEOS-Chem specialty simulation data

The following subdirectories of the HEMCO directory tree input data (emissions, oxidants, etc.) for use with the GEOS-Chem specialty simulations. If you do not regularly use these simulations, you may choose not to download these data directories.

Inventory Data file info Path Status
Aerosol-only simulation README $ROOT/OFFLINE_AEROSOL/v2014-09 CURRENTLY USED
CH4 simulation README $ROOT/CH4/v2014-09 CURRENTLY USED
CO2 simulation README1
README2
README3
README4
README5
$ROOT/CO2/v2014-09
$ROOT/CO2/v2014-09/BBIO/
$ROOT/CO2/v2014-09/CHEM/
$ROOT/CO2/v2014-09/FOSSIL/
$ROOT/CO2/v2014-09/OCEAN/
CURRENTLY USED
Mercury simulation README1
README2
README3
README4
README5
$ROOT/MERCURY/v2014-09/ARTISANAL
$ROOT/MERCURY/v2014-09/NATURAL
$ROOT/MERCURY/v2014-09/NEI2005
$ROOT/MERCURY/v2014-09/SOIL
$ROOT/MERCURY/v2014-09/STREETS
CURRENTLY USED
POPs simulation README $ROOT/POPs/v2014-09 CURRENTLY USED
Tagged CO simulation README $ROOT/TAGGED_CO/v2014-08 CURRENTLY USED
Tagged O3 simulation README $ROOT/TAGGED_O3/v2014-09 CURRENTLY USED
O3 for offline simulations README $ROOT/O3/v2014-09 CURRENTLY USED
OH for offline simulations README1
README2
$ROOT/OH/v2014-09/v5-07-08
$ROOT/OH/v2014-09/v7-02-03.GMI
CURRENTLY USED
H2O2 for offline simulations README $ROOT/OXIDANTS/v2014-07 CURRENTLY USED
CH3I simulation README $ROOT/CH3I/v2014-07 OBSOLETE
  • This simulation is no longer used in GEOS-Chem (awaiting revival)

--Bob Y. 10:37, 19 February 2015 (EST)

Seawater concentrations

The following subdirectories of the HEMCO directory tree input data contains seawater concentrations used by the acetone ocean loss and sulfate chemistry modules.

Inventory Data file info Path Status
Acetone seawater README $ROOT/ACET/v2014-07 CURRENTLY USED
DMS seawater README $ROOT/DMS/v2014-07 CURRENTLY USED

--Bob Y. 11:22, 13 February 2015 (EST)

Non-emissions data

The following subdirectories of the HEMCO directory tree input data contains non-emissions data sets. Most of these are used as inputs to GEOS-Chem's chemistry modules (i.e. photolysis, stratospheric chemistry, etc.).

Inventory Data file info Path Status
Timezone offsets from UTC README $ROOT/TIMEZONES/v2015-02 CURRENTLY USED
Stratospheric Bry from CCM README $ROOT/STRAT/v2015-01/Bry CURRENTLY USED
GMI strat chem mechanism README $ROOT/GMI/v2015-02 CURRENTLY USED
UV surface albedoes README $ROOT/UVALBEDO/v2015-03 CURRENTLY USED

--Bob Y. 13:41, 4 March 2015 (EST)

Scale factors and masks

The following subdirectories of the HEMCO directory tree input data for various HEMCO functions. These include regional masks, timezones, and emission scale factors.

Inventory Data file info Path Status
Annual scale factors README $ROOT/AnnualScalar/v2014-07 CURRENTLY USED
Mask files README $ROOT/MASKS/v2014-07 CURRENTLY USED
MAP_A2A regridding data README $ROOT/MAP_A2A/v2014-07 CURRENTLY USED
Weekly scale factors README $ROOT/WEEKSCALE/v2014-07 OBSOLETE
  • Not currently in use

--Bob Y. 13:29, 4 March 2015 (EST)

Downloading the HEMCO data directories

The GEOS-Chem Support Team has created a package called hemco_data_download. With this package, you can download the various emissions inventories and related data files for HEMCO to your own disk server. Furthermore, you can specify which data directories that you would like to download (as well as those you would like to ignore) via a configuration file.

Obtaining the hemco_data_download package

To obtain the hemco_data_download package, use Git to clone this repository

git clone https://github.com/GCST/hemco_data_download.git

This will create a directory named hemco_data_download, in which you should see the following files:

README
File with an overall description of the directory contents
hemcoDataDownload.pl
Perl script to download HEMCO data directories
hemcoDataDownload.rc
Configuration file for the hemcoDataDownload.pl script. In this file you can specify which HEMCO data directories you would like to download and which you would like to omit.
forTesting.rc
A configuration file that you can use for testing or debugging. This will tell hemcoDataDownload.pl only to download a couple of emissions inventories with files that do not take up much disk space.

--Bob Y. 13:58, 12 February 2015 (EST)

Setting up the configuration file

The configuration files (i.e. hemcoDataDownload.rc and forTesting.rc) are pretty much self-explanatory.

At the top of the configuration file you will see this section:

###############################################################################
#                                                                             #
#  Specify the remote and local HEMCO data paths, plus other options.         #
#                                                                             #
###############################################################################

Remote HEMCO data path | ftp://ftp.as.harvard.edu/gcgrid/data/ExtData/HEMCO
Your HEMCO data path   | /as/scratch/bmy/HEMCO
Verbose output         | yes
Dryrun only?           | no
  1. Remote HEMCO data path is the location on the FTP server from which you are going to download the data. This can be from either Harvard or from Dalhousie. (For now we will use the Harvard server). You can edit this accordingly.
  2. Your HEMCO data path specifies the root-level directory for HEMCO data on your own disk space. If you are not sure where to place this, then ask your sysadmin.
  3. Verbose output lets you specify if you want to print out extra output during the download process. This can be set to either "yes" or "no".
  4. Dryrun only flag allows you to print out the data download commands without actually downloading the data. This is useful for debugging.

In the next section you specify all of the HEMCO inventories that you want to download. You will see this header:

###############################################################################
#                                                                             #
#  THE FOLLOWING DATA DIRECTORIES WILL BE DOWNLOADED.                         #
#                                                                             #
#  These data directories comprise the recommended emissions configuration    #
#  for typical GEOS-Chem full-chemistry and specialty simulations.            #
#                                                                             #
#  NOTE: In most cases, you only have to specify the top-level folder.        #
#  All subfolders will be downloaded automatically.                           #
#                                                                             #
###############################################################################

#=============================+================================================
# AEROSOLS                    | Directory paths
#=============================+================================================
AEROCOM volcano emissions     | $ROOT/VOLCANO/v2014-10
Bond et al BC/OC              | $ROOT/BCOC_BOND/v2014-07
Cooke et al BC/OC             | $ROOT/BCOC_COOKE/v2014-07
Secondary organic aerosols    | $ROOT/SOA/v2014-07
... etc ...

Each line specifies the name of a HEMCO emissions inventory and the data path where it can be found on disk, relative to the root data path. NOTE: The script will replace the $ROOT token with the value you gave to the "HEMCO remote data path" above. (Lines starting with the comment character # will be ignored.)

Any inventory found in this section will be downloaded. To prevent an inventory from being downloaded you can either comment it out (i.e. place a # in the first column) or move the inventory to the next section.

The final section specifies HEMCO emission inventories that you do not wish to download. The section looks like this:

###############################################################################
#                                                                             #
#  THE FOLLOWING DATA DIRECTORIES WILL NOT BE DOWNLOADED.                     #
#                                                                             #
#  These data directories contain are optional emissions inventories that     #
#  are not used in typical GEOS-Chem simulations.  If you wish to download    #
#  any of these inventories, simply move the corresponding entry for each     #
#  inventory to the previous section.                                         #
#                                                                             #
###############################################################################

CH3I simulation (obsolete)    | $ROOT/CH3I/v2014-07
Chlorophyll A                 | $ROOT/CHLA/v2014-07
... etc ...

--Bob Y. 13:55, 12 February 2015 (EST)

Downloading the data

Once you have set up your configuration file, you can run the hemcoDataDownload.pl script to start downloading the HEMCO data to your local server. To run the script you can type:

hemcoDataDownload.pl

If you do not specify a configuration file name, then the hemcoDataDownload.pl scriptwill read the default configuration file hemcoDataDownload.rc configuration file. If you wish to specify a different configuration file name, simply pass that as an argument to the script, e.g.

hemcoDataDownload.pl myNewConfigFile.rc

Before you start downloading GB's of data, we recommend that you run a short test to make sure that the data directories are being copied to the proper locations on your disk server. For this purpose, we have provided a configuration file named forTesting.rc. Typing

hemcoDataDownload.pl forTesting.pl

will only download a couple of data inventories that do not take up much disk space. This allows you to ensure that the data transfer is sucessful without making you wait a long time.

--Bob Y. 13:57, 12 February 2015 (EST)