HEMCO data directories
On this page we describe the directory tree from which the HEMCO emissions component can read emissions inventories and other atmospheric data sets.
Contents
- 1 Overview
- 2 Default GEOS-Chem emissions configurations
- 3 HEMCO data directory structure
- 3.1 Aerosol emissions
- 3.2 Anthropogenic and biofuel emissions
- 3.3 Anthropogenic aircraft and ship emissions
- 3.4 Biomass burning emissions
- 3.5 Offline grid-independent emissions
- 3.6 Emissions implemented as HEMCO extensions
- 3.7 Future and historical emissions
- 3.8 GEOS-Chem specialty simulation data
- 3.9 Halogen emissions
- 3.10 Natural emissions data
- 3.11 Non-emissions data
- 3.12 Other inputs for HEMCO
- 4 Downloading the HEMCO data directories
- 5 Submitting new data for use with HEMCO
Overview
The HEMCO emissions component can read several types of emission inventories, as well as other types atmospheric data sets, such as production and loss rates, or concentration data. We have collated all of this data into a comprehensive directory tree structure. Each folder of the HEMCO data directory tree represents a particular emissions inventory or other data set.
At present, the HEMCO data directory tree resides on the disk servers at Harvard University (and soon at Dalhousie University). We have created a package that will let you download this directory tree to your local disk storage space. For more information, please see the Downloading the HEMCO data directories section below.
Please see our list of recommended default emission inventories, which has been compiled by the Emissions and Deposition Working Group.
--Bob Yantosca (talk) 22:27, 7 September 2017 (UTC)
Default GEOS-Chem emissions configurations
Christoph Keller and the GEOS-Chem Support Team have prepared documents describing the standard GEOS-Chem emissions configurations for recent GEOS-Chem versions. These emissions configurations are specified in the HEMCO configuration file (aka HEMCO_Config.rc), which is read by GEOS-Chem at startup.
Configuration | Released | Notes |
---|---|---|
Default emissions configuration for GEOS-Chem 12.6.0 | 21 Oct 2019 | |
Default emissions configuration for GEOS-Chem 12.3.0 | 01 Apr 2019 | Also applies to GEOS-Chem 12.3.1 |
Default emissions configuration for GEOS-Chem 12.2.0 | 19 Feb 2019 | Also applies to GEOS-Chem 12.2.1 |
Default emissions configuration for GEOS-Chem 12.1.0 | 26 Nov 2018 | Also applies to GEOS-Chem 12.1.1 |
Default emissions configuration for GEOS-Chem 12.0.0 | 10 Aug 2018 | Also applies to GEOS-Chem 12.0.1, 12.0.2, and 12.0.3 |
Information about each individual inventory or data source may be found in the following sections.
HEMCO data directory structure
The sections below describe each of the data sets contained in the HEMCO directory tree, grouped by type.
In each of the tables below:
- The Inventory column contains a link that describes each data set in more detail. Most of these links point to existing pages on the GEOS-Chem wiki.
- The Data file info column points to the README files that are stored with each data set. Each README provides a list of the files contained within a given folder, information about the data contained in each file, and (often) a description of how the files were created.
- The Path column shows the location of each data set, with respect to $ROOT, the top-level HEMCO directory. For example, on the Harvard disk server, $ROOT points to the directory /mnt/gcgrid/data/ExtData/HEMCO/.
- The Status column describes the current status of each data set:
CURRENTLY USED Denotes that the data set is part of the standard emissions configuration for GEOS-Chem, that is: - It is used in the GEOS-Chem 1-month and 1-year full-chemistry benchmark simulations, or
- It is used in one or more of the GEOS-Chem specialty simulations.
OPTIONAL Denotes that the data set is not part of the the standard GEOS-Chem emissions configuration. - But you can still use the data set in your research applications if you wish.
TO BE ADDED SOON Denotes that the data set is not yet ready for use with HEMCO, but will be added soon. OBSOLETE Denotes that the data set has been removed from the standard GEOS-Chem emissions configuration. - In most cases, OBSOLETE means that a newer version of the data is available and should be used instead.
NOT USED Denotes that the data set is not used in the standard GEOS-Chem emissions configuration, even if it is current. - For example, another data source is being used in place of this data set for a particular reason.
- If use GEOS-Chem, then we recommend that you download the data sets that are listed as CURRENTLY USED, as these form the standard GEOS-Chem emissions configuration. You are free, however, to use any of the data sets listed below. Your choice of data sets will depend on your particular research needs.
--Bob Y. 10:53, 18 March 2015 (EDT)
Aerosol emissions
The following subdirectories of the HEMCO directory tree contain aerosol emission inventories.
NOTE: AeroCom volcanic SO2 emissions are listed under the Emissions implemented as HEMCO extensions.
Inventory | Data file info | Path | Directory size | Status |
---|---|---|---|---|
Active inventories (turned ON by default in the standard emissions configuration) | ||||
Secondary organic aerosols | README | $ROOT/SOA/2014-07 | 3.0 MB | CURRENTLY USED
|
Optional inventories (turned OFF by default in the standard emissions configuration) | ||||
Spatially varying OM/OC ratios for SOA species | README | $ROOT/OMOC/v2018-01 | 8.6 MB | OPTIONAL
|
Obsolete inventories (superseded by newer developments) | ||||
AEROCOM volcanic emissions | README | $ROOT/VOLCANO/v2015-02 | 1.6 GB | OBSOLETE
|
Tami Bond et al (2007) BC and OC emissions | README | $ROOT/BCOC_BOND/v2014-07 | 3.0 MB | OBSOLETE
|
Cooke et al BC and OC emissions | README | $ROOT/BCOC_COOKE/v2014-07 | 744 KB | OBSOLETE
|
Anthropogenic and biofuel emissions
The following subdirectories of the HEMCO directory tree contain inventories of anthropogenic and biofuel emissions.
Please also see list of species included in each of these global inventories and regional inventories.
Inventory | Data file info | Path | Directory size | Status |
---|---|---|---|---|
Active inventories (turned ON by default in the standard emissions configuration) | ||||
APEI Canadian emissions (1989-2014) |
README | $ROOT/APEI/v2016-11/ | 68 MB | CURRENTLY USED
|
NEI2011 hourly US emissions | README | $ROOT/NEI2011/v2015-03 | 248 GB | CURRENTLY USED
|
NEI2011 monthly mean US emissions | README | $ROOT/NEI2011/v2017-02-MM | 248 GB | CURRENTLY USED
|
MIX Asian anthropogenic emissions | README | $ROOT/MIX/v2015-03 | 623 MB | CURRENTLY USED
|
DICE-Africa anthropogenic emissions | README | $ROOT/DICE_Africa/v2016-10 | 202 MB | CURRENTLY USED
|
EDGAR v4.3 anthropogenic emissions | README | $ROOT/EDGARv43/v2016-11 | 6.2 GB | CURRENTLY USED
|
CEDS anthropogenic emissions | README | $ROOT/CEDS/v2018-04 | 140 GB | CURRENTLY USED
|
POET (anthropogenic EOH inventory) | README | $ROOT/POET/v2017-03/ | 38 MB | CURRENTLY USED
|
C2H6 global emissions from Tzompa et al. (2017) | README | $ROOT/C2H6_2010/v2019-06 | 536 KB | CURRENTLY USED
|
C3H8 global emissions from Xiao et al. (2008) | README | $ROOT/XIAO/v2014-09 | 308 KB | CURRENTLY USED
|
Yevich & Logan biofuels | README | $ROOT/BIOFUEL/v2014-07 | 1.1 MB | CURRENTLY USED
|
Anthropogenic PM2.5 dust source (AFCID) | README | $ROOT/AFCID/2018-04 | 4.4 MB | CURRENTLY USED
|
Optional inventories (turned OFF by default in the standard emissions configuration) | ||||
Trash burning emissions | README | HEMCO/TrashEmis/v2015-03/ | 86 MB | OPTIONAL
|
HTAP global anthro+biofuel | README | $ROOT/HTAP/v2015-03 | 4.5 GB | OPTIONAL
|
MASAGE agricultural NH3 | README | $ROOT/MASAGE_NH3/v2015-02 | 4.4 MB | OPTIONAL
|
NEI2011 agricultural emissions only | README | $ROOT/NEI2011_ag_only/v2015-03 | 648 MB | OPTIONAL |
Obsolete inventories (superseded by newer developments) | ||||
NEI/VISTAS scale factors | README1 README2 |
$ROOT/VISTAS/v2014-07 $ROOT/VISTAS/v2014-07/VISTAS_NOX $ROOT/VISTAS/v2014-07/VISTAS_WEWD |
46 MB | OBSOLETE
|
RETRO VOC emissions | README | $ROOT/RETRO/v2014-07/orig_kgC | 38 MB | OBSOLETE
|
Streets regional anthro | README1 README2 |
$ROOT/STREETS/v2014-07 | 1.7 MB | OBSOLETE
|
EMEP regional anthropogenic | README | $ROOT/EMEP/v2015-03 | 24 MB | OBSOLETE |
BRAVO regional anthropogenic | README | $ROOT/BRAVO/v2014-07 | 68 KB | OBSOLETE |
CAC regional anthropogenic | README | $ROOT/CAC/v2016-09 | 29 MB | OBSOLETE
|
NEI2008 regional anthro/biofuel | README | $ROOT/NEI2008/v2015-02 | 3.0 GB | OBSOLETE
|
EDGAR v4.2 global anthropogenic | README | $ROOT/EDGARv42/v2015-02 $ROOT/EDGARv42/v2015-02/CO $ROOT/EDGARv42/v2015-02/NH3 $ROOT/EDGARv42/v2015-02/NO $ROOT/EDGARv42/v2015-02/SO2 $ROOT/EDGARv42/v2015-02/VOCv2 |
4.5 GB | OBSOLETE
|
GEIA global anthropogenic | README | $ROOT/GEIA/v2014-07 | 4.8 MB | OBSOLETE
|
EDGAR v3 global anthropogenic | README | $ROOT/EDGAR/v2014-07 | 40 MB | OBSOLETE
|
Anthropogenic aircraft and ship emissions
The following subdirectories of the HEMCO directory tree contain inventories of anthropogenic aircraft and ship emissions.
Inventory | Data file info | Path | Directory size | Status |
---|---|---|---|---|
Active inventories (turned ON by default in the standard emissions configuration) | ||||
AEIC aircraft | README | $ROOT/AEIC/v2015-01 | 2.0 GB | CURRENTLY USED
|
CEDS ship | README | $ROOT/CEDS/v2018-04 | 140 GB | CURRENTLY USED
|
Optional inventories (turned OFF by default in the standard emissions configuration) | ||||
ARCTAS ship emissions (SO2) | README | $ROOT/ARCTAS_SHIP/v2014-07 | 508 KB | OPTIONAL
|
ICOADS ship (CO) | README | $ROOT/ICOADS_SHIP/v2014-07 | 4.2 MB | OPTIONAL
|
Corbett et al ship emissions (SO2) | README | $ROOT/CORBETT_SHIP/v2014-07 | 1.8 MB | OPTIONAL
|
HTAP ship (CO, NO, SO2) | README | $ROOT/HTAP/v2015-03 | 4.5 GB | OPTIONAL
|
NEI2011 ship (several species) | README | $ROOT/NEI2011/v2015-03 | 248 GB | OPTIONAL
|
Obsolete inventories (superseded by newer developments) | ||||
EMEP ship (CO, NO, SO2) | README | $ROOT/EMEP/v2015-03 | 24 MB | OBSOLETE
|
EDGAR v3 ship (CO) | README | $ROOT/EDGAR/v2014-07 | 40 MB | OBSOLETE |
EDGAR v4.2 ship | README | $ROOT/EDGARv42/v2015-02 | 4.5 GB | NOT USED
|
NOTES about ship emissions:
- CEDS ship emissions are the default ship inventory in GEOS-Chem 12.1.0 and later versions.
- Data files for CEDS ship emissions are stored with the other CEDS emission sector files in the HEMCO data path: $ROOT/CEDS/v2018-04/.
- If CEDS emissions are turned off above then:
- ARCTAS should be used over ICOADS, CORBETT, and HTAP for SO2
- ICOADS should be used for CO and NO.
- The NEI2011 ship inventory only goes a few km from land, so it isn’t a complete "ship inventory". This is turned off by default in the HEMCO configuration files that ship with GEOS-Chem.
Biomass burning emissions
The following subdirectories of the HEMCO directory tree contain inventories of biomass burning emissions.
Inventory | Data file info | Path | Directory size | Status |
---|---|---|---|---|
Active inventories (turned ON by default in the standard emissions configuration) | ||||
GFED4 biomass burning (v4.1) | README | $ROOT/GFED4/v2015-10 | 692 MB | CURRENTLY USED
|
Optional inventories (turned OFF by default in the standard emissions configuration) | ||||
GFAS biomass burning emissions | README | $ROOT/GFAS/v2018-09 | 45 GB | OPTIONAL
|
FINN biomass burning (v1.6) | README | $ROOT/FINN/v2018-04 | 1.1 GB | TO BE ADDED SOON
|
FINN biomass burning (v1.5) | README | $ROOT/FINN/v2015-02 | 689 MB | OPTIONAL
|
QFED biomass burning (v2.5r1) | README | $ROOT/QFED/v2018-07 | 57 GB | OPTIONAL
|
QFED biomass burning (v2.4r6) | README | $ROOT/QFED/v2014-09 | 57 GB | OPTIONAL
|
Obsolete inventories (superseded by newer developments) | ||||
GFED4 biomass burning (v4.0) | README | $ROOT/GFED4/v2015-06 | 115 MB | OBSOLETE
|
GFED3 biomass burning | README | $ROOT/GFED3/v2014-10 | 343 MB | OBSOLETE
|
GFED2 biomass burning | README | $ROOT/GFED2/v2014-07 | 196 KB | OBSOLETE
|
Duncan et al biomass burning | README | $ROOT/BIOBURN/v2014-07 | 925 MB | OBSOLETE
|
Offline grid-independent emissions
Offline grid-independent emissions were generated by running high-resolution (0.25°x0.3125° for GEOS-FP, 0.5°x0.625° for MERRA-2) simulations using the HEMCO standalone and online emissions from the HEMCO extensions.
Inventory | Data file info | Path | Directory size | Status |
---|---|---|---|---|
Active inventories (turned ON by default in the standard emissions configuration) | ||||
Offline dust emissions | README | $ROOT/OFFLINE_DUST/v2019-01 | 18 GB | CURRENTLY USED
|
Offline biogenic VOC emissions | README | $ROOT/OFFLINE_BIOVOC/v2019-01 | 1006 GB | CURRENTLY USED
|
Offline sea salt emissions | README | $ROOT/OFFLINE_SEASALT/v2019-01 | 848 GB | CURRENTLY USED
|
Offline soil NOx emissions | README | $ROOT/OFFLINE_SOILNOX/v2019-01 | 46 GB | CURRENTLY USED
|
Emissions implemented as HEMCO extensions
The following subdirectories of the HEMCO directory tree contain input data used by various HEMCO extensions. These HEMCO extensions compute emissions for quantities that depend on meteorological variables (e.g. emissions from lighting, biogenic processes, etc.).
NOTE: GFED3, GFED4 and FINNv1 are implemented as HEMCO extensions, but we have listed them with the other biomass burning inventories, for clarity.
Inventory | Data file info | Path | Directory size | Status |
---|---|---|---|---|
Active inventories (turned ON by default in the standard emissions configuration) | ||||
Acetone ocean exchange | README | $ROOT/ACET/v2014-07 | 52 KB | CURRENTLY USED |
ALD2 ocean exchange | README | $ROOT/ALD2/v2017-03 | 52 KB | CURRENTLY USED |
DMS ocean exchange | README | $ROOT/DMS/v2015-07 | 3.0 MB | CURRENTLY USED
|
NO from lightning | README | $ROOT/OFFLINE_LIGHTNING/v2019-01/ | 47 GB | CURRENTLY USED
|
PARANOX ship plume model | README | $ROOT/PARANOX/v2015-02 | 162 MB | CURRENTLY USED
|
AeroCom volcanic emissions | README | $ROOT/VOLCANO/v2019-08 | 660 MB | CURRENTLY USED
|
Optional inventories (turned OFF by default in the standard emissions configuration) | ||||
DEAD dust model | README | $ROOT/DUST_DEAD/2014-07 | 712 KB | OPTIONAL
|
MEGAN biogenic emissions | README | $ROOT/MEGAN/v2018-05 | 12 MB | OPTIONAL
|
NO from soils/fertilizers | README | $ROOT/SOILNOX/v2014-07 | 79 MB | OPTIONAL
|
GINOUX dust model | README | $ROOT/DUST_GINOUX/2014-07 | 36 KB | OPTIONAL
|
Obsolete inventories (superseded by newer developments) | ||||
NO from lightning | README | $ROOT/LIGHTNOX/v2017-09 | 8.5 MB | OBSOLETE
|
DMS ocean exchange | README | $ROOT/DMS/v2014-07 | 2.2 MB | OBSOLETE
|
MEGAN biogenic emissions | README | $ROOT/MEGAN/v2015-02 | 2.0 MB | OBSOLETE
|
Future and historical emissions
The following subdirectories of the HEMCO directory tree contain historical and future emissions inventories.
Inventory | Data file info | Path | Directory size | Status |
---|---|---|---|---|
RCP future scenarios | README | $ROOT/RCP/v2015-02 | 368 MB | OPTIONAL
|
GEOS-Chem specialty simulation data
The following subdirectories of the HEMCO directory tree input data (emissions, oxidants, etc.) for use with the GEOS-Chem specialty simulations. If you do not regularly use these simulations, you may choose not to download these data directories.
Inventory | Data file info | Path | Directory size | Status |
---|---|---|---|---|
Active inventories (turned ON by default in the standard emissions configuration) | ||||
Aerosol-only simulation | README | $ROOT/OFFLINE_AEROSOL/v2014-09 | 165 MB | CURRENTLY USED |
CH4 simulation | README1 README2 |
$ROOT/CH4/v2017-10 $ROOT/CH4/v2014-09 |
274 MB | CURRENTLY USED |
CO2 simulation | README1 README2 README3 README4 README5 README6 |
$ROOT/CO2/v2015-04 $ROOT/CO2/v2015-04/BIO/ $ROOT/CO2/v2015-04/BIOFUEL/ $ROOT/CO2/v2015-04/CHEM/ $ROOT/CO2/v2015-04/FOSSIL/ $ROOT/CO2/v2015-04/OCEAN/ |
1033 MB | CURRENTLY USED |
Mercury simulation | README1 README2 README3 README4 README5 README6 README7 README8 README9 README10 |
$ROOT/MERCURY/v2014-09 $ROOT/MERCURY/v2014-09/ARTISANAL $ROOT/MERCURY/v2014-09/BrOx $ROOT/MERCURY/v2014-09/Hg2_PARTITION $ROOT/MERCURY/v2014-09/JVALUES $ROOT/MERCURY/v2014-09/NATURAL $ROOT/MERCURY/v2014-09/NEI2005 $ROOT/MERCURY/v2014-09/OCEAN $ROOT/MERCURY/v2014-09/SOIL $ROOT/MERCURY/v2014-09/STREETS |
342 MB | CURRENTLY USED |
POPs simulation | README | $$ROOT/POPs/v2015-08 | 809 MB | CURRENTLY USED
|
RRTMG radiative transfer model | README | $ROOT/RRTMG/v2018-11 | 19 MB | CURRENTLY USED
|
Tagged CO simulation | README | $ROOT/TAGGED_CO/v2017-04 | 260 KB | CURRENTLY USED
|
Tagged O3 simulation | README | $ROOT/TAGGED_O3/v2014-09 | 372 MB | CURRENTLY USED |
O3 for offline simulations | README | $ROOT/O3/v2014-09 | 130 MB | CURRENTLY USED |
OH for offline simulations | README1 README2 |
$ROOT/OH/v2014-09 $ROOT/OH/v2014-09/v5-07-08 $ROOT/OH/v2014-09/v7-02-03.GMI |
148 MB | CURRENTLY USED |
H2O2 for offline simulations | README | $ROOT/OXIDANTS/v2014-07 | 32 MB | CURRENTLY USED |
Oceanic Chlorophyll-A for Hg simulations | README | $ROOT/CHLA/v2014-07 | 2.0 MB | CURRENTLY USED |
Obsolete inventories (superseded by newer developments) | ||||
CH3I simulation | README | $ROOT/CH3I/v2014-07 | 280 KB | OBSOLETE
|
POPs simulation | README | $ROOT/POPs/v2014-09 | 622 MB | OBSOLETE
|
Tagged CO simulation | README | $ROOT/TAGGED_CO/v2014-08q | 260 KB | OBSOLETE
|
--Bob Yantosca (talk) 19:01, 2 July 2018 (UTC)
Halogen emissions
The following subdirectories of the HEMCO directory tree contain inventories of bromine and iodine emissions.
NOTE: See our Non-emissions data section for information about stratospheric bromine concentration data.
Inventory | Data file info | Path | Directory size | Status |
---|---|---|---|---|
Liang et al VSL emissions | README | $ROOT/BROMINE/v2015-02 | 156 KB | CURRENTLY USED |
Ordonez et al Iodine emissions | README | $ROOT/IODINE/v2017-09 | 4.4 MB | CURRENTLY USED
|
Natural emissions data
The following subdirectories of the HEMCO directory tree contain inventories natural emissions.
Inventory | Data file info | Path | Directory size | Status |
---|---|---|---|---|
GEIA NH3 (natural source) | README | $ROOT/NH3/v2014-07 | 8.1 MB | CURRENTLY USED |
NH3 emissions from arctic sea birds | README | HEMCO/NH3/v2018-04/ | 104 MB | CURRENTLY USED
|
ALD2 and EOH emissions from decaying plants | README | $ROOT/ALD2/v2017-03 | 8.1 MB | CURRENTLY USED |
Non-emissions data
The following subdirectories of the HEMCO directory tree input data contains non-emissions data sets. Most of these are used as inputs to GEOS-Chem's chemistry modules (i.e. photolysis, stratospheric chemistry, etc.).
Inventory | Data file info | Path | Directory size | Status |
---|---|---|---|---|
Active inventories (turned ON by default in the standard emissions configuration) | ||||
Timezone offsets from UTC | README | $ROOT/TIMEZONES/v2015-02 | 264 KB | CURRENTLY USED
|
UV surface albedoes | README | $ROOT/UVALBEDO/v2019-06 | 476 KB | CURRENTLY USED
|
TOMS/SBUV O3 columns | README | $ROOT/TOMS_SBUV/v2019-06 | 21 MB | CURRENTLY USED |
Stratospheric Bry from CCM | README | $ROOT/STRAT/v2015-01/Bry | 385 MB | CURRENTLY USED
|
GMI strat chem mechanism | README | $ROOT/GMI/v2015-02 | 16 GB | CURRENTLY USED
|
UCX chemistry mechanism | README | $ROOT/UCX/v2018-02 | 12 GB | CURRENTLY USED
|
MODIS XLAI (leaf area index per land type) |
README | $ROOT/MODIS_XLAI/v2017-07 | 203 MB | TCURRENTLY USED
|
Olson land map (masks for each land type) |
README | $ROOT/OLSON_MAP/v2019-02 | 1.9 MB | TCURRENTLY USED
|
Yuan-processed MODIS XLAI (leaf area index per land type) |
README | $ROOT/Yuan_XLAI/v2019-03 | 203 MB | TCURRENTLY USED
|
Obsolete inventories (superseded by newer developments) | ||||
TOMS/SBUV O3 columns | README | $ROOT/TOMS_SBUV/v2015-03 | 17 MB | OBSOLETE
|
Other inputs for HEMCO
The following subdirectories of the HEMCO directory tree input data for various HEMCO functions. These include regional masks, emission scale factors, and grid information.
Inventory | Data file info | Path | Directory size | Status |
---|---|---|---|---|
Active inventories (turned ON by default in the standard emissions configuration) | ||||
Annual scale factors | README | $ROOT/AnnualScalar/v2014-07 | 3.0 MB | CURRENTLY USED
|
NEI2005 scale factors | README | $ROOT/NEI2005/v2014-09 | 8.1 MB | CURRENTLY USED |
ID codes for each country | README | $ROOT/COUNTRY_ID/v2015-04 | 15 MB | CURRENTLY USED
|
Mask files for regional emissions | README | $ROOT/MASKS/v2014-07 | 51 MB | CURRENTLY USED
|
MAP_A2A regridding data | README | $ROOT/MAP_A2A/v2014-07 | 152 KB | CURRENTLY USED |
Obsolete inventories (superseded by newer developments) | ||||
Weekly scale factors | README | $ROOT/WEEKSCALE/v2014-07 | 4.0 KB | OBSOLETE
|
--Bob Yantosca (talk) 19:05, 2 July 2018 (UTC)
Downloading the HEMCO data directories
The GEOS-Chem Support Team has created a package called hemco_data_download. With this package, you can download the various emissions inventories and related data files for HEMCO to your own disk server. Furthermore, you can specify which data directories that you would like to download (as well as those you would like to ignore) via a configuration file.
Obtaining the hemco_data_download package
To obtain the hemco_data_download package, use Git to clone this repository
git clone https://github.com/GCST/hemco_data_download.git
This will create a directory named hemco_data_download, in which you should see the following files:
File | Description |
---|---|
README | File with an overall description of the directory contents |
hemcoDataDownload.pl | Perl script to download HEMCO data directories |
hemcoDataDownload.rc | Configuration file for the hemcoDataDownload.pl script. In this file you can specify which HEMCO data directories you would like to download and which you would like to omit. |
forTesting.rc | A configuration file that you can use for testing or debugging. This will tell hemcoDataDownload.pl only to download a couple of emissions inventories with files that do not take up much disk space. |
--Bob Yantosca (talk) 17:02, 5 December 2016 (UTC)
Setting up the configuration file
The configuration files (i.e. hemcoDataDownload.rc and forTesting.rc) are pretty much self-explanatory.
At the top of the configuration file you will see this section:
############################################################################### # # # Specify the remote and local HEMCO data paths, plus other options. # # # ############################################################################### Remote HEMCO data path | ftp://ftp.as.harvard.edu/gcgrid/data/ExtData/HEMCO Your HEMCO data path | /as/scratch/bmy/HEMCO Verbose output | yes Dryrun only? | no
Path | Description |
---|---|
Remote HEMCO data path | Location on the FTP server from which you are going to download the data. This can be from either Harvard or from Dalhousie. (For now we will use the Harvard server). You can edit this accordingly. |
Your HEMCO data path | The root-level directory for HEMCO data on your own disk space. If you are not sure where to place this, then ask your sysadmin. |
Verbose output | Lets you specify if you want to print out extra output during the download process. This can be set to either "yes" or "no". |
Dryrun only | Allows you to print out the data download commands without actually downloading the data. This is useful for debugging. This can be set to either "yes" or "no". |
In the next section you specify all of the HEMCO inventories that you want to download. You will see this header:
############################################################################### # # # THE FOLLOWING DATA DIRECTORIES WILL BE DOWNLOADED. # # # # These data directories comprise the recommended emissions configuration # # for typical GEOS-Chem full-chemistry and specialty simulations. # # # # NOTE: In most cases, you only have to specify the top-level folder. # # All subfolders will be downloaded automatically. # # # ############################################################################### #=============================+================================================ # AEROSOLS | Directory paths #=============================+================================================ AEROCOM volcano emissions | $ROOT/VOLCANO/v2014-10 Bond et al BC/OC | $ROOT/BCOC_BOND/v2014-07 Cooke et al BC/OC | $ROOT/BCOC_COOKE/v2014-07 Secondary organic aerosols | $ROOT/SOA/v2014-07 ... etc ...
Each line specifies the name of a HEMCO emissions inventory and the data path where it can be found on disk, relative to the root data path. NOTE: The script will replace the $ROOT token with the value you gave to the "HEMCO remote data path" above. (Lines starting with the comment character # will be ignored.)
Any inventory found in this section will be downloaded. To prevent an inventory from being downloaded you can either comment it out (i.e. place a # in the first column) or move the inventory to the next section.
The final section specifies HEMCO emission inventories that you do not wish to download. The section looks like this:
############################################################################### # # # THE FOLLOWING DATA DIRECTORIES WILL NOT BE DOWNLOADED. # # # # These data directories contain are optional emissions inventories that # # are not used in typical GEOS-Chem simulations. If you wish to download # # any of these inventories, simply move the corresponding entry for each # # inventory to the previous section. # # # ############################################################################### CH3I simulation (obsolete) | $ROOT/CH3I/v2014-07 Chlorophyll A | $ROOT/CHLA/v2014-07 ... etc ...
--Bob Y. 13:55, 12 February 2015 (EST)
Downloading the data
Once you have set up your configuration file, you can run the hemcoDataDownload.pl script to start downloading the HEMCO data to your local server. To run the script you can type:
hemcoDataDownload.pl
If you do not specify a configuration file name, then the hemcoDataDownload.pl scriptwill read the default configuration file hemcoDataDownload.rc configuration file. If you wish to specify a different configuration file name, simply pass that as an argument to the script, e.g.
hemcoDataDownload.pl myNewConfigFile.rc
Before you start downloading GB's of data, we recommend that you run a short test to make sure that the data directories are being copied to the proper locations on your disk server. For this purpose, we have provided a configuration file named forTesting.rc. Typing
hemcoDataDownload.pl forTesting.pl
will only download a couple of data inventories that do not take up much disk space. This allows you to ensure that the data transfer is sucessful without making you wait a long time.
--Bob Y. 13:57, 12 February 2015 (EST)
New features for the GEOS-Chem v10-01 public release
For the GEOS-Chem v10-01 public release, we modified the default download options in the hemcoDataDownload.pl script. We changed the default wget options from:
wget r -nH -q ...
to
wget -r -nH -c -N -q ...
This will tell wget to only download new or modified files, instead of trying to download the entire data archive from scratch. This should hopefully subsequent data download processes faster.
--Bob Y. (talk) 18:35, 16 June 2015 (UTC)
Submitting new data for use with HEMCO
If you have a new emissions inventory or other atmospheric data set that you would like to add to the HEMCO data directories, then please contact the GEOS-Chem Support Team. Data files must be in COARDS-compliant or CF-compliant netCDF format. Please see our Preparing data files for use with HEMCO wiki page for more information.
--Bob Y. 11:49, 5 March 2015 (EST)