Difference between revisions of "GCHP Output Data"

From Geos-chem
Jump to: navigation, search
(Data Visualization and Post-Processing)
m
 
(10 intermediate revisions by one other user not shown)
Line 1: Line 1:
 +
----
 +
<span style="color:crimson;font-size:120%">'''The GCHP documentation has moved to https://gchp.readthedocs.io/.''' The GCHP documentation on http://wiki.seas.harvard.edu/ will stay online for several months, but it is outdated and no longer active!</span>
 +
----
 +
 
__FORCETOC__
 
__FORCETOC__
 
'''''[[Running_GCHP:_Configuration|Previous]] | [[Developing_GCHP|Next]] | [[Getting Started with GCHP]] | [[GCHP Main Page]]'''''
 
'''''[[Running_GCHP:_Configuration|Previous]] | [[Developing_GCHP|Next]] | [[Getting Started with GCHP]] | [[GCHP Main Page]]'''''
 
#[[GCHP_Hardware_and_Software_Requirements|Hardware and Software Requirements]]
 
#[[GCHP_Hardware_and_Software_Requirements|Hardware and Software Requirements]]
 +
#[[Setting_Up_the_GCHP_Environment|Setting Up the GCHP Environment]]
 
#[[Downloading_GCHP|Downloading Source Code and Data Directories]]
 
#[[Downloading_GCHP|Downloading Source Code and Data Directories]]
 +
#[[Compiling_GCHP|Compiling]]
 
#[[Obtaining_a_GCHP_Run_Directory|Obtaining a Run Directory]]
 
#[[Obtaining_a_GCHP_Run_Directory|Obtaining a Run Directory]]
#[[Setting_Up_the_GCHP_Environment|Setting Up the GCHP Environment]]
 
#[[Compiling_GCHP|Compiling]]
 
 
#[[Running_GCHP:_Basics|Running GCHP: Basics]]
 
#[[Running_GCHP:_Basics|Running GCHP: Basics]]
 
#[[Running_GCHP:_Configuration|Running GCHP: Configuration]]
 
#[[Running_GCHP:_Configuration|Running GCHP: Configuration]]
Line 12: Line 16:
 
#[[GCHP_Run_Configuration_Files|Run Configuration Files]]
 
#[[GCHP_Run_Configuration_Files|Run Configuration Files]]
 
<br>
 
<br>
 
Please note that documentation on this page primarily reflects the latest GCHP public release which is currently the GCHP 12 series. The documentation will be updated for the GCHP 13.0.0 release over the coming months.
 
  
 
==GCHP Output==
 
==GCHP Output==
  
 
===List of Output Files===
 
===List of Output Files===
Running GCHP produces several output data and log files. The table belows lists what files to expect. All of these files are deleted if you run <tt>make cleanup_output</tt> within the run directory.
+
GCHP produces the following output data and log files:
  
 
{| border=1 cellspacing=0 cellpadding=5
 
{| border=1 cellspacing=0 cellpadding=5
 
|-valign="top" bgcolor="#CCCCCC" valign="top"
 
|-valign="top" bgcolor="#CCCCCC" valign="top"
 
!width="400px"|Filename
 
!width="400px"|Filename
!width="500px"|Description
+
!width="400px"|Description
!width="200px"|Notes
+
!width="300px"|Notes
  
 
|-valign="top"
 
|-valign="top"
 
|<tt>OutputDir/GCHP.{collection}.YYYYMMDD_HHmmz.nc4</tt>
 
|<tt>OutputDir/GCHP.{collection}.YYYYMMDD_HHmmz.nc4</tt>
|Diagnostic output data. Note that you can change the filename format for any collection within HISTORY.rc.
+
|Diagnostic output data
|
+
|You may customize the filename format for any collection within config file <tt>HISTORY.rc</tt>
  
 
|-valign="top"
 
|-valign="top"
 
|<tt>gchp.log</tt>
 
|<tt>gchp.log</tt>
|Primary GCHP log file. Sample run scripts <tt>gchp.run</tt> and <tt>gchp.loca.run</tt> are configured to send all stdout to this file by default.  
+
|If adapting an example run script, this file is the primary GCHP log file.
|Please include this file when [[Submitting GEOS-Chem support requests|submitting a support request]].
+
|Includes all model standard output as well output from sourcing <tt>gchp.env</tt> and <tt>runConfig.sh</tt>. If using the multi-run option then this file is not produced; standard output is sent to the job log file instead per job.
  
|-valign="top"
 
|<tt>PET00000.GEOSCHEMchem.log</tt>
 
|Can be ignored.
 
|
 
 
|-valign="top"
 
|-valign="top"
 
|<tt>HEMCO.log</tt>
 
|<tt>HEMCO.log</tt>
|Log file generated by HEMCO. This file is also output when running GEOS-Chem Classic.
+
|Log file generated by HEMCO.  
|Please include this file when [[Submitting GEOS-Chem support requests|submitting a support request]].
+
|This file is also output when running GEOS-Chem Classic.
  
 
|-valign="top"
 
|-valign="top"
 
|<tt>cap_restart</tt>
 
|<tt>cap_restart</tt>
|MAPL output text file containing GCHP run end time to be used as the next run's start time if doing multi-segmented runs. If not doing multi-segmented runs then this file should be deleted prior to rerunning so that start information in <tt>CAP.rc</tt> is used instead.
+
|MAPL output text file containing GCHP run end time.
|
+
|This file may be used as the next run's start time if doing multi-segmented runs. If not doing multi-segmented runs then this file should be deleted prior to rerunning so that start information in <tt>CAP.rc</tt> is used instead.
  
 
|-valign="top"
 
|-valign="top"
 
|<tt>*.rcx</tt>
 
|<tt>*.rcx</tt>
|Can be ignored.
+
|History collection information.
|
+
|One file per collection.
  
 
|-valign="top"  
 
|-valign="top"  
 
|<tt>EGRESS</tt>
 
|<tt>EGRESS</tt>
 +
|Empty file produced by MAPL.
 
|Can be ignored.
 
|Can be ignored.
|
 
  
 
|-valign="top"  
 
|-valign="top"  
 
|<tt>logfile.000000.out</tt>
 
|<tt>logfile.000000.out</tt>
|Can be ignored.
+
|Log file for advection settings.
 
|
 
|
  
 
|-valign="top"  
 
|-valign="top"  
|<tt>trac_avg.gchp_c24_standard.YYYYMMDDHHmm</tt>
+
|<tt>warnings_and_errors.log</tt>
|Can be ignored. All GCHP data output diagnostics data is stored in the <tt>OutputDir</tt> subdirectory.  
+
|MAPL logging layer output.
|
+
|Can be ignored since GCHP does not currently use GMAO library pFlogger.
 +
 
 
|}
 
|}
  
If you run GCHP using SLURM then you will also have a slurm output file that will contain potentially valuable information about your run.
+
If you run GCHP using a job scheduler then you will also have a system output file.
  
 
===Restart Files===
 
===Restart Files===
  
GCHP restart files are called "checkpoint" files and are output to the main level of the run directory. The primary restart file is output at the end of the run and is named "gcchem_internal_checkpoint.nc" by default (see the previous chapter on configuring restart files). However, if is renamed to include 'restart' and the timestamp automatically by the sample run scripts. If you are outputting checkpoint files at regular intervals during your run then they will be output with format "gcchem_internal_checkpoint.nc.YYYYMMDD_HHmmz.bin". Despite the extension, these files are netcdf files. You may need to rename them to have the ".nc" extension for compatibility with your netcdf viewer or analysis software.
+
GCHP restart files are called "checkpoint" files and are output to the main level of the run directory. The primary restart file is output at the end of the run and is named "gcchem_internal_checkpoint.nc" by default (see the previous chapter on configuring restart files). However, if is renamed to include 'restart' and the timestamp automatically by the sample run scripts.
  
 
The GCHP checkpoint files contains all variables that are stored in the MAPL Internal state in GCHP and are analogous to the combined contents of the GEOS-Chem Classic and HEMCO restart files. There is one notable difference. '''All 3D variables in the GCHP restart file are vertically flipped. This is an important distinction to know about when comparing or preparing GCHP restart files.'''
 
The GCHP checkpoint files contains all variables that are stored in the MAPL Internal state in GCHP and are analogous to the combined contents of the GEOS-Chem Classic and HEMCO restart files. There is one notable difference. '''All 3D variables in the GCHP restart file are vertically flipped. This is an important distinction to know about when comparing or preparing GCHP restart files.'''
Line 81: Line 80:
 
===Diagnostic Data===
 
===Diagnostic Data===
  
'''Important note: Starting in GCHP 12.5.0 you may output data on lat-lon grids via conservative online regridding using ESMF. Documentation for how to do this is not yet on the wiki but is coming soon. See the GCHP 12.5.0 HISTORY.rc file for instructions.'''
+
GCHP diagnostic data is in netCDF-4 format and is stored in the <tt>OutputDir</tt> directory within your run directory. The output filename format is <tt>GCHP.''collection''.YYYYMMDD_HHmmz.nc4</tt> where one or more collection names are configured in the output configuration file <tt>HISTORY.rc</tt>. You are free to create as many additional collections with any names you choose by adding them to <tt>HISTORY.rc</tt>. You may include both 2D and 3D arrays in a single collection. However, all 3D arrays must have the same 3rd dimension size. In GCHP the 3rd dimension size is limited to # of level centers (72) and # of level edges (73).  
 
+
GCHP diagnostic data is in netCDF-4 format and is stored in the <tt>OutputDir</tt> directory within your run directory. The output filename format is <tt>GCHP.''collection''.YYYYMMDD_HHmmz.nc4</tt> where one or more collection names are configured in the output configuration file <tt>HISTORY.rc</tt>. You are free to create as many additional collections with any names you choose by adding them to <tt>HISTORY.rc</tt>. Any 2D or 3D arrays stored in <tt>State_Met</tt>, <tt>State_Chm</tt>, or <tt>State_Diag</tt> can be included in a History collection. You may include both 2D and 3D arrays in a single collection, but be aware that all 3D arrays must have the same 3rd dimension size. In GCHP the 3rd dimension size is limited to # of level centers (72) and # of level edges (73).  
+
  
The emissions diagnostics file is different from the netcdf emissions diagnostics file output by GEOS-Chem Classic. In GCHP the emissions to include in the file are listed in HISTORY.rc and the output file is sent to <tt>OutputDir</tt> with the same naming format as other diagnostic collections. All 3d data within the file are flipped vertically (surface is last element in the level dimension). Note that this is also true of the GCHP restart file due to the way MAPL stores the internal state.
+
Unlike in GEOS-Chem Classic, the contents of the HEMCO emissions diagnostic file are listed in <tt>HISTORY.rc</tt> and the output file has the same naming format as other diagnostic collections. '''Like the GCHP restart file, all 3D HEMCO diagnostics are flipped vertically relative to other outputs.''' This means the first element in the output array corresponds to top-of-atmosphere and the last element is surface. The GCPy python package includes being able to vertically flip the level dimension prior to plotting to make post-run analysis easier.
  
 
==Data Visualization and Post-Processing==
 
==Data Visualization and Post-Processing==

Latest revision as of 15:41, 8 December 2020


The GCHP documentation has moved to https://gchp.readthedocs.io/. The GCHP documentation on http://wiki.seas.harvard.edu/ will stay online for several months, but it is outdated and no longer active!



Previous | Next | Getting Started with GCHP | GCHP Main Page

  1. Hardware and Software Requirements
  2. Setting Up the GCHP Environment
  3. Downloading Source Code and Data Directories
  4. Compiling
  5. Obtaining a Run Directory
  6. Running GCHP: Basics
  7. Running GCHP: Configuration
  8. Output Data
  9. Developing GCHP
  10. Run Configuration Files


GCHP Output

List of Output Files

GCHP produces the following output data and log files:

Filename Description Notes
OutputDir/GCHP.{collection}.YYYYMMDD_HHmmz.nc4 Diagnostic output data You may customize the filename format for any collection within config file HISTORY.rc
gchp.log If adapting an example run script, this file is the primary GCHP log file. Includes all model standard output as well output from sourcing gchp.env and runConfig.sh. If using the multi-run option then this file is not produced; standard output is sent to the job log file instead per job.
HEMCO.log Log file generated by HEMCO. This file is also output when running GEOS-Chem Classic.
cap_restart MAPL output text file containing GCHP run end time. This file may be used as the next run's start time if doing multi-segmented runs. If not doing multi-segmented runs then this file should be deleted prior to rerunning so that start information in CAP.rc is used instead.
*.rcx History collection information. One file per collection.
EGRESS Empty file produced by MAPL. Can be ignored.
logfile.000000.out Log file for advection settings.
warnings_and_errors.log MAPL logging layer output. Can be ignored since GCHP does not currently use GMAO library pFlogger.

If you run GCHP using a job scheduler then you will also have a system output file.

Restart Files

GCHP restart files are called "checkpoint" files and are output to the main level of the run directory. The primary restart file is output at the end of the run and is named "gcchem_internal_checkpoint.nc" by default (see the previous chapter on configuring restart files). However, if is renamed to include 'restart' and the timestamp automatically by the sample run scripts.

The GCHP checkpoint files contains all variables that are stored in the MAPL Internal state in GCHP and are analogous to the combined contents of the GEOS-Chem Classic and HEMCO restart files. There is one notable difference. All 3D variables in the GCHP restart file are vertically flipped. This is an important distinction to know about when comparing or preparing GCHP restart files.

Diagnostic Data

GCHP diagnostic data is in netCDF-4 format and is stored in the OutputDir directory within your run directory. The output filename format is GCHP.collection.YYYYMMDD_HHmmz.nc4 where one or more collection names are configured in the output configuration file HISTORY.rc. You are free to create as many additional collections with any names you choose by adding them to HISTORY.rc. You may include both 2D and 3D arrays in a single collection. However, all 3D arrays must have the same 3rd dimension size. In GCHP the 3rd dimension size is limited to # of level centers (72) and # of level edges (73).

Unlike in GEOS-Chem Classic, the contents of the HEMCO emissions diagnostic file are listed in HISTORY.rc and the output file has the same naming format as other diagnostic collections. Like the GCHP restart file, all 3D HEMCO diagnostics are flipped vertically relative to other outputs. This means the first element in the output array corresponds to top-of-atmosphere and the last element is surface. The GCPy python package includes being able to vertically flip the level dimension prior to plotting to make post-run analysis easier.

Data Visualization and Post-Processing

The GEOS-Chem python package GCPy is fully functional with GCHP output files. The same plotting functions are used for both GEOS-Chem Classic lat-lon data and GCHP cubed-sphere data, and contain the capability to automatically regrid between grid types and resolutions. Please see the GCPy GitHub page.



Previous | Next | Getting Started with GCHP | GCHP Main Page