Developing GEOS-Chem: Difference between revisions

From Geos-chem
Jump to navigation Jump to search
(Created page with "Image: Page is under construction.jpg Here we provide guidelines for writing clear, concise, and effective Fortran source code. == Background == We have written GEOS-Ch...")
 
 
(113 intermediate revisions by 2 users not shown)
Line 1: Line 1:
[[Image: Page is under construction.jpg]]
<span style="color:darkorange">'''''NOTE: We will be updating this coding style guide in the near future.  The recent efforts to interface GEOS-Chem with ESMs like NASA GEOS, CESM, BCC, and WRF necessitate a departure from past programming practice.  We will keep you updated.'''''</span>


Here we provide guidelines for writing clear, concise, and effective Fortran source code.


== Background ==
== Background ==


We have written GEOS-Chem in the Fortran-90 (F90) language.  F90 introduces several new features over its predecessor, Fortran-77 (F77), including allocatable arrays, modules, derived types, and new programming statements.  We invite you to consult [[GEOS-Chem#Online tutorials|this list of Fortran tutorials].
We have written GEOS-Chem in the Fortran-90 (F90) language.  F90 introduces several new features over its predecessor, Fortran-77 (F77), including allocatable arrays, modules, derived types, and new programming statements.  We invite you to consult [[GEOS-Chem_basics#Fortran_resources|this list of Fortran tutorials]].


F90 contains all of the features of F77. Legacy code adhering to the F77 standard should build with  any F90 compiler. Although we have written GEOS-Chem source code to the F90 standard, you should be aware that many 3rd-party routines included in GEOS-Chem&mdash;such as, [[Photolysis_mechanism|FAST-JX], and [[ISORROPIA_II|ISORROPIA]]&mdash;were originally ritten to the F77 standard.  In the past several GEOS-Chem releases, we have started to replaced older F77 code with F90 code, especially to  facilitate our [[GEOS-Chem HP|GEOS-Chem High Performance ("HP)") project]].
F90 contains all of the features of F77. Legacy code adhering to the F77 standard should build with  any F90 compiler. Although we have written GEOS-Chem source code to the F90 standard, you should be aware that many 3rd-party routines included in GEOS-Chem&mdash;such as, [[Photolysis_mechanism|FAST-JX], and [[ISORROPIA_II|ISORROPIA]]&mdash;were originally written to the F77 standard.  In the past several GEOS-Chem releases, we have started to replaced older F77 code with F90 code, especially to  facilitate our [[GEOS-Chem HP|GEOS-Chem High Performance (aka GCHP) project]].


You should also be aware that several other updates to the Fortran standard (F95, F2000, F2003) have been introduced in recent years. These new Fortran versions contain the same functionality as F90, but generally omit the obsolete F77-style features. Most modern Fortran compilers include support for all of these standards.
You should also be aware that several other updates to the Fortran standard (F95, F2000, F2003) have been introduced in recent years. These new Fortran versions contain the same functionality as F90, but generally omit the obsolete F77-style features. Most modern Fortran compilers include support for all of these standards.


== Style points ===
== Code layout ==


=== Fixed format vs. free-format ===
=== Fixed format vs. free format ===


We previously wrote GEOS-Chem source code using '''fixed-format layout''', which has the following requirements:
We previously wrote GEOS-Chem source code using '''fixed-format layout''', which has the following requirements:


*A line continuation character (usually <tt>&amp;</tt>) is placed in column 6  
*A line continuation character (usually <code>&</code>) is placed in column 6  
*Numeric labels can occupy column 2 through column 5
*Numeric labels can occupy column 2 through column 5
*Executable statements begin in column 7 and extend to column 72
*Executable statements begin in column 7 and extend to column 72


We chose the fixed-format layout for compatibility with the F77-style include files that were contained in the <tt>Headers/</tt> directory. Starting with GEOS-Chem v9-01-03, we converted all header files to [http://acmg.seas.harvard.edu/geos/doc/man/chapter_7.html#ProgUnits modules].  This allows the use of '''free-format layout''', which does not restrict where you place executable statements. (To continue a free-format statement across several lines of code, you must place an ampersand <tt>&quot;&amp;&quot;</tt> character at the end of each line.) We recommend that you write all new GEOS-Chem code using the free format layout.
We chose the fixed-format layout for compatibility with the F77-style include files that were contained in the <code>Headers/</code> directory. Starting with [[GEOS-Chem v9-01-03]], we converted all header files to [[#Modules|Fortran modules]].  This allows the use of '''free-format layout''', which does not restrict where you place executable statements. (To continue a free-format statement across several lines of code, you must place an ampersand (<code>&;</code>) character at the end of each line.)  


=== Denoting code authors ===
Modules in GEOS-Chem that are still written in F77 fixed-format layout are denoted with the <tt>.F</tt> extension, while modules written in F90 free-format layout are denoted with the <tt>.F90</tt> extension. '''We recommend that you write all new GEOS-Chem code using the free format layout.'''
 
=== Modules ===
 
Place all of your new GEOS-Chem code into Fortran 90 modules, if possible. These allow you to save both and routines and variables within a single program unit.
 
An example module is shown below.
 
!------------------------------------------------------------------------------
!          Harvard University Atmospheric Chemistry Modeling Group            !
!------------------------------------------------------------------------------
!BOP
!
! !MODULE: MY\_MODULE
!
! !DESCRIPTION: \subsection*{Overview}
! Module MY\_MODULE contains variables and routines to do something.
! (bmy, 5/1/03)
!\\
!\\
! !INTERFACE
!
        MODULE MY_MODULE
!
! !USES:
!
      IMPLICIT NONE
      ! Make everything private...
      PRIVATE
!
! !PUBLIC MEMBER FUNCTIONS:
!
      PUBLIC :: DRIVER
      PUBLIC :: INIT_MY_MODULE
      PUBLIC :: CLEANUP_MY_MODULE
!
! !REVISION HISTORY:
!  (1 ) Bug fix in MY_SUBROUTINE (bmy, 5/1/03)
!EOP
!------------------------------------------------------------------------------
!BOC
!
! !PRIVATE DATA MEMBERS:
!
      ! Argument to the SIN function
      REAL*8, ALLOCATABLE :: B(:)
      CONTAINS
!EOC
!------------------------------------------------------------------------------
!          Harvard University Atmospheric Chemistry Modeling Group            !
!------------------------------------------------------------------------------
!BOP
!
! !IROUTINE: DRIVER
!
! !DESCRIPTION: \subsection*{Overview}
! Subroutine DRIVER is the driver routine for MY_MODULE. (bmy, 5/1/03)
!\\
!\\
! !INTERFACE:
!
      SUBROUTINE DRIVER
!
! !REVISION HISTORY:
!
!EOP
!------------------------------------------------------------------------------
!BOC
!
! !LOCAL VARIABLES:
!
      LOGICAL, SAVE :: FIRST = .TRUE.
      INTEGER, :: I
      REAL*8 :: X, Y1, Y2
      REAL*8, PARAMETER :: PI180 = 180d0 / 3.14159265358979323d0
 
      !=================================================================
      ! DRIVER begins here!
      !=================================================================
      ! Initialize
      IF ( FIRST ) THEN
          CALL INIT_MY_MODULE()
      ENDIF
      !=================================================================
      ! Loop over degrees
      !=================================================================
      DO I = 1, 360
          ! Convert to radians
          B = DBLE( I ) / PI180
          ! Call subroutine
          CALL MY_SUBROUTINE( B(I), Y1 )
          ! Call function
          Y2 = MY_FUNCTION( B(I) )
          ! Write output
          WRITE( 6, '(i3, 1x, 3(f13.6,1x))' ) I, B(I), Y1, Y2
      ENDDO
 
      ! Return to calling program
      END SUBROUTINE DRIVER
!EOC
!-----------------------------------------------------------------------------
!          Harvard University Atmospheric Chemistry Modeling Group            !
!------------------------------------------------------------------------------
!BOP
!
! !IROUTINE: MY\_SUBROUTINE
!
! !DESCRIPTION: \subsection*{Overview}
! Function MY_SUBROUTINE returns the sine of a number. (bmy, 5/1/03)
!\\
!\\
! !INTERFACE:
!
      SUBROUTINE MY_SUBROUTINE( X, Y )
!
! !INPUT PARAMETERS:
!
      ! Argument for the sine function
      REAL*8, INTENT(IN) :: X
!
! !RETURN VALUE:
!   
      REAL*8, INTENT(OUT) :: Y
!
! !REVISION HISTORY:
! (1 ) Corrected typographic error (bmy, 5/1/03)
!EOP
!------------------------------------------------------------------------------
!BOC
      !=================================================================
      ! MY_SUBROUTINE begins here!
      !=================================================================
      Y = SIN( X )
 
      ! Return to calling program
      END SUBROUTINE MY_SUBROUTINE
!EOC
!-----------------------------------------------------------------------------
!          Harvard University Atmospheric Chemistry Modeling Group            !
!------------------------------------------------------------------------------
!BOP
!
! !IROUTINE: MY\_FUNCTION
!
! !DESCRIPTION: \subsection*{Overview}
! Function MY_FUNCTION returns the sine of a number. (bmy, 5/1/03)
!\\
!\\
! !INTERFACE:
!
      FUNCTION MY_FUNCTION( X ) RESULT( Y )
!
! !INPUT PARAMETERS:
!
      ! Argument for the sine function
      REAL*8, INTENT(IN) :: X
!
! !RETURN VALUE:
!   
      REAL*8, INTENT(OUT) :: Y
!
! !REVISION HISTORY:
!
!EOP
!------------------------------------------------------------------------------
!BOC
 
      !=================================================================
      ! MY_FUNCTION begins here!
      !=================================================================
      Y = SIN( X )
      ! Return to calling program
      END FUNCTION MY_FUNCTION
!EOC
!------------------------------------------------------------------------------
!          Harvard University Atmospheric Chemistry Modeling Group            !
!------------------------------------------------------------------------------
!BOP
!
! !IROUTINE: INIT\_MY\_MODULE
!
! !DESCRIPTION: \subsection*{Overview}
! Subroutine INIT\_MY\_MODULE allocates and zeroes the B array. (bmy, 5/1/03)
!\\
!\\
! !INTERFACE:
!
      SUBROUTINE INIT_MY_MODULE
!
! !USES
!
      USE ERROR_MOD, ONLY : ALLOC_ERR
!
! !REVISION HISTORY:
!
!EOP
!------------------------------------------------------------------------------
!BOC
!
! !LOCAL VARIABLES
!
      INTEGER :: AS
      !=================================================================
      ! INIT_MY_MODULE begins here!
      !=================================================================
      ! Allocate B, check status
      ALLOCATE( B(360), STAT=AS )
      ! Error check
      IF ( AS /= 0 ) CALL ALLOC_ERR( 'B' )
      ! Zero B
      B(:) = 0d0
      ! Return to calling program
      END SUBROUTINE INIT_MY_MODULE
!EOC
!-----------------------------------------------------------------------------
!          Harvard University Atmospheric Chemistry Modeling Group            !
!------------------------------------------------------------------------------
!BOP
!
! !IROUTINE: CLEANUP\_MY\_MODULE
!
! !DESCRIPTION: \subsection*{Overview}
! Subroutine CLEANUP_MY_MODULE deallocates all module arrays. (bmy, 5/1/03)
!\\
!\\
! !INTERFACE:
!
      SUBROUTINE CLEANUP_MY_MODULE()
!
! !REVISION HISTORY:
!
!EOP
!------------------------------------------------------------------------------
!BOC
      !=================================================================
      ! CLEANUP_MY_MODULE begins here!
      !=================================================================
      IF ( ALLOCATED( B ) ) DEALLOCATE( B )
      ! Return to calling program
      END SUBROUTINE CLEANUP_MY_MODULE
  !EOC
  END MODULE MY_MODULE<
 
Fortran 90 modules written for GEOS-Chem contain the following features:
 
{| border=1 cellspacing=0 cellpadding=5
|-valign="top" bgcolor="#CCCCCC"
!width="200px"|Item
!width="800px"|Description
 
|-valign="top"
|Module header
|This is similar to the subroutine header. Contains a list of all module variables, routines, and modification notes, in [[Automatic_documentation_with_protex|Automatic documentation with ProTex]]
 
|-valign="top"
|<code>PRIVATE</code> declarations
|You can "hide" certain routines or variables within a F90 module from other routines or modules. Ideally your module will be like a "black box" with only a few routines or variables that are publicly accessible. By convention, in GEOS-Chem, we declare everything <code>PRIVATE</code> first then indicate the public routines.
 
|-valign="top"
|<code>IMPLICIT NONE</code>
|If you declare <code>IMPLICIT NONE</code> once at the top of the module, then you don't have to declare it in the individual subroutines; the declaration "carries through" below.
 
|-valign="top"
|Module variables and arrays
|Any variables declared above the <code>CONTAINS</code> statement are as if they were stored in F77 common blocks; that is, they are preserved between calls to the various module routines. Also, you should declare module arrays as <code>ALLOCATABLE</code> whenever possible. This allows you to only set aside memory for that array if a certain routine is called. This results in more efficient memory management.
 
|-valign="top"
|<code>CONTAINS</code> statement
| All module variables and <code>PRIVATE</code> declarations go above the <code>CONTAINS</code> statement.  Module routines and functions go below the <code>CONTAINS</code> statement
;Subroutines and functions: Your module will contain various subroutines and functions depending on the particular needs at hand.
 
|-valign="top"
|An INIT routine
|Your module should have a subroutine named <code>INIT_modulename</code>, which initializes and allocates all module arrays.
 
|-valign="top"
|A CLEANUP routine
|Your module should have a subroutine named <code>CLEANUP_modulename</code>, which deallocates module arrays. (You only need this if you have allocatable arrays).
 
|}
 
F90 modules are very similar to classes in Java and C++; they allow you to group data and the routines which work on the data in a single package.


Generally, we denote code authors by a 3-letter abbreviation in source code comments <tt>(bmy, 5/1/03)</tt>. However, some authors have choosen to sign their code updates with their computer account names, e.g.<tt>(mpayer, 7/15/12)</tt>.
Theoretically, each GEOS-Chem routine and/or variable should belong to a module. In practice, this is not true, as we do have separate subroutine and function files from some of the older routines which were written by 3rd party sources. '''However, you should write all new  GEOS-Chem code in module form.'''


== Headers, declarations, indentations, white space ==
Also, it is OK for modules to reference other modules. If Module B references Module A, all that you have to do is to make sure that Module B declaration in the Makefile refers to Module A. The GEOS-Chem Makefiles have been prepared for you by the [[GEOS-Chem Support Team]], so in most instances, you will not have to concern yourself with this.


=== Automatic documentation with ProTeX ===
=== Automatic documentation with ProTeX ===


We generate detailed <a href="chapter_9.html">GEOS-Chem reference documentation</a> with [[Automatic documentation with protex|ProTeX]]. ProTeX is a perl script developed by [http://gmao.gsfc.nasa.gov/ NASA/GMAO] that generates ProTeX files from  GEOS-Chem's subroutine, function, and module comment headers. The LaTeX file can then be compiled to produce both PostScript and PDF output.
We generate detailed [https://geoschem.github.io/gcclassic-manpage-archive/man.GC_12/chapter_14.html reference documentation] with [[Automatic documentation with protex|ProTeX]]. ProTeX is a perl script developed by [http://gmao.gsfc.nasa.gov/ NASA/GMAO] that generates ProTeX files from  GEOS-Chem's subroutine, function, and module comment headers. The LaTeX file can then be compiled to produce both PostScript and PDF output.


To use ProTeX, you need to add a standard header at the top of each subroutine, function, and module. The header looks like this:</p>
To use ProTeX, you need to add a standard header at the top of each subroutine, function, and module. The header looks like this:


  !------------------------------------------------------------------------------
  !------------------------------------------------------------------------------
Line 149: Line 446:
Note that the ProTeX header contains the following elements:
Note that the ProTeX header contains the following elements:


#Indicators (<tt>!BOP</tt>, <tt>!EOP</tt>) as to where ProTeX should look for source code header comments<
#Indicators (<code>!BOP</code>, <code>!EOP</code>) as to where ProTeX should look for source code header comments<
#Indicators (<tt>!BOC</tt>, <tt>!EOC</tt>) as to where the source code begins and ends. ProTeX will ignore code between <tt>!BOC</tt> and <tt>!EOC</tt> unless you tell it otherwise.
#Indicators (<code>!BOC</code>, <code>!EOC</code>) as to where the source code begins and ends. ProTeX will ignore code between <code>!BOC</code> and <code>!EOC</code> unless you tell it otherwise.
#A description of the routine with the original date
#A description of the routine with the original date
#A list of input &amp; output arguments
#A list of input &amp; output arguments
#A list of references (if applicable)
#A list of references (if applicable)
#References to F90 modules (if any) after the <tt>USE</tt> keyword followed by the <tt>IMPLICIT NONE</tt> declaration
#References to F90 modules (if any) after the <code>USE</code> keyword followed by the <code>IMPLICIT NONE</code> declaration
#Each time the file is modified the REVISION HISTORY section should be updated.
#Each time the file is modified the <code>REVISION HISTORY</code> section should be updated.
#Declarations for local variables
#Declarations for local variables
#Declarations for Fortran parameters (aka constants)
#Declarations for Fortran parameters (aka constants)
#Declarations for external functions (if necessary)
#Declarations for external functions (if necessary)


Fortran 90 allows you to declare an argument to a subroutine or function with one of the following descriptors:</p>
Fortran 90 allows you to declare an argument to a subroutine or function with one of the following descriptors:


#<tt>INTENT(IN)</tt> (read-only),
#<code>INTENT(IN)</code> (read-only),
#<tt>INTENT(OUT)</tt> (write-only) or
#<code>INTENT(OUT)</code> (write-only) or
#<tt>INTENT(INOUT)</tt> (read-and-write)
#<code>INTENT(INOUT)</code> (read-and-write)


<p align="justify">This helps you to prevent overwriting arguments which should not be overwritten.</p>
This helps you to prevent overwriting arguments which should not be overwritten.


== Style Guide ==


=== Denoting code authors ===


<p align="justify">We recommend that you separate sections of source code with one or more divider comments:</p>
Generally, we denote code authors by a 3-letter abbreviation in source code comments <code>(bmy, 5/1/03)</code>.


<blockquote>
=== The more comments, the better! ===


<pre class="courier14">
It is good practice to add copious comments to your source code.  This will make sure that current and future GEOS-Chem users will be able to understand the modifications that you have added.


!=================================================================
Keep in mind that many GEOS-Chem users (including the [[GEOS-Chem Support Team]]) will probably not be as familiar with your specific area of research as you are.  While most users will probably get the "big picture" of what you are doing, they will be less knowledgeable about the little details (i.e. why does this reaction rate have a value of X, why is this parameter set to Y, why was this IF statement deleted, etc.).  Providing sufficient source code documentation will eliminate any such guesswork.


! 1st level header
=== Section headers ===


!=================================================================
We recommend that you separate sections of source code with one or more divider comments:
!=================================================================
! 1st level header
!=================================================================
    !--------------------------------------------------------------
    ! 2nd level header
    !--------------------------------------------------------------
      !-----------------------
      ! 3rd-level header
      !-----------------------


Line up each header with the the code that follows immediately below it. You can mix and match these styles as you wish. You don't always have to extend the header all the way across the screen.


If you are modifying a fixed-format source code file, end the major section headers at column #72. This will remind your reader where the limits of the allowable source code region occurs.  If you are modifying a free-format source code file, feel free to extend the ends of the major section headers to about column 76 or 77.


  !--------------------------------------------------------------
=== Capitals vs. lower-case ===


  ! 2nd level header
We used to write GEOS-Chem source code in all capitals. We chose this approach  to avoid mistaking similar characters,  such as the  lowercase <code>l</code> and the number <code>1</code>, which in some fonts look remarkably alike. You may at  your discretion mix capitals and lowercase in variable names and subroutine names, particularly if you want to improve readability or preserve scientific abbreviations ( e.g. <code>Rn_Pb_Be_mod.F</code>).


  !--------------------------------------------------------------
We still recommend that you write Fortran reserved words in all capitals: <code>SUBROUTINE</code>, <code>CALL</code>, <code>IF/THEN/ENDIF</code>, <code>DO/ENDDO</code>, <code>WHERE/ENDWHERE</code>. This distinguishes Fortran language elements from your variable and routine names.


Because GEOS-Chem contains a significant amount of 3rd-party code, you will see many routines where the code is written in all lowercase letters. Feel free to leave this code as-is. Reformatting a fixed-format source code file  can be  a cumbersome task (and maybe not the best use of your precious time)


Good editors such as <code>emacs</code> can "colorize" the different words in a program (e.g. statements, variables, and quoted text are rendered in different colors), thus making the code infinitely more readable.


      !-----------------------
=== Indent by 3 spaces ===


      ! 3rd-level header
Indent each new block of code 3 columns deeper than the last block:
 
      !----------------------</pre>
 
</blockquote>
 
<p align="justify">Line up each header with the the code that follows immediately below it. You can mix and match these styles as you wish. You don't always have to extend the header all the way across the screen.</p>
 
<p align="justify">If you are modifying a fixed-format source code file, end the major section headers at column #72. This will remind your reader where the limits of the allowable source code region occurs.  If you are modifying a free-format source code file, feel free to extend the ends of the major section headers to about column 76 or 77.</p>
 
<p align="justify"><img src="../../img/black_rule.jpg" height="2" width="100%"></p>
 
<p align="justify"><strong><em>A7.2.2</em></strong> We used to write GEOS-Chem source code in ALL CAPITALS. We chose this approach  to avoid mistaking similar characters,  such as the  lowercase <tt>l</tt> and the number <tt>1</tt>, which in some fonts look remarkably alike. You may at  your discretion mix capitals and lowercase in variable names and subroutine names, particularly if you want to improve readability or preserve scientific abbreviations ( e.g. <tt>Rn_Pb_Be_mod.F</tt>).</p>
 
<p align="justify">We still recommend that you write Fortran reserved words in ALL CAPITALS: <tt>SUBROUTINE</tt>, <tt>CALL</tt>, <tt>IF/THEN/ENDIF</tt>, <tt>DO/ENDDO</tt>, <tt>WHERE/ENDWHERE</tt>. This distinguishes Fortran language elements from your variable and routine names.</p>
 
<p align="justify">Because GEOS-Chem contains a significant amount of
 
  3rd-party code, you will see many routines where the code is written in all lowercase letters. Feel free to leave this code as-is. Reformatting a fixed-format source code file  can be  a cumbersome task (and maybe not the best use of your precious time).</p>
 
<p align="justify"> Good  editors such as <tt>emacs</tt> and <tt>Xemacs</tt> can "colorize" the different words in a program (e.g. statements, variables, and quoted text are rendered in different colors), thus making the code infinitely more readable..</p>
 
<p><img src="../../img/black_rule.jpg" height="2" width="100%"></p>
 
<p align="justify"><span class="size16bi">A7.2.3 </span>
 
Indent each new block of code 3 columns deeper than the last block: </p>
 
<blockquote>
 
<pre>        1        2        3        4
 
1234567890123456789012345678901234567890


          1        2        3        4
1234567890123456789012345678901234567890
       IF ( X == 0 ) THEN
       IF ( X == 0 ) THEN
         PRINT*, 'Hello, X is 0!'
         PRINT*, 'Hello, X is 0!'
 
 
 
         IF ( Y == 0 ) THEN
         IF ( Y == 0 ) THEN
             PRINT*, 'Hello, X is 0 and Y is also 0!'
             PRINT*, 'Hello, X is 0 and Y is also 0!'
         ENDIF
         ENDIF
      ENDIF


      ENDIF</pre>
A good editor such as <code>emacs</code> will indent for you automatically. You can specify the level of indent in your editor setup file (e.g. <code>.emacs</code>).


</blockquote>
=== Indenting nested DO loops ===


<p align="justify">A good editor such as <tt>emacs</tt> will indent for you automatically. You can specify the level of indent in your editor setup file (e.g. <tt>.emacs</tt>).</p>
Omit indentation for each line of a multiple <code>DO</code> loop. Use this syntax:


<p><img src="../../img/black_rule.jpg" height="2" width="100%"></p>
DO N = 1, NNPAR
DO L = 1, LLPAR
DO J = 1, JJPAR
DO I = 1, IIPAR
    A(I,J,L,N) = A(I,J,L,N) / 2.0d0
ENDDO
ENDDO
ENDDO
ENDDO


<p align="justify"><span class="size16bi"> A7.2.4 </span>
instead of letting the DO loops "creep" unnecessarily across the screen:


Omit
DO N = 1, NNPAR
    DO L = 1, LLPAR
      DO J = 1, JJPAR
          DO I = 1, IIPAR
            A(I,J,L,N) = A(I,J,L,N) / 2.0d0
          ENDDO
      ENDDO
    ENDDO
ENDDO


indentation for each line of a multiple DO loop. Use this syntax:</p>
=== Enhance readability with white space ===


<blockquote>
Use LOTS of white space to separate code. For example instead of


  <pre>DO N = 1, NNPAR
IF(X.eq.0)CALL DOLOOP(X,Y,Z,A,B,C,1,2,3)


DO L = 1, LLPAR
type:


DO J = 1, JJPAR
IF ( X == 0 ) CALL DOLOOP( X, Y, Z, A, B, C, 1, 2, 3 )


DO I = 1, IIPAR
This makes the code much more readable, which prevents bugs.. If  you can't read what you've written, you will never find your mistake! Don't be afraid to continue the statement to the next line if necessary.


  A(I,J,L,N) = A(I,J,L,N) / 2.0d0
In the above example, we also have used the new F90 equality test operator <code>==</code> (see below for more information).


ENDDO
=== Line up code into columns for better readability ===


ENDDO
Always line up quantities in assignment statements. Instead of:


ENDDO
A=1
THISLOOP=2
C=3
D=THISLOOP*C - A


ENDDO</pre>
add white space so that all of the equals signs fall in the same column.


</blockquote>
A        = 1
THISLOOP = 2
C        = 3
D        = ( THISLOOP * C ) - A


<p align="justify">instead of letting the DO loops &quot;creep&quot; unneccessarily across the screen:</p>
Your eyes can make more sense out of text that is lined up into vertical columns.


<blockquote>
Group multiple terms of an equation with parentheses (as we have done for the D equation above). The  compiler evaluates evaluate expressions in the order the parentheses are placed, from innermost to outermost.


<pre>DO N = 1, NNPAR
=== Use of white space for arrays and functions ===


  DO L = 1, LLPAR
Do not leave white space between array indices:


      DO J = 1, JJPAR
A = X(I,J,L)


        DO I = 1, IIPAR
Leave white space between arguments of functions and WRITE statements:


            A(I,J,L,N) = A(I,J,L,N) / 2.0d0
A = MYFUNCTION( I, J, L )
WRITE( 6, '(a)' )


        ENDDO
=== Line up arguments into columns ===


      ENDDO
Line up arguments in subroutine or function calls. If there are an even number of arguments, break them up symmetrically:


  ENDDO
Fortran-77 fixed-file format:


ENDDO</pre>
      CALL MYSUB( THIS_A, THIS_B, THIS_C,
    &            THIS_D, THIS_E, THIS_F )
      X = MYFUNCTION( THIS_A, THIS_B, THIS_C,
    &                THIS_D, THIS_E, THIS_F )


</blockquote>
Fortran-90 free-format:


      CALL MYSUB( THIS_A, THIS_B, THIS_C, &
                  THIS_D, THIS_E, THIS_F )
      X = MYFUNCTION( THIS_A, THIS_B, THIS_C, &


=== Use language features of Fortran-90 ===


<p><img src="../../img/black_rule.jpg" height="2" width="100%"></p>
==== Operators ====


<p align="justify"><strong><em>A7.2.5</em></strong>
Use the new F90-style operators instead of the older F77-style operators:


Use LOTS of white space to separate code. For example instead of</p>
>== instead of .EQ.
/= instead of .NE.
>  instead of .GT.
>= instead of .GE.
< instead of .LT.
>= instead of .LE.
.EQV. (or .eqv, it's case-insensitive) when comparing two logical values, i.e.
 
    IF ( MY_LOGICAL .eqv .FALSE. ) THEN ...


<blockquote><pre>IF(X.eq.0)CALL DOLOOP(X,Y,Z,A,B,C,1,2,3)</pre></blockquote>
The new operators take up only one or two spaces (instead of four spaces) and are easier to read.


<p align="justify">type:</p>
Some operators (<code>.AND.</code>, <code>.OR.</code, and <code>.NOT.</code>) are the same in F90 as in F77. This is also true of the boolean values (<code>.TRUE.</code> and <code>.FALSE.</code>).


<blockquote><pre>IF ( X == 0 ) CALL DOLOOP( X, Y, Z, A, B, C, 1, 2, 3 )</pre></blockquote>
==== Comment characters ====


<p align="justify">This makes the code much more readable,
Use the F90 <code>!</code> comment character. This replaces the F77 comment character (a <code>C</code> placed in the first column of fixed-format code). You can can create very legible comments by lining up the <code>!</code> with your source code.


which prevents bugs.. If  you can't read what you've written, you will never find your mistake! Don't be afraid to continue the statement to the next line if necessary.</p>
==== Continuation characters ====


<p align="justify">In the above example, we also have used the new F90 equality test
To continue an instruction from one line to the next, place an ampersand (<code>&</code>) at the end of the line (for free-format layout).


  operator <tt>==</tt> (see
If you are working with a source code file that has not yet been converted to free-format, place an ampersand in column 6 of each continued line of code.


  below for more information). </p>
==== Array assignment statements ====


Use the F90 array assignment functionality  to assign a value to all elements of an array without using array indices. For example:


INTEGER  :: ARRAY(10,10)
REAL(fp) :: ARRAY_F(10,10)
ARRAY(:,:)  = 0
ARRAY_F(:,:) = 0.0_fp


<p><img src="../../img/black_rule.jpg" height="2" width="100%"></p>
You can also leave off the default array mask <code>(:,:),</code> so
 
<p align="justify"><span class="size16bi">A7.2.6 </span>
 
Always line up  quantities in assignment statements. Instead of:</p>
 
<blockquote>
 
<pre>A=1
 
THISLOOP=2
 
C=3
 
D=THISLOOP*C - A</pre>
 
</blockquote>
 
<p align="justify">add white space so that all of the equals signs fall in the same column.</p>
 
<blockquote>
 
<pre>A        = 1
 
THISLOOP = 2
 
C        = 3
 
D        = ( THISLOOP * C ) - A</pre>
 
</blockquote>
 
<p align="justify">Your eyes can make more sense out of text that is lined up into vertical columns.</p>
 
<p align="justify">Group multiple terms of an equation with parentheses (as we have done for the D equation above). The  compiler evaluates evaluate expressions in the order the parentheses are placed, from innermost to outermost.</p>
 
<p><img src="../../img/black_rule.jpg" height="2" width="100%"></p>
 
<p align="justify"><span class="size16bi">A7.2.7</span>
 
Do not leave white space between array indices: </p>
 
<blockquote><pre>A = X(I,J,L)</pre></blockquote>
 
<p align="justify">Leave  white space between arguments of functions and WRITE statements:</p>
 
<blockquote>
 
<pre>A = MYFUNCTION( I, J, L )
 
WRITE( 6, '(a)' ) A</pre>
 
</blockquote>
 
<p><img src="../../img/black_rule.jpg" height="2" width="100%"></p>
 
<p align="justify"><span class="size16bi">A7.2.8 </span>
 
Line up arguments in
 
subroutine or function calls. If there are an even
 
number of arguments, break them up symmetrically:</p>
 
<blockquote>
 
<pre> CALL MYSUB( THIS_A, THIS_B, THIS_C,
 
&            THIS_D, THIS_E, THIS_F )
 
</pre>
 
</blockquote>
 
<p align="justify">or</p>
 
<blockquote>
 
<pre> X = MYFUNCTION( THIS_A, THIS_B, THIS_C,
 
&amp;                THIS_D, THIS_E, THIS_F )</pre>
 
</blockquote>
 
 
 
<p><img src="../../img/black_rule.jpg" height="2" width="100%"></p>
 
<p class="size20bi"><a name="DataTypes" id="DataTypes"></a>A7.3
 
Numeric and character data types</p>
 
<p align="justify"><span class="size16bi">A7.3.1 </span>
 
GEOS-Chem uses the following data types:</p>
 
<blockquote>
 
<pre>LOGICAL            --> TRUE or FALSE switch
 
INTEGER            --> 32 byte integer value
 
REAL*4            --> 32 byte floating-point value
 
REAL*8            --> 64 byte floating-point value
 
CHARACTER(LEN=255) --> string w/ 255 characters (the max limit)</pre>
 
</blockquote>
 
<p>where</p>
 
<blockquote>
 
<pre>INTEGER can express numbers from -2e9  to +2e9
 
REAL*4  can express numbers from -1e38  to +1e38
 
REAL*8  can express numbers from -1e312 to +1e312</pre>
 
</blockquote>
 
<p align="justify">Use <tt>REAL*8</tt> as your default floating-point type. This will prevent round off and precision truncation errors. </p>
 
<p align="justify">However, you should use  <tt>REAL*4</tt> data type instead of <tt>REAL*8</tt> in the following instances:</p>
 
<ul class="disc">
 
<li>
 
<p align="justify"><span class="size16bi">Diagnostic arrays:</span> GEOS-Chem
 
diagnostic values rarely exceed 10<sup>38</sup>, since they
 
are usually in units such as kg, v/v, ppbv, or molecules/cm<sup>2</sup>/s.</p>
 
</li>
 
<li>
 
<p align="justify"><span class="size16bi">Arrays and scalars required for binary or netCDF file I/O:</span> Declaring these types of variables <tt>REAL*4</tt> will help to keep file sizes as small as possible, thus saving disk space.</p>
 
</li>
 
</ul>
 
<p align="justify">Some third-party routines used by GEOS-Chem (such as <tt>TPCORE</tt>) use the <tt>REAL</tt> datatype. Most compilers (but not all) interpret  <tt>REAL</tt> as <tt>REAL*4</tt>.  If you want to force the compiler to interpret <tt>REAL</tt> as <tt>REAL*8</tt>, you must use a special compiler switch (usually <tt>-r8,</tt> but check your compiler manual to be sure). The GEOS-Chem Makefiles already take care of this for you..</p>
 
<p class="size16bi" align="justify">NOTE: See the <a href="http://wiki.geos-chem.org/Floating_point_math_issues" target="_blank">GEOS-Chem wiki page on floating point math
 
issues</a> for more information. </p>
 
 
 
<p><img src="../../img/black_rule.jpg" height="2" width="100%"></p>
 
<p align="justify"><span class="size16bi">A7.3.2 
 
</span>Use <tt>D</tt> (double-precision) exponents when assigning a value to a <tt>REAL*8</tt> variable:</p>
 
<blockquote>
 
  <pre>REAL*8 :: PI
 
PI = 3.14159265358979323d0</pre>
 
</blockquote>
 
<p align="justify">instead of this syntax:</p>
 
<blockquote>
 
<pre>REAL*8 :: PI
 
PI = 3.14159265359e0</pre>
 
</blockquote>


<p align="justify">In F90, the <tt>E</tt> (single-precision) exponent only yields about 7 decimal places of precision, whereas the <tt>D</tt> exponent yields 15 or 16 decimal places of precision. Using <tt>D</tt> exponents prevents roundoff errors.</p>
ARRAY  = 0
  ARRAY_F = 0.0_fp


is also acceptable.


==== PARAMETER declaration statements ====


<p><img src="../../img/black_rule.jpg" height="2" width="100%"></p>
Initialize <code>PARAMETER</code> constants on the same line where they are declared:


<p align="justify"><span class="size16bi">A7.3.3</span>
REAL(fp), PARAMETER :: PI  = 3.14159265358979323_fp
REAL(fp), PARAMETER :: AVO = 6.022140857e+23_fp


Declare strings with 255 characters, even if you don't know
==== Array constructors ====


a priori how long the string will be. Use the <tt>TRIM</tt> statement to strip excess white space. For example:</p>
Use F90 direct assignment statements to assign values to arrays:


<blockquote>
REAL*8 :: A(2) = (/ 1.0d0, 2.0d0 /)


<pre>CHARACTER(LEN=255) :: STR
Avoid using obsolete F77 syntax:


REAL*8 A(2)
DATA A / 1.0, 2.0 /


CAVEAT! In the above example the <code>A</code> array will be automatically given the <code>SAVE</code> attribute.  This means that its values will be stored from one subroutine call to the next.  This may not be what you had intended to do, so just be aware of this.


STR = 'I am the very model of a modern major general...'
==== The SAVE attribute ====


Include the SAVE attribute on the same line where the variable is declared:


LOGICAL, SAVE :: FIRSTTIME = .TRUE.


WRITE( 6, '(a)' ) TRIM( STR )</pre>
Avoid using obsolete F77 syntax:


</blockquote>
LOGICAL FIRSTTIME
SAVE FIRSTTIME
DATA FIRSTTIME / .TRUE. /


Note: <code>SAVE</code>d variables within a subroutine or function keep their values from one call to the next. This allows you to do some initialization only on the first call to a subroutine, for example:


 
  LOGICAL, SAVE :: FIRSTTIME = .TRUE.
<p><img src="../../img/black_rule.jpg" height="2" width="100%"></p>
 
<p class="size20bi"><a name="Conversions" id="Conversions"></a>A7.4
 
Converting from number to character (and vice versa)</p>
 
<p align="justify"><span class="size16bi">A7.4.1</span>
 
Several GEOS-Chem
 
routines convert a numeric variable to a character variable (and vice-versa). This usually happens when a number representing a date or time has to be incorporated into a file path. </p>
 
<p align="justify">Use a Fortran internal write statement to  convert from  from <tt>INTEGER</tt> to <tt>CHARACTER</tt>. This looks like a regular <tt>WRITE</tt> statement, only that the unit number is replaced by a character variable. Think of it as "writing" your number into a character variable instead of to a file.</p>
 
<p align="justify">For example, the following code creates a string containing the date 20110701:</p>
 
<blockquote>
 
  <pre>! Declare variables
 
CHARACTER(LEN=8) :: DATE_STR
 
INTEGER          :: DATE = 20110701
 
 
 
! FORTRAN internal write -- converts number to string
 
WRITE( DATE_STR, '(i8.8)' ) DATE</pre>
 
</blockquote>
 
<p align="justify">You can incorporate DATE_STR into a directory or file path. The format <tt>'(i8.8)'</tt> tells the <tt>WRITE</tt> command to make the string 8 characters long, and to change any spaces (if they exist) into preceding zeroes.</p>
 
<p align="justify">Caveat: Some versions of the PGI compiler did not allow internal WRITE statements. You can use the <tt>ENCODE</tt> function instead:</p>
 
<blockquote>
 
  <pre>! Declare variables
 
CHARACTER(LEN=8) :: DATE_STR
 
INTEGER :: DATE = 20010701
 
 
 
! Writes value from DATE into DATE_STR
 
ENCODE( 8, '(i8.8)', DATE_STR ) DATE</pre>
 
</blockquote>
 
<p align="justify">You must specify the length of the character variable (in this case, 8), the
 
format string, and the character variable when calling <tt>ENCODE</tt>.</p>
 
 
 
<p><img src="../../img/black_rule.jpg" height="2" width="100%"></p>
 
<p align="justify"><span class="size16bi">A7.4.2</span> Use a Fortran internal read to extract numbers from a character string. This is the reverse of the Fortran internal write described above:</p>
 
<blockquote>
 
  <pre class="courier16" align="left">! Declare variables
 
CHARACTER(LEN=8) :: DATE_STR = '20010101'
 
INTEGER :: DATE
 
 
 
! FORTRAN internal read -- converts string to number
 
READ( DATE_STR, '(i8.8)' ) DATE</pre>
 
</blockquote>
 
<p align="justify">Caveat: Some versions of the PGI compiler did not allow Fortran internal reads. You can use the <tt>DECODE</tt> function instead:</p>
 
<blockquote>
 
<pre>! Declare variables
 
CHARACTER(LEN=8) :: DATE_STR = '20010101'
 
INTEGER :: DATE
 
 
 
! Reads string value from DATE_STR into DATE
 
DECODE( 8, '(i8.8)', DATE_STR ) DATE</pre>
 
</blockquote>
 
<p align="justify">As with <tt>ENCODE</tt>, you must specify the length of the character variable
 
(in this case, 8), the format string, and the character variable in the call to <tt>DECODE</tt>.</p>
 
 
 
<p><img src="../../img/black_rule.jpg" height="2" width="100%"></p>
 
<p align="justify"><span class="size16bi">A7.4.3</span> When writing GEOS-Chem code, you may have to use both the internal read / internal write and <tt>ENCODE</tt> / <tt>DECODE</tt> methods simultaneously,  separated by the appropriate <a href="chapter_3.html#CPP">C-preprocessor</a> compiler flags.</p>
 
<blockquote>
 
<pre>! Declare variables
 
CHARACTER(LEN=8) :: DATE_STR
 
INTEGER :: DATE = 20010701
 
   
   
 
  IF ( FIRSTTIME ) THEN
#if defined( LINUX_PGI )
 
  ! Write numeric value from DATE into DATE_STR for PGI Linux
 
  ENCODE( 8, '(i8)', DATE_STR ) DATE
 
#else
 
  ! FORTRAN Internal Write for other platforms
 
  WRITE( DATE_STR, '(i8)' ) DATE
 
#endif</pre>
 
</blockquote>
 
 
 
<p><img src="../../img/black_rule.jpg" height="2" width="100%"></p>
 
<p class="size20bi"><a name="NewFeatures" id="NewFeatures"></a>A7.5 New language features of Fortran 90</p>
 
<p align="justify"><span class="size16bi">A7.5.1</span>
 
Use the new F90-style operators instead of the older F77-style operators:</p>
 
<blockquote>
 
<pre>== instead of .EQ.
 
/= instead of .NE.
 
> instead of .GT.
 
>= instead of .GE.
 
<  instead of .LT.
 
>= instead of .LE.
 
</pre>
 
</blockquote>
 
<p align="justify">The new operators take up only one or two spaces (instead of four spaces) and are easier to read.</p>
 
<p align="justify">Some operators (<tt>.AND.</tt>, <tt>.OR.</tt>, and <tt>.NOT.</tt>) are the same in F90 as in F77. This is also true of the boolean values (<tt>.TRUE.</tt> and <tt>.FALSE.</tt>).</p>
 
 
 
<p><img src="../../img/black_rule.jpg" height="2" width="100%"></p>
 
<p align="justify"><span class="size16bi">A7.5.2</span>
 
Use the F90 <tt>!</tt> comment character. This replaces the F77 comment character (a <tt>C</tt> placed in the first column of fixed-format code). You can can create very legible comments by lining up the <tt>!</tt> with your source code.</p>
 
<p><img src="../../img/black_rule.jpg" height="2" width="100%"></p>
 
<p align="justify"><span class="size16bi">A7.5.3</span> To continue an instruction from one line to the next, place an ampersand (<tt>&amp;</tt>) at the end of the line (for free-format layout).</p>
 
<p align="justify">If you are working with a source code file that has not yet been converted to free-format, place an ampersand in column 6 of each continued line of code.</p>
 
 
 
<p><img src="../../img/black_rule.jpg" height="2" width="100%"></p>
 
<p align="justify"><span class="size16bi">A7.5.4</span>
 
Use the F90 array assignment functionality  to assign a value to all elements of an array without using array indices. For example:</p>
 
<blockquote>
 
<pre>INTEGER :: ARRAY(10,10)
 
ARRAY(:,:) = 0</pre>
 
</blockquote>
 
<p align="justify">You can also leave off the default array mask <tt>(:,:),</tt> so</p>
 
<blockquote><pre>ARRAY = 0</pre></blockquote>
 
<p align="justify">is also acceptable. </p>
 
 
 
<p><img src="../../img/black_rule.jpg" height="2" width="100%"></p>
 
<p align="justify"><span class="size16bi">A7.5.5</span>  Initialize <tt>PARAMETER</tt> constants on the same line where they are declared:</p>
 
<blockquote><pre>REAL*8, PARAMETER :: PI = 3.14159265358979323d0</pre></blockquote>
 
<p align="justify">Avoid using obsolete F77 syntax:</p>
 
<blockquote>
 
<pre>REAL*8 PI
 
PARAMETER( PI = 3.14159265358979323 )</pre>
 
</blockquote>
 
 
 
<p><img src="../../img/black_rule.jpg" height="2" width="100%"></p>
 
<p><span class="size16bi">A7.5.6</span> Use F90 direct assignment statements  to assign values to arrays:</p>
 
<blockquote><pre>REAL*8 :: A(2) = (/ 1.0d0, 2.0d0 /)</pre></blockquote>
 
<p>Avoid using obsolete F77 syntax:</p>
 
<blockquote>
 
<pre>REAL*8 A(2)
 
DATA A / 1.0, 2.0 /</pre>
 
</blockquote>
 
 
 
<p><img src="../../img/black_rule.jpg" height="2" width="100%"></p>
 
<p align="justify"><span class="size16bi">A7.5.7
 
</span> Include the SAVE attribute on the same line where the variable is declared:</p>
 
<blockquote><pre>LOGICAL, SAVE :: FIRSTTIME = .TRUE.</pre></blockquote>
 
<p align="justify">Avoid using obsolete F77 syntax:</p>
 
<blockquote><pre>LOGICAL FIRSTTIME
 
SAVE FIRSTTIME
 
DATA FIRSTTIME / .TRUE. /</pre>
 
</blockquote>
 
Note: <tt>SAVE</tt>d variables within a subroutine or function keep their values from one call to the next. This allows you to do some initialization only on the first call to a subroutine, for example:
 
<blockquote>
 
<pre>LOGICAL, SAVE :: FIRSTTIME = .TRUE.
 
 
 
IF ( FIRSTTIME ) THEN
 
   PRINT*, 'This is the first time this routine is called!'
   PRINT*, 'This is the first time this routine is called!'
   FIRSTTIME = .FALSE.
   FIRSTTIME = .FALSE.
ENDIF


ENDIF</pre>
Since the value of <code>FIRSTTIME</code> is saved between calls, the sentence above will only be printed out on the first call to the subroutine.
 
</blockquote>
 
<p align="justify">Since the value of <tt>FIRSTTIME</tt> is saved between calls, the sentence above will only be printed out on the first call to the subroutine.</p>
 
 
 
<p><img src="../../img/black_rule.jpg" height="2" width="100%"></p>


<p align="justify"><span class="size16bi">A7.5.8 </span> Include the <tt>EXTERNAL</tt> attribute on the same line where a function's type is declared. Use this syntax:</p>
==== The EXTERNAL attribute ====


<blockquote><pre>INTEGER, EXTERNAL :: MYFUNC</pre></blockquote>
Include the <code>EXTERNAL</code> attribute on the same line where a function's type is declared. Use this syntax:


<p align="justify">Avoid using obsolete F77 syntax:</p>
INTEGER, EXTERNAL :: MYFUNC


<blockquote>
Avoid using obsolete F77 syntax:


<pre>INTEGER MYFUNC
INTEGER MYFUNC
EXTERNAL MYFUNC


EXTERNAL MYFUNC</pre>
NOTE: If you are referencing functions contained within a module file, then you do not need to declare it with <code>EXTERNAL</code>. The <code>EXTERNAL</code> statement is a hangover from F77.


</blockquote>
==== The SELECT CASE statement ====


<p align="justify">NOTE: If you are referencing functions contained within a module file, then you do not need to declare it with <tt>EXTERNAL</tt>. The <tt>EXTERNAL</tt> statement is a hangover from F77.</p>
Use the F90 <code>SELECT CASE</code> statement to pick from a list of options. In F77 you would have had to write:
 
 
 
<p><img src="../../img/black_rule.jpg" height="2" width="100%"></p>
 
<p align="justify"><span class="size16bi">A7.5.9 </span>
 
Use the F90 <tt>SELECT CASE</tt> statement to pick from a list of options. In F77 you
 
would have had to write:</p>
 
<blockquote>
 
  <pre>IF ( X .EQ. 1 ) THEN


IF ( X .EQ. 1 ) THEN
 
   CALL MYSUB( X1 )
   CALL MYSUB( X1 )
 
ELSE IF ( X .EQ. 2 ) THEN  
ELSE IF ( X .EQ. 2 ) THEN  
 
   CALL MYSUB( X2 )
   CALL MYSUB( X2 )
 
ELSE
ELSE
 
   CALL MYSUB( X0 )
   CALL MYSUB( X0 )
ENDIF


ENDIF</pre>
but with F90, you can write this instead:


</blockquote>
SELECT CASE ( X )
    CASE( 1 )
      CALL MYSUB( X1 )
    CASE( 2 )
      CALL MYSUB( X2 )
    CASE DEFAULT
      CALL MYSUB( X0 )
END SELECT


<p align="justify">but with F90, you can write this instead:</p>
<code>SELECT CASE</code> also works with character constants:


<blockquote>
SELECT CASE ( TRIM( TRACERNAME ) )
    CASE( 'NOx' )
      CALL MYSUB( 1 )
    CASE( 'Ox' )
      CALL MYSUB( 2 )
    CASE DEFAULT
      CALL MYSUB( 3 )
END SELECT


<pre>SELECT CASE ( X )
A note of warning: You cannot specify <code>CASE( X )</code> where <code>X</code> is a variable, or else the compiler will balk. Then you  must resort to the <code>IF - ELSE IF</code> block structure as shown above.


  CASE( 1 )
==== Logarithms ====


      CALL MYSUB( X1 )
Use the F90 intrinsic functions <code>LOG( X )</code> and <code>LOG10( X )</code> to take the natural and common logarithms of <code>X</code>.  Here <code>X</code> may be declared either as <code>REAL(f4)</code> or <code>REAL(f8)</code>.


  CASE( 2 )  
Avoid using the F77 intrinsic functions <code>ALOG( X )</code> and <code>ALOG10( X )</code>.  Some compilers do not support these functions.


      CALL MYSUB( X2 )
==== Keyword arguments ====


  CASE DEFAULT
Use F90's keyword argument capability to clarify subroutine calls. You can replace this:


      CALL MYSUB( X0 )
  CALL READ_A1( NYMD,    NHMS,
&              ALBEDO,  CLDTOT,  EFLUX,    EVAP,   
&              ... etc ...                        )


END SELECT</pre>
with:


</blockquote>
  CALL READ_A1( NYMD    = NYMD,
&              NHMS    = NHMS,
&              ALBEDO  = ALBD,
&              CLDTOT  = CLDTOT
&              EFLUX    = EFLUX
&              EVAP    = EVAP
&            ... etc ...      )


<p align="justify"><tt>SELECT CASE</tt> also works with character constants:</p>
In the call above, you see several  pairs of variable names separated by an equals sign: <code>ALBEDO  = ALBD</code>


<blockquote>
The name on the left of the equals sign is the name of the argument as defined in the subroutine (i.e. <code>ALBEDO</code>).  This is often called the "dummy argument" because it is just a name for the memory that is getting passed to it from outside of the subroutine.


<pre>SELECT CASE ( TRIM( TRACERNAME ) )
The name on the right of the equals sign is the name of the variable that we are passing down to the subroutine.  Here we are using the <code>ALBD</code> variable (from GEOS-Chem module <code>dao_mod.F</code> and passing that down to <code>READ_A1</code> as the <code>ALBEDO</code> argument.


  CASE( 'NOx' )
                      THIS_D, THIS_E, THIS_F )


      CALL MYSUB( 1 )
=== Numeric and character data types ===


  CASE( 'Ox' )
==== Data types ====


      CALL MYSUB( 2 )
GEOS-Chem uses the following basic data types:


  CASE DEFAULT
{| border=1 cellspacing=0 cellpadding=5
|-valign="top" bgcolor="#CCCCCC"
!width="300px"|Data type
!width="150px"|Fortran type
!width="150px"|Flexible precision definintion
!width="200px"|Range


      CALL MYSUB( 3 )
|-valign="top"
|TRUE or FALSE values ||<code>LOGICAL</code> || ||


END SELECT</pre>
|-valign="top"
|Whole numbers ||<code>INTEGER</code> || ||-2 x 10<sup>9</sup> up to +2 x 10<sup>9</sup>


</blockquote>
|-valign="top"
|4-byte (aka 32-bit) floating point real values || <code>REAL*4</code> || <code>REAL(f4)</code> || -1 x 10<sup>38</sup> up to +1 x 10<sup>38</sup>


<p align="justify">A note of warning: You cannot specify <tt>CASE( X )</tt> where <tt>X</tt> is a variable, or else the compiler will balk. Then you  must resort to the <tt>IF - ELSE IF</tt> block structure as shown above.</p>
|-valign="top"
|8-byte (aka 64-bit) floating point real values || <code>REAL*8</code> || <code>REAL(f8)</code> || -1 x 10<sup>312</sup> up to +1 x 10<sup>312</sup>


|-valign="top"
|Character strings || <code>CHARACTER(LEN=255)</code> || ||


|}


<p><img src="../../img/black_rule.jpg" height="2" width="100%"></p>
However, you should use 4-byte floating point instead of 8-byte floating point in the following instances:


<p align="justify"><span class="size16bi">A7.5.10 </span> Use the F90 intrinsic functions <tt>LOG( X )</tt> and <tt>LOG10( X )</tt> to take the natural and common logarithms of <tt>X</tt>. Here <tt>X</tt> may be declared either as <tt>REAL*4</tt> or <tt>REAL*8</tt>.</p>
# Diagnostic arrays: GEOS-Chem diagnostic values rarely exceed 10<sup>38</sup>, since they are usually in units such as kg, v/v, ppbv, or molecules/cm<sup>2</sup>/s.
#Arrays and scalars required for binary or netCDF file I/O: Declaring these types of variables <code>REAL*4</code> will help to keep file sizes as small as possible, thus saving disk space.


<p align="justify">Avoid using the  F77 intrinsic functions <tt>ALOG( X )</tt> and <tt>ALOG10( X )</tt>.  Some compilers do not support these functions.</p>
Some third-party routines used by GEOS-Chem (such as <code>TPCORE</code>) use the <code>REAL</code> datatype. Most compilers (but not all) interpret <code>REAL</code> as a 4-byte floating point realIf you want to force the compiler to interpret <code>REAL</code> as an 8-byte floating point real, you must use a special compiler switch (usually <code>-r8,</code> but check your compiler manual to be sure). The GEOS-Chem Makefiles already take care of this for you.


NOTE: See [[Floating point math issues|our ''Floating point math issues'' wiki page]] for more information.


==== Flexible precision ====


<p><img src="../../img/black_rule.jpg" height="2" width="100%"></p>
[[GEOS-Chem v10-01]] and later versions use a [[Flexible precision in GEOS-Chem|flexible precision definition]]. Floating point variables can be defined with a [http://fortranwiki.org/fortran/show/Real+precision Fortran KIND parameter] that can be switched between 4-byte and 8-bytes at compile time.


<p align="justify"><span class="size16bi">A7.5.11 </span> Use F90's keyword argument capability to clarify subroutine calls. You can replace  this:</p>
Here is an example of how you can use flexible precision in GEOS-Chem:


USE PRECISION_MOD          ! Contains the fp, f4, f8 parameters used to define real values
 
REAL*4  :: VAR_REAL4      ! This was the old way to define a  4-byte real (from Fortran-77)
REAL*8  :: VAR REAL8      ! This was the old way to define an 8-byte real (from Fortran-77)
REAL(f4) :: VAR_REAL4_NEW  ! This is the new way to define a  4-byte real (introduced in Fortran 90)
REAL(f8) :: VAR_REAL8_NEW  ! This is the new way to define an 8-byte real (introduced in Fortran 90)
REAL(fp) :: VAR_FLEXIBLE  ! This is an 8-byte real by default, but can be switched to a 4-byte real at compile time


Declaring floating point real variables with <code>REAL(fp)</code> allows you to change from the default 8-byte precision to 4-byte precision by compiling GEOS-Chem with the <code>PRECISION=4</code> switch.  (NOTE: [[#Data types|As described in the prior section]], some floating point variables should always be declared as 4-byte reals, such as those variables that get written to disk.)


<blockquote>
We have not yet made full use of the flexible precision in GEOS-Chem as of [[GEOS-Chem v11-01]].  This will happen in a later version.


<pre> SUBROUTINE READ_A1( NYMD,    NHMS,
==== Use the Fortran-90 exponent syntax ====


&                    ALBEDO,  CLDTOT,  EFLUX,    EVAP,   
Use <code>_fp</code> (double-precision) suffixes when assigning a constant value to a <code>REAL(fp)</code> variable:


&                    ... etc ...                        )</pre>
REAL(fp) :: PI, ONE_THOUSAND
PI          = 3.14159265358979323_fp
ONE_THOUSAND = 1.0e+3_fp


</blockquote>
This will ensure that the value will be declared with sufficient precision.


<p align="justify">with this:</p>
Similarly, if you have used <code>REAL(f4)</code> to declare 4-byte floating point variables, then use this syntax: 


<blockqute>
REAL(f4) :: PI, ONE_THOUSAND
PI          = 3.14159265358979323_f4
ONE_THOUSAND = 1.0e+3_f4


<pre>     CALL READ_A1( NYMD    = NYMD,
and similarly for <code>REAL(f8)</code>:


    &              NHMS    = NHMS,
REAL(f9) :: PI, ONE_THOUSAND
PI          = 3.14159265358979323_f8
ONE_THOUSAND = 1.0e+3_f9


    &              ALBEDO  = ALBD,
==== Default string length ====


    &              CLDTOT  = CLDTOT
Declare strings with 255 characters, even if you don't know a priori how long the string will be. Use the <code>TRIM</code> statement to strip excess white space. For example:


    &amp;              EFLUX    = EFLUX
CHARACTER(LEN=255) :: STR
STR = 'I am the very model of a modern major general...'
WRITE( 6, '(a)' ) TRIM( STR )


    &amp;              EVAP    = EVAP
=== Converting from number to character (and vice-versa) ===


                    ... etc ...      )</pre>
==== Internal write statements ====


</blockquote>
Several GEOS-Chem routines convert a numeric variable to a character variable (and vice-versa). This usually happens when a number representing a date or time has to be incorporated into a file path.


<p align="justify">In the call above, you see several  pairs of variable names separated by an equals sign:<blockquote><pre>ALBEDO  = ALBD,</pre></blockquote>
Use a Fortran internal write statement to  convert from  from <code>INTEGER</code> to <code>CHARACTER</code>. This looks like a  regular <code>WRITE</code> statement, only that the unit number is replaced by a character variable. Think of it as "writing" your number into a character variable instead of to a file.


<p align="justify">The name on the left of the equals sign is the name of the argument as defined in the subroutine (i.e. <tt>ALBEDO</tt>).  This is often called the "dummy argument" because it is just a name for the memory that is getting passed to it from outside of the subroutine.</p> 
For example, the following code creates a string containing the date 20110701:


<p align="justify">The name on the right of the equals sign is the name of the variable that we are passing down to the subroutine. Here we are using the <tt>ALBD</tt> variable (from GEOS-Chem module <tt>dao_mod.F</tt> and passing that down to READ_A1 as the <tt>ALBEDO</tt> argument.
! Declare variables
CHARACTER(LEN=8) :: DATE_STR
INTEGER          :: DATE = 20110701
! FORTRAN internal write -- converts number to string
  WRITE( DATE_STR, '(i8.8)' ) DATE


You can incorporate <code>DATE_STR</code> into a directory or file path. The format <code>'(i8.8)'</code> tells the <code>WRITE</code> command to make the string 8 characters long, and to change any spaces (if they exist) into preceding zeroes.


==== Internal read statements ====


<p><img src="../../img/black_rule.jpg" height="2" width="100%"></p>
Use a Fortran internal read to extract numbers from a character string. This is the reverse of the Fortran internal write described above:


<p class="size20bi"><a name="MathOpt" id="MathOpt"></a>A7.6
! Declare variables
CHARACTER(LEN=8) :: DATE_STR = '20010101'
INTEGER :: DATE
! FORTRAN internal read -- converts string to number
READ( DATE_STR, '(i8.8)' ) DATE


Math optimizations</p>
=== Math optimizations ===


<p align="justify"><span class="size16bi">A7.6.1 </span> Logarithms and exponentiation are the most computationally expensive mathematical functions. Avoid using the traditional formulation:</p>
==== Write polynomials efficiently ====


<blockquote><pre> Y = A0 + A1*X + A2*X**2 + A3*X**3 + A4*X**4 + A5*X**5 </pre></blockquote>
Logarithms and exponentiation are the most computationally expensive mathematical functions. Avoid using the traditional formulation:


<p align="justify">but instead use parentheses to group the polynomial terms.</p>
Y = A0 + A1*X + A2*X**2 + A3*X**3 + A4*X**4 + A5*X**5


<blockquote><pre > Y = A0 + X* ( A1 + X* ( A2 + X* ( A3 + X* ( A4 + X* ( A5 )))))</pre></blockquote>
but instead use parentheses to group the polynomial terms.


<p align="justify">This modified polynomial expression replaces the costly exponentiations with more efficient multiplication operations. A Nth order polynomial will
Y = A0 + X* ( A1 + X* ( A2 + X* ( A3 + X* ( A4 + X* ( A5 )))))


now only require N multiplications.</p>
This modified polynomial expression replaces the costly exponentiations with more efficient multiplication operations. A Nth order polynomial will now only require N multiplications.


==== Exercise care with exponential approximations ====


Although it is tempting to replace <code>EXP( -x )</code> with the first-order approximation <code>1 - x,</code> this can result in negative values for certain values of <code>x</code>. This can cause negative tracer concentrations.


<p><img src="../../img/black_rule.jpg" height="2" width="100%"></p>
=== DO loops ===
 
<p align="justify"><span class="size16bi">A7.6.2</span>
 
Although it is tempting to replace <tt>EXP( -x )</tt> with the first-order approximation <tt>1 - x,</tt> this can result in negative values for certain values of <tt>x</tt>. This can cause negative tracer concentrations.</p>
 
 
 
<p><img src="../../img/black_rule.jpg" height="2" width="100%"></p>
 
<p class="size20bi"><a name="DoLoops" id="DoLoops"></a>A7.7
 
DO loops</p>
 
<p align="justify"><span class="size16bi">A7.7.1 </span> Always use the CYCLE statement to skip to the next iteration in a DO-loop. Use this syntax:</p>
 
<blockquote>
 
<pre>DO I = 1, 1000


==== The CYCLE statement ====


Always use the <code>CYCLE</code> statement to skip to the next iteration in a <code>DO</code> loop. Use this syntax:


DO I = 1, 1000
   ! Skip to next iteration
   ! Skip to next iteration
   IF ( X(I) == 0 ) CYCLE
   IF ( X(I) == 0 ) CYCLE
   
   
   ! Print
   ! Print
   PRINT*, X(I)
   PRINT*, X(I)
ENDDO


ENDDO</pre>
instead of this obsolete F77 syntax:
 
</blockquote>
 
 
 
<p align="justify">instead of this obsolete F77 syntax:</p>
 
<blockquote>
 
<pre>DO 100 I = 1, 1000
 
  IF ( X(I) .EQ. 0 ) GOTO 100
 
  PRINT*, X(I)
 
100 CONTINUE</pre>
 
</blockquote>
 
<p align="justify">Notice that the F90 <tt>ENDDO</tt> statement replaces the labeled <tt>CONTINUE</tt> statement.  This results in cleaner code.</p>


DO 100 I = 1, 1000
    IF ( X(I) .EQ. 0 ) GOTO 100
    PRINT*, X(I)
100 CONTINUE


Notice that the F90 <code>ENDDO</code> statement replaces the labeled <code>CONTINUE</code> statement.  This results in cleaner code.


<p><img src="../../img/black_rule.jpg" height="2" width="100%"></p>
==== The EXIT statement ====


<p align="justify"><span class="size16bi">A7.7.2 </span>
Always use the <code>EXIT</code> statement to completely exit from a <code>DO</code> loop. Use this syntax:
 
Always use the EXIT statement to completely exit from a DO loop. Use this syntax:</p>
 
<blockquote>
 
<pre>DO I = 1, 1000


DO I = 1, 1000
   
   
   ! Break out of this loop if X==0!
   ! Break out of this loop if X==0!
   IF ( X(I) == 0 ) EXIT
   IF ( X(I) == 0 ) EXIT
   
   
   ! Print info if we have not exited
   ! Print info if we have not exited
  <nowiki>WRITE( 6, '(''Iteration: '', i5, f13.6)' ) I, X(I)</nowiki>
ENDDO
PRINT*, 'outside loop'</pre>


  WRITE( 6, '(''Iteration: '', i5, f13.6)' ) I, X(I)
instead of this obsolete F77 syntax, which requires a <code>GOTO</code> statement:
 
ENDDO
 
 
 
PRINT*, 'outside loop'</pre>
 
</blockquote>
 
<p align="justify">instead of this obsolete F77 syntax:</p>
 
<blockquote>
 
<pre>C Do-loop
 
DO 100 I = 1, 1000
 
  IF ( X(I) .EQ. 0 ) GOTO 200
 
  PRINT*, 'inside loop'
 
100 CONTINUE
 
 
 
C Continue outside loop
 
200 CONTINUE
 
PRINT*, 'outside loop'</pre>
 
</blockquote>
 
 
 
<p><img src="../../img/black_rule.jpg" height="2" width="100%"></p>
 
<p align="justify"><span class="size16bi">A7.7.3
 
</span>Structure your DO loops so that they access memory in the most optimal manner. Fortran is a
 
column-major language, which means that arrays are stored in memory by columns
 
first, then by rows. If you have declared an array such as:</p>
 
<blockquote>
 
  <pre>INTEGER :: I, J, L, N
 
REAL*8  :: ARRAY(IIPAR,JJPAR,LLPAR,NNPAR)</pre>
 
</blockquote>
 
<p align="justify">then for optimal efficiency, the leftmost dimension (<tt>I</tt>) needs to vary the fastest, and needs to be accessed by the innermost DO-loop.  Then the next leftmost dimension (<tt>J</tt>) should be accessed by the next innermost DO-loop, and so on. </p>
 
<p align="justify">Therefore, the proper way to loop over this array is:</p>
 
<blockquote>
 
<pre>DO N = 1, NNPAR
 
DO L = 1, LLPAR
 
DO J = 1, JJPAR
 
DO I = 1, IIPAR
 
  ARRAY(I,J,L,N) = ARRAY(I,J,L,N) * 2.0d0
 
ENDDO
 
ENDDO
 
ENDDO
 
ENDDO </pre>
 
</blockquote>
 
<p align="justify">Note that the <tt>I</tt> index is varying most often, since it is the innermost DO-loop, then <tt>J</tt>, <tt>L</tt>, and <tt>N</tt>. This is opposite to how a car's odometer reads.</p>
 
<p align="justify">If you loop through an array in this fashion, with leftmost indicies varying fastest, then the code minimizes the number of times it has to load subsections of the array into cache memory. In this optimal manner of execution, all of the array elements sitting in the cache memory are read in the proper order before the next array subsection needs to be loaded into the cache. But if you step through array elements in the wrong order, the number of cache loads is proportionally increased. Because it takes a finite amount of time to reload array elements into cache memory, the more times you have to access the cache, the longer it will take the code to execute. This can slow down the code dramatically.</p>
 
<p><img src="../../img/black_rule.jpg" height="2" width="100%"></p>
 
<p align="justify"><span class="size16bi">A7.7.4</span>
 
Use &quot;infinite&quot; DO loops in situations where you need to iterate for an unknown number of iterations before some exit criterion is reached. This is most often used to read data from a file whose length you do not know a priori. </p>
 
<p align="justify">Here is an example of an infinite DO loop used to read data from disk:</p>
 
<blockquote>
 
<pre>INTEGER :: I, IOS, IUNIT
 
REAL*8 :: A(10000)
 
 
 
! Open file
 
OPEN( IUNIT, FILE='myfile.txt', IOSTAT=IOS )
 
 
 
! Quit if we can't open the file
 
IF ( IOS /= 0 ) THEN
 
  PRINT*, 'Error opening the file!'
 
  STOP
 
ENDIF
 
 
 
! Infinite DO loop for reading data from "myfile.txt"
 
DO
 
 
 
  ! Read I and A(I) from the file
 
  READ( IUNIT, '(i3,f11.3)', IOSTAT=IOS ) I, A(I)


C Do-loop
DO 100 I = 1, 1000
   
   
 
    IF ( X(I) .EQ. 0 ) GOTO 200
  ! IOS < 0 is end-of-file, so we exit the loop
    PRINT*, 'inside loop'
 
  IF ( IOS < 0 ) EXIT
 
 
  ! If IOS < 0, then this is an I/O error,
 
  ! so we print an error msg and quit
 
  IF ( IOS > 0 ) THEN
 
      PRINT*, 'I/O error reading from file!
 
      STOP
 
  ENDIF
 
ENDDO
 
   
   
 
  100 CONTINUE
! Close the file after we exit the loop
 
CLOSE( IUNIT )</pre>
 
</blockquote>
 
<p align="justify">In this example, the DO loop wants to execute forever, but it is stopped by an external criterion&#8212;the end of the file. The <tt>OPEN</tt>, <tt>READ</tt>, and <tt>CLOSE</tt> functions can return the I/O status to a variable if you specify via the <tt>IOSTAT</tt> keyword. If the I/O status variable (in this case, named IOS) is negative, then this is a normal end-of-file condition, and so we can just exit the loop and close the file. However, if <tt>IOS</tt> is a nonzero, positive number, then this means that we have encountered a real error condition.</p>
 
 
 
<p><img src="../../img/black_rule.jpg" height="2" width="100%"></p>
 
<p align="justify"><span class="size16bi">A7.7.5</span>
 
Reserve the following variable names for DO-loop and array indices:</p>
 
 
 
<ul>
 
  <li>Use <tt>I</tt> for the longitude index</li>
 
  <li>Use <tt>J</tt> for the latitude index</li>
 
  <li>Use <tt>L</tt> for the altitude index</li>
 
  <li>Use <tt>N</tt> for the quantity index (tracer, species, etc.)</pre>
 
  </li>
 
</ul>
 
<p align="justify">Therefore, you can assume that any DO-loop that uses <tt>I</tt>, <tt>J</tt>, and <tt>L</tt> is looping over grid box longitude, latitude, and altitude dimensions, and that any DO-loop which uses <tt>N</tt>  is looping over  tracer, species, or some other quantity. These special indices should NOT be used to refer other quantities. In this way, if you should get an  error message such as:</p>
 
<blockquote><pre> Error at grid box (I,J) = (23,34)!</pre></blockquote>
 
<p align="justify">you will immediately understand <tt>(I,J)</tt> are the  longitude and latitude indices
 
of the box where the error happened.</p>
 
<p align="justify">NOTE: In some 3rd-party routines, you will see <tt>K</tt> used instead of <tt>L</tt> to denote the altitude index.  We recommend leaving these as-is, so as to avoid having to rewrite entire sections of 3rd-party code.  However, you should use <tt>L</tt> for the altitude index in any new GEOS-Chem code that you write.</p>
 
 
 
<p><img src="../../img/black_rule.jpg" height="2" width="100%"></p>
 
<p class="size20bi"><a name="Modules" id="Modules"></a>A7.8 Fortran-90 modules</p>
 
<p align="justify"><span class="size16bi">A7.8.1</span>
 
Place all of your new GEOS-Chem code into  <a href="chapter_7.html#ProgUnits">Fortran 90 modules</a>, if possible. These allow you to save both
 
and routines and variables within a single program unit.</p>
 
<p align="justify">An example module is shown below.</p>
 
<blockquote>
 
  <pre class="courier14">
 
!------------------------------------------------------------------------------
 
!          Harvard University Atmospheric Chemistry Modeling Group            !
 
!------------------------------------------------------------------------------
 
!BOP
 
!
 
! !MODULE: MY\_MODULE
 
!
 
! !DESCRIPTION: \subsection*{Overview}
 
! Module MY\_MODULE contains variables and routines to do something.
 
! (bmy, 5/1/03)
 
!\\
 
!\\
 
! !INTERFACE
 
      MODULE MY_MODULE
 
!
 
! !USES:
 
!
 
      IMPLICIT NONE
 
 
 
      ! Make everything private...
 
      PRIVATE
 
!
 
! !PUBLIC MEMBER FUNCTIONS:&nbsp;
 
!
 
      PUBLIC :: DRIVER
 
      PUBLIC :: INIT_MY_MODULE
 
      PUBLIC :: CLEANUP_MY_MODULE
 
!
 
! !REVISION HISTORY:
 
!  (1 ) Bug fix in MY_SUBROUTINE (bmy, 5/1/03)
 
!EOP
 
!------------------------------------------------------------------------------
 
!
 
! !PRIVATE DATA MEMBERS:
 
!
 
      ! Argument to the SIN function
 
      REAL*8, ALLOCATABLE :: B(:)
 
 
 
      CONTAINS
 
 
 
!------------------------------------------------------------------------------
 
!          Harvard University Atmospheric Chemistry Modeling Group            !
 
!------------------------------------------------------------------------------
 
!BOP
 
!
 
! !IROUTINE: DRIVER
 
!
 
! !DESCRIPTION: \subsection*{Overview}
 
! Subroutine DRIVER is the driver routine for MY_MODULE. (bmy, 5/1/03)
 
!\\
 
!\\
 
! !INTERFACE:
 
      SUBROUTINE DRIVER
 
!
 
! !REVISION HISTORY:
 
!
 
!EOP
 
!------------------------------------------------------------------------------
 
!BOC
 
!
 
! !LOCAL VARIABLES:
 
!
 
      LOGICAL, SAVE :: FIRST = .TRUE.
 
      INTEGER, :: I
 
      REAL*8 :: X, Y1, Y2
 
      REAL*8, PARAMETER :: PI180 = 180d0 / 3.14159265358979323d0
 
   
   
C Continue outside loop
200 CONTINUE
PRINT*, 'outside loop'


      !=================================================================
==== Ordering of DO loops ====
 
      ! DRIVER begins here!
 
      !=================================================================


Structure your <code>DO</code> loops so that they access memory in the most optimal manner. Fortran is a column-major language, which means that arrays are stored in memory by columns first, then by rows. If you have declared an array such as:


      ! Initialize
INTEGER :: I, J, L, N
REAL*8  :: ARRAY(IIPAR,JJPAR,LLPAR,NNPAR)


      IF ( FIRST ) THEN
then for optimal efficiency, the leftmost dimension (<code>I</code>) needs to vary the fastest, and needs to be accessed by the innermost <code>DO</code> loop.  Then the next leftmost dimension (<code>J</code>) should be accessed by the next innermost <code>DO</code> loop, and so on.


        CALL INIT_MY_MODULE
Therefore, the proper way to loop over this array is:


      ENDIF
DO N = 1, NNPAR
DO L = 1, LLPAR
DO J = 1, JJPAR
DO I = 1, IIPAR
    ARRAY(I,J,L,N) = ARRAY(I,J,L,N) * 2.0d0
ENDDO
ENDDO
ENDDO
ENDDO


Note that the <code>I</code> index is varying most often, since it is the innermost <code>DO</code> loop, then <code>J</code>, <code>L</code>, and <code>N</code>. This is opposite to how a car's odometer reads.


      !=================================================================
If you loop through an array in this fashion, with leftmost indicies varying fastest, then the code minimizes the number of times it has to load subsections of the array into cache memory. In this optimal manner of execution, all of the array elements sitting in the cache memory are read in the proper order before the next array subsection needs to be loaded into the cache. But if you step through array elements in the wrong order, the number of cache loads is proportionally increased. Because it takes a finite amount of time to reload array elements into cache memory, the more times you have to access the cache, the longer it will take the code to execute. This can slow down the code dramatically.


      ! Loop over degrees
==== Infinite DO loop with EXIT ====


      !=================================================================
Use "infinite" <code>DO</code> loops in situations where you need to iterate for an unknown number of iterations before some exit criterion is reached. This is most often used to read data from a file whose length you do not know a priori.


      DO I = 1, 360
Here is an example of an infinite <code>DO</code> loop used to read data from disk:


INTEGER :: I, IOS, IUNIT
REAL*8 :: A(10000)
   
   
 
! Open file
        ! Convert to radians
OPEN( IUNIT, FILE='myfile.txt', IOSTAT=IOS )
 
        B = DBLE( I ) / PI180
 
   
   
 
! Quit if we can't open the file
        ! Call subroutine
IF ( IOS /= 0 ) THEN
 
    PRINT*, 'Error opening the file!'
        CALL MY_SUBROUTINE( B(I), Y1 )
    STOP
 
ENDIF
   
   
 
! Infinite DO loop for reading data from "myfile.txt"
        ! Call function
DO
 
        Y2 = MY_FUNCTION( B(I) )
 
   
   
 
    ! Read I and A(I) from the file
        ! Write output
    READ( IUNIT, '(i3,f11.3)', IOSTAT=IOS ) I, A(I)
 
 
        WRITE( 6, '(i3, 1x, 3(f13.6,1x))' ) I, B(I), Y1, Y2
    ! IOS < 0 is end-of-file, so we exit the loop
 
    IF ( IOS < 0 ) EXIT
      ENDDO  
 
    ! If IOS < 0, then this is an I/O error,
    ! so we print an error msg and quit
    IF ( IOS > 0 ) THEN
      PRINT*, 'I/O error reading from file!
      STOP
    ENDIF
ENDDO
   
   
! Close the file after we exit the loop
CLOSE( IUNIT )


      ! Return to calling program
In this example, the <code>DO</code> loop wants to execute forever, but it is stopped by an external criterion&mdash;the end of the file. The <code>OPEN</code>, <code>READ</code>, and <code>CLOSE</code> functions can return the I/O status to a variable if you specify via the <code>IOSTAT</code> keyword. If the I/O status variable (in this case, named <code>IOS</code>) is negative, then this is a normal end-of-file condition, and so we can just exit the loop and close the file. However, if <code>IOS</code> is a nonzero, positive number, then this means that we have encountered a real error condition.
 
      END SUBROUTINE DRIVER
 
!EOC
 
!-----------------------------------------------------------------------------
 
!          Harvard University Atmospheric Chemistry Modeling Group            !
 
!------------------------------------------------------------------------------
 
!BOP
 
!
 
! !IROUTINE: MY\_SUBROUTINE
 
!
 
! !DESCRIPTION: \subsection*{Overview}
 
! Function MY_SUBROUTINE returns the sine of a number. (bmy, 5/1/03)
 
!\\
 
!\\
 
! !INTERFACE:
 
 


      SUBROUTINE MY_SUBROUTINE( X, Y )
==== Reserved DO loop index variable names ====


!
Reserve the following variable names for <code>DO</code> loop and array indices:


! !INPUT PARAMETERS:
#Use <code>I</code> for the longitude index
#Use <code>J</code> for the latitude index
#Use <code>L</code> for the altitude index
#Use <code>N</code> for the quantity index (tracer, species, etc.)


!
Therefore, you can assume that any <code>DO</code> loop that uses <code>I</code>, <code>J</code>, and <code>L</code> is looping over grid box longitude, latitude, and altitude dimensions, and that any <code>DO</code> loop which uses <code>N</code>  is looping over  species or some other quantity. These special indices should NOT be used to refer other quantities. In this way, if you should get an  error message such as:


      ! Argument for the sine function
Error at grid box (I,J) = (23,34)!


      REAL*8, INTENT(IN) :: X
you will immediately understand <code>(I,J)</code> are the longitude and latitude indices of the box where the error happened.


!
NOTE: In some 3rd-party routines, you will see <code>K</code> used instead of <code>L</code> to denote the altitude index.  We recommend leaving these as-is, so as to avoid having to rewrite entire sections of 3rd-party code.  However, you should use <code>L</code> for the altitude index in any new GEOS-Chem code that you write.


! !RETURN VALUE:
== Programming techniques to promote running in HPC environments ==


!   
=== Restrict screen output to the root CPU ===


      REAL*8, INTENT(OUT) :: Y
When GEOS-Chem connects to an external GCM&mdash;such as NASA's GEOS-5 GCM&mdash;it will utilize MPI parallelization. Each CPU will perform all of GEOS-Chem's operations, but on a much smaller geographical domain (i.e. on a single vertical column or several vertical columns). This also means that each CPU will print informational messages to the screen (or log file, if you redirect screen output there) simultaneously. All of these I/O operations can seriously impact performance.


!
We have now started the process of bracketing <code>WRITE</code> and <code>PRINT</code> statements with <code>IF</code> blocks. In [[GEOS-Chem v9-01-03]] and higher versions, you will see code like this:


! !REVISION HISTORY:
  IF ( am_I_Root ) THEN
    WRITE( 6, '(a  )' ) REPEAT( '=', 79 )         
    WRITE( 6, '(a,/)' ) 'G E O S - C H E M  U S E R  I N P U T'
    WRITE( 6, 100  ) TRIM( FILENAME )
100 FORMAT( 'READ_INPUT_FILE: Reading ', a )       
  ENDIF


! (1 ) Corrected typographic error (bmy, 5/1/03)
When we use MPI parallelization, the <code>am_I_Root</code> variable, which is passed as an argument to subroutines and functions, determines if we are on the root CPU.  This will restrict the printing of informational messages to just the root CPU. We encourage you to use this approach as you write new GEOS-Chem code.


!EOP
For more information, please see [[Programming_techniques_for_HPC_environments#Restricting_screen_and_log_file_output_to_the_root_CPU|this wiki post]].


!------------------------------------------------------------------------------
=== Do not use fixed parameters for file unit numbers===


!BOC
[[GEOS-Chem v9-01-02]] and prior versions used fixed parameters (stored in module <code>GeosUtil/file_mod.F</code>) for Fortran logical unit numbers. Logical unit numbers, or LUN's, are numeric values that are used in Fortran <code>OPEN</code>, <code>WRITE</code>, <code>READ</code>, and <code>CLOSE</code> statements.  A typical GEOS-Chem file read operation looks like this:


USE FILE_MOD, ONLY : IU_FILE
   
   
 
...
      !=================================================================
 
      ! MY_SUBROUTINE begins here!
 
      !=================================================================
 
      Y = SIN( X )
 
   
   
 
! Open file
      ! Return to calling program
OPEN( UNIT=IU_FILE, FILE=TRIM( FILENAME ) ... )
 
 
      END SUBROUTINE MY_SUBROUTINE
! Read data
 
READ( IU_FILE, IOSTAT=IOS ) ...
!EOC
 
!-----------------------------------------------------------------------------
 
!          Harvard University Atmospheric Chemistry Modeling Group            !
 
!------------------------------------------------------------------------------
 
!BOP
 
!
 
! !IROUTINE: MY\_FUNCTION
 
!
 
! !DESCRIPTION: \subsection*{Overview}
 
! Function MY_FUNCTION returns the sine of a number. (bmy, 5/1/03)
 
!\\
 
!\\
 
! !INTERFACE:
 
 
 
      FUNCTION MY_FUNCTION( X ) RESULT( Y )
 
!
 
! !INPUT PARAMETERS:
 
!
 
      ! Argument for the sine function
 
      REAL*8, INTENT(IN) :: X
 
!
 
! !RETURN VALUE:
 
!   
 
      REAL*8, INTENT(OUT) :: Y
 
!
 
! !REVISION HISTORY:
 
!
 
!EOP
 
!------------------------------------------------------------------------------
 
!BOC
 
   
   
! Close file
CLOSE( IU_FILE )


      !=================================================================
But when we connect GEOS-Chem to an external GCM, we can no longer rely on pre-defined LUNs.  The GCM may have already assigned the LUN that we are trying to use to a different file.  In [[GEOS-Chem v9-01-03]] and higher versions, all LUNs are determined dynamically. We now call a function (<code>inquireMod/findFreeLUN</code>) to return the next unused LUN. You will now see code that looks like this:
 
      ! MY_FUNCTION begins here!
 
      !=================================================================
 
      Y = SIN( X )


USE inquireMod, ONLY : findFreeLUN
   
   
 
...
      ! Return to calling program
 
      END FUNCTION MY_FUNCTION
 
!EOC
 
!------------------------------------------------------------------------------
 
!!          Harvard University Atmospheric Chemistry Modeling Group            !
 
!------------------------------------------------------------------------------
 
!BOP
 
!
 
! !IROUTINE: INIT\_MY\_MODULE
 
!
 
! !DESCRIPTION: \subsection*{Overview}
 
! Subroutine INIT\_MY\_MODULE allocates and zeroes the B array. (bmy, 5/1/03)
 
!\\
 
!\\
 
! !INTERFACE:
 
      SUBROUTINE INIT_MY_MODULE
 
!
 
! !USES
 
!
 
      USE ERROR_MOD, ONLY : ALLOC_ERR
 
!
 
! !REVISION HISTORY:
 
!
 
!EOP
 
!------------------------------------------------------------------------------
 
!BOC
 
!
 
! !LOCAL VARIABLES
 
      INTEGER :: AS
 
   
   
 
INTEGER :: IU_FILE
      !=================================================================
 
      ! INIT_MY_MODULE begins here!
 
      !=================================================================
 
   
   
 
...
      ! Allocate B, check status
 
      ALLOCATE( B(360), STAT=AS )
 
   
   
 
! Find a free file LUN
      ! Error check
IU_FILE = findFreeLUN()
 
      IF ( AS /= 0 ) CALL ALLOC_ERR( 'B' )
 
   
   
 
! Open file
      ! Zero B
OPEN( UNIT=IU_FILE, FILE=TRIM( FILENAME ) ... )
 
      B(:) = 0d0
 
   
   
! Read data
READ( IU_FILE, IOSTAT=IOS ) ...
     
! Close file
CLOSE( IU_FILE )


      ! Return to calling program
For more information, please see [[Programming_techniques_for_HPC_environments#Using_findFreeLUN_to_assign_logical_unit_numbers_for_file_I.2FO|this wiki post]].


      END SUBROUTINE INIT_MY_MODULE
=== Do not refer to public variables via USE statements ===


!EOC
We used to refer to variables in modules by means of the <code>USE</code> statement, such as:


!-----------------------------------------------------------------------------
! Reference variables from dao_mod.F
USE DAO_MOD, ONLY : UWND, VWND, T


!!          Harvard University Atmospheric Chemistry Modeling Group            !
While many routines in GEOS-Chem still use this syntax, [[Derived_type_objects_used_by_GEOS-Chem#Using_derived-type_objects_to_replace_USE_statements|we are abandoning this approach]]. Instead of referring to module variables with <code>USE</code>, we shall (wherever expedient) use derived type objects to contain variables and arrays, which then can be passed from one routine to another via the argument list. We are doin[[File:g]] this to facilitate our [[GEOS-Chem HP]], which seeks to embed GEOS-Chem within the NASA GEOS-5 GCM.</p>


!------------------------------------------------------------------------------
The only things that you should obtain from modules via the USE statement will be:


!BOP
#Subroutines
#Functions
#Derived type definitions


!
such as:


! !IROUTINE: CLEANUP\_MY\_MODULE
  ! Refer to derived type definition from gigc_state_met_mod.F90
 
USE State_Met_Mod, ONLY : MetState
!
 
! Refer to subroutines from dao_mod.F
! !DESCRIPTION: \subsection*{Overview}
USE DAO_MOD, ONLY: AIRQNT, COSSZA
 
! Subroutine CLEANUP_MY_MODULE deallocates all module arrays. (bmy, 5/1/03)
 
!\\
 
!\\
 
! !INTERFACE:
 
      SUBROUTINE CLEANUP_MY_MODULE
 
!
 
! !REVISION HISTORY:
 
!
 
!EOP
 
!------------------------------------------------------------------------------
 
!BOC
 
 
 
      !=================================================================
 
      ! CLEANUP_MY_MODULE begins here!
 
      !=================================================================
 
      IF ( ALLOCATED( B ) ) DEALLOCATE( B )
 
 
 
      ! Return to calling program
 
      END SUBROUTINE CLEANUP_MY_MODULE
 
!EOC
 
END MODULE MY_MODULE</pre>
 
</blockquote>
 
 
 
<p align="justify">Fortran 90 modules written for GEOS-Chem  contain the following features:</p>
 
<ol>
 
<li>
 
  <p align="justify">Module header: This is similar to the subroutine header. Contains a list of all module variables, routines,
 
and modification notes, in <a href="http://wiki.geos-chem.org/Automatic_documentation_with_protex" target="_blank">ProTeX</a> style.</p>
 
</li>
 
<li>
 
  <p align="justify"><tt>PRIVATE</tt> declarations: You can "hide" certain routines or variables within a F90 module from other routines or modules. Ideally your module will be like a "black box" with only a few routines or variables that are publicly accessible. By convention, in GEOS-Chem, we declare everything <tt>PRIVATE</tt> first then indicate the public routines.</p>
 
</li>
 
<li>
 
  <p align="justify"><tt>IMPLICIT NONE</tt>: If you declare <tt>IMPLICIT NONE</tt> once at the top of the module, then you
 
don't have to declare it in the individual subroutines; the declaration "carries through" below.</p>
 
</li>
 
<li>
 
  <p align="justify">Module variables and arrays: Any variables declared above the <tt>CONTAINS</tt> statement are as if they were stored in F77 common blocks; that is, they are preserved between calls to the various module routines. Also, you should declare module arrays as <tt>ALLOCATABLE</tt> whenever possible. This allows you to only set aside memory for that array if a
 
certain routine is called. This results in more efficient memory management.</p>
 
</li>
 
<li>
 
  <p align="justify"><tt>CONTAINS</tt> statement: All module variables and <tt>PRIVATE</tt> declarations go above the <tt>CONTAINS</tt> statement. Module routines and functions go below the <tt>CONTAINS</tt> statement.</p>
 
</li>
 
<li>
 
  <p align="justify">Subroutines and functions: Your module will contain various subroutines and functions depending on the particular needs at hand.</p>
 
</li>
 
<li>
 
  <p align="justify">An INIT routine: Your module should have a subroutine named <tt>INIT_modulename</tt>, which initializes and allocates all module arrays.</p>
 
</li>
 
<li>
 
  <p align="justify">A CLEANUP routine: Your module should have a subroutine named <tt>CLEANUP_modulename</tt>, which deallocates module arrays. (You only need this if you have allocatable arrays).</p>
 
</li>
 
</ol>
 
<p align="justify">F90 modules are very similar to classes in Java and C++; they allow you to group data and the routines which work on the data in a single package.</p>
 
<p align="justify">Theoretically, each GEOS-Chem routine and/or variable should belong to a module. In practice, this is not true, as we do have separate subroutine and function files from some of the older routines which were written by 3rd party sources. <span class="size16bi">However, you should write all new  GEOS-Chem code  in module form</span>.</p>
 
<p align="justify">Also, it is OK for modules to reference other modules. If Module B references Module A, all that you have to do is to make sure that Module B declaration in the Makefile refers to Module A. The GEOS-Chem Makefiles have been prepared for you by the <a href="http://wiki.geos-chem.org/GEOS-Chem_Support_Team" target="_blank">GEOS-Chem Support Team</a>, so in most instances, you will not have to concern yourself with this.</p>
 
<p><img src="../../img/black_rule.jpg" height="2" width="100%"></p>
 
 
 
<p align="justify"><strong><em>A7.8.2</em></strong> We used to refer to variables in modules by means of the <tt>USE</tt> statement, such as:</p>
 
<blockquote>
 
  <pre>! Reference variables from dao_mod.F
 
USE DAO_MOD, ONLY : UWND, VWND, T</pre>
 
</blockquote>
 
<p align="justify"> While many routines in GEOS-Chem still use this syntax, <a href="http://wiki.geos-chem.org/Derived_type_objects_used_by_the_Grid-Independent_GEOS-Chem#Using_derived-type_objects_to_replace_USE_statements" target="_blank">we are abandoning this approach</a>. Instead of referring to module variables with <tt>USE</tt>, we shall (wherever expedient) use derived type objects to contain variables and arrays, which then can be passed from one routine to another via the argument list. We are doing this to facilitate our <a href="http://wiki.geos-chem.org/Grid-independent_GEOS-Chem" target="_blank">Grid Independent GEOS-Chem project</a>, which seeks to embed GEOS-Chem within the NASA GEOS-5 GCM.</p>
 
<p align="justify">Eventually, the only things that you should obtain from modules via the USE statement will be:</p>
 
<ol>
 
  <li>Subroutines</li>
 
  <li>Functions</li>
 
  <li>Derived type definitions</li>
 
</ol>
 
<p align="justify">such as:</p>
 
<blockquote>
 
  <pre>! Refer to derived type definition from gigc_state_met_mod.F90
 
USE GIGC_State_Met_Mod, ONLY : MetState
 
 
 
! Refer to subroutines from dao_mod.F
 
USE DAO_MOD, ONLY: AIRQNT, COSSZA</pre></blockquote>
 
<p align="justify">For more information about derived types, please see <a href="appendix_8.html">Appendix 8</a>.</p>
 
 
 
<p><img src="../../img/black_rule.jpg" height="2" width="100%"></p>
 
<p class="size20bi"><a name="GIGC" id="GIGC"></a>A7.9 Programming techniques to facilitate grid-independence</p>
 
<p align="justify"><span class="size16bi">A7.9.1</span> When GEOS-Chem connects to an external GCM&mdash;such as NASA's GEOS-5 GCM&mdash;it will utilize MPI parallelization. Each CPU will perform all of GEOS-Chem's operations, but on a much smaller geographical domain (i.e. on a single vertical column or several vertical columns). This also means that each CPU will print informational messages to the screen (or log file, if you redirect screen output there) simultaneously. All of these I/O operations can seriously impact performance.</p>
 
<p align="justify">We have now started the process of bracketing <tt>WRITE</tt> and <tt>PRINT</tt> statements with <tt>IF</tt> blocks. In <a href="http://wiki.geos-chem.org/GEOS-Chem_v9-01-03" target="_blank">GEOS-Chem v9-01-03</a> and higher versions, you will see code like this:</p>
 
<blockquote>
 
<pre>      IF ( am_I_Root ) THEN
 
        WRITE( 6, '(a  )' ) REPEAT( '=', 79 )         
 
        WRITE( 6, '(a,/)' ) 'G E O S - C H E M  U S E R  I N P U T'
 
        WRITE( 6, 100  ) TRIM( FILENAME )
 
100    FORMAT( 'READ_INPUT_FILE: Reading ', a )       
 
      ENDIF</pre></blockquote>
 
<p align="justify">When we use MPI parallelization, the <tt>am_I_Root</tt> variable, which is passed as an argument to subroutines and functions, determines if we are on the root CPU.  This will restrict the printing of informational messages to just the root CPU. We encourage you to use this approach as you write new GEOS-Chem code.</p>
 
<p align="justify">For more information, please see <a href="http://wiki.geos-chem.org/Programming_techniques_used_for_the_Grid-Independent_GEOS-Chem#Restricting_screen_and_log_file_output_to_the_root_CPU" target="_blank">this post on our Grid-Independent GEOS-Chem wiki page</a>.
 
 
 
<p><img src="../../img/black_rule.jpg" height="2" width="100%"></p>
 
 
 
<p align="justify"><span class="size16bi">A7.9.2</span> <a href="http://wiki.geos-chem.org/GEOS-Chem_v9-01-02" target="_blank">GEOS-Chem v9-01-02</a> and prior versions used fixed parameters (stored in module <tt>GeosUtil/file_mod.F</tt>) for Fortran logical unit numbers. Logical unit numbers, or LUN's, are numeric values that are used in Fortran <tt>OPEN</tt>, <tt>WRITE</tt>, <tt>READ</tt>, and <tt>CLOSE</tt> statements.  A typical GEOS-Chem file read operation looks like this:</p>
 
<blockquote>
 
  <pre>      USE FILE_MOD, ONLY : IU_FILE
 
      ...
 
     
 
      ! Open file
 
      OPEN( UNIT=IU_FILE, FILE=TRIM( FILENAME ) ... )
 
     
 
      ! Read data
 
      READ( IU_FILE, IOSTAT=IOS ) ...
 
     
 
      ! Close file
 
      CLOSE( IU_FILE )</pre></blockquote>
 
 
 
<p>But when we connect GEOS-Chem to an external GCM, we can no longer rely on pre-defined LUNs.  The GCM may have already assigned the LUN that we are trying to use to a different file.  Therefore, starting in <a href="http://wiki.geos-chem.org/GEOS-Chem_v9-01-03" target="_blank">GEOS-Chem v9-01-03</a>, all LUNs will be determined dynamically. We now call a function (<tt>inquireMod/findFreeLUN</tt>) to return the next unused LUN. You will now see code that looks like this:</p>
 
<blockquote>
 
  <pre>      USE inquireMod, ONLY : findFreeLUN
 
      ...
 
     
 
      INTEGER :: IU_FILE
 
      ...
 
 
 
      ! Find a free file LUN
 
      IU_FILE = findFreeLUN()
 
   
 
      ! Open file
 
      OPEN( UNIT=IU_FILE, FILE=TRIM( FILENAME ) ... )
 
     
 
      ! Read data
 
      READ( IU_FILE, IOSTAT=IOS ) ...
 
     
 
      ! Close file
 
      CLOSE( IU_FILE )</pre></blockquote>
 
 
 
 
 
<p>For more information, please see <a href="http://wiki.geos-chem.org/Programming_techniques_used_for_the_Grid-Independent_GEOS-Chem#Using_findFreeLUN_to_assign_logical_unit_numbers_for_file_I.2FO" target="_blank">this post on our Grid-Independent GEOS-Chem wiki page</a>.</p>
 
 
 
<p><img src="../../img/black_rule.jpg" height="2" width="100%"></p>
 
 
 
<p align="justify"><span class="size16bi">A7.9.3</span> As mentioned in <a href="#Modules">Appendix 7.8.2</a>, we are now moving away from referring to module variables via <tt>USE</tt> statements. Please see <a href="http://wiki.geos-chem.org/Derived_type_objects_used_by_the_Grid-Independent_GEOS-Chem" target="_blank"> this wiki post on our Grid Independent GEOS-Chem wiki page</a> for more information about how we are passing data between subroutines via derived type objects.</p>


<p><img src="../../img/black_rule.jpg" height="2" width="100%"></p>
Here are some tips for moving arrays or adding new arrays to the derived type objects:


<p class="size14bi" align="center"><a href="appendix_6.html">Previous</a> | <a href="appendix_8.html">Next</a> | <a href="appendix_7.html" target="_parent">Printable
* Instead of finding and replacing every instance of an array with <tt>State_Chm&array</tt> throughout a module, create pointers at the top of each routine (e.g. <code>localarray => State_Chm%array</code>). This is the method we used when replacing the species array (<code>Spc => State_Chm%Species</code>).
* Where possible, clean up routine arguments. If State_Chm is already passed, then there is no need to also pass <tt>State_Chm%array</tt>.
* Consider adding arrays used for bpch diagnostics into <tt>State_Diag</tt> instead of <tt>State_Chm</tt>.
* Try to avoid confusing fields names in the State_Chm object. For example, in <tt>DO_DIAG_OH State_Chm%AIR_MASS</tt> may be confused with fields in <tt>State_Met</tt>. If we use pointers at the top of routines, we can still use the existing local array name to avoid having to change code deep in subroutines (e.g. <tt>AIR_MASS => State_Chm%Sum_Air_Mass</tt>).
* Allocate <tt>State_Chm</tt> fields in routine <tt>Init_State_Chm</tt> instead of local modules.
* The GCST highly recommends running [[Unit_Tester_for_GEOS-Chem_12|unit tests]] and [[Performing_Difference_Tests_with_GEOS-Chem|difference tests]] often as you make these updates. Unit tests will ensure you don't break any simulations and difference tests will ensure you aren't introducing any changes to model output.


View (no frames)</a></p>
=== More on derived-type objects===


</body></html>
We are now moving away from including module variables into subroutines or functions via <code>USE</code> statements. Please see [[Derived_type_objects_used_by_GEOS-Chem|this wiki post on our ''Derived type objects used by the Grid-Independent GEOS-Chem'' wiki page]] for more information about how we are passing data between subroutines via derived type objects.

Latest revision as of 19:59, 20 September 2022

NOTE: We will be updating this coding style guide in the near future. The recent efforts to interface GEOS-Chem with ESMs like NASA GEOS, CESM, BCC, and WRF necessitate a departure from past programming practice. We will keep you updated.


Background

We have written GEOS-Chem in the Fortran-90 (F90) language. F90 introduces several new features over its predecessor, Fortran-77 (F77), including allocatable arrays, modules, derived types, and new programming statements. We invite you to consult this list of Fortran tutorials.

F90 contains all of the features of F77. Legacy code adhering to the F77 standard should build with any F90 compiler. Although we have written GEOS-Chem source code to the F90 standard, you should be aware that many 3rd-party routines included in GEOS-Chem—such as, [[Photolysis_mechanism|FAST-JX], and ISORROPIA—were originally written to the F77 standard. In the past several GEOS-Chem releases, we have started to replaced older F77 code with F90 code, especially to facilitate our GEOS-Chem High Performance (aka GCHP) project.

You should also be aware that several other updates to the Fortran standard (F95, F2000, F2003) have been introduced in recent years. These new Fortran versions contain the same functionality as F90, but generally omit the obsolete F77-style features. Most modern Fortran compilers include support for all of these standards.

Code layout

Fixed format vs. free format

We previously wrote GEOS-Chem source code using fixed-format layout, which has the following requirements:

  • A line continuation character (usually &) is placed in column 6
  • Numeric labels can occupy column 2 through column 5
  • Executable statements begin in column 7 and extend to column 72

We chose the fixed-format layout for compatibility with the F77-style include files that were contained in the Headers/ directory. Starting with GEOS-Chem v9-01-03, we converted all header files to Fortran modules. This allows the use of free-format layout, which does not restrict where you place executable statements. (To continue a free-format statement across several lines of code, you must place an ampersand (&;) character at the end of each line.)

Modules in GEOS-Chem that are still written in F77 fixed-format layout are denoted with the .F extension, while modules written in F90 free-format layout are denoted with the .F90 extension. We recommend that you write all new GEOS-Chem code using the free format layout.

Modules

Place all of your new GEOS-Chem code into Fortran 90 modules, if possible. These allow you to save both and routines and variables within a single program unit.

An example module is shown below.

!------------------------------------------------------------------------------
!          Harvard University Atmospheric Chemistry Modeling Group            !
!------------------------------------------------------------------------------
!BOP
!
! !MODULE: MY\_MODULE
!
! !DESCRIPTION: \subsection*{Overview}
! Module MY\_MODULE contains variables and routines to do something.
! (bmy, 5/1/03)
!\\
!\\
! !INTERFACE
!
       MODULE MY_MODULE
!
! !USES:
! 
     IMPLICIT NONE

     ! Make everything private...
     PRIVATE
!
! !PUBLIC MEMBER FUNCTIONS:
!
     PUBLIC :: DRIVER
     PUBLIC :: INIT_MY_MODULE
     PUBLIC :: CLEANUP_MY_MODULE
!
! !REVISION HISTORY:
!  (1 ) Bug fix in MY_SUBROUTINE (bmy, 5/1/03)
!EOP
!------------------------------------------------------------------------------
!BOC
!
! !PRIVATE DATA MEMBERS:
!
     ! Argument to the SIN function
     REAL*8, ALLOCATABLE :: B(:)

     CONTAINS
!EOC 
!------------------------------------------------------------------------------
!          Harvard University Atmospheric Chemistry Modeling Group            !
!------------------------------------------------------------------------------
!BOP
!
! !IROUTINE: DRIVER
!
! !DESCRIPTION: \subsection*{Overview}
! Subroutine DRIVER is the driver routine for MY_MODULE. (bmy, 5/1/03)
!\\
!\\
! !INTERFACE:
!
      SUBROUTINE DRIVER
!
! !REVISION HISTORY:
!
!EOP
!------------------------------------------------------------------------------
!BOC
!
! !LOCAL VARIABLES:
!
      LOGICAL, SAVE :: FIRST = .TRUE.
      INTEGER, :: I
      REAL*8 :: X, Y1, Y2
      REAL*8, PARAMETER :: PI180 = 180d0 / 3.14159265358979323d0
 
      !=================================================================
      ! DRIVER begins here!
      !=================================================================

      ! Initialize 
      IF ( FIRST ) THEN
         CALL INIT_MY_MODULE()
      ENDIF

      !=================================================================
      ! Loop over degrees
      !=================================================================
      DO I = 1, 360

         ! Convert to radians
         B = DBLE( I ) / PI180

         ! Call subroutine
         CALL MY_SUBROUTINE( B(I), Y1 )

         ! Call function
         Y2 = MY_FUNCTION( B(I) )

         ! Write output
         WRITE( 6, '(i3, 1x, 3(f13.6,1x))' ) I, B(I), Y1, Y2
      ENDDO 
 
      ! Return to calling program
      END SUBROUTINE DRIVER
!EOC
!-----------------------------------------------------------------------------
!          Harvard University Atmospheric Chemistry Modeling Group            !
!------------------------------------------------------------------------------
!BOP
!
! !IROUTINE: MY\_SUBROUTINE
!
! !DESCRIPTION: \subsection*{Overview}
! Function MY_SUBROUTINE returns the sine of a number. (bmy, 5/1/03)
!\\
!\\
! !INTERFACE:
!
      SUBROUTINE MY_SUBROUTINE( X, Y )
!
! !INPUT PARAMETERS: 
!
      ! Argument for the sine function
      REAL*8, INTENT(IN) :: X
!
! !RETURN VALUE:
!     
      REAL*8, INTENT(OUT) :: Y 
!
! !REVISION HISTORY:
! (1 ) Corrected typographic error (bmy, 5/1/03)
!EOP
!------------------------------------------------------------------------------
!BOC
      !=================================================================
      ! MY_SUBROUTINE begins here!
      !=================================================================
      Y = SIN( X )
 
      ! Return to calling program
      END SUBROUTINE MY_SUBROUTINE
!EOC
!-----------------------------------------------------------------------------
!          Harvard University Atmospheric Chemistry Modeling Group            !
!------------------------------------------------------------------------------
!BOP
!
! !IROUTINE: MY\_FUNCTION
!
! !DESCRIPTION: \subsection*{Overview}
! Function MY_FUNCTION returns the sine of a number. (bmy, 5/1/03)
!\\
!\\
! !INTERFACE:
!
      FUNCTION MY_FUNCTION( X ) RESULT( Y )
!
! !INPUT PARAMETERS: 
!
      ! Argument for the sine function
      REAL*8, INTENT(IN) :: X
!
! !RETURN VALUE:
!     
      REAL*8, INTENT(OUT) :: Y 
!
! !REVISION HISTORY:
!
!EOP
!------------------------------------------------------------------------------
!BOC
 
      !=================================================================
      ! MY_FUNCTION begins here!
      !=================================================================
      Y = SIN( X )

      ! Return to calling program
      END FUNCTION MY_FUNCTION
!EOC
!------------------------------------------------------------------------------
!          Harvard University Atmospheric Chemistry Modeling Group            !
!------------------------------------------------------------------------------
!BOP
!
! !IROUTINE: INIT\_MY\_MODULE
!
! !DESCRIPTION: \subsection*{Overview}
! Subroutine INIT\_MY\_MODULE allocates and zeroes the B array. (bmy, 5/1/03)
!\\
!\\ 
! !INTERFACE:
! 
     SUBROUTINE INIT_MY_MODULE
!
! !USES
!
      USE ERROR_MOD, ONLY : ALLOC_ERR
!
! !REVISION HISTORY:
!
!EOP
!------------------------------------------------------------------------------
!BOC
!
! !LOCAL VARIABLES
!
     INTEGER :: AS

      !=================================================================
      ! INIT_MY_MODULE begins here!
      !=================================================================

      ! Allocate B, check status
      ALLOCATE( B(360), STAT=AS )

      ! Error check
      IF ( AS /= 0 ) CALL ALLOC_ERR( 'B' )

      ! Zero B
      B(:) = 0d0

      ! Return to calling program
      END SUBROUTINE INIT_MY_MODULE
!EOC
!-----------------------------------------------------------------------------
!          Harvard University Atmospheric Chemistry Modeling Group            !
!------------------------------------------------------------------------------
!BOP
!
! !IROUTINE: CLEANUP\_MY\_MODULE
!
! !DESCRIPTION: \subsection*{Overview}
! Subroutine CLEANUP_MY_MODULE deallocates all module arrays. (bmy, 5/1/03)
!\\
!\\
! !INTERFACE:
!
     SUBROUTINE CLEANUP_MY_MODULE()
!
! !REVISION HISTORY:
!
!EOP
!------------------------------------------------------------------------------
!BOC

      !=================================================================
      ! CLEANUP_MY_MODULE begins here!
      !=================================================================
      IF ( ALLOCATED( B ) ) DEALLOCATE( B )

      ! Return to calling program
      END SUBROUTINE CLEANUP_MY_MODULE
 !EOC

 END MODULE MY_MODULE<

Fortran 90 modules written for GEOS-Chem contain the following features:

Item Description
Module header This is similar to the subroutine header. Contains a list of all module variables, routines, and modification notes, in Automatic documentation with ProTex
PRIVATE declarations You can "hide" certain routines or variables within a F90 module from other routines or modules. Ideally your module will be like a "black box" with only a few routines or variables that are publicly accessible. By convention, in GEOS-Chem, we declare everything PRIVATE first then indicate the public routines.
IMPLICIT NONE If you declare IMPLICIT NONE once at the top of the module, then you don't have to declare it in the individual subroutines; the declaration "carries through" below.
Module variables and arrays Any variables declared above the CONTAINS statement are as if they were stored in F77 common blocks; that is, they are preserved between calls to the various module routines. Also, you should declare module arrays as ALLOCATABLE whenever possible. This allows you to only set aside memory for that array if a certain routine is called. This results in more efficient memory management.
CONTAINS statement All module variables and PRIVATE declarations go above the CONTAINS statement. Module routines and functions go below the CONTAINS statement
Subroutines and functions
Your module will contain various subroutines and functions depending on the particular needs at hand.
An INIT routine Your module should have a subroutine named INIT_modulename, which initializes and allocates all module arrays.
A CLEANUP routine Your module should have a subroutine named CLEANUP_modulename, which deallocates module arrays. (You only need this if you have allocatable arrays).

F90 modules are very similar to classes in Java and C++; they allow you to group data and the routines which work on the data in a single package.

Theoretically, each GEOS-Chem routine and/or variable should belong to a module. In practice, this is not true, as we do have separate subroutine and function files from some of the older routines which were written by 3rd party sources. However, you should write all new GEOS-Chem code in module form.

Also, it is OK for modules to reference other modules. If Module B references Module A, all that you have to do is to make sure that Module B declaration in the Makefile refers to Module A. The GEOS-Chem Makefiles have been prepared for you by the GEOS-Chem Support Team, so in most instances, you will not have to concern yourself with this.

Automatic documentation with ProTeX

We generate detailed reference documentation with ProTeX. ProTeX is a perl script developed by NASA/GMAO that generates ProTeX files from GEOS-Chem's subroutine, function, and module comment headers. The LaTeX file can then be compiled to produce both PostScript and PDF output.

To use ProTeX, you need to add a standard header at the top of each subroutine, function, and module. The header looks like this:

!------------------------------------------------------------------------------
!          Harvard University Atmospheric Chemistry Modeling Group            !
!------------------------------------------------------------------------------
!BOP
!
! !IROUTINE: metero
!
! !DESCRIPTION: Subroutine METERO calculates meteorological constants needed
!  for the dry deposition velocity module. (lwh, gmg, djj, 1989, 1994; bmy,
!  10/3/05)
!\\
!\\
! !INTERFACE:
!
      SUBROUTINE METERO( State_Met, CZ1,    TC0,  OBK,       CFRAC,  
     &                   RADIAT,    AZO,    USTR, ZH,        LSNOW, 
     &                   RHB,       PRESSU, W10,  SUNCOS_MID        )
!
! !USES:
!
      USE GIGC_State_Met_Mod, ONLY : MetState 
      IMPLICIT NONE
!
! !INPUT PARAMETERS:
!
      TYPE(MetState), INTENT(IN)  :: State_Met   ! Meteorology State object
!
! !OUTPUT PARAMETERS:
!
      LOGICAL, INTENT(OUT) :: LSNOW (MAXIJ)  ! Flag for denoting snow/ice
      REAL*8,  INTENT(OUT) :: CZ1   (MAXIJ)  ! Midpt ht of 1st model level [m]
      REAL*8,  INTENT(OUT) :: TC0   (MAXIJ)  ! Grid box sfc temperature [K]
      REAL*8,  INTENT(OUT) :: OBK   (MAXIJ)  ! Monin-Obhukov length [m]
      REAL*8,  INTENT(OUT) :: CFRAC (MAXIJ)  ! Column cloud fraction [unitless]
      REAL*8,  INTENT(OUT) :: RADIAT(MAXIJ)  ! Solar radiation @ ground [W/m2]
      REAL*8,  INTENT(OUT) :: RHB   (MAXIJ)  ! Rel humidity at sfc [unitless]
      REAL*8,  INTENT(OUT) :: USTR  (MAXIJ)  ! Friction velocity [m/s]
      REAL*8,  INTENT(OUT) :: ZH    (MAXIJ)  ! PBL height [m]
      REAL*8,  INTENT(OUT) :: PRESSU(MAXIJ)  ! Local surface pressure [Pa]
      REAL*8,  INTENT(OUT) :: W10   (MAXIJ)  ! 10 meter windspeed [m/s]
      REAL*8,  INTENT(OUT) :: SUNCOS_MID(MAXIJ)  ! COS(SZA) @ midpt of current
                                                 !  chemistry timestep
     ! Dimension AZO for GCAP or GEOS met fields (swu, bmy, 5/25/05)
#if  defined( GCAP )
      REAL*8, INTENT(OUT)  :: AZO(NTYPE)     ! Roughness heights, by landtype
#else
      REAL*8, INTENT(OUT)  :: AZO(MAXIJ)     ! Roughness heights, by grid box
#endif
!
! !REMARKS:
!  NOTE: We save into arrays of dimension MAXIJ=IIPAR*JJPAR for compatibility
!  with the legacy drydep routine DEPVEL.
!                                                                             .
!  References (see full citations above):
!  ============================================================================
!  (1 ) Wesely, M. L., 1989. 
!  (2 ) Jacob, D.J., and S.C. Wofsy, 1990
!
! !REVISION HISTORY: 
!  (1 ) Now reference GET_PEDGE from "pressure_mod.f".  Now reference T from 
!        "dao_mod.f".  Removed obsolete code & comments, and added new 
!         documentation header.  Now force double precision with "D" 
!         exponents.  Now compute OBK here as well.  Bundled into F90 module
!         "drydep_mod.f" (bmy, 11/20/02)
!  (2 ) Now reference CLDFRC, RADSWG, ZO, USTAR from "dao_mod.f".  Also now 
!         pass CFRAC, RADIAT, AZO, USTR back to the calling routine 
!         via the arg list. (bmy, 12/9/03)
!  (3 ) Now use explicit formula for IJLOOP to allow parallelization
!        (bmy, 7/20/04)
!  (4 ) Now compute ZH and LSNOW here instead of w/in DO_DRYDEP.  Parallelize
!        DO-loops.  Now use BXHEIGHT from "dao_mod.f" instead of computing 
!        the thickness of the 1st level here.  Remove reference to 
!        "pressure_mod.f".  Remove reference to T from "dao_mod.f".  Now
!        reference ALBD from "dao_mod.f" (bmy, 2/22/05)
!  (5 ) Now references RH from "dao_mod.f".  Now passes relative humidity
!        from the surface layer back via RHB argument. (bec, bmy, 4/13/05)
!  (6 ) Now call GET_OBK from "dao_mod.f" to get the M-O length for both
!        GEOS or GCAP met fields.  Remove local computation of M-O length
!        here.  Also now dimension AZO appropriately for GCAP or GEOS met
!        fields.  Remove obsolete variables. (swu, bmy, 5/25/05)
!  (7 ) Now make sure all USE statements are USE, ONLY (bmy, 10/3/05)
!  (8 ) Move XLTMMP function to module MEGANUT_MOD. (ccc, 11/20/09)
!  (9 ) Add sea level pressure and 10m windspeed as arguments (jaegle 5/11/11)
!  22 Dec 2011 - M. Payer    - Added ProTeX headers
!  10 Jan 2012 - M. Payer    - Added local surface pressure
!  09 Nov 2012 - M. Payer    - Replaced all met field arrays with State_Met
!                              derived type object
!  28 Nov 2012 - R. Yantosca - Add SUNCOS_MID to the argument list and 
!                              populate that with State_Met%SUNCOSmid
!EOP
!------------------------------------------------------------------------------
!BOC
!
! !LOCAL VARIABLES:
!
      INTEGER               :: I,  J,  IJLOOP
      REAL*8                :: THIK
      REAL*8                :: SP
!
! !EXTERNAL FUNCTIONS:
!
      REAL*8, EXTERNAL      :: SFCWINDSQR ! Surface wind speed (jaegle 5/11/11)

      !=================================================================
      ! METERO begins here!
      !=================================================================

      ... code goes here ...

     END SUBROUTINE METERO
!EOC

Note that the ProTeX header contains the following elements:

  1. Indicators (!BOP, !EOP) as to where ProTeX should look for source code header comments<
  2. Indicators (!BOC, !EOC) as to where the source code begins and ends. ProTeX will ignore code between !BOC and !EOC unless you tell it otherwise.
  3. A description of the routine with the original date
  4. A list of input & output arguments
  5. A list of references (if applicable)
  6. References to F90 modules (if any) after the USE keyword followed by the IMPLICIT NONE declaration
  7. Each time the file is modified the REVISION HISTORY section should be updated.
  8. Declarations for local variables
  9. Declarations for Fortran parameters (aka constants)
  10. Declarations for external functions (if necessary)

Fortran 90 allows you to declare an argument to a subroutine or function with one of the following descriptors:

  1. INTENT(IN) (read-only),
  2. INTENT(OUT) (write-only) or
  3. INTENT(INOUT) (read-and-write)

This helps you to prevent overwriting arguments which should not be overwritten.

Style Guide

Denoting code authors

Generally, we denote code authors by a 3-letter abbreviation in source code comments (bmy, 5/1/03).

The more comments, the better!

It is good practice to add copious comments to your source code. This will make sure that current and future GEOS-Chem users will be able to understand the modifications that you have added.

Keep in mind that many GEOS-Chem users (including the GEOS-Chem Support Team) will probably not be as familiar with your specific area of research as you are. While most users will probably get the "big picture" of what you are doing, they will be less knowledgeable about the little details (i.e. why does this reaction rate have a value of X, why is this parameter set to Y, why was this IF statement deleted, etc.). Providing sufficient source code documentation will eliminate any such guesswork.

Section headers

We recommend that you separate sections of source code with one or more divider comments:

!=================================================================
! 1st level header
!=================================================================

   !--------------------------------------------------------------
   ! 2nd level header
   !--------------------------------------------------------------

      !-----------------------
      ! 3rd-level header
      !-----------------------

Line up each header with the the code that follows immediately below it. You can mix and match these styles as you wish. You don't always have to extend the header all the way across the screen.

If you are modifying a fixed-format source code file, end the major section headers at column #72. This will remind your reader where the limits of the allowable source code region occurs. If you are modifying a free-format source code file, feel free to extend the ends of the major section headers to about column 76 or 77.

Capitals vs. lower-case

We used to write GEOS-Chem source code in all capitals. We chose this approach to avoid mistaking similar characters, such as the lowercase l and the number 1, which in some fonts look remarkably alike. You may at your discretion mix capitals and lowercase in variable names and subroutine names, particularly if you want to improve readability or preserve scientific abbreviations ( e.g. Rn_Pb_Be_mod.F).

We still recommend that you write Fortran reserved words in all capitals: SUBROUTINE, CALL, IF/THEN/ENDIF, DO/ENDDO, WHERE/ENDWHERE. This distinguishes Fortran language elements from your variable and routine names.

Because GEOS-Chem contains a significant amount of 3rd-party code, you will see many routines where the code is written in all lowercase letters. Feel free to leave this code as-is. Reformatting a fixed-format source code file can be a cumbersome task (and maybe not the best use of your precious time)

Good editors such as emacs can "colorize" the different words in a program (e.g. statements, variables, and quoted text are rendered in different colors), thus making the code infinitely more readable.

Indent by 3 spaces

Indent each new block of code 3 columns deeper than the last block:

         1         2         3         4
1234567890123456789012345678901234567890

     IF ( X == 0 ) THEN
        PRINT*, 'Hello, X is 0!'

        IF ( Y == 0 ) THEN
           PRINT*, 'Hello, X is 0 and Y is also 0!'
        ENDIF

     ENDIF

A good editor such as emacs will indent for you automatically. You can specify the level of indent in your editor setup file (e.g. .emacs).

Indenting nested DO loops

Omit indentation for each line of a multiple DO loop. Use this syntax:

DO N = 1, NNPAR
DO L = 1, LLPAR
DO J = 1, JJPAR
DO I = 1, IIPAR
   A(I,J,L,N) = A(I,J,L,N) / 2.0d0
ENDDO
ENDDO
ENDDO
ENDDO

instead of letting the DO loops "creep" unnecessarily across the screen:

DO N = 1, NNPAR
   DO L = 1, LLPAR
      DO J = 1, JJPAR
         DO I = 1, IIPAR
            A(I,J,L,N) = A(I,J,L,N) / 2.0d0
         ENDDO
      ENDDO
   ENDDO
ENDDO

Enhance readability with white space

Use LOTS of white space to separate code. For example instead of

IF(X.eq.0)CALL DOLOOP(X,Y,Z,A,B,C,1,2,3)

type:

IF ( X == 0 ) CALL DOLOOP( X, Y, Z, A, B, C, 1, 2, 3 )

This makes the code much more readable, which prevents bugs.. If you can't read what you've written, you will never find your mistake! Don't be afraid to continue the statement to the next line if necessary.

In the above example, we also have used the new F90 equality test operator == (see below for more information).

Line up code into columns for better readability

Always line up quantities in assignment statements. Instead of:

A=1
THISLOOP=2
C=3
D=THISLOOP*C - A

add white space so that all of the equals signs fall in the same column.

A        = 1
THISLOOP = 2
C        = 3 
D        = ( THISLOOP * C ) - A

Your eyes can make more sense out of text that is lined up into vertical columns.

Group multiple terms of an equation with parentheses (as we have done for the D equation above). The compiler evaluates evaluate expressions in the order the parentheses are placed, from innermost to outermost.

Use of white space for arrays and functions

Do not leave white space between array indices:

A = X(I,J,L)

Leave white space between arguments of functions and WRITE statements:

A = MYFUNCTION( I, J, L )

WRITE( 6, '(a)' )

Line up arguments into columns

Line up arguments in subroutine or function calls. If there are an even number of arguments, break them up symmetrically:

Fortran-77 fixed-file format:

     CALL MYSUB( THIS_A, THIS_B, THIS_C,
    &            THIS_D, THIS_E, THIS_F )

     X = MYFUNCTION( THIS_A, THIS_B, THIS_C,
    &                THIS_D, THIS_E, THIS_F )

Fortran-90 free-format:

     CALL MYSUB( THIS_A, THIS_B, THIS_C, &
                 THIS_D, THIS_E, THIS_F )

     X = MYFUNCTION( THIS_A, THIS_B, THIS_C, &

Use language features of Fortran-90

Operators

Use the new F90-style operators instead of the older F77-style operators:

>== instead of .EQ.

/= instead of .NE.

>  instead of .GT.

>= instead of .GE.

<  instead of .LT. 

>= instead of .LE.

.EQV. (or .eqv, it's case-insensitive) when comparing two logical values, i.e.
 
   IF ( MY_LOGICAL .eqv .FALSE. ) THEN ...

The new operators take up only one or two spaces (instead of four spaces) and are easier to read.

Some operators (.AND., .OR.</code, and .NOT.) are the same in F90 as in F77. This is also true of the boolean values (.TRUE. and .FALSE.).

Comment characters

Use the F90 ! comment character. This replaces the F77 comment character (a C placed in the first column of fixed-format code). You can can create very legible comments by lining up the ! with your source code.

Continuation characters

To continue an instruction from one line to the next, place an ampersand (&) at the end of the line (for free-format layout).

If you are working with a source code file that has not yet been converted to free-format, place an ampersand in column 6 of each continued line of code.

Array assignment statements

Use the F90 array assignment functionality to assign a value to all elements of an array without using array indices. For example:

INTEGER  :: ARRAY(10,10)
REAL(fp) :: ARRAY_F(10,10)

ARRAY(:,:)   = 0
ARRAY_F(:,:) = 0.0_fp

You can also leave off the default array mask (:,:), so

ARRAY   = 0
ARRAY_F = 0.0_fp

is also acceptable.

PARAMETER declaration statements

Initialize PARAMETER constants on the same line where they are declared:

REAL(fp), PARAMETER :: PI  = 3.14159265358979323_fp
REAL(fp), PARAMETER :: AVO = 6.022140857e+23_fp

Array constructors

Use F90 direct assignment statements to assign values to arrays:

REAL*8 :: A(2) = (/ 1.0d0, 2.0d0 /)

Avoid using obsolete F77 syntax:

REAL*8 A(2)
DATA A / 1.0, 2.0 /

CAVEAT! In the above example the A array will be automatically given the SAVE attribute. This means that its values will be stored from one subroutine call to the next. This may not be what you had intended to do, so just be aware of this.

The SAVE attribute

Include the SAVE attribute on the same line where the variable is declared:

LOGICAL, SAVE :: FIRSTTIME = .TRUE.

Avoid using obsolete F77 syntax:

LOGICAL FIRSTTIME 
SAVE FIRSTTIME
DATA FIRSTTIME / .TRUE. /

Note: SAVEd variables within a subroutine or function keep their values from one call to the next. This allows you to do some initialization only on the first call to a subroutine, for example:

LOGICAL, SAVE :: FIRSTTIME = .TRUE.

IF ( FIRSTTIME ) THEN
  PRINT*, 'This is the first time this routine is called!'
  FIRSTTIME = .FALSE.
ENDIF

Since the value of FIRSTTIME is saved between calls, the sentence above will only be printed out on the first call to the subroutine.

The EXTERNAL attribute

Include the EXTERNAL attribute on the same line where a function's type is declared. Use this syntax:

INTEGER, EXTERNAL :: MYFUNC

Avoid using obsolete F77 syntax:

INTEGER MYFUNC
EXTERNAL MYFUNC

NOTE: If you are referencing functions contained within a module file, then you do not need to declare it with EXTERNAL. The EXTERNAL statement is a hangover from F77.

The SELECT CASE statement

Use the F90 SELECT CASE statement to pick from a list of options. In F77 you would have had to write:

IF ( X .EQ. 1 ) THEN
 
  CALL MYSUB( X1 )

ELSE IF ( X .EQ. 2 ) THEN 

  CALL MYSUB( X2 )

ELSE

  CALL MYSUB( X0 )

ENDIF

but with F90, you can write this instead:

SELECT CASE ( X )

   CASE( 1 )
      CALL MYSUB( X1 )

   CASE( 2 ) 
      CALL MYSUB( X2 )

   CASE DEFAULT
      CALL MYSUB( X0 )

END SELECT

SELECT CASE also works with character constants:

SELECT CASE ( TRIM( TRACERNAME ) )

   CASE( 'NOx' )
      CALL MYSUB( 1 )

   CASE( 'Ox' )
      CALL MYSUB( 2 )

   CASE DEFAULT
      CALL MYSUB( 3 )

END SELECT

A note of warning: You cannot specify CASE( X ) where X is a variable, or else the compiler will balk. Then you must resort to the IF - ELSE IF block structure as shown above.

Logarithms

Use the F90 intrinsic functions LOG( X ) and LOG10( X ) to take the natural and common logarithms of X. Here X may be declared either as REAL(f4) or REAL(f8).

Avoid using the F77 intrinsic functions ALOG( X ) and ALOG10( X ). Some compilers do not support these functions.

Keyword arguments

Use F90's keyword argument capability to clarify subroutine calls. You can replace this:

 CALL READ_A1( NYMD,     NHMS, 
&              ALBEDO,   CLDTOT,   EFLUX,    EVAP,    
&              ... etc ...                         )

with:

 CALL READ_A1( NYMD     = NYMD, 
&              NHMS     = NHMS,
&              ALBEDO   = ALBD,
&              CLDTOT   = CLDTOT
&              EFLUX    = EFLUX
&              EVAP     = EVAP
&             ... etc ...      )

In the call above, you see several pairs of variable names separated by an equals sign: ALBEDO = ALBD

The name on the left of the equals sign is the name of the argument as defined in the subroutine (i.e. ALBEDO). This is often called the "dummy argument" because it is just a name for the memory that is getting passed to it from outside of the subroutine.

The name on the right of the equals sign is the name of the variable that we are passing down to the subroutine. Here we are using the ALBD variable (from GEOS-Chem module dao_mod.F and passing that down to READ_A1 as the ALBEDO argument.

                     THIS_D, THIS_E, THIS_F )

Numeric and character data types

Data types

GEOS-Chem uses the following basic data types:

Data type Fortran type Flexible precision definintion Range
TRUE or FALSE values LOGICAL
Whole numbers INTEGER -2 x 109 up to +2 x 109
4-byte (aka 32-bit) floating point real values REAL*4 REAL(f4) -1 x 1038 up to +1 x 1038
8-byte (aka 64-bit) floating point real values REAL*8 REAL(f8) -1 x 10312 up to +1 x 10312
Character strings CHARACTER(LEN=255)

However, you should use 4-byte floating point instead of 8-byte floating point in the following instances:

  1. Diagnostic arrays: GEOS-Chem diagnostic values rarely exceed 1038, since they are usually in units such as kg, v/v, ppbv, or molecules/cm2/s.
  2. Arrays and scalars required for binary or netCDF file I/O: Declaring these types of variables REAL*4 will help to keep file sizes as small as possible, thus saving disk space.

Some third-party routines used by GEOS-Chem (such as TPCORE) use the REAL datatype. Most compilers (but not all) interpret REAL as a 4-byte floating point real. If you want to force the compiler to interpret REAL as an 8-byte floating point real, you must use a special compiler switch (usually -r8, but check your compiler manual to be sure). The GEOS-Chem Makefiles already take care of this for you.

NOTE: See our Floating point math issues wiki page for more information.

Flexible precision

GEOS-Chem v10-01 and later versions use a flexible precision definition. Floating point variables can be defined with a Fortran KIND parameter that can be switched between 4-byte and 8-bytes at compile time.

Here is an example of how you can use flexible precision in GEOS-Chem:

USE PRECISION_MOD          ! Contains the fp, f4, f8 parameters used to define real values
 
REAL*4   :: VAR_REAL4      ! This was the old way to define a  4-byte real (from Fortran-77)
REAL*8   :: VAR REAL8      ! This was the old way to define an 8-byte real (from Fortran-77) 

REAL(f4) :: VAR_REAL4_NEW  ! This is the new way to define a  4-byte real (introduced in Fortran 90)
REAL(f8) :: VAR_REAL8_NEW  ! This is the new way to define an 8-byte real (introduced in Fortran 90)

REAL(fp) :: VAR_FLEXIBLE   ! This is an 8-byte real by default, but can be switched to a 4-byte real at compile time 

Declaring floating point real variables with REAL(fp) allows you to change from the default 8-byte precision to 4-byte precision by compiling GEOS-Chem with the PRECISION=4 switch. (NOTE: As described in the prior section, some floating point variables should always be declared as 4-byte reals, such as those variables that get written to disk.)

We have not yet made full use of the flexible precision in GEOS-Chem as of GEOS-Chem v11-01. This will happen in a later version.

Use the Fortran-90 exponent syntax

Use _fp (double-precision) suffixes when assigning a constant value to a REAL(fp) variable:

REAL(fp) :: PI, ONE_THOUSAND
PI           = 3.14159265358979323_fp
ONE_THOUSAND = 1.0e+3_fp

This will ensure that the value will be declared with sufficient precision.

Similarly, if you have used REAL(f4) to declare 4-byte floating point variables, then use this syntax:

REAL(f4) :: PI, ONE_THOUSAND
PI           = 3.14159265358979323_f4
ONE_THOUSAND = 1.0e+3_f4

and similarly for REAL(f8):

REAL(f9) :: PI, ONE_THOUSAND
PI           = 3.14159265358979323_f8
ONE_THOUSAND = 1.0e+3_f9

Default string length

Declare strings with 255 characters, even if you don't know a priori how long the string will be. Use the TRIM statement to strip excess white space. For example:

CHARACTER(LEN=255) :: STR

STR = 'I am the very model of a modern major general...'

WRITE( 6, '(a)' ) TRIM( STR )

Converting from number to character (and vice-versa)

Internal write statements

Several GEOS-Chem routines convert a numeric variable to a character variable (and vice-versa). This usually happens when a number representing a date or time has to be incorporated into a file path.

Use a Fortran internal write statement to convert from from INTEGER to CHARACTER. This looks like a regular WRITE statement, only that the unit number is replaced by a character variable. Think of it as "writing" your number into a character variable instead of to a file.

For example, the following code creates a string containing the date 20110701:

! Declare variables
CHARACTER(LEN=8) :: DATE_STR
INTEGER          :: DATE = 20110701

! FORTRAN internal write -- converts number to string
WRITE( DATE_STR, '(i8.8)' ) DATE

You can incorporate DATE_STR into a directory or file path. The format '(i8.8)' tells the WRITE command to make the string 8 characters long, and to change any spaces (if they exist) into preceding zeroes.

Internal read statements

Use a Fortran internal read to extract numbers from a character string. This is the reverse of the Fortran internal write described above:

! Declare variables
CHARACTER(LEN=8) :: DATE_STR = '20010101'
INTEGER :: DATE

! FORTRAN internal read -- converts string to number
READ( DATE_STR, '(i8.8)' ) DATE

Math optimizations

Write polynomials efficiently

Logarithms and exponentiation are the most computationally expensive mathematical functions. Avoid using the traditional formulation:

Y = A0 + A1*X + A2*X**2 + A3*X**3 + A4*X**4 + A5*X**5

but instead use parentheses to group the polynomial terms.

Y = A0 + X* ( A1 + X* ( A2 + X* ( A3 + X* ( A4 + X* ( A5 )))))

This modified polynomial expression replaces the costly exponentiations with more efficient multiplication operations. A Nth order polynomial will now only require N multiplications.

Exercise care with exponential approximations

Although it is tempting to replace EXP( -x ) with the first-order approximation 1 - x, this can result in negative values for certain values of x. This can cause negative tracer concentrations.

DO loops

The CYCLE statement

Always use the CYCLE statement to skip to the next iteration in a DO loop. Use this syntax:

DO I = 1, 1000

  ! Skip to next iteration
  IF ( X(I) == 0 ) CYCLE

  ! Print
  PRINT*, X(I)

ENDDO

instead of this obsolete F77 syntax:

DO 100 I = 1, 1000

   IF ( X(I) .EQ. 0 ) GOTO 100
   PRINT*, X(I)

100 CONTINUE

Notice that the F90 ENDDO statement replaces the labeled CONTINUE statement. This results in cleaner code.

The EXIT statement

Always use the EXIT statement to completely exit from a DO loop. Use this syntax:

DO I = 1, 1000

  ! Break out of this loop if X==0!
  IF ( X(I) == 0 ) EXIT

  ! Print info if we have not exited
  WRITE( 6, '(''Iteration: '', i5, f13.6)' ) I, X(I)

ENDDO 

PRINT*, 'outside loop'

instead of this obsolete F77 syntax, which requires a GOTO statement:

C Do-loop
DO 100 I = 1, 1000

   IF ( X(I) .EQ. 0 ) GOTO 200
   PRINT*, 'inside loop'

100 CONTINUE

C Continue outside loop
200 CONTINUE
PRINT*, 'outside loop'

Ordering of DO loops

Structure your DO loops so that they access memory in the most optimal manner. Fortran is a column-major language, which means that arrays are stored in memory by columns first, then by rows. If you have declared an array such as:

INTEGER :: I, J, L, N
REAL*8  :: ARRAY(IIPAR,JJPAR,LLPAR,NNPAR)

then for optimal efficiency, the leftmost dimension (I) needs to vary the fastest, and needs to be accessed by the innermost DO loop. Then the next leftmost dimension (J) should be accessed by the next innermost DO loop, and so on.

Therefore, the proper way to loop over this array is:

DO N = 1, NNPAR
DO L = 1, LLPAR
DO J = 1, JJPAR
DO I = 1, IIPAR
   ARRAY(I,J,L,N) = ARRAY(I,J,L,N) * 2.0d0
ENDDO
ENDDO
ENDDO
ENDDO

Note that the I index is varying most often, since it is the innermost DO loop, then J, L, and N. This is opposite to how a car's odometer reads.

If you loop through an array in this fashion, with leftmost indicies varying fastest, then the code minimizes the number of times it has to load subsections of the array into cache memory. In this optimal manner of execution, all of the array elements sitting in the cache memory are read in the proper order before the next array subsection needs to be loaded into the cache. But if you step through array elements in the wrong order, the number of cache loads is proportionally increased. Because it takes a finite amount of time to reload array elements into cache memory, the more times you have to access the cache, the longer it will take the code to execute. This can slow down the code dramatically.

Infinite DO loop with EXIT

Use "infinite" DO loops in situations where you need to iterate for an unknown number of iterations before some exit criterion is reached. This is most often used to read data from a file whose length you do not know a priori.

Here is an example of an infinite DO loop used to read data from disk:

INTEGER :: I, IOS, IUNIT
REAL*8 :: A(10000) 

! Open file
OPEN( IUNIT, FILE='myfile.txt', IOSTAT=IOS )

! Quit if we can't open the file
IF ( IOS /= 0 ) THEN
   PRINT*, 'Error opening the file!'
   STOP
ENDIF

! Infinite DO loop for reading data from "myfile.txt"
DO

   ! Read I and A(I) from the file
   READ( IUNIT, '(i3,f11.3)', IOSTAT=IOS ) I, A(I)
 
   ! IOS < 0 is end-of-file, so we exit the loop
   IF ( IOS < 0 ) EXIT
	
   ! If IOS < 0, then this is an I/O error,
   ! so we print an error msg and quit
   IF ( IOS > 0 ) THEN
      PRINT*, 'I/O error reading from file!
      STOP
   ENDIF
ENDDO

! Close the file after we exit the loop
CLOSE( IUNIT )

In this example, the DO loop wants to execute forever, but it is stopped by an external criterion—the end of the file. The OPEN, READ, and CLOSE functions can return the I/O status to a variable if you specify via the IOSTAT keyword. If the I/O status variable (in this case, named IOS) is negative, then this is a normal end-of-file condition, and so we can just exit the loop and close the file. However, if IOS is a nonzero, positive number, then this means that we have encountered a real error condition.

Reserved DO loop index variable names

Reserve the following variable names for DO loop and array indices:

  1. Use I for the longitude index
  2. Use J for the latitude index
  3. Use L for the altitude index
  4. Use N for the quantity index (tracer, species, etc.)

Therefore, you can assume that any DO loop that uses I, J, and L is looping over grid box longitude, latitude, and altitude dimensions, and that any DO loop which uses N is looping over species or some other quantity. These special indices should NOT be used to refer other quantities. In this way, if you should get an error message such as:

Error at grid box (I,J) = (23,34)!

you will immediately understand (I,J) are the longitude and latitude indices of the box where the error happened.

NOTE: In some 3rd-party routines, you will see K used instead of L to denote the altitude index. We recommend leaving these as-is, so as to avoid having to rewrite entire sections of 3rd-party code. However, you should use L for the altitude index in any new GEOS-Chem code that you write.

Programming techniques to promote running in HPC environments

Restrict screen output to the root CPU

When GEOS-Chem connects to an external GCM—such as NASA's GEOS-5 GCM—it will utilize MPI parallelization. Each CPU will perform all of GEOS-Chem's operations, but on a much smaller geographical domain (i.e. on a single vertical column or several vertical columns). This also means that each CPU will print informational messages to the screen (or log file, if you redirect screen output there) simultaneously. All of these I/O operations can seriously impact performance.

We have now started the process of bracketing WRITE and PRINT statements with IF blocks. In GEOS-Chem v9-01-03 and higher versions, you will see code like this:

 IF ( am_I_Root ) THEN 
    WRITE( 6, '(a  )' ) REPEAT( '=', 79 )           
    WRITE( 6, '(a,/)' ) 'G E O S - C H E M   U S E R   I N P U T'
    WRITE( 6, 100   ) TRIM( FILENAME )
100 FORMAT( 'READ_INPUT_FILE: Reading ', a )        
 ENDIF

When we use MPI parallelization, the am_I_Root variable, which is passed as an argument to subroutines and functions, determines if we are on the root CPU. This will restrict the printing of informational messages to just the root CPU. We encourage you to use this approach as you write new GEOS-Chem code.

For more information, please see this wiki post.

Do not use fixed parameters for file unit numbers

GEOS-Chem v9-01-02 and prior versions used fixed parameters (stored in module GeosUtil/file_mod.F) for Fortran logical unit numbers. Logical unit numbers, or LUN's, are numeric values that are used in Fortran OPEN, WRITE, READ, and CLOSE statements. A typical GEOS-Chem file read operation looks like this:

USE FILE_MOD, ONLY : IU_FILE

...

! Open file
OPEN( UNIT=IU_FILE, FILE=TRIM( FILENAME ) ... )
  
! Read data
READ( IU_FILE, IOSTAT=IOS ) ...

! Close file
CLOSE( IU_FILE )

But when we connect GEOS-Chem to an external GCM, we can no longer rely on pre-defined LUNs. The GCM may have already assigned the LUN that we are trying to use to a different file. In GEOS-Chem v9-01-03 and higher versions, all LUNs are determined dynamically. We now call a function (inquireMod/findFreeLUN) to return the next unused LUN. You will now see code that looks like this:

USE inquireMod, ONLY : findFreeLUN

...

INTEGER :: IU_FILE

...

! Find a free file LUN
IU_FILE = findFreeLUN()

! Open file
OPEN( UNIT=IU_FILE, FILE=TRIM( FILENAME ) ... )

! Read data
READ( IU_FILE, IOSTAT=IOS ) ...
      
! Close file
CLOSE( IU_FILE )

For more information, please see this wiki post.

Do not refer to public variables via USE statements

We used to refer to variables in modules by means of the USE statement, such as:

! Reference variables from dao_mod.F
USE DAO_MOD, ONLY : UWND, VWND, T

While many routines in GEOS-Chem still use this syntax, we are abandoning this approach. Instead of referring to module variables with USE, we shall (wherever expedient) use derived type objects to contain variables and arrays, which then can be passed from one routine to another via the argument list. We are doinFile:G this to facilitate our GEOS-Chem HP, which seeks to embed GEOS-Chem within the NASA GEOS-5 GCM.

The only things that you should obtain from modules via the USE statement will be:

  1. Subroutines
  2. Functions
  3. Derived type definitions

such as:

! Refer to derived type definition from gigc_state_met_mod.F90
USE State_Met_Mod, ONLY : MetState

! Refer to subroutines from dao_mod.F
USE DAO_MOD, ONLY: AIRQNT, COSSZA

Here are some tips for moving arrays or adding new arrays to the derived type objects:

  • Instead of finding and replacing every instance of an array with State_Chm&array throughout a module, create pointers at the top of each routine (e.g. localarray => State_Chm%array). This is the method we used when replacing the species array (Spc => State_Chm%Species).
  • Where possible, clean up routine arguments. If State_Chm is already passed, then there is no need to also pass State_Chm%array.
  • Consider adding arrays used for bpch diagnostics into State_Diag instead of State_Chm.
  • Try to avoid confusing fields names in the State_Chm object. For example, in DO_DIAG_OH State_Chm%AIR_MASS may be confused with fields in State_Met. If we use pointers at the top of routines, we can still use the existing local array name to avoid having to change code deep in subroutines (e.g. AIR_MASS => State_Chm%Sum_Air_Mass).
  • Allocate State_Chm fields in routine Init_State_Chm instead of local modules.
  • The GCST highly recommends running unit tests and difference tests often as you make these updates. Unit tests will ensure you don't break any simulations and difference tests will ensure you aren't introducing any changes to model output.

More on derived-type objects

We are now moving away from including module variables into subroutines or functions via USE statements. Please see this wiki post on our Derived type objects used by the Grid-Independent GEOS-Chem wiki page for more information about how we are passing data between subroutines via derived type objects.