manual/s_phys_pkgs/mnc.tex

% $Header: /u/gcmpack/manual/part6/mnc.tex,v 1.13 2004/10/12 21:44:59 edhill Exp $
% $Name:  $

\section{NetCDF I/O Integration: MNC}
\label{sec:pkg:mnc}
\begin{rawhtml}
<!-- CMIREDIR:package_mnc: -->
\end{rawhtml}

The \texttt{mnc} package is a set of convenience routines written to
expedite the process of creating, appending, and reading NetCDF files.
NetCDF is an increasingly popular self-describing file format
\cite{rew:97} intended primarily for scientific data sets.  An
extensive collection of NetCDF reference papers, user guides,
software, FAQs, and other information can be obtained from UCAR's web
site at:
\begin{rawhtml} <A href="http://www.unidata.ucar.edu/packages/netcdf/"> \end{rawhtml}
\begin{verbatim}
http://www.unidata.ucar.edu/packages/netcdf/
\end{verbatim}
\begin{rawhtml} </A> \end{rawhtml}


\subsection{Using MNC}

\subsubsection{MNC Configuration}

As with all MITgcm packages, MNC can be turned on or off at compile time
using the \texttt{packages.conf} file or the \texttt{genmake2}
\texttt{-enable=mnc} or \texttt{-disable=mnc} switches.

While MNC is likely to work ``as is'', there are a few compile--time
constants that may need to be increased for simulations that employ
large numbers of tiles within each process.  Note that the important
quantity is the maximum number of tiles \textbf{per process}.  Since
MPI configurations tend to distribute large numbers of tiles over
relatively large numbers of MPI processes, these constants will rarely
need to be increased.

If MNC runs out of space within its ``lookup'' tables during a
simulation, then it will provide an error message along with a
recommendation of which parameter to increase.  The parameters are all
located within \filelink{pkg/mnc/mnc\_common.h}{pkg-mnc-mnc_common.h}
and the ones that may need to be increased are:

\begin{center}
  {\footnotesize
    \begin{tabular}[htb]{|l|r|l|}\hline
      \textbf{Name}  &  
      \textbf{Default}  &  \textbf{Description}  \\\hline
      &  &  \\
      \texttt{MNC\_MAX\_ID}  &  1000  & 
      \textbf{IDs for various low-level entities}  \\
      \texttt{MNC\_MAX\_INFO}  &   400  & 
      \textbf{IDs (mostly for object sizes)}  \\
      \texttt{MNC\_CW\_MAX\_I}  &  150  & 
      \textbf{IDs for the ``wrapper'' layer}  \\\hline
    \end{tabular}
  }
\end{center}

In those rare cases where MNC ``out-of-memory'' error messages are
encountered, it is a good idea to increase the too-small parameter by
a factor of \textbf{2--10} in order to avoid wasting time on an
iterative compile--test sequence.


\subsubsection{MNC Inputs}

For run-time configuration, most of the MNC--related model parameters
are contained within a Fortran namelist file called \texttt{data.mnc}.
If this file does not exist, then the MNC package will interpret that
as an indication that it is not to be used.  If the \texttt{data.mnc}
file does exist, then it may contain the following parameters:

\begin{center}
  {\footnotesize
    \begin{tabular}[htb]{|l|c|l|l|}\hline
      \textbf{Name}  &  \textbf{T}  &  
      \textbf{Default}  &  \textbf{Description}  \\\hline
      &  &  &  \\
      \texttt{useMNC}  &  L  & \texttt{.FALSE.}  &  
      \textbf{overall MNC ON/OFF switch}  \\
      \texttt{mnc\_echo\_gvtypes}  &  L  & \texttt{.FALSE.}  &  
      echo pre-defined ``types'' (debugging)   \\
      \texttt{mnc\_use\_outdir}  &  L  & \texttt{.FALSE.}  &  
      create a directory for output  \\
      \texttt{mnc\_outdir\_str}  &  S  & \texttt{'mnc\_'}  &  
      output directory name \\
      \texttt{mnc\_outdir\_date}  &  L  & \texttt{.FALSE.}  &  
      embed date in the output dir name  \\
      \texttt{pickup\_write\_mnc}  &  L  & \texttt{.FALSE.}  &  
      use MNC to write (create) pickup files  \\
      \texttt{pickup\_read\_mnc}  &  L  & \texttt{.FALSE.}  &  
      use MNC to read pickup files  \\
      \texttt{mnc\_use\_indir}  &  L  & \texttt{.FALSE.}  &  
      use a directory (path) for input  \\
      \texttt{mnc\_indir\_str}  &  S  & \texttt{''}  &  
      input directory (or path) name  \\
      \texttt{snapshot\_mnc}  &  L  & \texttt{.FALSE.}  &  
      write \texttt{snapshot} (instantaneous) w/MNC  \\
      \texttt{monitor\_mnc}  &  L  & \texttt{.FALSE.}  &  
      write \texttt{monitor} w/MNC  \\
      \texttt{timeave\_mnc}  &  L  & \texttt{.FALSE.}  &  
      write \texttt{timeave} w/MNC  \\
      \texttt{autodiff\_mnc}  &  L  & \texttt{.FALSE.}  &  
      write \texttt{autodiff} w/MNC  \\\hline
    \end{tabular}
  }
\end{center}

Additional MNC--related parameters are contained within the main
\texttt{data} namelist file and in some of the namelist files for
individual packages.  These options are:
\begin{center}
  {\footnotesize
    \begin{tabular}[htb]{|l|c|l|l|}\hline
      \textbf{Name}  &  \textbf{T}  &  
      \textbf{Default}  &  \textbf{Description}  \\\hline
      \multicolumn{4}{|c|}{\ }  \\
      \multicolumn{4}{|c|}{Main namelist file: 
        ``\textbf{data}''}  \\\hline
      \texttt{snapshot\_ioinc}  &  L  & \texttt{.FALSE.}  &  
      write \texttt{snapshot} ``inclusively''  \\
      \texttt{timeave\_ioinc}  &  L  & \texttt{.FALSE.}  &  
      write \texttt{timeave} ``inclusively''  \\
      \texttt{monitor\_ioinc}  &  L  & \texttt{.FALSE.}  &  
      write \texttt{monitor} ``inclusively''  \\
      \texttt{the\_run\_name}  &  C  & ``name...''  &  
      name is included in all MNC output  \\\hline
      \multicolumn{4}{|c|}{\ }  \\
      \multicolumn{4}{|c|}{Diagnostics namelist file: 
        ``\textbf{data.diagnostics}''}  \\\hline
      \texttt{diag\_mnc}  &  L  & \texttt{.FALSE.}  &  
      write \texttt{diagnostics} w/MNC  \\
      \texttt{diag\_ioinc}  &  L  & \texttt{.FALSE.}  &  
      write \texttt{diagnostics} ``inclusively''  \\\hline
    \end{tabular}
  }
\end{center}

By default, turning on MNC for a particular output type will result in
turning off all the corresponding (usually, default) MDSIO or STDOUT
output mechanisms.  In other words, output defaults to being an
exclusive selection.  To enable multiple kinds of simultaneous output,
flags of the form \texttt{NAME\_ioinc} have been created where
\texttt{NAME} corresponds to the various MNC output flags.  When a
\texttt{NAME\_ioinc} flag is set to \texttt{.TRUE.}, then multiple
simultaneous forms of output are allowed for the \texttt{NAME} output
mechanism.  The intent of this design is that typical users will only
want one kind of output while people debugging the code (particularly
the I/O routines) may want simultaneous types of output.

This ``inclusive'' versus ``exclusive'' design is easily applied in
cases where three or more kinds of output may be generated.  Thus, it
can be readily extended to additional new output types (eg. HDF5).

Input types are always exclusive.

\subsubsection{MNC Output}

While NetCDF files are supposed to be ``self-describing'', it is
helpful to note the following:

\begin{itemize}
\item The constraints placed upon the ``unlimited'' (or ``record'')
  dimension inherent with NetCDF v3.x make it very inefficient to put
  variables written at potentially different intervals within the same
  file.  For this reason, MNC output is split into a few file ``base
  names'' which try to reflect the nature of their content.
  
\item All MNC output is currently done in a ``tile-per-file'' fashion
  since most NetCDF v3.x implementions cannot write safely within MPI
  or multi-threaded environments.  This tiling is done in a global
  fashion and the tile numbers are appended to the base names
  described above.  Some scripts to ``assemble'' output are available
  (\texttt{MITgcm/utils/matlab}).  More general manipulations can be
  accomplished with the
  \begin{rawhtml}
    <A href="http://nco.sourceforge.net"> 
  \end{rawhtml} 
\begin{verbatim}
NetCDF Operators (or ``NCO'') at http://nco.sourceforge.net
\end{verbatim}
  \begin{rawhtml} </A> \end{rawhtml}
  which is a very powerful and convenient set of tools for working
  with all NetCDF files.
  
\item On many systems, NetCDF has practical file size limits on the
  order of 2--4GB (the maximium memory addressable with 32bit
  pointers) due to a lack of operating system, compiler, and/or
  library support.  In cases where this limit is reached, it is
  generally a good idea to reduce write frequencies or restart from
  pickups.
  
\item MNC does not (yet) provide a mechanism for reading information
  from a single ``global'' file as can be done with the MDSIO
  package.  This is in progress.

\end{itemize}


\subsection{MNC Internals}

The \texttt{mnc} package is a two-level convenience library (or
``wrapper'') for most of the NetCDF Fortran API.  Its purpose is to
streamline the user interface to NetCDF by maintaining internal
relations (look-up tables) keyed with strings (or names) and entities
such as NetCDF files, variables, and attributes.

The two levels of the \texttt{mnc} package are:
\begin{description}

\item[Upper level] \ 
  
  The upper level contains information about two kinds of
  associations:
  \begin{description}
  \item[grid type] is lookup table indexed with a grid type name.
    Each grid type name is associated with a number of dimensions, the
    dimension sizes (one of which may be unlimited), and starting and
    ending index arrays.  The intent is to store all the necessary
    size and shape information for the Fortran arrays containing
    MITgcm--style ``tile'' variables (that is, a central region
    surrounded by a variably-sized ``halo'' or exchange region as
    shown in Figures \ref{fig:communication_primitives} and
    \ref{fig:tiling-strategy}).
  
  \item[variable type] is a lookup table indexed by a variable type
    name.  For each name, the table contains a reference to a grid
    type for the variable and the names and values of various
    attributes.
  \end{description}
  
  Within the upper level, these associations are not permanently tied
  to any particular NetCDF file.  This allows the information to be
  re-used over multiple file reads and writes.

\item[Lower level] \ 
  
  In the lower (or internal) level, associations are stored for NetCDF
  files and many of the entities that they contain including
  dimensions, variables, and global attributes.  All associations are
  on a per-file basis.  Thus, each entity is tied to a unique NetCDF
  file and will be created or destroyed when files are, respectively,
  opened or closed.

\end{description}


\subsubsection{MNC Grid--Types and Variable--Types}

As a convenience for users, the MNC package includes numerous routines
to aid in the writing of data to NetCDF format.  Probably the biggest
convenience is the use of pre-defined ``grid types'' and ``variable
types''.  These ``types'' are simply look-up tables that store
dimensions, indicies, attributes, and other information that can all
be retrieved using a single character string.

The ``grid types'' are a way of mapping variables within MITgcm to
NetCDF arrays.  Within MITgcm, most spatial variables are defined
using two-- or three--dimensional arrays with ``overlap'' regions (see
Figures \ref{fig:communication_primitives}, a possible vertical index,
and \ref{fig:tiling-strategy}) and tile indicies such as the following
``U'' velocity:
\begin{verbatim}
      _RL  uVel (1-OLx:sNx+OLx,1-OLy:sNy+OLy,Nr,nSx,nSy)
\end{verbatim}
as defined in \filelink{model/inc/DYNVARS.h}{model-inc-DYNVARS.h}

The grid type is a character string that encodes the presence and
types associated with the four possible dimensions.  The character
string follows the format
\begin{center}
  \texttt{H0\_H1\_H2\_\_V\_\_T}
\end{center}
where the terms \textit{H0}, \textit{H1}, \textit{H2}, \textit{V},
\textit{T} can be almost any combination of the following:
\begin{center}
  \begin{tabular}[h]{|ccc|c|c|}\hline
    \multicolumn{3}{|c|}{Horizontal} & Vertical & Time \\
    \textbf{H0}: location & \textbf{H1}: dimensions & \textbf{H2}: halo 
          & \textbf{V}: location & \textbf{T}: level  \\\hline
    \texttt{-} & xy & Hn & \texttt{-} & \texttt{-} \\
    U  &  x  &  Hy  &  i  &  t  \\
    V  &  y  &      &  c  &     \\
    Cen  &   &      &     &     \\
    Cor  &   &      &     &     \\\hline
  \end{tabular}
\end{center}
A example list of all pre-defined combinations is contained in the
file
\begin{center}
  \texttt{pkg/mnc/pre-defined\_grids.txt}.
\end{center}

The variable type is an association between a variable type name and the
following items:
\begin{center}
  \begin{tabular}[h]{|l|l|}\hline
    \textbf{Item}  & \textbf{Purpose}  \\\hline
    grid type  &  defines the in-memory arrangement  \\
    \texttt{bi,bj} dimensions  &  tiling indices, if present  \\\hline
  \end{tabular}
\end{center}
and is used by the \texttt{mnc\_cw\_*\_[R|W]} subroutines for reading
and writing variables.


\subsubsection{Using MNC: Examples}

Writing variables to NetCDF files can be accomplished in as few as two
function calls.  The first function call defines a variable type,
associates it with a name (character string), and provides additional
information about the indicies for the tile (\texttt{bi},\texttt{bj})
dimensions.  The second function call will write the data at, if
necessary, the current time level within the model.

Examples of the initialization calls can be found in the file 
\filelink{model/src/ini\_mnc\_io.F}{model-src-ini_mnc_io.F}
where these function calls:
{\footnotesize
\begin{verbatim}
C     Create MNC definitions for DYNVARS.h variables
      CALL MNC_CW_ADD_VNAME('iter', '-_-_--__-__t', 0,0, myThid)
      CALL MNC_CW_ADD_VATTR_TEXT('iter',1,
     &     'long_name','iteration_count', myThid)

      CALL MNC_CW_ADD_VNAME('model_time', '-_-_--__-__t', 0,0, myThid)
      CALL MNC_CW_ADD_VATTR_TEXT('model_time',1,
     &     'long_name','Model Time', myThid)
      CALL MNC_CW_ADD_VATTR_TEXT('model_time',1,'units','s', myThid)

      CALL MNC_CW_ADD_VNAME('U', 'U_xy_Hn__C__t', 4,5, myThid)
      CALL MNC_CW_ADD_VATTR_TEXT('U',1,'units','m/s', myThid)
      CALL MNC_CW_ADD_VATTR_TEXT('U',1,
     &     'coordinates','XU YU RC iter', myThid)

      CALL MNC_CW_ADD_VNAME('T', 'Cen_xy_Hn__C__t', 4,5, myThid)
      CALL MNC_CW_ADD_VATTR_TEXT('T',1,'units','degC', myThid)
      CALL MNC_CW_ADD_VATTR_TEXT('T',1,'long_name',
     &     'potential_temperature', myThid)
      CALL MNC_CW_ADD_VATTR_TEXT('T',1,
     &     'coordinates','XC YC RC iter', myThid)
\end{verbatim}
}
{\noindent initialize four \texttt{VNAME}s and add one or more NetCDF
  attributes to each.}
    
The four variables defined above are subsequently written at specific
time steps within
\filelink{model/src/write\_state.F}{model-src-write_state.F}
using the function calls:
{\footnotesize
\begin{verbatim}
C       Write dynvars using the MNC package
        CALL MNC_CW_SET_UDIM('state', -1, myThid)
        CALL MNC_CW_I_W('I','state',0,0,'iter', myIter, myThid)
        CALL MNC_CW_SET_UDIM('state', 0, myThid)
        CALL MNC_CW_RL_W('D','state',0,0,'model_time',myTime, myThid)
        CALL MNC_CW_RL_W('D','state',0,0,'U', uVel, myThid)
        CALL MNC_CW_RL_W('D','state',0,0,'T', theta, myThid)
\end{verbatim}
}

While it is easiest to write variables within typical 2D and 3D fields
where all data is known at a given time, it is also possible to write
fields where only a portion (\textit{eg.} a ``slab'' or ``slice'') is
known at a given instant.  An example is provided within
\filelink{pkg/mom\_vecinv/mom\_vecinv.F}{pkg-mom_vecinv-mom_vecinv.F}
where an offset vector is used: {\footnotesize
\begin{verbatim}
       IF (useMNC .AND. snapshot_mnc) THEN
         CALL MNC_CW_RL_W_OFFSET('D','mom_vi',bi,bj, 'fV', uCf,
   &          offsets, myThid)
         CALL MNC_CW_RL_W_OFFSET('D','mom_vi',bi,bj, 'fU', vCf,
   &          offsets, myThid)
       ENDIF
\end{verbatim}
}
to write a 3D field one depth slice at a time.

Each element in the offset vector corresponds (in order) to the
dimensions of the ``full'' (or virtual) array and specifies which are
known at the time of the call.  A zero within the offset array means
that all values along that dimension are available while a positive
integer means that only values along that index of the dimension are
available.  In all cases, the matrix passed is assumed to start (that
is, have an in-memory structure) coinciding with the start of the
specified slice.  Thus, using this offset array mechanism, a slice
can be written along any single dimension or combinations of
dimensions.

1	% $Header: /u/gcmpack/manual/part6/mnc.tex,v 1.13 2004/10/12 21:44:59 edhill Exp $
2	% $Name: $
3
4	\section{NetCDF I/O Integration: MNC}
5	\label{sec:pkg:mnc}
6	\begin{rawhtml}
7	<!-- CMIREDIR:package_mnc: -->
8	\end{rawhtml}
9
10	The \texttt{mnc} package is a set of convenience routines written to
11	expedite the process of creating, appending, and reading NetCDF files.
12	NetCDF is an increasingly popular self-describing file format
13	\cite{rew:97} intended primarily for scientific data sets. An
14	extensive collection of NetCDF reference papers, user guides,
15	software, FAQs, and other information can be obtained from UCAR's web
16	site at:
17	\begin{rawhtml} <A href="http://www.unidata.ucar.edu/packages/netcdf/"> \end{rawhtml}
18	\begin{verbatim}
19	http://www.unidata.ucar.edu/packages/netcdf/
20	\end{verbatim}
21	\begin{rawhtml} </A> \end{rawhtml}
22
23
24	\subsection{Using MNC}
25
26	\subsubsection{MNC Configuration}
27
28	As with all MITgcm packages, MNC can be turned on or off at compile time
29	using the \texttt{packages.conf} file or the \texttt{genmake2}
30	\texttt{-enable=mnc} or \texttt{-disable=mnc} switches.
31
32	While MNC is likely to work ``as is'', there are a few compile--time
33	constants that may need to be increased for simulations that employ
34	large numbers of tiles within each process. Note that the important
35	quantity is the maximum number of tiles \textbf{per process}. Since
36	MPI configurations tend to distribute large numbers of tiles over
37	relatively large numbers of MPI processes, these constants will rarely
38	need to be increased.
39
40	If MNC runs out of space within its ``lookup'' tables during a
41	simulation, then it will provide an error message along with a
42	recommendation of which parameter to increase. The parameters are all
43	located within \filelink{pkg/mnc/mnc\_common.h}{pkg-mnc-mnc_common.h}
44	and the ones that may need to be increased are:
45
46	\begin{center}
47	{\footnotesize
48	\begin{tabular}[htb]{\|l\|r\|l\|}\hline
49	\textbf{Name} &
50	\textbf{Default} & \textbf{Description} \\\hline
51	& & \\
52	\texttt{MNC\_MAX\_ID} & 1000 &
53	\textbf{IDs for various low-level entities} \\
54	\texttt{MNC\_MAX\_INFO} & 400 &
55	\textbf{IDs (mostly for object sizes)} \\
56	\texttt{MNC\_CW\_MAX\_I} & 150 &
57	\textbf{IDs for the ``wrapper'' layer} \\\hline
58	\end{tabular}
59	}
60	\end{center}
61
62	In those rare cases where MNC ``out-of-memory'' error messages are
63	encountered, it is a good idea to increase the too-small parameter by
64	a factor of \textbf{2--10} in order to avoid wasting time on an
65	iterative compile--test sequence.
66
67
68	\subsubsection{MNC Inputs}
69
70	For run-time configuration, most of the MNC--related model parameters
71	are contained within a Fortran namelist file called \texttt{data.mnc}.
72	If this file does not exist, then the MNC package will interpret that
73	as an indication that it is not to be used. If the \texttt{data.mnc}
74	file does exist, then it may contain the following parameters:
75
76	\begin{center}
77	{\footnotesize
78	\begin{tabular}[htb]{\|l\|c\|l\|l\|}\hline
79	\textbf{Name} & \textbf{T} &
80	\textbf{Default} & \textbf{Description} \\\hline
81	& & & \\
82	\texttt{useMNC} & L & \texttt{.FALSE.} &
83	\textbf{overall MNC ON/OFF switch} \\
84	\texttt{mnc\_echo\_gvtypes} & L & \texttt{.FALSE.} &
85	echo pre-defined ``types'' (debugging) \\
86	\texttt{mnc\_use\_outdir} & L & \texttt{.FALSE.} &
87	create a directory for output \\
88	\texttt{mnc\_outdir\_str} & S & \texttt{'mnc\_'} &
89	output directory name \\
90	\texttt{mnc\_outdir\_date} & L & \texttt{.FALSE.} &
91	embed date in the output dir name \\
92	\texttt{pickup\_write\_mnc} & L & \texttt{.FALSE.} &
93	use MNC to write (create) pickup files \\
94	\texttt{pickup\_read\_mnc} & L & \texttt{.FALSE.} &
95	use MNC to read pickup files \\
96	\texttt{mnc\_use\_indir} & L & \texttt{.FALSE.} &
97	use a directory (path) for input \\
98	\texttt{mnc\_indir\_str} & S & \texttt{''} &
99	input directory (or path) name \\
100	\texttt{snapshot\_mnc} & L & \texttt{.FALSE.} &
101	write \texttt{snapshot} (instantaneous) w/MNC \\
102	\texttt{monitor\_mnc} & L & \texttt{.FALSE.} &
103	write \texttt{monitor} w/MNC \\
104	\texttt{timeave\_mnc} & L & \texttt{.FALSE.} &
105	write \texttt{timeave} w/MNC \\
106	\texttt{autodiff\_mnc} & L & \texttt{.FALSE.} &
107	write \texttt{autodiff} w/MNC \\\hline
108	\end{tabular}
109	}
110	\end{center}
111
112	Additional MNC--related parameters are contained within the main
113	\texttt{data} namelist file and in some of the namelist files for
114	individual packages. These options are:
115	\begin{center}
116	{\footnotesize
117	\begin{tabular}[htb]{\|l\|c\|l\|l\|}\hline
118	\textbf{Name} & \textbf{T} &
119	\textbf{Default} & \textbf{Description} \\\hline
120	\multicolumn{4}{\|c\|}{\ } \\
121	\multicolumn{4}{\|c\|}{Main namelist file:
122	``\textbf{data}''} \\\hline
123	\texttt{snapshot\_ioinc} & L & \texttt{.FALSE.} &
124	write \texttt{snapshot} ``inclusively'' \\
125	\texttt{timeave\_ioinc} & L & \texttt{.FALSE.} &
126	write \texttt{timeave} ``inclusively'' \\
127	\texttt{monitor\_ioinc} & L & \texttt{.FALSE.} &
128	write \texttt{monitor} ``inclusively'' \\
129	\texttt{the\_run\_name} & C & ``name...'' &
130	name is included in all MNC output \\\hline
131	\multicolumn{4}{\|c\|}{\ } \\
132	\multicolumn{4}{\|c\|}{Diagnostics namelist file:
133	``\textbf{data.diagnostics}''} \\\hline
134	\texttt{diag\_mnc} & L & \texttt{.FALSE.} &
135	write \texttt{diagnostics} w/MNC \\
136	\texttt{diag\_ioinc} & L & \texttt{.FALSE.} &
137	write \texttt{diagnostics} ``inclusively'' \\\hline
138	\end{tabular}
139	}
140	\end{center}
141
142	By default, turning on MNC for a particular output type will result in
143	turning off all the corresponding (usually, default) MDSIO or STDOUT
144	output mechanisms. In other words, output defaults to being an
145	exclusive selection. To enable multiple kinds of simultaneous output,
146	flags of the form \texttt{NAME\_ioinc} have been created where
147	\texttt{NAME} corresponds to the various MNC output flags. When a
148	\texttt{NAME\_ioinc} flag is set to \texttt{.TRUE.}, then multiple
149	simultaneous forms of output are allowed for the \texttt{NAME} output
150	mechanism. The intent of this design is that typical users will only
151	want one kind of output while people debugging the code (particularly
152	the I/O routines) may want simultaneous types of output.
153
154	This ``inclusive'' versus ``exclusive'' design is easily applied in
155	cases where three or more kinds of output may be generated. Thus, it
156	can be readily extended to additional new output types (eg. HDF5).
157
158	Input types are always exclusive.
159
160	\subsubsection{MNC Output}
161
162	While NetCDF files are supposed to be ``self-describing'', it is
163	helpful to note the following:
164
165	\begin{itemize}
166	\item The constraints placed upon the ``unlimited'' (or ``record'')
167	dimension inherent with NetCDF v3.x make it very inefficient to put
168	variables written at potentially different intervals within the same
169	file. For this reason, MNC output is split into a few file ``base
170	names'' which try to reflect the nature of their content.
171
172	\item All MNC output is currently done in a ``tile-per-file'' fashion
173	since most NetCDF v3.x implementions cannot write safely within MPI
174	or multi-threaded environments. This tiling is done in a global
175	fashion and the tile numbers are appended to the base names
176	described above. Some scripts to ``assemble'' output are available
177	(\texttt{MITgcm/utils/matlab}). More general manipulations can be
178	accomplished with the
179	\begin{rawhtml}
180	<A href="http://nco.sourceforge.net">
181	\end{rawhtml}
182	\begin{verbatim}
183	NetCDF Operators (or ``NCO'') at http://nco.sourceforge.net
184	\end{verbatim}
185	\begin{rawhtml} </A> \end{rawhtml}
186	which is a very powerful and convenient set of tools for working
187	with all NetCDF files.
188
189	\item On many systems, NetCDF has practical file size limits on the
190	order of 2--4GB (the maximium memory addressable with 32bit
191	pointers) due to a lack of operating system, compiler, and/or
192	library support. In cases where this limit is reached, it is
193	generally a good idea to reduce write frequencies or restart from
194	pickups.
195
196	\item MNC does not (yet) provide a mechanism for reading information
197	from a single ``global'' file as can be done with the MDSIO
198	package. This is in progress.
199
200	\end{itemize}
201
202
203	\subsection{MNC Internals}
204
205	The \texttt{mnc} package is a two-level convenience library (or
206	``wrapper'') for most of the NetCDF Fortran API. Its purpose is to
207	streamline the user interface to NetCDF by maintaining internal
208	relations (look-up tables) keyed with strings (or names) and entities
209	such as NetCDF files, variables, and attributes.
210
211	The two levels of the \texttt{mnc} package are:
212	\begin{description}
213
214	\item[Upper level] \
215
216	The upper level contains information about two kinds of
217	associations:
218	\begin{description}
219	\item[grid type] is lookup table indexed with a grid type name.
220	Each grid type name is associated with a number of dimensions, the
221	dimension sizes (one of which may be unlimited), and starting and
222	ending index arrays. The intent is to store all the necessary
223	size and shape information for the Fortran arrays containing
224	MITgcm--style ``tile'' variables (that is, a central region
225	surrounded by a variably-sized ``halo'' or exchange region as
226	shown in Figures \ref{fig:communication_primitives} and
227	\ref{fig:tiling-strategy}).
228
229	\item[variable type] is a lookup table indexed by a variable type
230	name. For each name, the table contains a reference to a grid
231	type for the variable and the names and values of various
232	attributes.
233	\end{description}
234
235	Within the upper level, these associations are not permanently tied
236	to any particular NetCDF file. This allows the information to be
237	re-used over multiple file reads and writes.
238
239	\item[Lower level] \
240
241	In the lower (or internal) level, associations are stored for NetCDF
242	files and many of the entities that they contain including
243	dimensions, variables, and global attributes. All associations are
244	on a per-file basis. Thus, each entity is tied to a unique NetCDF
245	file and will be created or destroyed when files are, respectively,
246	opened or closed.
247
248	\end{description}
249
250
251	\subsubsection{MNC Grid--Types and Variable--Types}
252
253	As a convenience for users, the MNC package includes numerous routines
254	to aid in the writing of data to NetCDF format. Probably the biggest
255	convenience is the use of pre-defined ``grid types'' and ``variable
256	types''. These ``types'' are simply look-up tables that store
257	dimensions, indicies, attributes, and other information that can all
258	be retrieved using a single character string.
259
260	The ``grid types'' are a way of mapping variables within MITgcm to
261	NetCDF arrays. Within MITgcm, most spatial variables are defined
262	using two-- or three--dimensional arrays with ``overlap'' regions (see
263	Figures \ref{fig:communication_primitives}, a possible vertical index,
264	and \ref{fig:tiling-strategy}) and tile indicies such as the following
265	``U'' velocity:
266	\begin{verbatim}
267	_RL uVel (1-OLx:sNx+OLx,1-OLy:sNy+OLy,Nr,nSx,nSy)
268	\end{verbatim}
269	as defined in \filelink{model/inc/DYNVARS.h}{model-inc-DYNVARS.h}
270
271	The grid type is a character string that encodes the presence and
272	types associated with the four possible dimensions. The character
273	string follows the format
274	\begin{center}
275	\texttt{H0\_H1\_H2\_\_V\_\_T}
276	\end{center}
277	where the terms \textit{H0}, \textit{H1}, \textit{H2}, \textit{V},
278	\textit{T} can be almost any combination of the following:
279	\begin{center}
280	\begin{tabular}[h]{\|ccc\|c\|c\|}\hline
281	\multicolumn{3}{\|c\|}{Horizontal} & Vertical & Time \\
282	\textbf{H0}: location & \textbf{H1}: dimensions & \textbf{H2}: halo
283	& \textbf{V}: location & \textbf{T}: level \\\hline
284	\texttt{-} & xy & Hn & \texttt{-} & \texttt{-} \\
285	U & x & Hy & i & t \\
286	V & y & & c & \\
287	Cen & & & & \\
288	Cor & & & & \\\hline
289	\end{tabular}
290	\end{center}
291	A example list of all pre-defined combinations is contained in the
292	file
293	\begin{center}
294	\texttt{pkg/mnc/pre-defined\_grids.txt}.
295	\end{center}
296
297	The variable type is an association between a variable type name and the
298	following items:
299	\begin{center}
300	\begin{tabular}[h]{\|l\|l\|}\hline
301	\textbf{Item} & \textbf{Purpose} \\\hline
302	grid type & defines the in-memory arrangement \\
303	\texttt{bi,bj} dimensions & tiling indices, if present \\\hline
304	\end{tabular}
305	\end{center}
306	and is used by the \texttt{mnc\_cw\_*\_[R\|W]} subroutines for reading
307	and writing variables.
308
309
310	\subsubsection{Using MNC: Examples}
311
312	Writing variables to NetCDF files can be accomplished in as few as two
313	function calls. The first function call defines a variable type,
314	associates it with a name (character string), and provides additional
315	information about the indicies for the tile (\texttt{bi},\texttt{bj})
316	dimensions. The second function call will write the data at, if
317	necessary, the current time level within the model.
318
319	Examples of the initialization calls can be found in the file
320	\filelink{model/src/ini\_mnc\_io.F}{model-src-ini_mnc_io.F}
321	where these function calls:
322	{\footnotesize
323	\begin{verbatim}
324	C Create MNC definitions for DYNVARS.h variables
325	CALL MNC_CW_ADD_VNAME('iter', '-_-_--__-__t', 0,0, myThid)
326	CALL MNC_CW_ADD_VATTR_TEXT('iter',1,
327	& 'long_name','iteration_count', myThid)
328
329	CALL MNC_CW_ADD_VNAME('model_time', '-_-_--__-__t', 0,0, myThid)
330	CALL MNC_CW_ADD_VATTR_TEXT('model_time',1,
331	& 'long_name','Model Time', myThid)
332	CALL MNC_CW_ADD_VATTR_TEXT('model_time',1,'units','s', myThid)
333
334	CALL MNC_CW_ADD_VNAME('U', 'U_xy_Hn__C__t', 4,5, myThid)
335	CALL MNC_CW_ADD_VATTR_TEXT('U',1,'units','m/s', myThid)
336	CALL MNC_CW_ADD_VATTR_TEXT('U',1,
337	& 'coordinates','XU YU RC iter', myThid)
338
339	CALL MNC_CW_ADD_VNAME('T', 'Cen_xy_Hn__C__t', 4,5, myThid)
340	CALL MNC_CW_ADD_VATTR_TEXT('T',1,'units','degC', myThid)
341	CALL MNC_CW_ADD_VATTR_TEXT('T',1,'long_name',
342	& 'potential_temperature', myThid)
343	CALL MNC_CW_ADD_VATTR_TEXT('T',1,
344	& 'coordinates','XC YC RC iter', myThid)
345	\end{verbatim}
346	}
347	{\noindent initialize four \texttt{VNAME}s and add one or more NetCDF
348	attributes to each.}
349
350	The four variables defined above are subsequently written at specific
351	time steps within
352	\filelink{model/src/write\_state.F}{model-src-write_state.F}
353	using the function calls:
354	{\footnotesize
355	\begin{verbatim}
356	C Write dynvars using the MNC package
357	CALL MNC_CW_SET_UDIM('state', -1, myThid)
358	CALL MNC_CW_I_W('I','state',0,0,'iter', myIter, myThid)
359	CALL MNC_CW_SET_UDIM('state', 0, myThid)
360	CALL MNC_CW_RL_W('D','state',0,0,'model_time',myTime, myThid)
361	CALL MNC_CW_RL_W('D','state',0,0,'U', uVel, myThid)
362	CALL MNC_CW_RL_W('D','state',0,0,'T', theta, myThid)
363	\end{verbatim}
364	}
365
366	While it is easiest to write variables within typical 2D and 3D fields
367	where all data is known at a given time, it is also possible to write
368	fields where only a portion (\textit{eg.} a ``slab'' or ``slice'') is
369	known at a given instant. An example is provided within
370	\filelink{pkg/mom\_vecinv/mom\_vecinv.F}{pkg-mom_vecinv-mom_vecinv.F}
371	where an offset vector is used: {\footnotesize
372	\begin{verbatim}
373	IF (useMNC .AND. snapshot_mnc) THEN
374	CALL MNC_CW_RL_W_OFFSET('D','mom_vi',bi,bj, 'fV', uCf,
375	& offsets, myThid)
376	CALL MNC_CW_RL_W_OFFSET('D','mom_vi',bi,bj, 'fU', vCf,
377	& offsets, myThid)
378	ENDIF
379	\end{verbatim}
380	}
381	to write a 3D field one depth slice at a time.
382
383	Each element in the offset vector corresponds (in order) to the
384	dimensions of the ``full'' (or virtual) array and specifies which are
385	known at the time of the call. A zero within the offset array means
386	that all values along that dimension are available while a positive
387	integer means that only values along that index of the dimension are
388	available. In all cases, the matrix passed is assumed to start (that
389	is, have an in-memory structure) coinciding with the start of the
390	specified slice. Thus, using this offset array mechanism, a slice
391	can be written along any single dimension or combinations of
392	dimensions.
393