/[MITgcm]/manual/s_autodiff/text/doc_ad_2.tex

Diff of /manual/s_autodiff/text/doc_ad_2.tex

Parent Directory | Revision Log | View Revision Graph Revision Graph | View Patch Patch

-revision 1.13 by heimbach,
Fri Jan 18 22:56:45 2002 UTC
+revision 1.17 by edhill,
Sat Oct 16 03:40:17 2004 UTC
 Line 42 
 Jacobian matrices of the forward code's
  %**********************************************************************
  \section{Some basic algebra}
  \label{sec_ad_algebra}
+ \begin{rawhtml}
+ <!-- CMIREDIR:sec_ad_algebra: -->
+ \end{rawhtml}
  %**********************************************************************
  Let $ \cal{M} $ be a general nonlinear, model, i.e. a
-Line 557 
 Because of the local character of the de
+Line 560 
 Because of the local character of the de
  (a derivative is defined w.r.t. a point along the trajectory),
  the intermediate results of the model trajectory
  $\vec{v}^{(\lambda+1)}={\cal M}_{\lambda}(v^{(\lambda)})$
- are needed to evaluate the intermediate Jacobian
+ may be required to evaluate the intermediate Jacobian
  $M_{\lambda}|_{\vec{v}^{(\lambda)}} \, \delta \vec{v}^{(\lambda)} $.
+ This is the case e.g. for nonlinear expressions
+ (momentum advection, nonlinear equation of state), state-dependent
+ conditional statements (parameterization schemes).
  In the forward mode, the intermediate results are required
  in the same order as computed by the full forward model ${\cal M}$,
  but in the reverse mode they are required in the reverse order.
-Line 569 
 point of evaluation has to be recomputed
+Line 575 
 point of evaluation has to be recomputed
  A method to balance the amount of recomputations vs.
  storage requirements is called {\sf checkpointing}
- (e.g. \cite{res-eta:98}).
+ (e.g. \cite{gri:92}, \cite{res-eta:98}).
  It is depicted in \ref{fig:3levelcheck} for a 3-level checkpointing
  [as an example, we give explicit numbers for a 3-day
  integration with a 1-hourly timestep in square brackets].
-Line 580 
 In a first step, the model trajectory is
+Line 586 
 In a first step, the model trajectory is
  $ {n}^{lev3} $ subsections [$ {n}^{lev3} $=3 1-day intervals],
  with the label $lev3$ for this outermost loop.
  The model is then integrated along the full trajectory,
- and the model state stored only at every $ k_{i}^{lev3} $-th timestep
+ and the model state stored to disk only at every $ k_{i}^{lev3} $-th timestep
  [i.e. 3 times, at
  $ i = 0,1,2 $ corresponding to $ k_{i}^{lev3} = 0, 24, 48 $].
+ In addition, the cost function is computed, if needed.
  %
  \item [$lev2$]
  In a second step each subsection itself is divided into
- $ {n}^{lev2} $ sub-subsections
+ $ {n}^{lev2} $ subsections
  [$ {n}^{lev2} $=4 6-hour intervals per subsection].
  The model picks up at the last outermost dumped state
  $ v_{k_{n}^{lev3}} $ and is integrated forward in time along
  the last subsection, with the label $lev2$ for this
  intermediate loop.
- The model state is now stored at every $ k_{i}^{lev2} $-th
+ The model state is now stored to disk at every $ k_{i}^{lev2} $-th
  timestep
  [i.e. 4 times, at
  $ i = 0,1,2,3 $ corresponding to $ k_{i}^{lev2} = 48, 54, 60, 66 $].
-Line 600 
 $ i = 0,1,2,3 $ corresponding to $ k_{i}
+Line 607 
 $ i = 0,1,2,3 $ corresponding to $ k_{i}
  \item [$lev1$]
  Finally, the model picks up at the last intermediate dump state
  $ v_{k_{n}^{lev2}} $ and is integrated forward in time along
- the last sub-subsection, with the label $lev1$ for this
+ the last subsection, with the label $lev1$ for this
  intermediate loop.
- Within this sub-subsection only, the model state is stored
+ Within this sub-subsection only, parts of the model state is stored
- at every timestep
+ to memory at every timestep
  [i.e. every hour $ i=0,...,5$ corresponding to
  $ k_{i}^{lev1} = 66, 67, \ldots, 71 $].
- Thus, the  final state $ v_n = v_{k_{n}^{lev1}} $ is reached
+ The  final state $ v_n = v_{k_{n}^{lev1}} $ is reached
- and the model state of all proceeding timesteps along the last
+ and the model state of all preceding timesteps along the last
- sub-subsections are available, enabling integration backwards
+ innermost subsection are available, enabling integration backwards
- in time along the last sub-subsection.
+ in time along the last subsection.
- Thus, the adjoint can be computed along this last
+ The adjoint can thus be computed along this last
- sub-subsection $k_{n}^{lev2}$.
+ subsection $k_{n}^{lev2}$.
  %
  \end{itemize}
  %
  This procedure is repeated consecutively for each previous
- sub-subsection $k_{n-1}^{lev2}, \ldots, k_{1}^{lev2} $
+ subsection $k_{n-1}^{lev2}, \ldots, k_{1}^{lev2} $
  carrying the adjoint computation to the initial time
  of the subsection $k_{n}^{lev3}$.
  Then, the procedure is repeated for the previous subsection
-Line 627 
 $k_{1}^{lev3}$.
+Line 634 
 $k_{1}^{lev3}$.
  For the full model trajectory of
  $ n^{lev3} \cdot n^{lev2} \cdot n^{lev1} $ timesteps
  the required storing of the model state was significantly reduced to
- $ n^{lev1} + n^{lev2} + n^{lev3} $
+ $ n^{lev2} + n^{lev3} $ to disk and roughly $ n^{lev1} $ to memory
  [i.e. for the 3-day integration with a total oof 72 timesteps
- the model state was stored 13 times].
+ the model state was stored 7 times to disk and roughly 6 times
+ to memory].
  This saving in memory comes at a cost of a required
 full forward integrations of the model (one for each
  checkpointing level).
- The balance of storage vs. recomputation certainly depends
+ The optimal balance of storage vs. recomputation certainly depends
- on the computing resources available.
+ on the computing resources available and may be adjusted by
+ adjusting the partitioning among the
+ $ n^{lev3}, \,\, n^{lev2}, \,\, n^{lev1} $.
  \begin{figure}[t!]
  \begin{center}
-Line 664 
 Schematic view of intermediate dump and
+Line 674 
 Schematic view of intermediate dump and
  % \subsection{Error covariance estimate and Hessian matrix}
  % \label{sec_hessian}
  \newpage
  %**********************************************************************
- \section{AD-specific setup by example: sensitivity of carbon sequestration}
+ \section{TLM and ADM generation in general}
- \label{sec_ad_setup_ex}
+ \label{sec_ad_setup_gen}
+ \begin{rawhtml}
+ <!-- CMIREDIR:sec_ad_setup_gen: -->
+ \end{rawhtml}
  %**********************************************************************
- The MITGCM has been adapted to enable AD using TAMC or TAF.
+ In this section we describe in a general fashion
- The present description, therefore, is specific to the
+ the parts of the code that are relevant for automatic
- use of TAMC or TAF as AD tool.
+ differentiation using the software tool TAF.
- The following sections describe the steps which are necessary to
- generate a tangent linear or adjoint model of the MITGCM.
+ \input{part5/doc_ad_the_model}
- We take as an example the sensitivity of carbon sequestration
- in the ocean.
+ The basic flow is depicted in \ref{fig:adthemodel}.
- The AD-relevant hooks in the code are sketched in
+ If CPP option {\tt ALLOW\_AUTODIFF\_TAMC} is defined, the driver routine
- \ref{fig:adthemodel}, \ref{fig:adthemain}.
+ {\it the\_model\_main}, instead of calling {\it the\_main\_loop},
+ invokes the adjoint of this routine, {\it adthe\_main\_loop},
- \subsection{Overview of the experiment}
+ which is the toplevel routine in terms of automatic differentiation.
+ The routine {\it adthe\_main\_loop} has been generated by TAF.
- We describe an adjoint sensitivity analysis of out-gassing from
+ It contains both the forward integration of the full model, the
- the ocean into the atmosphere of a carbon-like tracer injected
+ cost function calculation,
- into the ocean interior (see \cite{hil-eta:01}).
+ any additional storing that is required for efficient checkpointing,
+ and the reverse integration of the adjoint model.
- \subsubsection{Passive tracer equation}
+ [DESCRIBE IN A SEPARATE SECTION THE WORKING OF THE TLM]
- For this work the MITGCM was augmented with a thermodynamically
- inactive tracer, $C$. Tracer residing in the ocean
+ In Fig. \ref{fig:adthemodel}
- model surface layer is out-gassed according to a relaxation time scale,
+ the structure of {\it adthe\_main\_loop} has been strongly
- $\mu$. Within the ocean interior, the tracer is passively advected
+ simplified to focus on the essentials; in particular, no checkpointing
- by the ocean model currents. The full equation for the time evolution
+ procedures are shown here.
- %
+ Prior to the call of {\it adthe\_main\_loop}, the routine
- \begin{equation}
+ {\it ctrl\_unpack} is invoked to unpack the control vector
- \label{carbon_ddt}
+ or initialise the control variables.
- \frac{\partial C}{\partial t} \, = \,
+ Following the call of {\it adthe\_main\_loop},
- -U\cdot \nabla C \, - \, \mu C \, + \, \Gamma(C) \,+ \, S
+ the routine {\it ctrl\_pack}
- \end{equation}
+ is invoked to pack the control vector
- %
+ (cf. Section \ref{section_ctrl}).
- also includes a source term $S$. This term
+ If gradient checks are to be performed, the option
- represents interior sources of $C$ such as would arise due to
+ {\tt ALLOW\_GRADIENT\_CHECK} is defined. In this case
- direct injection.
+ the driver routine {\it grdchk\_main} is called after
- The velocity term, $U$, is the sum of the
+ the gradient has been computed via the adjoint
- model Eulerian circulation and an eddy-induced velocity, the latter
+ (cf. Section \ref{section_grdchk}).
- parameterized according to Gent/McWilliams
- (\cite{gen-mcw:90, gen-eta:95}).
- The convection function, $\Gamma$, mixes $C$ vertically wherever the
- fluid is locally statically unstable.
- The out-gassing time scale, $\mu$, in eqn. (\ref{carbon_ddt})
- is set so that \( 1/\mu \sim 1 \ \mathrm{year} \) for the surface
- ocean and $\mu=0$ elsewhere. With this value, eqn. (\ref{carbon_ddt})
- is valid as a prognostic equation for small perturbations in oceanic
- carbon concentrations. This configuration provides a
- powerful tool for examining the impact of large-scale ocean circulation
- on $ CO_2 $ out-gassing due to interior injections.
- As source we choose a constant in time injection of
- $ S = 1 \,\, {\rm mol / s}$.
- \subsubsection{Model configuration}
- The model configuration employed has a constant
- $4^\circ \times 4^\circ$ resolution horizontal grid and realistic
- geography and bathymetry. Twenty vertical layers are used with
- vertical spacing ranging
- from 50 m near the surface to 815 m at depth.
- Driven to steady-state by climatological wind-stress, heat and
- fresh-water forcing the model reproduces well known large-scale
- features of the ocean general circulation.
- \subsubsection{Out-gassing cost function}
- To quantify and understand out-gassing due to injections of $C$
- in eqn. (\ref{carbon_ddt}),
- we define a cost function $ {\cal J} $ that measures the total amount of
- tracer out-gassed at each timestep:
- %
- \begin{equation}
- \label{cost_tracer}
- {\cal J}(t=T)=\int_{t=0}^{t=T}\int_{A} \mu C \, dA \, dt
- \end{equation}
- %
- Equation(\ref{cost_tracer}) integrates the out-gassing term, $\mu C$,
- from (\ref{carbon_ddt})
- over the entire ocean surface area, $A$, and accumulates it
- up to time $T$.
- Physically, ${\cal J}$ can be thought of as representing the amount of
- $CO_2$ that our model predicts would be out-gassed following an
- injection at rate $S$.
- The sensitivity of ${\cal J}$ to the spatial location of $S$,
- $\frac{\partial {\cal J}}{\partial S}$,
- can be used to identify regions from which circulation
- would cause $CO_2$ to rapidly out-gas following injection
- and regions in which $CO_2$ injections would remain effectively
- sequestered within the ocean.
- \subsection{Code configuration}
- The model configuration for this experiment resides under the
- directory {\it verification/carbon/}.
- The code customization routines are in {\it verification/carbon/code/}:
- %
- \begin{itemize}
- %
- \item {\it .genmakerc}
- %
- \item {\it COST\_CPPOPTIONS.h}
- %
- \item {\it CPP\_EEOPTIONS.h}
- %
- \item {\it CPP\_OPTIONS.h}
- %
- \item {\it CTRL\_OPTIONS.h}
- %
- \item {\it ECCO\_OPTIONS.h}
- %
- \item {\it SIZE.h}
- %
- \item {\it adcommon.h}
- %
- \item {\it tamc.h}
- %
- \end{itemize}
- %
- The runtime flag and parameters settings are contained in
- {\it verification/carbon/input/},
- together with the forcing fields and and restart files:
- %
- \begin{itemize}
- %
- \item {\it data}
- %
- \item {\it data.cost}
- %
- \item {\it data.ctrl}
- %
- \item {\it data.gmredi}
- %
- \item {\it data.grdchk}
- %
- \item {\it data.optim}
- %
- \item {\it data.pkg}
- %
- \item {\it eedata}
- %
- \item {\it topog.bin}
- %
- \item {\it windx.bin, windy.bin}
- %
- \item {\it salt.bin, theta.bin}
- %
- \item {\it SSS.bin, SST.bin}
- %
- \item {\it pickup*}
- %
- \end{itemize}
- %
- Finally, the file to generate the adjoint code resides in
- $ adjoint/ $:
- %
- \begin{itemize}
- %
- \item {\it makefile}
- %
- \end{itemize}
- %
- Below we describe the customizations of this files which are
+ %------------------------------------------------------------------
- specific to this experiment.
- \subsubsection{File {\it .genmakerc}}
+ \subsection{General setup
- This file overwrites default settings of {\it genmake}.
+ \label{section_ad_setup}}
- In the present example it is used to switch on the following
- packages which are related to automatic differentiation
- and are disabled by default: \\
- \hspace*{4ex} {\tt set ENABLE=( autodiff cost ctrl ecco gmredi grdchk kpp )}  \\
- Other packages which are not needed are switched off: \\
- \hspace*{4ex} {\tt set DISABLE=( aim obcs zonal\_filt shap\_filt cal exf )}
- \subsubsection{File {\it COST\_CPPOPTIONS.h,  CTRL\_OPTIONS.h}}
- These files used to contain package-specific CPP-options
- (see Section \ref{???}).
- For technical reasons those options have been grouped together
- in the file {\it ECCO\_OPTIONS.h}.
- To retain the modularity, the files have been kept and contain
- the standard include of the {\it CPP\_OPTIONS.h} file.
- \subsubsection{File {\it CPP\_EEOPTIONS.h}}
- This file contains 'wrapper'-specific CPP options.
- It only needs to be changed if the code is to be run
- in a parallel environment (see Section \ref{???}).
- \subsubsection{File {\it CPP\_OPTIONS.h}}
- This file contains model-specific CPP options
- (see Section \ref{???}).
- Most options are related to the forward model setup.
- They are identical to the global steady circulation setup of
- {\it verification/exp2/}.
- The three options specific to this experiment are \\
- \hspace*{4ex} {\tt \#define ALLOW\_PASSIVE\_TRACER} \\
- This flag enables the code to carry through the
- advection/diffusion of a passive tracer along the
- model integration. \\
- \hspace*{4ex} {\tt \#define ALLOW\_MIT\_ADJOINT\_RUN} \\
- This flag enables the inclusion of some AD-related fields
- concerning initialization, link between control variables
- and forward model variables, and the call to the top-level
- forward/adjoint subroutine {\it adthe\_main\_loop}
- instead of {\it the\_main\_loop}. \\
- \hspace*{4ex} {\tt \#define ALLOW\_GRADIENT\_CHECK} \\
- This flag enables the gradient check package.
- After computing the unperturbed cost function and its gradient,
- a series of computations are performed for which \\
- $\bullet$ an element of the control vector is perturbed \\
- $\bullet$ the cost function w.r.t. the perturbed element is
- computed \\
- $\bullet$ the difference between the perturbed and unperturbed
- cost function is computed to compute the finite difference gradient \\
- $\bullet$ the finite difference gradient is compared with the
- adjoint-generated gradient.
- The gradient check package is further described in Section ???.
- \subsubsection{File {\it ECCO\_OPTIONS.h}}
+ In order to configure AD-related setups the following packages need
+ to be enabled:
+ {\it
+ \begin{table}[h!]
+ \begin{tabular}{l}
+ autodiff \\
+ ctrl \\
+ cost \\
+ grdchk \\
+ \end{tabular}
+ \end{table}
+ }
+ The packages are enabled by adding them to your experiment-specific
+ configuration file
+ {\it packages.conf} (see Section ???).
- The CPP options of several AD-related packages are grouped
+ The following AD-specific CPP option files need to be customized:
- in this file:
  %
  \begin{itemize}
  %
- \item
+ \item {\it ECCO\_CPPOPTIONS.h} \\
- Adjoint support package: {\it pkg/autodiff/} \\
+ This header file collects CPP options for the packages
- This package contains hand-written adjoint code such as
+ {\it autodiff, cost, ctrl} as well as AD-unrelated options for
- active file handling, flow directives for files which must not
+ the external forcing package {\it exf}.
- be differentiated, and TAMC-specific header files. \\
+ \footnote{NOTE: These options are not set in their package-specific
- \hspace*{4ex} {\tt \#define ALLOW\_AUTODIFF\_TAMC} \\
+ headers such as {\it COST\_CPPOPTIONS.h}, but are instead collected
- defines TAMC-related features in the code. \\
+ in the single header file {\it ECCO\_CPPOPTIONS.h}.
- \hspace*{4ex} {\tt \#define ALLOW\_TAMC\_CHECKPOINTING} \\
+ The package-specific header files serve as simple
- enables the checkpointing feature of TAMC
+ placeholders at this point.}
- (see Section \ref{???}).
+ %
- In the present example a 3-level checkpointing is implemented.
+ \item {\it tamc.h} \\
- The code contains the relevant store directives, common block
+ This header configures the splitting of the time stepping loop
- and tape initializations, storing key computation,
+ w.r.t. the 3-level checkpointing (see section ???).
- and loop index handling.
- The checkpointing length at each level is defined in
- file {\it tamc.h}, cf. below.
- %
- \item Cost function package: {\it pkg/cost/} \\
- This package contains all relevant routines for
- initializing, accumulating and finalizing the cost function
- (see Section \ref{???}). \\
- \hspace*{4ex} {\tt \#define ALLOW\_COST} \\
- enables all general aspects of the cost function handling,
- in particular the hooks in the forward code for
- initializing, accumulating and finalizing the cost function. \\
- \hspace*{4ex} {\tt \#define ALLOW\_COST\_TRACER} \\
- includes the call to the cost function for this
- particular experiment, eqn. (\ref{cost_tracer}).
- %
- \item Control variable package: {\it pkg/ctrl/} \\
- This package contains all relevant routines for
- the handling of the control vector.
- Each control variable can be enabled/disabled with its own flag: \\
- \begin{tabular}{ll}
- \hspace*{2ex} {\tt \#define ALLOW\_THETA0\_CONTROL} &
- initial temperature \\
- \hspace*{2ex} {\tt \#define ALLOW\_SALT0\_CONTROL} &
- initial salinity \\
- \hspace*{2ex} {\tt \#define ALLOW\_TR0\_CONTROL} &
- initial passive tracer concentration \\
- \hspace*{2ex} {\tt \#define ALLOW\_TAUU0\_CONTROL} &
- zonal wind stress \\
- \hspace*{2ex} {\tt \#define ALLOW\_TAUV0\_CONTROL} &
- meridional wind stress \\
- \hspace*{2ex} {\tt \#define ALLOW\_SFLUX0\_CONTROL} &
- freshwater flux \\
- \hspace*{2ex} {\tt \#define ALLOW\_HFLUX0\_CONTROL} &
- heat flux \\
- \hspace*{2ex} {\tt \#define ALLOW\_DIFFKR\_CONTROL} &
- diapycnal diffusivity \\
- \hspace*{2ex} {\tt \#undef ALLOW\_KAPPAGM\_CONTROL} &
- isopycnal diffusivity \\
- \end{tabular}
  %
  \end{itemize}
- \subsubsection{File {\it SIZE.h}}
+ %------------------------------------------------------------------
- The file contains the grid point dimensions of the forward
+ \subsection{Building the AD code
- model. It is identical to the {\it verification/exp2/}: \\
+ \label{section_ad_build}}
- \hspace*{4ex} {\tt sNx = 90} \\
- \hspace*{4ex} {\tt sNy = 40} \\
- \hspace*{4ex} {\tt Nr = 20} \\
- It corresponds to a single-tile/single-processor setup:
- {\tt nSx = nSy = 1, nPx = nPy = 1},
- with standard overlap dimensioning
- {\tt OLx = OLy = 3}.
- \subsubsection{File {\it adcommon.h}}
- This file contains common blocks of some adjoint variables
- that are generated by TAMC.
- The common blocks are used by the adjoint support routine
- {\it addummy\_in\_stepping} which needs to access those variables:
- \begin{tabular}{ll}
- \hspace*{4ex} {\tt common /addynvars\_r/} &
- \hspace*{4ex} is related to {\it DYNVARS.h} \\
- \hspace*{4ex} {\tt common /addynvars\_cd/} &
- \hspace*{4ex} is related to {\it DYNVARS.h} \\
- \hspace*{4ex} {\tt common /addynvars\_diffkr/} &
- \hspace*{4ex} is related to {\it DYNVARS.h} \\
- \hspace*{4ex} {\tt common /addynvars\_kapgm/} &
- \hspace*{4ex} is related to {\it DYNVARS.h} \\
- \hspace*{4ex} {\tt common /adtr1\_r/} &
- \hspace*{4ex} is related to {\it TR1.h} \\
- \hspace*{4ex} {\tt common /adffields/} &
- \hspace*{4ex} is related to {\it FFIELDS.h}\\
- \end{tabular}
- Note that if the structure of the common block changes in the
- above header files of the forward code, the structure
- of the adjoint common blocks will change accordingly.
- Thus, it has to be made sure that the structure of the
- adjoint common block in the hand-written file {\it adcommon.h}
- complies with the automatically generated adjoint common blocks
- in {\it adjoint\_model.F}.
- \subsubsection{File {\it tamc.h}}
- This routine contains the dimensions for TAMC checkpointing.
+ The build process of an AD code is very similar to building
+ the forward model. However, depending on which AD code one wishes
+ to generate, and on which AD tool is available (TAF or TAMC),
+ the following {\tt make} targets are available:
+ \begin{table}[h!]
+ {\footnotesize
+ \begin{tabular}{ccll}
+ ~ & {\it AD-target} & {\it output} & {\it description} \\
+ \hline
+ \hline
+ (1) & {\tt <MODE><TOOL>only} & {\tt <MODE>\_<TOOL>\_output.f}  &
+ generates code for $<$MODE$>$ using $<$TOOL$>$ \\
+ ~ & ~ & ~ & no {\tt make} dependencies on {\tt .F .h} \\
+ ~ & ~ & ~ & useful for compiling on remote platforms \\
+ \hline
+ (2) & {\tt <MODE><TOOL>} & {\tt <MODE>\_<TOOL>\_output.f}  &
+ generates code for $<$MODE$>$ using $<$TOOL$>$ \\
+ ~ & ~ & ~ & includes {\tt make} dependencies on {\tt .F .h} \\
+ ~ & ~ & ~ & i.e. input for $<$TOOL$>$ may be re-generated \\
+ \hline
+ (3) & {\tt <MODE>all} & {\tt mitgcmuv\_<MODE>}  &
+ generates code for $<$MODE$>$ using $<$TOOL$>$ \\
+ ~ & ~ & ~ & and compiles all code \\
+ ~ & ~ & ~ & (use of TAF is set as default) \\
+ \hline
+ \hline
+ \end{tabular}
+ }
+ \end{table}
+ %
+ Here, the following placeholders are used
  %
  \begin{itemize}
  %
- \item {\tt \#ifdef ALLOW\_TAMC\_CHECKPOINTING} \\
+ \item [$<$TOOL$>$]
--level checkpointing is enabled, i.e. the timestepping
- is divided into three different levels (see Section \ref{???}).
- The model state of the outermost ({\tt nchklev\_3}) and the
- intermediate ({\tt nchklev\_2}) timestepping loop are stored to file
- (handled in {\it the\_main\_loop}).
- The innermost loop ({\tt nchklev\_1})
- avoids I/O by storing all required variables
- to common blocks. This storing may also be necessary if
- no checkpointing is chosen
- (nonlinear functions, if-statements, iterative loops, ...).
- In the present example the dimensions are chosen as follows: \\
- \hspace*{4ex} {\tt nchklev\_1      =  36 } \\
- \hspace*{4ex} {\tt nchklev\_2      =  30 } \\
- \hspace*{4ex} {\tt nchklev\_3      =  60 } \\
- To guarantee that the checkpointing intervals span the entire
- integration period the following relation must be satisfied: \\
- \hspace*{4ex} {\tt nchklev\_1*nchklev\_2*nchklev\_3 $ \ge $ nTimeSteps} \\
- where {\tt nTimeSteps} is either specified in {\it data}
- or computed via \\
- \hspace*{4ex} {\tt nTimeSteps = (endTime-startTime)/deltaTClock }.
- %
- \item {\tt \#undef ALLOW\_TAMC\_CHECKPOINTING} \\
- No checkpointing is enabled.
- In this case the relevant counter is {\tt nchklev\_0}.
- Similar to above, the following relation has to be satisfied \\
- \hspace*{4ex} {\tt nchklev\_0 $ \ge $ nTimeSteps}.
  %
- \end{itemize}
+ \begin{itemize}
- The following parameters may be worth describing: \\
  %
- \hspace*{4ex} {\tt isbyte} \\
+ \item {\tt TAF}
- \hspace*{4ex} {\tt maxpass} \\
+ \item {\tt TAMC}
- ~
- \subsubsection{File {\it makefile}}
- This file contains all relevant parameter flags and
- lists to run TAMC or TAF.
- It is assumed that TAMC is available to you, either locally,
- being installed on your network, or remotely through the 'TAMC Utility'.
- TAMC is called with the command {\tt tamc} followed by a
- number of options. They are described in detail in the
- TAMC manual \cite{gie:99}.
- Here we briefly discuss the main flags used in the {\it makefile}
  %
- \begin{itemize}
- \item [{\tt tamc}] {\tt
- -input <variable names>
- -output <variable name> -r4 ... \\
- -toplevel <S/R name> -reverse <file names>
- }
  \end{itemize}
  %
+ \item [$<$MODE$>$]
+ %
  \begin{itemize}
  %
- \item {\tt -toplevel <S/R name>} \\
+ \item {\tt ad} generates the adjoint model (ADM)
- Name of the toplevel routine, with respect to which the
+ \item {\tt ftl} generates the tangent linear model (TLM)
- control flow analysis is performed.
+ \item {\tt svd} generates both ADM and TLM for \\
- %
+ singular value decomposition (SVD) type calculations
- \item {\tt -input <variable names>} \\
- List of independent variables $ u $ with respect to which the
- dependent variable $ J $ is differentiated.
- %
- \item {\tt -output <variable name>} \\
- Dependent variable $ J $  which is to be differentiated.
- %
- \item {\tt -reverse <file names>} \\
- Adjoint code is generated to compute the sensitivity of an
- independent variable w.r.t.  many dependent variables.
- In the discussion of Section ???
- the generated adjoint top-level routine computes the product
- of the transposed Jacobian matrix $ M^T $ times
- the gradient vector $ \nabla_v J $.
- \\
- {\tt <file names>} refers to the list of files {\it .f} which are to be
- analyzed by TAMC. This list is generally smaller than the full list
- of code to be compiled. The files not contained are either
- above the top-level routine (some initializations), or are
- deliberately hidden from TAMC, either because hand-written
- adjoint routines exist, or the routines must not (or don't have to)
- be differentiated. For each routine which is part of the flow tree
- of the top-level routine, but deliberately hidden from TAMC
- (or for each package which contains such routines),
- a corresponding file {\it .flow} exists containing flow directives
- for TAMC.
  %
- \item {\tt -r4} \\
+ \end{itemize}
- ~
  %
  \end{itemize}
+ For example, to generate the adjoint model using TAF after routines ({\tt .F})
+ or headers ({\tt .h}) have been modified, but without compilation,
+ type {\tt make adtaf};
+ or, to generate the tangent linear model using TAMC without
+ re-generating the input code, type {\tt make ftltamconly}.
- \subsubsection{The input parameter files}
- \paragraph{File {\it data}}
- \paragraph{File {\it data.cost}}
- \paragraph{File {\it data.ctrl}}
- \paragraph{File {\it data.gmredi}}
- \paragraph{File {\it data.grdchk}}
- \paragraph{File {\it data.optim}}
- \paragraph{File {\it data.pkg}}
- \paragraph{File {\it eedata}}
+ A typical full build process to generate the ADM via TAF would
+ look like follows:
- \paragraph{File {\it topog.bin}}
+ \begin{verbatim}
+ % mkdir build
- \paragraph{File {\it windx.bin, windy.bin}}
+ % cd build
+ % ../../../tools/genmake2 -mods=../code_ad
- \paragraph{File {\it salt.bin, theta.bin}}
+ % make depend
+ % make adall
- \paragraph{File {\it SSS.bin, SST.bin}}
+ \end{verbatim}
- \paragraph{File {\it pickup*}}
- \subsection{Compiling the model and its adjoint}
- The built process of the adjoint model is slightly more
+ %------------------------------------------------------------------
- complex than that of compiling the forward code.
- The main reason is that the adjoint code generation requires
- a specific list of routines that are to be differentiated
- (as opposed to the automatic generation of a list of
- files to be compiled by genmake).
- This list excludes routines that don't have to be or must not be
- differentiated. For some of the latter routines flow directives
- may be necessary, a list of which has to be given as well.
- For this reason, a separate {\it makefile} is currently
- maintained in the directory {\tt adjoint/}. This
- makefile is responsible for the adjoint code generation.
- In the following we describe the build process step by step,
+ \subsection{The AD build process in detail
- assuming you are in the directory {\tt bin/}.
+ \label{section_ad_build_detail}}
- A summary of steps to follow is given at the end.
- \paragraph{Adjoint code generation and compilation -- step by step}
+ The {\tt make <MODE>all} target consists of the following procedures:
  \begin{enumerate}
  %
  \item
- {\tt ln -s ../verification/???/code/.genmakerc .} \\
+ A header file {\tt AD\_CONFIG.h} is generated which contains a CPP option
- {\tt ln -s ../verification/???/code/*.[Fh] .} \\
+ on which code ought to be generated. Depending on the {\tt make} target,
- Link your customized genmake options, header files,
+ the contents is
- and modified code to the compile directory.
+ \begin{itemize}
- %
  \item
- {\tt ../tools/genmake -makefile} \\
+ {\tt \#define ALLOW\_ADJOINT\_RUN}
- Generate your Makefile (cf. Section ???).
- %
  \item
- {\tt make depend} \\
+ {\tt \#define ALLOW\_TANGENTLINEAR\_RUN}
- Dependency analysis for the CPP pre-compiler (cf. Section ???).
- %
  \item
- {\tt make small\_f} \\
+ {\tt \#define ALLOW\_ECCO\_OPTIMIZATION}
- This is the first difference between forward code compilation
+ \end{itemize}
- and adjoint code generation and compilation.
- Instead of going through the entire compilation process
- (CPP precompiling -- {\tt .f}, object code generation -- {\tt .o},
- linking of object files and libraries to generate executable),
- only the CPP compiler is invoked at this stage to generate
- the {\tt .f} files.
  %
  \item
- {\tt cd ../adjoint} \\
+ A single file {\tt <MODE>\_input\_code.f} is concatenated
- {\tt make adtaf} or {\tt make adtamc} \\
+ consisting of all {\tt .f} files that are part of the list {\bf AD\_FILES}
- Depending on whether you have TAF or TAMC at your disposal,
+ and all {\tt .flow} files that are part of the list {\bf AD\_FLOW\_FILES}.
- you'll choose {\tt adtaf} or {\tt adtamc} as your
- make target for the {\it makefile} in the directory {\tt adjoint/}.
- Several things happen at this stage.
- %
- \begin{enumerate}
  %
  \item
- The initial template file {\it adjoint\_model.F} which is part
+ The AD tool is invoked with the {\bf <MODE>\_<TOOL>\_FLAGS}.
- of the compiling list created by {\it genmake} is restored.
+ The default AD tool flags in {\tt genmake2} can be overrwritten by
+ an {\tt adjoint\_options} file (similar to the platform-specific
+ {\tt build\_options}, see Section ???.
+ The AD tool writes the resulting AD code into the file
+ {\tt <MODE>\_input\_code\_ad.f}
  %
  \item
- All Fortran routines {\tt *.f} in {\tt bin/} are
+ A short sed script {\tt adjoint\_sed} is applied to
- concatenated into a single file (it's current name is
+ {\tt <MODE>\_input\_code\_ad.f}
- {\it tamc\_code.f}).
+ to reinstate {\bf myThid} into the CALL argument list of active file I/O.
+ The result is written to file {\tt <MODE>\_<TOOL>\_output.f}.
  %
  \item
- Adjoint code is generated by TAMC or TAF.
+ All routines are compiled and an executable is generated
- The adjoint code is written to the file {\it tamc\_code\_ad.f}.
+ (see Table ???).
- It contains all adjoint routines of the forward routines
- concatenated in {\it tamc\_code.f}.
- For a given forward routines {\tt subroutine routinename}
- the adjoint routine is named {\tt adsubroutine routinename}
- by default (that default can be changed via the flag
- {\tt -admark <markname>}).
- Furthermore, it may contain modified code which
- incorporates the translation of adjoint store directives
- into specific Fortran code.
- For a given forward routines {\tt subroutine routinename}
- the modified routine is named {\tt mdsubroutine routinename}.
- TAMC or TAF info is written to file
- {\it tamc\_code.prot} or {\it taf.log}, respectively.
  %
  \end{enumerate}
+ \subsubsection{The list AD\_FILES and {\tt .list} files}
+ Not all routines are presented to the AD tool.
+ Routines typically hidden are diagnostics routines which
+ do not influence the cost function, but may create
+ artificial flow dependencies such as I/O of active variables.
+ {\tt genmake2} generates a list (or variable) {\bf AD\_FILES}
+ which contains all routines that are shown to the AD tool.
+ This list is put together from all files with suffix {\tt .list}
+ that {\tt genmake2} finds in its search directories.
+ The list file for the core MITgcm routines is in {\tt model/src/}
+ is called {\tt model\_ad\_diff.list}.
+ Note that no wrapper routine is shown to TAF. These are either
+ not visible at all to the AD code, or hand-written AD code
+ is available (see next section).
+ Each package directory contains its package-specific
+ list file {\tt <PKG>\_ad\_diff.list}. For example,
+ {\tt pkg/ptracers/} contains the file {\tt ptracers\_ad\_diff.list}.
+ Thus, enabling a package will automatically extend the
+ {\bf AD\_FILES} list of {\tt genmake2} to incorporate the
+ package-specific routines.
+ Note that you will need to regenerate the {\tt Makefile} if
+ you enable a package (e.g. by adding it to {\tt packages.conf})
+ and a {\tt Makefile} already exists.
+ \subsubsection{The list AD\_FLOW\_FILES and {\tt .flow} files}
+ TAMC and TAF can evaluate user-specified directives
+ that start with a specific syntax ({\tt CADJ}, {\tt C\$TAF}, {\tt !\$TAF}).
+ The main categories of directives are STORE directives and
+ FLOW directives. Here, we are concerned with flow directives,
+ store directives are treated elsewhere.
+ Flow directives enable the AD tool to evaluate how it should treat
+ routines that are 'hidden' by the user, i.e. routines which are
+ not contained in the {\bf AD\_FILES} list (see previous section),
+ but which are called in part of the code that the AD tool does see.
+ The flow directive tell the AD tool
  %
- \item
+ \begin{itemize}
- {\tt make adchange} \\
- The multi-threading capability of the MITGCM requires a slight
- change in the parameter list of some routines that are related to
- to active file handling.
- This post-processing invokes the sed script {\it adjoint\_ecco\_sed.com}
- to insert the threading counter {\bf myThId} into the parameter list
- of those subroutines.
- The resulting code is written to file {\it tamc\_code\_sed\_ad.f}
- and appended to the file {\it adjoint\_model.F}.
- This concludes the adjoint code generation.
  %
- \item
+ \item which subroutine arguments are input/output
- {\tt cd ../bin} \\
+ \item which subroutine arguments are active
- {\tt make} \\
+ \item which subroutine arguments are required to compute the cost
- The file {\it adjoint\_model.F} now contains the full adjoint code.
+ \item which subroutine arguments are dependent
- All routines are now compiled.
  %
- \end{enumerate}
+ \end{itemize}
+ %
+ The syntax for the flow directives can be found in the
+ AD tool manuals.
- \paragraph{Adjoint code generation and compilation -- summary}
+ {\tt genmake2} generates a list (or variable) {\bf AD\_FLOW\_FILES}
- ~ \\
+ which contains all files with suffix{\tt .flow} that it finds
+ in its search directories.
+ The flow directives for the core MITgcm routines of
+ {\tt eesupp/src/} and {\tt model/src/}
+ reside in {\tt pkg/autodiff/}.
+ This directory also contains hand-written adjoint code
+ for the MITgcm WRAPPER (see Section ???).
+ Flow directives for package-specific routines are contained in
+ the corresponding package directories in the file
+ {\tt <PKG>\_ad.flow}, e.g. ptracers-specific directives are in
+ {\tt ptracers\_ad.flow}.
+ \subsubsection{Store directives for 3-level checkpointing}
+ The storing that is required at each period of the
+-level checkpointing is controled by three
+ top-level headers.
- \[
+ \begin{verbatim}
- \boxed{
+ do ilev_3 = 1, nchklev_3
- \begin{split}
+ #  include ``checkpoint_lev3.h''
-  ~ & \mbox{\tt cd bin} \\
+    do ilev_2 = 1, nchklev_2
-  ~ & \mbox{\tt ln -s ../verification/my\_experiment/code/.genmakerc .} \\
+ #     include ``checkpoint_lev2.h''
-  ~ & \mbox{\tt ln -s ../verification/my\_experiment/code/*.[Fh] .} \\
+       do ilev_1 = 1, nchklev_1
-  ~ & \mbox{\tt ../tools/genmake -makefile} \\
+ #        include ``checkpoint_lev1.h''
-  ~ & \mbox{\tt make depend} \\
-  ~ & \mbox{\tt make small\_f} \\
+ ...
-  ~ & \mbox{\tt cd ../adjoint} \\
-  ~ & \mbox{\tt make adtaf <OR: make adtamc>} \\
+       end do
-  ~ & \mbox{\tt make adchange} \\
+    end do
-  ~ & \mbox{\tt cd ../bin} \\
+ end do
-  ~ & \mbox{\tt make} \\
+ \end{verbatim}
- \end{split}
- }
- \]
- \newpage
+ All files {\tt checkpoint\_lev?.h} are contained in directory
+ {\tt pkg/autodiff/}.
- %**********************************************************************
- \section{TLM and ADM generation in general}
- \label{sec_ad_setup_gen}
- %**********************************************************************
- In this section we describe in a general fashion
+ \subsubsection{Changing the default AD tool flags: ad\_options files}
- the parts of the code that are relevant for automatic
- differentiation using the software tool TAMC.
- \input{part5/doc_ad_the_model}
- The basic flow is depicted in \ref{fig:adthemodel}.
+ \subsubsection{Hand-written adjoint code}
- If the option {\tt ALLOW\_AUTODIFF\_TAMC} is defined, the driver routine
- {\it the\_model\_main}, instead of calling {\it the\_main\_loop},
+ %------------------------------------------------------------------
- invokes the adjoint of this routine, {\it adthe\_main\_loop},
- which is the toplevel routine in terms of reverse mode computation.
- The routine {\it adthe\_main\_loop} has been generated using TAMC.
- It contains both the forward integration of the full model,
- any additional storing that is required for efficient checkpointing,
- and the reverse integration of the adjoint model.
- The structure of {\it adthe\_main\_loop} has been strongly
- simplified for clarification; in particular, no checkpointing
- procedures are shown here.
- Prior to the call of {\it adthe\_main\_loop}, the routine
- {\it ctrl\_unpack} is invoked to unpack the control vector,
- and following that call, the routine {\it ctrl\_pack}
- is invoked to pack the control vector
- (cf. Section \ref{section_ctrl}).
- If gradient checks are to be performed, the option
- {\tt ALLOW\_GRADIENT\_CHECK} is defined. In this case
- the driver routine {\it grdchk\_main} is called after
- the gradient has been computed via the adjoint
- (cf. Section \ref{section_grdchk}).
  \subsection{The cost function (dependent variable)
  \label{section_cost}}
-Line 1289 
 the gradient has been computed via the a
+Line 993 
 the gradient has been computed via the a
  The cost function $ {\cal J} $ is referred to as the {\sf dependent variable}.
  It is a function of the input variables $ \vec{u} $ via the composition
  $ {\cal J}(\vec{u}) \, = \, {\cal J}(M(\vec{u})) $.
- The input is referred to as the
+ The input are referred to as the
  {\sf independent variables} or {\sf control variables}.
  All aspects relevant to the treatment of the cost function $ {\cal J} $
  (parameter setting, initialization, accumulation,
  final evaluation), are controlled by the package {\it pkg/cost}.
+ The aspects relevant to the treatment of the independent variables
+ are controlled by the package {\it pkg/ctrl} and will be treated
+ in the next section.
  \input{part5/doc_cost_flow}
- \subsubsection{genmake and CPP options}
+ \subsubsection{Enabling the package}
- %
- \begin{itemize}
- %
- \item
  \fbox{
  \begin{minipage}{12cm}
- {\it genmake}, {\it CPP\_OPTIONS.h}, {\it ECCO\_CPPOPTIONS.h}
+ {\it packages.conf}, {\it ECCO\_CPPOPTIONS.h}
  \end{minipage}
  }
- \end{itemize}
+ \begin{itemize}
- %
- The directory {\it pkg/cost} can be included to the
- compile list in 3 different ways (cf. Section \ref{???}):
  %
- \begin{enumerate}
+ \item
+ The package is enabled by adding {\it cost} to your file {\it packages.conf}
+ (see Section ???)
  %
- \item {\it genmake}: \\
+ \item
- Change the default settings in the file {\it genmake} by adding
- {\bf cost} to the {\bf enable} list (not recommended).
- %
+ \end{itemize}
- \item {\it .genmakerc}: \\
- Customize the settings of {\bf enable}, {\bf disable} which are
- appropriate for your experiment in the file {\it .genmakerc}
- and add the file to your compile directory.
- %
- \item genmake-options: \\
- Call {\it genmake} with the option
- {\tt genmake -enable=cost}.
  %
- \end{enumerate}
+ N.B.: In general the following packages ought to be enabled
+ simultaneously: {\it autodiff, cost, ctrl}.
  The basic CPP option to enable the cost function is {\bf ALLOW\_COST}.
  Each specific cost function contribution has its own option.
  For the present example the option is {\bf ALLOW\_COST\_TRACER}.
  All cost-specific options are set in {\it ECCO\_CPPOPTIONS.h}
  Since the cost function is usually used in conjunction with
  automatic differentiation, the CPP option
- {\bf ALLOW\_ADJOINT\_RUN} should be defined
+ {\bf ALLOW\_ADJOINT\_RUN} (file {\it CPP\_OPTIONS.h}) and
- (file {\it CPP\_OPTIONS.h}).
+ {\bf ALLOW\_AUTODIFF\_TAMC} (file {\it ECCO\_CPPOPTIONS.h})
+ should be defined.
  \subsubsection{Initialization}
  %
  The initialization of the {\it cost} package is readily enabled
- as soon as the CPP option {\bf ALLOW\_ADJOINT\_RUN} is defined.
+ as soon as the CPP option {\bf ALLOW\_COST} is defined.
  %
  \begin{itemize}
  %
-Line 1414 
 from each contribution and sums over all
+Line 1112 
 from each contribution and sums over all
  \begin{equation}
  {\cal J} \, = \,
  {\rm fc} \, = \,
- {\rm mult\_tracer} \sum_{bi,\,bj}^{nSx,\,nSy}
+ {\rm mult\_tracer} \sum_{\text{global sum}} \sum_{bi,\,bj}^{nSx,\,nSy}
  {\rm objf\_tracer}(bi,bj) \, + \, ...
  \end{equation}
  %
-Line 1462 
 are controlled by the package {\it pkg/c
+Line 1160 
 are controlled by the package {\it pkg/c
  %
  To enable the directory to be included to the compile list,
  {\bf ctrl} has to be added to the {\bf enable} list in
- {\it .genmakerc} (or {\it genmake} itself).
+ {\it .genmakerc} or in {\it genmake} itself (analogous to {\it cost}
+ package, cf. previous section).
  Each control variable is enabled via its own CPP option
  in {\it ECCO\_CPPOPTIONS.h}.
-Line 1606 
 in the code takes on the form
+Line 1305 
 in the code takes on the form
  %
  Note, that reading an active variable corresponds
  to a variable assignment. Its derivative corresponds
- to a write statement of the adjoint variable.
+ to a write statement of the adjoint variable, followed by
+ a reset.
  The 'active file' routines have been designed
  to support active read and corresponding adjoint active write
  operations (and vice versa).
-Line 1723 
 at intermediate times can be written usi
+Line 1423 
 at intermediate times can be written usi
  {\it addummy\_in\_stepping}.
  This routine is part of the adjoint support package
  {\it pkg/autodiff} (cf.f. below).
+ The procedure is enabled using via the CPP-option
+ {\bf ALLOW\_AUTODIFF\_MONITOR} (file {\it ECCO\_CPPOPTIONS.h}).
  To be part of the adjoint code, the corresponding S/R
  {\it dummy\_in\_stepping} has to be called in the forward
  model (S/R {\it the\_main\_loop}) at the appropriate place.
+ The adjoint common blocks are extracted from the adjoint code
+ via the header file {\it adcommon.h}.
  {\it dummy\_in\_stepping} is essentially empty,
  the corresponding adjoint routine is hand-written rather
-Line 1752 
 the common blocks
+Line 1456 
 the common blocks
  {\bf /adtr1\_r/}, {\bf /adffields/},
  which have been extracted from the adjoint code to enable
  access to the adjoint variables.
+ {\bf WARNING:} If the structure of the common blocks
+ {\bf /dynvars\_r/}, {\bf /dynvars\_cd/}, etc., changes
+ similar changes will occur in the adjoint common blocks.
+ Therefore, consistency between the TAMC-generated common blocks
+ and those in {\it adcommon.h} have to be checked.
  %
  \end{itemize}

 Legend:



Removed from v.1.13
 


changed lines


 
Added in v.1.17
 Legend:



Removed from v.1.13
 


changed lines


 
Added in v.1.17
-Removed from v.1.13
+Added in v.1.17

	ViewVC Help
Powered by ViewVC 1.1.22