/[MITgcm]/MITgcm_contrib/articles/ceaice/ceaice_adjoint.tex

Diff of /MITgcm_contrib/articles/ceaice/ceaice_adjoint.tex

Parent Directory | Revision Log | View Revision Graph Revision Graph | View Patch Patch

-revision 1.3 by heimbach,
Tue Mar 25 22:04:31 2008 UTC
+revision 1.4 by mlosch,
Wed Jun  4 13:34:41 2008 UTC
 Line 3
  \subsection{The adjoint of MITsim}
+ The adjoint model of the MITgcm has become an invaluable
- The ability to generate tangent linear and adjoint components
+ tool for sensitivity analysis as well as state estimation \citep[for a
- of a coupled ocean sea-ice system was one of the main drivers
+ recent summary, see][]{heim:08}. The code has been developed and
- behind the MITsim development.
+ tailored to be readily used with automatic differentiation tools for
- For the ocean the adjoint capability has proven to be an
+ adjoint code generation. This route was also taken in developing and
- invaluable tool for sensitivity analysis as well as state estimation,
+ adapting the sea-ice compontent MITsim, so that tangent linear and
- as evidenced by various adjoint-based studies
+ adjoint components can be obtained and kept up to date without
- (for a recent summary, see \cite{heim:08}).
+ excessive effort.
- The adjoint model operator (ADM) is the transpose of the tangent linear
+ The adjoint model operator (ADM) is the transpose of the tangent
- model operator (TLM)
+ linear model operator (TLM) of the full (in general nonlinear) forward
- of the full (in general nonlinear) forward model, i.e. the MITsim.
+ model, in this case the MITsim. This operator computes the gradients
- It enables very efficient computation of gradients
+ of scalar-valued model diagnostics (so-called cost function or
- of scalar-valued model diagnostics
+ objective function) with respect to many model inputs (so-called
- (so-called cost function or objective function)
+ independent or control variables).  These inputs can be two- or
- with respect to many model inputs (so-called independent or control variables).
+ three-dimensional fields of initial conditions of the ocean or sea-ice
- These inputs can be two- or three-dimensional fields of initial
+ state, model parameters such as mixing coefficients, or time-varying
- conditions of the ocean or sea-ice state, model parameters such as
+ surface or lateral (open) boundary conditions.  When combined, these
- mixing coefficients, or time-varying surface or lateral (open) boundary conditions.
+ variables span a potentially high-dimensional (e.g.  O(10$^8$))
- When combined, these variables span a potentially high-dimensional
+ so-called control space. At this problem dimension, perturbing
- (e.g. O(10$^8$)) so-called control space. Performing parameter perturbations
+ individual parameters to assess model sensitivities quickly becomes
- to assess model sensitivities quickly becomes prohibitive at these scales.
+ prohibitive. By contrast, transient sensitivities of the objective
- Alternatively, transient sensitivities of the objective function
+ function to any element of the control and model state space can be
- to any element of the  control and model state space can be computed
+ computed very efficiently in one single adjoint model integration,
- very efficiently in  one single adjoint
+ provided an adjoint model is available.
- model integration, provided an efficient adjoint model is available.
+ In anology to the TLM and ADM components of the MITgcm we rely on the
- Following closely the development and maintenance of the
+ autmomatic differentiation (AD) tool ``Transformation of Algorithms in
- TLM and ADM components of the MITgcm we have relied heavily on the
+ Fortran'' (TAF) developed by Fastopt \citep{gier-kami:98} to generate
- autmomatic differentiation (AD) tool
+ TLM and ADM code of the MITsim \citep[for details see][]{maro-etal:99,
- "Transformation of Algorithms in Fortran" (TAF)
+   heim-etal:05}.  In short, the AD tool uses the nonlinear parent
- developed by Fastopt \citep{gier-kami:98}.
+ model code to generate derivative code for the specified control space
- to derive TLM and ADM code of the MITsim
+ and objective function. Advantages of this approach have been pointed
- (for details see \cite{maro-etal:99}, \cite{heim-etal:05}).
+ out, for example by \cite{gier-kami:98}.
- Briefly, the nonlinear parent model is fed to the AD tool which produces
- derivative code for the specified control space and objective function.
+ Many issues of generating efficient exact adjoint sea-ice code are
- Apart from its evident success, advantages of this approach have been
+ similar to those for the ocean model's adjoint.  Linearizing the model
- pointed out, e.g. by \cite{gier-kami:98}.
+ around the exact nonlinear model trajectory is a crucial aspect in the
+ presence of different regimes (e.g., is the thermodynamic growth term
- Many issues underlying the efficient exact adjoint sea-ice code generation
+ for sea-ice evaluated near or far away from the freezing point of the
- are similar to those arising for the ocean model's adjoint.
+ ocean surface?). Adapting the (parent) model code to support the AD
- Linearizing the model around the exact nonlinear model trajectory,
+ tool in providing exact and efficient adjoint code represents the main
- as we do, is a crucial aspect in the presence of different
+ work load initially. For legacy code, this task may become
- regimes (e.g. effect of the seaice growth term at or away from the
+ substantial, but it is fairly straightforward when writing new code
- freezing point of the ocean surface).
+ with an AD tool in mind. Once this initial task is completed,
- Adjusting the (parent) model code to support the AD tool in
+ generating the adjoint code of a new model configuration takes about
- providing exact and efficient adjoint code is the main initial work.
+minutes.
- This may be substantial for legacy code, but fairly straightforward
- when coding with "AD application in mind".
- Once in place, an adjoint model of a new model configuration
- may be derived in about 10 minutes.
  [HIGHLIGHT COUPLED NATURE OF THE ADJOINT!]
-Line 106 
 NCEP/NCAR atmospheric state variables. O
+Line 102 
 NCEP/NCAR atmospheric state variables. O
  converted into air-sea fluxes via the bulk formulae of
  \citet{large04}.  Derivation of air-sea fluxes in the presence of
  sea-ice is handled by the ice model as described in \refsec{model}.
- The objective function chosen is
+ The objective function is chosen $J$ as the
  sea-ice export through
  Lancaster Sound at XX$^{\circ}$W
  averaged over an 8-month period between October 1992 and May 1993.
-Line 129 
 DISCUSS FORWARD STATE, INCLUDING SOME NU
+Line 125 
 DISCUSS FORWARD STATE, INCLUDING SOME NU
  \paragraph{Sensitivities to the sea-ice thickness}
  The most readily interpretable ice-export sensitivity is that
- to ice thickness, $\partial J / \partial heff$.
+ to effective ice thickness, $\partial{J} / \partial{h}$.
- Fig. XXX depcits transient $\partial J / \partial heff$ using free-slip
+ Fig. XXX depcits transient $\partial{J} / \partial{h}$ using free-slip
  (left column) and no-slip (right column) boundary conditions.
  Sensitivity snapshots are depicted for (from top to bottom)
 , 24, 36, and 48 months prior to May 2003.
- The dominant features are in accordance with expectations:
+ The dominant features are\ml{ in accordance with expectations/as expected}:
  (*)
  Dominant pattern (for the free-slip run) is that of positive sensitivities, i.e.
-Line 337 
 Primary features are the effect of the h
+Line 333 
 Primary features are the effect of the h
  Atlantic current which feeds into the West Spitsbergen current,
  the circulation around Svalbard, and ...
+ \ml{[based on the movie series
+   zzz\_run\_export\_canarch\_freeslip\_4yr\_1989\_ADJ*:]} The ice
+ export through the Canadian Archipelag is highly sensitive to the
+ previous state of the ocean-ice system in the Archipelago and the
+ Western Arctic. According to the \ml{(adjoint)} senstivities of the
+ eastward ice transport through Lancaster Sound (\reffig{arctic_topog},
+ cross-section G) with respect to ice volume (effective thickness), ocean
+ surface temperature, and vertical diffusivity near the surface
+ (\reffig{fouryearadj}) after 4 years of integration the following
+ mechanisms can be identified: near the ``observation'' (cross-section
+ G), smaller vertical diffusivities lead to lower surface temperatures
+ and hence to more ice that is available for export. Further away from
+ cross-section G, the sensitivity to vertical diffusivity has the
+ opposite sign, but temperature and ice volume sensitivities have the
+ same sign as close to the observation.
  \begin{figure}[t!]
  \centerline{
  \subfigure[{\footnotesize -12 months}]

 Legend:



Removed from v.1.3
 


changed lines


 
Added in v.1.4
 Legend:



Removed from v.1.3
 


changed lines


 
Added in v.1.4
-Removed from v.1.3
+Added in v.1.4

	ViewVC Help
Powered by ViewVC 1.1.22