/[MITgcm]/MITgcm_contrib/articles/ceaice/ceaice_adjoint.tex
ViewVC logotype

Annotation of /MITgcm_contrib/articles/ceaice/ceaice_adjoint.tex

Parent Directory Parent Directory | Revision Log Revision Log | View Revision Graph Revision Graph


Revision 1.2 - (hide annotations) (download) (as text)
Tue Mar 4 20:30:40 2008 UTC (17 years, 4 months ago) by mlosch
Branch: MAIN
Changes since 1.1: +6 -1 lines
File MIME type: application/x-tex
add some xemacs hooks for latex

1 dimitri 1.1 \section{Adjoint sensiivities of the MITsim}
2     \label{sec:adjoint}
3    
4     \subsection{The adjoint of MITsim}
5    
6     The ability to generate tangent linear and adjoint model components
7     of the MITsim has been a main design task.
8     For the ocean the adjoint capability has proven to be an
9     invaluable tool for sensitivity analysis as well as state estimation.
10     In short, the adjoint enables very efficient computation of the gradient
11     of scalar-valued model diagnostics (called cost function or objective function)
12     with respect to many model "variables".
13     These variables can be two- or three-dimensional fields of initial
14     conditions, model parameters such as mixing coefficients, or
15     time-varying surface or lateral (open) boundary conditions.
16     When combined, these variables span a potentially high-dimensional
17     (e.g. O(10$^8$)) so-called control space. Performing parameter perturbations
18     to assess model sensitivities quickly becomes prohibitive at these scales.
19     Alternatively, (time-varying) sensitivities of the objective function
20     to any element of the control space can be computed very efficiently in
21     one single adjoint
22     model integration, provided an efficient adjoint model is available.
23    
24     [REFERENCES]
25    
26    
27     The adjoint operator (ADM) is the transpose of the tangent linear operator (TLM)
28     of the full (in general nonlinear) forward model, i.e. the MITsim.
29     The TLM maps perturbations of elements of the control space
30     (e.g. initial ice thickness distribution)
31     via the model Jacobian
32     to a perturbation in the objective function
33     (e.g. sea-ice export at the end of the integration interval).
34     \textit{Tangent} linearity ensures that the derivatives are evaluated
35     with respect to the underlying model trajectory at each point in time.
36     This is crucial for nonlinear trajectories and the presence of different
37     regimes (e.g. effect of the seaice growth term at or away from the
38     freezing point of the ocean surface).
39     Ensuring tangent linearity can be easily achieved by integrating
40     the full model in sync with the TLM to provide the underlying model state.
41     Ensuring \textit{tangent} adjoints is equally crucial, but much more
42     difficult to achieve because of the reverse nature of the integration:
43     the adjoint accumulates sensitivities backward in time,
44     starting from a unit perturbation of the objective function.
45     The adjoint model requires the model state in reverse order.
46     This presents one of the major complications in deriving an
47     exact, i.e. \textit{tangent} adjoint model.
48    
49     Following closely the development and maintenance of TLM and ADM
50     components of the MITgcm we have relied heavily on the
51     autmomatic differentiation (AD) tool
52     "Transformation of Algorithms in Fortran" (TAF)
53     developed by Fastopt (Giering and Kaminski, 1998)
54     to derive TLM and ADM code of the MITsim.
55     Briefly, the nonlinear parent model is fed to the AD tool which produces
56     derivative code for the specified control space and objective function.
57     Following this approach has (apart from its evident success)
58     several advantages:
59     (1) the adjoint model is the exact adjoint operator of the parent model,
60     (2) the adjoint model can be kept up to date with respect to ongoing
61     development of the parent model, and adjustments to the parent model
62     to extend the automatically generated adjoint are incremental changes
63     only, rather than extensive re-developments,
64     (3) the parallel structure of the parent model is preserved
65     by the adjoint model, ensuring efficient use in high performance
66     computing environments.
67    
68     Some initial code adjustments are required to support dependency analysis
69     of the flow reversal and certain language limitations which may lead
70     to irreducible flow graphs (e.g. GOTO statements).
71     The problem of providing the required model state in reverse order
72     at the time of evaluating nonlinear or conditional
73     derivatives is solved via balancing
74     storing vs. recomputation of the model state in a multi-level
75     checkpointing loop.
76     Again, an initial code adjustment is required to support TAFs
77     checkpointing capability.
78     The code adjustments are sufficiently simple so as not to cause
79     major limitations to the full nonlinear parent model.
80     Once in place, an adjoint model of a new model configuration
81     may be derived in about 10 minutes.
82    
83     [HIGHLIGHT COUPLED NATURE OF THE ADJOINT!]
84    
85     \subsection{Special considerations}
86    
87     * growth term(?)
88    
89     * small active denominators
90    
91     * dynamic solver (implicit function theorem)
92    
93     * approximate adjoints
94    
95    
96     \subsection{An example: sensitivities of sea-ice export through Fram Strait}
97    
98     We demonstrate the power of the adjoint method
99     in the context of investigating sea-ice export sensitivities through Fram Strait
100     (for details of this study see Heimbach et al., 2007).
101     %\citep[for details of this study see][]{heimbach07}. %Heimbach et al., 2007).
102     The domain chosen is a coarsened version of the Arctic face of the
103     high-resolution cubed-sphere configuration of the ECCO2 project
104     \citep[see][]{menemenlis05}. It covers the entire Arctic,
105     extends into the North Pacific such as to cover the entire
106     ice-covered regions, and comprises parts of the North Atlantic
107     down to XXN to enable analysis of remote influences of the
108     North Atlantic current to sea-ice variability and export.
109     The horizontal resolution varies between XX and YY km
110     with 50 unevenly spaced vertical levels.
111     The adjoint models run efficiently on 80 processors
112     (benchmarks have been performed both on an SGI Altix as well as an
113     IBM SP5 at NASA/ARC).
114    
115     Following a 1-year spinup, the model has been integrated for four
116     years between 1992 and 1995. It is forced using realistic 6-hourly
117     NCEP/NCAR atmospheric state variables. Over the open ocean these are
118     converted into air-sea fluxes via the bulk formulae of
119     \citet{large04}. Derivation of air-sea fluxes in the presence of
120     sea-ice is handled by the ice model as described in \refsec{model}.
121     The objective function chosen is sea-ice export through Fram Strait
122     computed for December 1995. The adjoint model computes sensitivities
123     to sea-ice export back in time from 1995 to 1992 along this
124     trajectory. In principle all adjoint model variable (i.e., Lagrange
125     multipliers) of the coupled ocean/sea-ice model are available to
126     analyze the transient sensitivity behaviour of the ocean and sea-ice
127     state. Over the open ocean, the adjoint of the bulk formula scheme
128     computes sensitivities to the time-varying atmospheric state. Over
129     ice-covered parts, the sea-ice adjoint converts surface ocean
130     sensitivities to atmospheric sensitivities.
131    
132     \reffig{4yradjheff}(a--d) depict sensitivities of sea-ice export
133     through Fram Strait in December 1995 to changes in sea-ice thickness
134     12, 24, 36, 48 months back in time. Corresponding sensitivities to
135     ocean surface temperature are depicted in
136     \reffig{4yradjthetalev1}(a--d). The main characteristics is
137     consistency with expected advection of sea-ice over the relevant time
138     scales considered. The general positive pattern means that an
139     increase in sea-ice thickness at location $(x,y)$ and time $t$ will
140     increase sea-ice export through Fram Strait at time $T_e$. Largest
141     distances from Fram Strait indicate fastest sea-ice advection over the
142     time span considered. The ice thickness sensitivities are in close
143     correspondence to ocean surface sentivitites, but of opposite sign.
144     An increase in temperature will incur ice melting, decrease in ice
145     thickness, and therefore decrease in sea-ice export at time $T_e$.
146    
147     The picture is fundamentally different and much more complex
148     for sensitivities to ocean temperatures away from the surface.
149     \reffig{4yradjthetalev10??}(a--d) depicts ice export sensitivities to
150     temperatures at roughly 400 m depth.
151     Primary features are the effect of the heat transport of the North
152     Atlantic current which feeds into the West Spitsbergen current,
153     the circulation around Svalbard, and ...
154    
155     \begin{figure}[t!]
156     \centerline{
157     \subfigure[{\footnotesize -12 months}]
158     {\includegraphics*[width=0.44\linewidth]{\fpath/run_4yr_ADJheff_arc_lev1_tim072_cmax2.0E+02.eps}}
159     %\includegraphics*[width=.3\textwidth]{H_c.bin_res_100_lev1.pdf}
160     %
161     \subfigure[{\footnotesize -24 months}]
162     {\includegraphics*[width=0.44\linewidth]{\fpath/run_4yr_ADJheff_arc_lev1_tim145_cmax2.0E+02.eps}}
163     }
164    
165     \centerline{
166     \subfigure[{\footnotesize
167     -36 months}]
168     {\includegraphics*[width=0.44\linewidth]{\fpath/run_4yr_ADJheff_arc_lev1_tim218_cmax2.0E+02.eps}}
169     %
170     \subfigure[{\footnotesize
171     -48 months}]
172     {\includegraphics*[width=0.44\linewidth]{\fpath/run_4yr_ADJheff_arc_lev1_tim292_cmax2.0E+02.eps}}
173     }
174     \caption{Sensitivity of sea-ice export through Fram Strait in December 2005 to
175     sea-ice thickness at various prior times.
176     \label{fig:4yradjheff}}
177     \end{figure}
178    
179    
180     \begin{figure}[t!]
181     \centerline{
182     \subfigure[{\footnotesize -12 months}]
183     {\includegraphics*[width=0.44\linewidth]{\fpath/run_4yr_ADJtheta_arc_lev1_tim072_cmax5.0E+01.eps}}
184 mlosch 1.2 %\includegraphics*[width=.3\textwidth]{H_c.bin_res_100_lev1}
185 dimitri 1.1 %
186     \subfigure[{\footnotesize -24 months}]
187     {\includegraphics*[width=0.44\linewidth]{\fpath/run_4yr_ADJtheta_arc_lev1_tim145_cmax5.0E+01.eps}}
188     }
189    
190     \centerline{
191     \subfigure[{\footnotesize
192     -36 months}]
193     {\includegraphics*[width=0.44\linewidth]{\fpath/run_4yr_ADJtheta_arc_lev1_tim218_cmax5.0E+01.eps}}
194     %
195     \subfigure[{\footnotesize
196     -48 months}]
197     {\includegraphics*[width=0.44\linewidth]{\fpath/run_4yr_ADJtheta_arc_lev1_tim292_cmax5.0E+01.eps}}
198     }
199     \caption{Same as \reffig{4yradjheff} but for sea surface temperature
200     \label{fig:4yradjthetalev1}}
201     \end{figure}
202 mlosch 1.2
203     %%% Local Variables:
204     %%% mode: latex
205     %%% TeX-master: "ceaice"
206     %%% End:

  ViewVC Help
Powered by ViewVC 1.1.22