/[MITgcm]/MITgcm_contrib/articles/ceaice/ceaice_adjoint.tex
ViewVC logotype

Contents of /MITgcm_contrib/articles/ceaice/ceaice_adjoint.tex

Parent Directory Parent Directory | Revision Log Revision Log | View Revision Graph Revision Graph


Revision 1.2 - (show annotations) (download) (as text)
Tue Mar 4 20:30:40 2008 UTC (17 years, 4 months ago) by mlosch
Branch: MAIN
Changes since 1.1: +6 -1 lines
File MIME type: application/x-tex
add some xemacs hooks for latex

1 \section{Adjoint sensiivities of the MITsim}
2 \label{sec:adjoint}
3
4 \subsection{The adjoint of MITsim}
5
6 The ability to generate tangent linear and adjoint model components
7 of the MITsim has been a main design task.
8 For the ocean the adjoint capability has proven to be an
9 invaluable tool for sensitivity analysis as well as state estimation.
10 In short, the adjoint enables very efficient computation of the gradient
11 of scalar-valued model diagnostics (called cost function or objective function)
12 with respect to many model "variables".
13 These variables can be two- or three-dimensional fields of initial
14 conditions, model parameters such as mixing coefficients, or
15 time-varying surface or lateral (open) boundary conditions.
16 When combined, these variables span a potentially high-dimensional
17 (e.g. O(10$^8$)) so-called control space. Performing parameter perturbations
18 to assess model sensitivities quickly becomes prohibitive at these scales.
19 Alternatively, (time-varying) sensitivities of the objective function
20 to any element of the control space can be computed very efficiently in
21 one single adjoint
22 model integration, provided an efficient adjoint model is available.
23
24 [REFERENCES]
25
26
27 The adjoint operator (ADM) is the transpose of the tangent linear operator (TLM)
28 of the full (in general nonlinear) forward model, i.e. the MITsim.
29 The TLM maps perturbations of elements of the control space
30 (e.g. initial ice thickness distribution)
31 via the model Jacobian
32 to a perturbation in the objective function
33 (e.g. sea-ice export at the end of the integration interval).
34 \textit{Tangent} linearity ensures that the derivatives are evaluated
35 with respect to the underlying model trajectory at each point in time.
36 This is crucial for nonlinear trajectories and the presence of different
37 regimes (e.g. effect of the seaice growth term at or away from the
38 freezing point of the ocean surface).
39 Ensuring tangent linearity can be easily achieved by integrating
40 the full model in sync with the TLM to provide the underlying model state.
41 Ensuring \textit{tangent} adjoints is equally crucial, but much more
42 difficult to achieve because of the reverse nature of the integration:
43 the adjoint accumulates sensitivities backward in time,
44 starting from a unit perturbation of the objective function.
45 The adjoint model requires the model state in reverse order.
46 This presents one of the major complications in deriving an
47 exact, i.e. \textit{tangent} adjoint model.
48
49 Following closely the development and maintenance of TLM and ADM
50 components of the MITgcm we have relied heavily on the
51 autmomatic differentiation (AD) tool
52 "Transformation of Algorithms in Fortran" (TAF)
53 developed by Fastopt (Giering and Kaminski, 1998)
54 to derive TLM and ADM code of the MITsim.
55 Briefly, the nonlinear parent model is fed to the AD tool which produces
56 derivative code for the specified control space and objective function.
57 Following this approach has (apart from its evident success)
58 several advantages:
59 (1) the adjoint model is the exact adjoint operator of the parent model,
60 (2) the adjoint model can be kept up to date with respect to ongoing
61 development of the parent model, and adjustments to the parent model
62 to extend the automatically generated adjoint are incremental changes
63 only, rather than extensive re-developments,
64 (3) the parallel structure of the parent model is preserved
65 by the adjoint model, ensuring efficient use in high performance
66 computing environments.
67
68 Some initial code adjustments are required to support dependency analysis
69 of the flow reversal and certain language limitations which may lead
70 to irreducible flow graphs (e.g. GOTO statements).
71 The problem of providing the required model state in reverse order
72 at the time of evaluating nonlinear or conditional
73 derivatives is solved via balancing
74 storing vs. recomputation of the model state in a multi-level
75 checkpointing loop.
76 Again, an initial code adjustment is required to support TAFs
77 checkpointing capability.
78 The code adjustments are sufficiently simple so as not to cause
79 major limitations to the full nonlinear parent model.
80 Once in place, an adjoint model of a new model configuration
81 may be derived in about 10 minutes.
82
83 [HIGHLIGHT COUPLED NATURE OF THE ADJOINT!]
84
85 \subsection{Special considerations}
86
87 * growth term(?)
88
89 * small active denominators
90
91 * dynamic solver (implicit function theorem)
92
93 * approximate adjoints
94
95
96 \subsection{An example: sensitivities of sea-ice export through Fram Strait}
97
98 We demonstrate the power of the adjoint method
99 in the context of investigating sea-ice export sensitivities through Fram Strait
100 (for details of this study see Heimbach et al., 2007).
101 %\citep[for details of this study see][]{heimbach07}. %Heimbach et al., 2007).
102 The domain chosen is a coarsened version of the Arctic face of the
103 high-resolution cubed-sphere configuration of the ECCO2 project
104 \citep[see][]{menemenlis05}. It covers the entire Arctic,
105 extends into the North Pacific such as to cover the entire
106 ice-covered regions, and comprises parts of the North Atlantic
107 down to XXN to enable analysis of remote influences of the
108 North Atlantic current to sea-ice variability and export.
109 The horizontal resolution varies between XX and YY km
110 with 50 unevenly spaced vertical levels.
111 The adjoint models run efficiently on 80 processors
112 (benchmarks have been performed both on an SGI Altix as well as an
113 IBM SP5 at NASA/ARC).
114
115 Following a 1-year spinup, the model has been integrated for four
116 years between 1992 and 1995. It is forced using realistic 6-hourly
117 NCEP/NCAR atmospheric state variables. Over the open ocean these are
118 converted into air-sea fluxes via the bulk formulae of
119 \citet{large04}. Derivation of air-sea fluxes in the presence of
120 sea-ice is handled by the ice model as described in \refsec{model}.
121 The objective function chosen is sea-ice export through Fram Strait
122 computed for December 1995. The adjoint model computes sensitivities
123 to sea-ice export back in time from 1995 to 1992 along this
124 trajectory. In principle all adjoint model variable (i.e., Lagrange
125 multipliers) of the coupled ocean/sea-ice model are available to
126 analyze the transient sensitivity behaviour of the ocean and sea-ice
127 state. Over the open ocean, the adjoint of the bulk formula scheme
128 computes sensitivities to the time-varying atmospheric state. Over
129 ice-covered parts, the sea-ice adjoint converts surface ocean
130 sensitivities to atmospheric sensitivities.
131
132 \reffig{4yradjheff}(a--d) depict sensitivities of sea-ice export
133 through Fram Strait in December 1995 to changes in sea-ice thickness
134 12, 24, 36, 48 months back in time. Corresponding sensitivities to
135 ocean surface temperature are depicted in
136 \reffig{4yradjthetalev1}(a--d). The main characteristics is
137 consistency with expected advection of sea-ice over the relevant time
138 scales considered. The general positive pattern means that an
139 increase in sea-ice thickness at location $(x,y)$ and time $t$ will
140 increase sea-ice export through Fram Strait at time $T_e$. Largest
141 distances from Fram Strait indicate fastest sea-ice advection over the
142 time span considered. The ice thickness sensitivities are in close
143 correspondence to ocean surface sentivitites, but of opposite sign.
144 An increase in temperature will incur ice melting, decrease in ice
145 thickness, and therefore decrease in sea-ice export at time $T_e$.
146
147 The picture is fundamentally different and much more complex
148 for sensitivities to ocean temperatures away from the surface.
149 \reffig{4yradjthetalev10??}(a--d) depicts ice export sensitivities to
150 temperatures at roughly 400 m depth.
151 Primary features are the effect of the heat transport of the North
152 Atlantic current which feeds into the West Spitsbergen current,
153 the circulation around Svalbard, and ...
154
155 \begin{figure}[t!]
156 \centerline{
157 \subfigure[{\footnotesize -12 months}]
158 {\includegraphics*[width=0.44\linewidth]{\fpath/run_4yr_ADJheff_arc_lev1_tim072_cmax2.0E+02.eps}}
159 %\includegraphics*[width=.3\textwidth]{H_c.bin_res_100_lev1.pdf}
160 %
161 \subfigure[{\footnotesize -24 months}]
162 {\includegraphics*[width=0.44\linewidth]{\fpath/run_4yr_ADJheff_arc_lev1_tim145_cmax2.0E+02.eps}}
163 }
164
165 \centerline{
166 \subfigure[{\footnotesize
167 -36 months}]
168 {\includegraphics*[width=0.44\linewidth]{\fpath/run_4yr_ADJheff_arc_lev1_tim218_cmax2.0E+02.eps}}
169 %
170 \subfigure[{\footnotesize
171 -48 months}]
172 {\includegraphics*[width=0.44\linewidth]{\fpath/run_4yr_ADJheff_arc_lev1_tim292_cmax2.0E+02.eps}}
173 }
174 \caption{Sensitivity of sea-ice export through Fram Strait in December 2005 to
175 sea-ice thickness at various prior times.
176 \label{fig:4yradjheff}}
177 \end{figure}
178
179
180 \begin{figure}[t!]
181 \centerline{
182 \subfigure[{\footnotesize -12 months}]
183 {\includegraphics*[width=0.44\linewidth]{\fpath/run_4yr_ADJtheta_arc_lev1_tim072_cmax5.0E+01.eps}}
184 %\includegraphics*[width=.3\textwidth]{H_c.bin_res_100_lev1}
185 %
186 \subfigure[{\footnotesize -24 months}]
187 {\includegraphics*[width=0.44\linewidth]{\fpath/run_4yr_ADJtheta_arc_lev1_tim145_cmax5.0E+01.eps}}
188 }
189
190 \centerline{
191 \subfigure[{\footnotesize
192 -36 months}]
193 {\includegraphics*[width=0.44\linewidth]{\fpath/run_4yr_ADJtheta_arc_lev1_tim218_cmax5.0E+01.eps}}
194 %
195 \subfigure[{\footnotesize
196 -48 months}]
197 {\includegraphics*[width=0.44\linewidth]{\fpath/run_4yr_ADJtheta_arc_lev1_tim292_cmax5.0E+01.eps}}
198 }
199 \caption{Same as \reffig{4yradjheff} but for sea surface temperature
200 \label{fig:4yradjthetalev1}}
201 \end{figure}
202
203 %%% Local Variables:
204 %%% mode: latex
205 %%% TeX-master: "ceaice"
206 %%% End:

  ViewVC Help
Powered by ViewVC 1.1.22