| 1 |
dimitri |
1.1 |
\section{Adjoint sensiivities of the MITsim} |
| 2 |
|
|
\label{sec:adjoint} |
| 3 |
|
|
|
| 4 |
|
|
\subsection{The adjoint of MITsim} |
| 5 |
|
|
|
| 6 |
|
|
The ability to generate tangent linear and adjoint model components |
| 7 |
|
|
of the MITsim has been a main design task. |
| 8 |
|
|
For the ocean the adjoint capability has proven to be an |
| 9 |
|
|
invaluable tool for sensitivity analysis as well as state estimation. |
| 10 |
|
|
In short, the adjoint enables very efficient computation of the gradient |
| 11 |
|
|
of scalar-valued model diagnostics (called cost function or objective function) |
| 12 |
|
|
with respect to many model "variables". |
| 13 |
|
|
These variables can be two- or three-dimensional fields of initial |
| 14 |
|
|
conditions, model parameters such as mixing coefficients, or |
| 15 |
|
|
time-varying surface or lateral (open) boundary conditions. |
| 16 |
|
|
When combined, these variables span a potentially high-dimensional |
| 17 |
|
|
(e.g. O(10$^8$)) so-called control space. Performing parameter perturbations |
| 18 |
|
|
to assess model sensitivities quickly becomes prohibitive at these scales. |
| 19 |
|
|
Alternatively, (time-varying) sensitivities of the objective function |
| 20 |
|
|
to any element of the control space can be computed very efficiently in |
| 21 |
|
|
one single adjoint |
| 22 |
|
|
model integration, provided an efficient adjoint model is available. |
| 23 |
|
|
|
| 24 |
|
|
[REFERENCES] |
| 25 |
|
|
|
| 26 |
|
|
|
| 27 |
|
|
The adjoint operator (ADM) is the transpose of the tangent linear operator (TLM) |
| 28 |
|
|
of the full (in general nonlinear) forward model, i.e. the MITsim. |
| 29 |
|
|
The TLM maps perturbations of elements of the control space |
| 30 |
|
|
(e.g. initial ice thickness distribution) |
| 31 |
|
|
via the model Jacobian |
| 32 |
|
|
to a perturbation in the objective function |
| 33 |
|
|
(e.g. sea-ice export at the end of the integration interval). |
| 34 |
|
|
\textit{Tangent} linearity ensures that the derivatives are evaluated |
| 35 |
|
|
with respect to the underlying model trajectory at each point in time. |
| 36 |
|
|
This is crucial for nonlinear trajectories and the presence of different |
| 37 |
|
|
regimes (e.g. effect of the seaice growth term at or away from the |
| 38 |
|
|
freezing point of the ocean surface). |
| 39 |
|
|
Ensuring tangent linearity can be easily achieved by integrating |
| 40 |
|
|
the full model in sync with the TLM to provide the underlying model state. |
| 41 |
|
|
Ensuring \textit{tangent} adjoints is equally crucial, but much more |
| 42 |
|
|
difficult to achieve because of the reverse nature of the integration: |
| 43 |
|
|
the adjoint accumulates sensitivities backward in time, |
| 44 |
|
|
starting from a unit perturbation of the objective function. |
| 45 |
|
|
The adjoint model requires the model state in reverse order. |
| 46 |
|
|
This presents one of the major complications in deriving an |
| 47 |
|
|
exact, i.e. \textit{tangent} adjoint model. |
| 48 |
|
|
|
| 49 |
|
|
Following closely the development and maintenance of TLM and ADM |
| 50 |
|
|
components of the MITgcm we have relied heavily on the |
| 51 |
|
|
autmomatic differentiation (AD) tool |
| 52 |
|
|
"Transformation of Algorithms in Fortran" (TAF) |
| 53 |
|
|
developed by Fastopt (Giering and Kaminski, 1998) |
| 54 |
|
|
to derive TLM and ADM code of the MITsim. |
| 55 |
|
|
Briefly, the nonlinear parent model is fed to the AD tool which produces |
| 56 |
|
|
derivative code for the specified control space and objective function. |
| 57 |
|
|
Following this approach has (apart from its evident success) |
| 58 |
|
|
several advantages: |
| 59 |
|
|
(1) the adjoint model is the exact adjoint operator of the parent model, |
| 60 |
|
|
(2) the adjoint model can be kept up to date with respect to ongoing |
| 61 |
|
|
development of the parent model, and adjustments to the parent model |
| 62 |
|
|
to extend the automatically generated adjoint are incremental changes |
| 63 |
|
|
only, rather than extensive re-developments, |
| 64 |
|
|
(3) the parallel structure of the parent model is preserved |
| 65 |
|
|
by the adjoint model, ensuring efficient use in high performance |
| 66 |
|
|
computing environments. |
| 67 |
|
|
|
| 68 |
|
|
Some initial code adjustments are required to support dependency analysis |
| 69 |
|
|
of the flow reversal and certain language limitations which may lead |
| 70 |
|
|
to irreducible flow graphs (e.g. GOTO statements). |
| 71 |
|
|
The problem of providing the required model state in reverse order |
| 72 |
|
|
at the time of evaluating nonlinear or conditional |
| 73 |
|
|
derivatives is solved via balancing |
| 74 |
|
|
storing vs. recomputation of the model state in a multi-level |
| 75 |
|
|
checkpointing loop. |
| 76 |
|
|
Again, an initial code adjustment is required to support TAFs |
| 77 |
|
|
checkpointing capability. |
| 78 |
|
|
The code adjustments are sufficiently simple so as not to cause |
| 79 |
|
|
major limitations to the full nonlinear parent model. |
| 80 |
|
|
Once in place, an adjoint model of a new model configuration |
| 81 |
|
|
may be derived in about 10 minutes. |
| 82 |
|
|
|
| 83 |
|
|
[HIGHLIGHT COUPLED NATURE OF THE ADJOINT!] |
| 84 |
|
|
|
| 85 |
|
|
\subsection{Special considerations} |
| 86 |
|
|
|
| 87 |
|
|
* growth term(?) |
| 88 |
|
|
|
| 89 |
|
|
* small active denominators |
| 90 |
|
|
|
| 91 |
|
|
* dynamic solver (implicit function theorem) |
| 92 |
|
|
|
| 93 |
|
|
* approximate adjoints |
| 94 |
|
|
|
| 95 |
|
|
|
| 96 |
|
|
\subsection{An example: sensitivities of sea-ice export through Fram Strait} |
| 97 |
|
|
|
| 98 |
|
|
We demonstrate the power of the adjoint method |
| 99 |
|
|
in the context of investigating sea-ice export sensitivities through Fram Strait |
| 100 |
|
|
(for details of this study see Heimbach et al., 2007). |
| 101 |
|
|
%\citep[for details of this study see][]{heimbach07}. %Heimbach et al., 2007). |
| 102 |
|
|
The domain chosen is a coarsened version of the Arctic face of the |
| 103 |
|
|
high-resolution cubed-sphere configuration of the ECCO2 project |
| 104 |
|
|
\citep[see][]{menemenlis05}. It covers the entire Arctic, |
| 105 |
|
|
extends into the North Pacific such as to cover the entire |
| 106 |
|
|
ice-covered regions, and comprises parts of the North Atlantic |
| 107 |
|
|
down to XXN to enable analysis of remote influences of the |
| 108 |
|
|
North Atlantic current to sea-ice variability and export. |
| 109 |
|
|
The horizontal resolution varies between XX and YY km |
| 110 |
|
|
with 50 unevenly spaced vertical levels. |
| 111 |
|
|
The adjoint models run efficiently on 80 processors |
| 112 |
|
|
(benchmarks have been performed both on an SGI Altix as well as an |
| 113 |
|
|
IBM SP5 at NASA/ARC). |
| 114 |
|
|
|
| 115 |
|
|
Following a 1-year spinup, the model has been integrated for four |
| 116 |
|
|
years between 1992 and 1995. It is forced using realistic 6-hourly |
| 117 |
|
|
NCEP/NCAR atmospheric state variables. Over the open ocean these are |
| 118 |
|
|
converted into air-sea fluxes via the bulk formulae of |
| 119 |
|
|
\citet{large04}. Derivation of air-sea fluxes in the presence of |
| 120 |
|
|
sea-ice is handled by the ice model as described in \refsec{model}. |
| 121 |
|
|
The objective function chosen is sea-ice export through Fram Strait |
| 122 |
|
|
computed for December 1995. The adjoint model computes sensitivities |
| 123 |
|
|
to sea-ice export back in time from 1995 to 1992 along this |
| 124 |
|
|
trajectory. In principle all adjoint model variable (i.e., Lagrange |
| 125 |
|
|
multipliers) of the coupled ocean/sea-ice model are available to |
| 126 |
|
|
analyze the transient sensitivity behaviour of the ocean and sea-ice |
| 127 |
|
|
state. Over the open ocean, the adjoint of the bulk formula scheme |
| 128 |
|
|
computes sensitivities to the time-varying atmospheric state. Over |
| 129 |
|
|
ice-covered parts, the sea-ice adjoint converts surface ocean |
| 130 |
|
|
sensitivities to atmospheric sensitivities. |
| 131 |
|
|
|
| 132 |
|
|
\reffig{4yradjheff}(a--d) depict sensitivities of sea-ice export |
| 133 |
|
|
through Fram Strait in December 1995 to changes in sea-ice thickness |
| 134 |
|
|
12, 24, 36, 48 months back in time. Corresponding sensitivities to |
| 135 |
|
|
ocean surface temperature are depicted in |
| 136 |
|
|
\reffig{4yradjthetalev1}(a--d). The main characteristics is |
| 137 |
|
|
consistency with expected advection of sea-ice over the relevant time |
| 138 |
|
|
scales considered. The general positive pattern means that an |
| 139 |
|
|
increase in sea-ice thickness at location $(x,y)$ and time $t$ will |
| 140 |
|
|
increase sea-ice export through Fram Strait at time $T_e$. Largest |
| 141 |
|
|
distances from Fram Strait indicate fastest sea-ice advection over the |
| 142 |
|
|
time span considered. The ice thickness sensitivities are in close |
| 143 |
|
|
correspondence to ocean surface sentivitites, but of opposite sign. |
| 144 |
|
|
An increase in temperature will incur ice melting, decrease in ice |
| 145 |
|
|
thickness, and therefore decrease in sea-ice export at time $T_e$. |
| 146 |
|
|
|
| 147 |
|
|
The picture is fundamentally different and much more complex |
| 148 |
|
|
for sensitivities to ocean temperatures away from the surface. |
| 149 |
|
|
\reffig{4yradjthetalev10??}(a--d) depicts ice export sensitivities to |
| 150 |
|
|
temperatures at roughly 400 m depth. |
| 151 |
|
|
Primary features are the effect of the heat transport of the North |
| 152 |
|
|
Atlantic current which feeds into the West Spitsbergen current, |
| 153 |
|
|
the circulation around Svalbard, and ... |
| 154 |
|
|
|
| 155 |
|
|
\begin{figure}[t!] |
| 156 |
|
|
\centerline{ |
| 157 |
|
|
\subfigure[{\footnotesize -12 months}] |
| 158 |
|
|
{\includegraphics*[width=0.44\linewidth]{\fpath/run_4yr_ADJheff_arc_lev1_tim072_cmax2.0E+02.eps}} |
| 159 |
|
|
%\includegraphics*[width=.3\textwidth]{H_c.bin_res_100_lev1.pdf} |
| 160 |
|
|
% |
| 161 |
|
|
\subfigure[{\footnotesize -24 months}] |
| 162 |
|
|
{\includegraphics*[width=0.44\linewidth]{\fpath/run_4yr_ADJheff_arc_lev1_tim145_cmax2.0E+02.eps}} |
| 163 |
|
|
} |
| 164 |
|
|
|
| 165 |
|
|
\centerline{ |
| 166 |
|
|
\subfigure[{\footnotesize |
| 167 |
|
|
-36 months}] |
| 168 |
|
|
{\includegraphics*[width=0.44\linewidth]{\fpath/run_4yr_ADJheff_arc_lev1_tim218_cmax2.0E+02.eps}} |
| 169 |
|
|
% |
| 170 |
|
|
\subfigure[{\footnotesize |
| 171 |
|
|
-48 months}] |
| 172 |
|
|
{\includegraphics*[width=0.44\linewidth]{\fpath/run_4yr_ADJheff_arc_lev1_tim292_cmax2.0E+02.eps}} |
| 173 |
|
|
} |
| 174 |
|
|
\caption{Sensitivity of sea-ice export through Fram Strait in December 2005 to |
| 175 |
|
|
sea-ice thickness at various prior times. |
| 176 |
|
|
\label{fig:4yradjheff}} |
| 177 |
|
|
\end{figure} |
| 178 |
|
|
|
| 179 |
|
|
|
| 180 |
|
|
\begin{figure}[t!] |
| 181 |
|
|
\centerline{ |
| 182 |
|
|
\subfigure[{\footnotesize -12 months}] |
| 183 |
|
|
{\includegraphics*[width=0.44\linewidth]{\fpath/run_4yr_ADJtheta_arc_lev1_tim072_cmax5.0E+01.eps}} |
| 184 |
mlosch |
1.2 |
%\includegraphics*[width=.3\textwidth]{H_c.bin_res_100_lev1} |
| 185 |
dimitri |
1.1 |
% |
| 186 |
|
|
\subfigure[{\footnotesize -24 months}] |
| 187 |
|
|
{\includegraphics*[width=0.44\linewidth]{\fpath/run_4yr_ADJtheta_arc_lev1_tim145_cmax5.0E+01.eps}} |
| 188 |
|
|
} |
| 189 |
|
|
|
| 190 |
|
|
\centerline{ |
| 191 |
|
|
\subfigure[{\footnotesize |
| 192 |
|
|
-36 months}] |
| 193 |
|
|
{\includegraphics*[width=0.44\linewidth]{\fpath/run_4yr_ADJtheta_arc_lev1_tim218_cmax5.0E+01.eps}} |
| 194 |
|
|
% |
| 195 |
|
|
\subfigure[{\footnotesize |
| 196 |
|
|
-48 months}] |
| 197 |
|
|
{\includegraphics*[width=0.44\linewidth]{\fpath/run_4yr_ADJtheta_arc_lev1_tim292_cmax5.0E+01.eps}} |
| 198 |
|
|
} |
| 199 |
|
|
\caption{Same as \reffig{4yradjheff} but for sea surface temperature |
| 200 |
|
|
\label{fig:4yradjthetalev1}} |
| 201 |
|
|
\end{figure} |
| 202 |
mlosch |
1.2 |
|
| 203 |
|
|
%%% Local Variables: |
| 204 |
|
|
%%% mode: latex |
| 205 |
|
|
%%% TeX-master: "ceaice" |
| 206 |
|
|
%%% End: |