| 54 |
|
|
| 55 |
$\bullet$ An example of \file{W2\_EXCH2\_TOPOLOGY.h} and |
$\bullet$ An example of \file{W2\_EXCH2\_TOPOLOGY.h} and |
| 56 |
\file{w2\_e2setup.F} must reside in a directory containing files |
\file{w2\_e2setup.F} must reside in a directory containing files |
| 57 |
symbolically linked when \file{genmake2} runs. The safest place to |
symbolically linked by the \file{genmake2} script. The safest place to |
| 58 |
put these is the directory indicated in the \code{-mods=DIR} command |
put these is the directory indicated in the \code{-mods=DIR} command |
| 59 |
line modifier (typically \file{../code}), or the build directory. |
line modifier (typically \file{../code}), or the build directory. |
| 60 |
The default versions of these files reside in \file{pkg/exch2} and |
The default versions of these files reside in \file{pkg/exch2} and |
| 77 |
\file{W2\_EXCH2\_TOPOLOGY.h} and \file{w2\_e2setup.F}. Domain |
\file{W2\_EXCH2\_TOPOLOGY.h} and \file{w2\_e2setup.F}. Domain |
| 78 |
decomposition issues particular to exch2 are addressed in Section |
decomposition issues particular to exch2 are addressed in Section |
| 79 |
\ref{sec:topogen} \sectiontitle{Generating Topology Files for exch2} |
\ref{sec:topogen} \sectiontitle{Generating Topology Files for exch2} |
| 80 |
and \ref{sec:exch2mpi} \sectiontitle{exch2, SIZE.h, and MPI}; a more |
and \ref{sec:exch2mpi} \sectiontitle{exch2, SIZE.h, and Multiprocessing}; a more |
| 81 |
general background on the subject relevant to MITgcm is presented in |
general background on the subject relevant to MITgcm is presented in |
| 82 |
Section \ref{sect:specifying_a_decomposition} |
Section \ref{sect:specifying_a_decomposition} |
| 83 |
\sectiontitle{Specifying a decomposition}.\\ |
\sectiontitle{Specifying a decomposition}.\\ |
| 107 |
exch2 topology files \file{W2\_EXCH2\_TOPOLOGY.h} and |
exch2 topology files \file{W2\_EXCH2\_TOPOLOGY.h} and |
| 108 |
\file{w2\_e2setup.F} in the working directory and displays a figure of |
\file{w2\_e2setup.F} in the working directory and displays a figure of |
| 109 |
the topology via Matlab -- figures \ref{fig:6tile}, \ref{fig:12tile}, |
the topology via Matlab -- figures \ref{fig:6tile}, \ref{fig:12tile}, |
| 110 |
and \ref{fig:24tile} are examples. The other m-files in the directory are |
and \ref{fig:24tile} are examples of the generated diagrams. The other |
| 111 |
subroutines of \file{driver.m} and should not be run ``bare'' except |
m-files in the directory are |
| 112 |
|
subroutines called from \file{driver.m} and should not be run ``bare'' except |
| 113 |
for development purposes. \\ |
for development purposes. \\ |
| 114 |
|
|
| 115 |
The parameters that determine the dimensions and topology of the |
The parameters that determine the dimensions and topology of the |
| 116 |
generated configuration are \code{nr}, \code{nb}, \code{ng}, |
generated configuration are \code{nr}, \code{nb}, \code{ng}, |
| 117 |
\code{tnx} and \code{tny}, and all are assigned early in the script. \\ |
\code{tnx} and \code{tny}, and all are assigned early in the script. \\ |
| 118 |
|
|
| 119 |
The first three determine the size of the subdomains and |
The first three determine the height and width of the subdomains and |
| 120 |
hence the size of the overall domain. Each one determines the number |
hence the size of the overall domain. Each one determines the number |
| 121 |
of grid points, and therefore the resolution, along the subdomain |
of grid points, and therefore the resolution, along the subdomain |
| 122 |
sides in a ``great circle'' around each the three spatial axes of the cube. At the time |
sides in a ``great circle'' around each the three spatial axes of the cube. At the time |
| 123 |
of this writing MITgcm requires these three parameters to be equal, |
of this writing MITgcm requires these three parameters to be equal, |
| 124 |
but they provide for future releases to accomodate different |
but they provide for future releases to accomodate different |
| 125 |
resolutions around the axes to allow (for example) greater resolution |
resolutions around the axes to allow subdomains with differing resolutions.\\ |
|
around the equator.\\ |
|
| 126 |
|
|
| 127 |
The parameters \code{tnx} and \code{tny} determine the width and height of |
The parameters \code{tnx} and \code{tny} determine the width and height of |
| 128 |
the tiles into which the subdomains are decomposed, and must evenly |
the tiles into which the subdomains are decomposed, and must evenly |
| 182 |
|
|
| 183 |
|
|
| 184 |
|
|
| 185 |
\subsection{exch2, SIZE.h, and multiprocessing} |
\subsection{exch2, SIZE.h, and Multiprocessing} |
| 186 |
\label{sec:exch2mpi} |
\label{sec:exch2mpi} |
| 187 |
|
|
| 188 |
Once the topology configuration files are created, the Fortran |
Once the topology configuration files are created, the Fortran |
| 190 |
Section \ref{sect:specifying_a_decomposition} \sectiontitle{Specifying |
Section \ref{sect:specifying_a_decomposition} \sectiontitle{Specifying |
| 191 |
a decomposition} provides a general description of domain |
a decomposition} provides a general description of domain |
| 192 |
decomposition within MITgcm and its relation to \file{SIZE.h}. The |
decomposition within MITgcm and its relation to \file{SIZE.h}. The |
| 193 |
current section specifies certain constraints the exch2 package |
current section specifies constraints that the exch2 package |
| 194 |
imposes as well as describes how to enable parallel execution with |
imposes and describes how to enable parallel execution with |
| 195 |
MPI. \\ |
MPI. \\ |
| 196 |
|
|
| 197 |
As in the general case, the parameters \varlink{sNx}{sNx} and |
As in the general case, the parameters \varlink{sNx}{sNx} and |
| 207 |
The parameters \varlink{nSx}{nSx}, \varlink{nSy}{nSy}, |
The parameters \varlink{nSx}{nSx}, \varlink{nSy}{nSy}, |
| 208 |
\varlink{nPx}{nPx}, and \varlink{nPy}{nPy} relate to the number of |
\varlink{nPx}{nPx}, and \varlink{nPy}{nPy} relate to the number of |
| 209 |
tiles and how they are distributed on processors. When using exch2, |
tiles and how they are distributed on processors. When using exch2, |
| 210 |
the tiles are stored in a single dimension, and so |
the tiles are stored in the $x$ dimension, and so |
| 211 |
\code{\varlink{nSy}{nSy}=1} in all cases. Since the tiles as |
\code{\varlink{nSy}{nSy}=1} in all cases. Since the tiles as |
| 212 |
configured by exch2 cannot be split up accross processors without |
configured by exch2 cannot be split up accross processors without |
| 213 |
regenerating the topology, \code{\varlink{nPy}{nPy}=1} as well. \\ |
regenerating the topology, \code{\varlink{nPy}{nPy}=1} as well. \\ |
| 215 |
The number of tiles MITgcm allocates and how they are distributed |
The number of tiles MITgcm allocates and how they are distributed |
| 216 |
between processors depends on \varlink{nPx}{nPx} and |
between processors depends on \varlink{nPx}{nPx} and |
| 217 |
\varlink{nSx}{nSx}. \varlink{nSx}{nSx} is the number of tiles per |
\varlink{nSx}{nSx}. \varlink{nSx}{nSx} is the number of tiles per |
| 218 |
processor and \varlink{nPx}{nPx} the number of processors. The total |
processor and \varlink{nPx}{nPx} is the number of processors. The total |
| 219 |
number of tiles in the topology minus those listed in |
number of tiles in the topology minus those listed in |
| 220 |
\file{blanklist.txt} must equal \code{nSx*nPx}. \\ |
\file{blanklist.txt} must equal \code{nSx*nPx}. Note that in order to |
| 221 |
|
obtain maximum usage from a given number of processors in some cases, |
| 222 |
|
this restriction might entail sharing a processor with a tile that would |
| 223 |
|
otherwise be excluded. \\ |
| 224 |
|
|
| 225 |
The following is an example of \file{SIZE.h} for the twelve-tile |
The following is an example of \file{SIZE.h} for the twelve-tile |
| 226 |
configuration illustrated in figure \ref{fig:12tile} running on |
configuration illustrated in figure \ref{fig:12tile} running on |
| 290 |
topology of twenty-four square tiles, four per subdomain (as in figure |
topology of twenty-four square tiles, four per subdomain (as in figure |
| 291 |
\ref{fig:24tile}), will have \code{exch2\_domain\_nxt=12} and |
\ref{fig:24tile}), will have \code{exch2\_domain\_nxt=12} and |
| 292 |
\code{exch2\_domain\_nyt=2}. Note that these parameters express the |
\code{exch2\_domain\_nyt=2}. Note that these parameters express the |
| 293 |
tile layout to allow global data files that are tile-layout-neutral |
tile layout in order to allow global data files that are tile-layout-neutral. |
| 294 |
and have no bearing on the internal storage of the arrays. The tiles |
They have no bearing on the internal storage of the arrays. The tiles |
| 295 |
are stored internally in a range from \code{(1:\varlink{bi}{bi})} the |
are stored internally in a range from \code{\varlink{bi}{bi}=(1:NTILES)} in the |
| 296 |
$x$ axis, and the $y$ axis variable \varlink{bj}{bj} is assumed to |
$x$ axis, and the $y$ axis variable \varlink{bj}{bj} is assumed to |
| 297 |
equal \code{1} throughout the package. \\ |
equal \code{1} throughout the package. \\ |
| 298 |
|
|
| 299 |
\subsubsection{Arrays Indexed to Tile Number} |
\subsubsection{Arrays indexed to tile number} |
| 300 |
|
|
| 301 |
The following arrays are of length \code{NTILES} and are indexed to |
The following arrays are of length \code{NTILES} and are indexed to |
| 302 |
the tile number, which is indicated in the diagrams with the notation |
the tile number, which is indicated in the diagrams with the notation |
| 306 |
\varlink{exch2\_tny}{exch2_tny} express the $x$ and $y$ dimensions of |
\varlink{exch2\_tny}{exch2_tny} express the $x$ and $y$ dimensions of |
| 307 |
each tile. At present for each tile \texttt{exch2\_tnx=sNx} and |
each tile. At present for each tile \texttt{exch2\_tnx=sNx} and |
| 308 |
\texttt{exch2\_tny=sNy}, as assigned in \file{SIZE.h} and described in |
\texttt{exch2\_tny=sNy}, as assigned in \file{SIZE.h} and described in |
| 309 |
section \ref{sec:exch2mpi} \sectiontitle{exch2, SIZE.h, and |
Section \ref{sec:exch2mpi} \sectiontitle{exch2, SIZE.h, and |
| 310 |
multiprocessing}. Future releases of MITgcm may allow varying tile |
Multiprocessing}. Future releases of MITgcm may allow varying tile |
| 311 |
sizes. \\ |
sizes. \\ |
| 312 |
|
|
| 313 |
The location of the tiles' Cartesian origin within a subdomain are |
The arrays \varlink{exch2\_tbasex}{exch2_tbasex} and |
| 314 |
determined by the arrays \varlink{exch2\_tbasex}{exch2_tbasex} and |
\varlink{exch2\_tbasey}{exch2_tbasey} determine the tiles' |
| 315 |
\varlink{exch2\_tbasey}{exch2_tbasey}. These variables are used to |
Cartesian origin within a subdomain |
| 316 |
relate the location of the edges of different tiles to each other. As |
and locate the edges of different tiles relative to each other. As |
| 317 |
an example, in the default six-tile topology (Fig. \ref{fig:6tile}) |
an example, in the default six-tile topology (Fig. \ref{fig:6tile}) |
| 318 |
each index in these arrays is set to \code{0} since a tile occupies |
each index in these arrays is set to \code{0} since a tile occupies |
| 319 |
its entire subdomain. The twenty-four-tile case discussed above will |
its entire subdomain. The twenty-four-tile case discussed above will |
| 320 |
have values of \code{0} or \code{16}, depending on the quadrant the |
have values of \code{0} or \code{16}, depending on the quadrant of the |
| 321 |
tile falls within the subdomain. The elements of the arrays |
tile within the subdomain. The elements of the arrays |
| 322 |
\varlink{exch2\_txglobalo}{exch2_txglobalo} and |
\varlink{exch2\_txglobalo}{exch2_txglobalo} and |
| 323 |
\varlink{exch2\_txglobalo}{exch2_txglobalo} are similar to |
\varlink{exch2\_txglobalo}{exch2_txglobalo} are similar to |
| 324 |
\varlink{exch2\_tbasex}{exch2_tbasex} and |
\varlink{exch2\_tbasex}{exch2_tbasex} and |
| 325 |
\varlink{exch2\_tbasey}{exch2_tbasey}, but locate the tiles within the |
\varlink{exch2\_tbasey}{exch2_tbasey}, but locate the tile edges within the |
| 326 |
global address space, similar to that used by global output and input |
global address space, similar to that used by global output and input |
| 327 |
files. \\ |
files. \\ |
| 328 |
|
|
| 331 |
standard cube topology and indicated by \textbf{\textsf{f}}$n$ in |
standard cube topology and indicated by \textbf{\textsf{f}}$n$ in |
| 332 |
figures \ref{fig:12tile} and |
figures \ref{fig:12tile} and |
| 333 |
\ref{fig:24tile}. \varlink{exch2\_nNeighbours}{exch2_nNeighbours} |
\ref{fig:24tile}. \varlink{exch2\_nNeighbours}{exch2_nNeighbours} |
| 334 |
contains a count of the neighboring tiles each tile has, and is used |
contains a count of the neighboring tiles each tile has, and sets |
| 335 |
for setting bounds for looping over neighboring tiles. |
the bounds for looping over neighboring tiles. |
| 336 |
\varlink{exch2\_tProc}{exch2_tProc} holds the process rank of each |
\varlink{exch2\_tProc}{exch2_tProc} holds the process rank of each |
| 337 |
tile, and is used in interprocess communication. \\ |
tile, and is used in interprocess communication. \\ |
| 338 |
|
|
| 341 |
\varlink{exch2\_isEedge}{exch2_isEedge}, |
\varlink{exch2\_isEedge}{exch2_isEedge}, |
| 342 |
\varlink{exch2\_isSedge}{exch2_isSedge}, and |
\varlink{exch2\_isSedge}{exch2_isSedge}, and |
| 343 |
\varlink{exch2\_isNedge}{exch2_isNedge} are set to \code{1} if the |
\varlink{exch2\_isNedge}{exch2_isNedge} are set to \code{1} if the |
| 344 |
indexed tile lies on the respective edge of a subdomain, \code{0} if |
indexed tile lies on the edge of its subdomain, \code{0} if |
| 345 |
not. The values are used within the topology generator to determine |
not. The values are used within the topology generator to determine |
| 346 |
the orientation of neighboring tiles, and to indicate whether a tile |
the orientation of neighboring tiles, and to indicate whether a tile |
| 347 |
lies on the corner of a subdomain. The latter case requires special |
lies on the corner of a subdomain. The latter case requires special |
| 374 |
The arrays \varlink{exch2\_pi}{exch2_pi} and |
The arrays \varlink{exch2\_pi}{exch2_pi} and |
| 375 |
\varlink{exch2\_pj}{exch2_pj} specify the transformations of indices |
\varlink{exch2\_pj}{exch2_pj} specify the transformations of indices |
| 376 |
in exchanges between the neighboring tiles. These transformations are |
in exchanges between the neighboring tiles. These transformations are |
| 377 |
necessary in exchanges between subdomains because the array index in |
necessary in exchanges between subdomains because a horizontal dimension |
| 378 |
one dimension may map to the other index in an adjacent subdomain, and |
in one subdomain |
| 379 |
may be have its indexing reversed. This swapping arises from the |
may map to other horizonal dimension in an adjacent subdomain, and |
| 380 |
|
may also have its indexing reversed. This swapping arises from the |
| 381 |
``folding'' of two-dimensional arrays into a three-dimensional |
``folding'' of two-dimensional arrays into a three-dimensional |
| 382 |
cube. \\ |
cube. \\ |
| 383 |
|
|
| 385 |
are the neighbor ID \code{N} and the tile number \code{T} as explained |
are the neighbor ID \code{N} and the tile number \code{T} as explained |
| 386 |
above, plus a vector of length \code{2} containing transformation |
above, plus a vector of length \code{2} containing transformation |
| 387 |
factors \code{t}. The first element of the transformation vector |
factors \code{t}. The first element of the transformation vector |
| 388 |
holds the factor to multiply the index in the same axis, and the |
holds the factor to multiply the index in the same dimension, and the |
| 389 |
second element holds the the same for the orthogonal index. To |
second element holds the the same for the orthogonal dimension. To |
| 390 |
clarify, \code{exch2\_pi(1,N,T)} holds the mapping of the $x$ axis |
clarify, \code{exch2\_pi(1,N,T)} holds the mapping of the $x$ axis |
| 391 |
index of tile \code{T} to the $x$ axis of tile \code{T}'s neighbor |
index of tile \code{T} to the $x$ axis of tile \code{T}'s neighbor |
| 392 |
\code{N}, and \code{exch2\_pi(2,N,T)} holds the mapping of \code{T}'s |
\code{N}, and \code{exch2\_pi(2,N,T)} holds the mapping of \code{T}'s |
| 401 |
\code{(1,0)}, since all tiles on the same subdomain are oriented |
\code{(1,0)}, since all tiles on the same subdomain are oriented |
| 402 |
identically. An axis that corresponds to the orthogonal dimension |
identically. An axis that corresponds to the orthogonal dimension |
| 403 |
with the same index direction in a particular tile-neighbor |
with the same index direction in a particular tile-neighbor |
| 404 |
orientation will have \code{(0,1)}. Those in the opposite index |
orientation will have \code{(0,1)}. Those with the opposite index |
| 405 |
direction will have \code{(0,-1)} in order to reverse the ordering. \\ |
direction will have \code{(0,-1)} in order to reverse the ordering. \\ |
| 406 |
|
|
| 407 |
The arrays \varlink{exch2\_oi}{exch2_oi}, |
The arrays \varlink{exch2\_oi}{exch2_oi}, |
| 487 |
\code{Tn}'s $y$ axis corresponds to \code{T}'s $x$ axis, \code{T}'s |
\code{Tn}'s $y$ axis corresponds to \code{T}'s $x$ axis, \code{T}'s |
| 488 |
northern edge exchanges with \code{Tn}'s western edge. The western |
northern edge exchanges with \code{Tn}'s western edge. The western |
| 489 |
edge of the tiles corresponds to the lower bound of the $x$ axis, so |
edge of the tiles corresponds to the lower bound of the $x$ axis, so |
| 490 |
\code{exch2\_itlo\_c} \code{exch2\_ithi\_c} are \code{0}. The range of |
\code{exch2\_itlo\_c} and \code{exch2\_ithi\_c} are \code{0}, in the |
| 491 |
|
western halo region of \code{Tn}. The range of |
| 492 |
\code{exch2\_jtlo\_c} and \code{exch2\_jthi\_c} correspond to the |
\code{exch2\_jtlo\_c} and \code{exch2\_jthi\_c} correspond to the |
| 493 |
width of \code{T}'s northern edge, plus the halo. \\ |
width of \code{T}'s northern edge, expanded by one into the halo. \\ |
| 494 |
|
|
| 495 |
|
|
| 496 |
\subsection{Key Routines} |
\subsection{Key Routines} |
| 499 |
themselves and are of the same format as those described in |
themselves and are of the same format as those described in |
| 500 |
\ref{sect:cube_sphere_communication} \sectiontitle{Cube sphere |
\ref{sect:cube_sphere_communication} \sectiontitle{Cube sphere |
| 501 |
communication}. Like the original routines, they are written as |
communication}. Like the original routines, they are written as |
| 502 |
templates which the local Makefile converts from RX into RL and RS |
templates which the local Makefile converts from \code{RX} into |
| 503 |
forms. \\ |
\code{RL} and \code{RS} forms. \\ |
| 504 |
|
|
| 505 |
The interfaces with the core model subroutines are |
The interfaces with the core model subroutines are |
| 506 |
\code{EXCH\_UV\_XY\_RX}, \code{EXCH\_UV\_XYZ\_RX} and |
\code{EXCH\_UV\_XY\_RX}, \code{EXCH\_UV\_XYZ\_RX} and |
| 515 |
the singularities at the cube corners. \\ |
the singularities at the cube corners. \\ |
| 516 |
|
|
| 517 |
The separate scalar and vector forms of \code{EXCH2\_RX1\_CUBE} and |
The separate scalar and vector forms of \code{EXCH2\_RX1\_CUBE} and |
| 518 |
\code{EXCH2\_RX2\_CUBE} reflect that the vector-handling subrouine |
\code{EXCH2\_RX2\_CUBE} reflect that the vector-handling subroutine |
| 519 |
needs to pass both the $u$ and $v$ components of the phsical vectors. |
needs to pass both the $u$ and $v$ components of the physical vectors. |
| 520 |
This arises from the topological folding discussed above, where the |
This swapping arises from the topological folding discussed above, where the |
| 521 |
$x$ and $y$ axes get swapped in some cases. This swapping is not an |
$x$ and $y$ axes get swapped in some cases, and is not an |
| 522 |
issue with the scalar version. These subroutines call |
issue with the scalar case. These subroutines call |
| 523 |
\code{EXCH2\_SEND\_RX1} and \code{EXCH2\_SEND\_RX2}, which do most of |
\code{EXCH2\_SEND\_RX1} and \code{EXCH2\_SEND\_RX2}, which do most of |
| 524 |
the work using the variables discussed above. \\ |
the work using the variables discussed above. \\ |
| 525 |
|
|