IODA
05a-ObsGroup.cpp
Go to the documentation of this file.
1 /*
2  * (C) Copyright 2020-2021 UCAR
3  *
4  * This software is licensed under the terms of the Apache Licence Version 2.0
5  * which can be obtained at http://www.apache.org/licenses/LICENSE-2.0.
6  */
7 /*! \addtogroup ioda_cxx_ex
8  *
9  * @{
10  *
11  * \defgroup ioda_cxx_ex_5a Ex 5a: ObsGroups
12  * \brief Constructing ObsGroups
13  * \see 05a-ObsGroup.cpp for comments and the walkthrough.
14  *
15  * @{
16  *
17  * \file 05a-ObsGroup.cpp
18  * \brief ObsGroup
19  *
20  * The ObsGroup class is derived from the Group class and provides some help in
21  * organizing your groups, variables, attributes and dimension scales into a cohesive
22  * structure intended to house observation data. In this case "structure" refers to the
23  * hierarchical layout of the groups and the proper management of dimension scales
24  * associated with the variables.
25  *
26  * The ObsGroup and underlying layout policies (internal to ioda-engines) present a stable
27  * logical group hierarchical layout to the client while keeping the actual layout implemented
28  * in the backend open to change. The logical "frontend" layout appears to the client to be
29  * as shown below:
30  *
31  * layout notes
32  *
33  * / top-level group
34  * nlocs dimension scales (variables, coordinate values)
35  * nchans
36  * ...
37  * ObsValue/ group: observational measurement values
38  * brightness_temperature variable: Tb, 2D, nlocs X nchans
39  * air_temperature variable: T, 1D, nlocs
40  * ...
41  * ObsError/ group: observational error estimates
42  * brightness_temperature
43  * air_temperature
44  * ...
45  * PreQC/ group: observational QC marks from data provider
46  * brightness_temperature
47  * air_temperature
48  * ...
49  * MetaData/ group: meta data associated with locations
50  * latitude
51  * longitude
52  * datetime
53  * ...
54  * ...
55  *
56  * It is intended to keep this layout stable so that the client interface remains stable.
57  * The actual layout used in the various backends can optionally be organized differently
58  * according to their needs.
59  *
60  * The ObsGroup class also assists with the management of dimension scales. For example, if
61  * a dimension is resized, the ObsGroup::resize function will resize the dimension scale
62  * along with all variables that use that dimension scale.
63  *
64  * The basic ideas is to dimension observation data with nlocs as the first dimension, and
65  * allow nlocs to be resizable so that it's possible to incrementally append data along
66  * the nlocs (1st) dimension. For data that have rank > 1, the second through nth dimensions
67  * are of fixed size. For example, brightness_temperature can be store as 2D data with
68  * dimensions (nlocs, nchans).
69  *
70  * \author Stephen Herbener (stephenh@ucar.edu), Ryan Honeyager (honeyage@ucar.edu)
71  **/
72 
73 #include <array> // Arrays are fixed-length vectors.
74 #include <iostream> // We want I/O.
75 #include <string> // We want strings
76 #include <valarray> // Like a vector, but can also do basic element-wise math.
77 #include <vector> // We want vectors
78 
79 #include "Eigen/Dense" // Eigen Arrays and Matrices
80 #include "ioda/Engines/Factory.h" // Used to kickstart the Group engine.
81 #include "ioda/Exception.h" // Exceptions and debugging
82 #include "ioda/Group.h" // Groups have attributes.
83 #include "ioda/ObsGroup.h"
84 #include "unsupported/Eigen/CXX11/Tensor" // Eigen Tensors
85 
86 int main(int argc, char** argv) {
87  using namespace ioda; // All of the ioda functions are here.
88  using std::cerr;
89  using std::endl;
90  using std::string;
91  using std::vector;
92  try {
93  // Create the backend. For this code we are using a factory function,
94  // constructFromCmdLine, made for testing purposes, which allows one to specify
95  // a backend from the command line using the "--ioda-engine-options" option.
96  //
97  // There exists another factory function, constructBackend, which allows you
98  // to create a backend without requiring the command line option. The signature
99  // for this function is:
100  //
101  // constructBackend(BackendNames, BackendCreationParameters &);
102  //
103  //
104  // BackendNames is an enum type with values:
105  // Hdf5File - file backend using HDF5 file
106  // ObsStore - in-memory backend
107  //
108  // BackendCreationParameters is a C++ structure with members:
109  // fileName - string, used for file backend
110  //
111  // actions - enum BackendFileActions type:
112  // Create - create a new file
113  // Open - open an existing file
114  //
115  // createMode - enum BackendCreateModes type:
116  // Truncate_If_Exists - overwrite existing file
117  // Fail_If_Exists - throw exception if file exists
118  //
119  // openMode - enum BackendOpenModes types:
120  // Read_Only - open in read only mode
121  // Read_Write - open in modify mode
122  //
123  // Here are some code examples:
124  //
125  // Create backend using an hdf5 file for writing:
126  //
127  // Engines::BackendNames backendName;
128  // backendName = Engines::BackendNames::Hdf5File;
129  //
130  // Engines::BackendCreationParameters backendParams;
131  // backendParams.fileName = fileName;
132  // backendParams.action = Engines::BackendFileActions::Create;
133  // backendParams.createMode = Engines::BackendCreateModes::Truncate_If_Exists;
134  //
135  // Group g = constructBackend(backendName, backendParams);
136  //
137  // Create backend using an hdf5 file for reading:
138  //
139  // Engines::BackendNames backendName;
140  // backendName = Engines::BackendNames::Hdf5File;
141  //
142  // Engines::BackendCreationParameters backendParams;
143  // backendParams.fileName = fileName;
144  // backendParams.action = Engines::BackendFileActions::Open;
145  // backendParams.openMode = Engines::BackendOpenModes::Read_Only;
146  //
147  // Group g = constructBackend(backendName, backendParams);
148  //
149  // Create an in-memory backend:
150  //
151  // Engines::BackendNames backendName;
152  // backendName = Engines::BackendNames::ObsStore;
153  //
154  // Engines::BackendCreationParameters backendParams;
155  //
156  // Group g = constructBackend(backendName, backendParams);
157  //
158 
159  // Create the backend using the command line construct function
160  Group g = Engines::constructFromCmdLine(argc, argv, "Example-05a.hdf5");
161 
162  // Create an ObsGroup object using the ObsGroup::generate function. This function
163  // takes a Group argument (the backend we just created above) and a vector of dimension
164  // creation specs.
165  const int numLocs = 40;
166  const int numChans = 30;
167 
168  // The NewDimensionsScales_t is a vector, that holds specs for one dimension scale
169  // per element. An individual dimension scale spec is held in a NewDimensionsScale
170  // object, whose constructor arguments are:
171  // 1st - dimension scale name
172  // 2nd - size of dimension. May be zero.
173  // 3rd - maximum size of dimension
174  // resizeable dimensions are said to have "unlimited" size, so there
175  // is a built-in variable ("Unlimited") that can be used to denote
176  // unlimited size. If Unspecified (the default), we assume that the
177  // maximum size is the same as the initial size (the previous parameter).
178  // 4th - suggested chunk size for dimension (and associated variables).
179  // This defaults to the initial size. This parameter must be nonzero. If
180  // the initial size is zero, it must be explicitly specified.
181  //
183  newDims.push_back(NewDimensionScale<int>("nlocs", numLocs, Unlimited, numLocs));
184  newDims.push_back(NewDimensionScale<int>("nchans", numChans, numChans, numChans));
185 
186  // Construct an ObsGroup object, with 2 dimensions nlocs, nchans, and attach
187  // the backend we constructed above. Under the hood, the ObsGroup::generate function
188  // initializes the dimension coordinate values to index numbering 1..n. This can be
189  // overwritten with other coordinate values if desired.
191 
192  // We now have the top-level group containing the two dimension scales. We need
193  // Variable objects for these dimension scales later on for creating variables so
194  // build those now.
195  ioda::Variable nlocsVar = og.vars["nlocs"];
196  ioda::Variable nchansVar = og.vars["nchans"];
197 
198  // Next let's create the variables. The variable names should be specified using the
199  // hierarchy as described above. For example, the variable brightness_temperature
200  // in the group ObsValue is specified in a string as "ObsValue/brightness_temperature".
201  string tbName = "ObsValue/brightness_temperature";
202  string latName = "MetaData/latitude";
203  string lonName = "MetaData/longitude";
204 
205  // Set up the creation parameters for the variables. All three variables in this case
206  // are float types, so they can share the same creation parameters object.
208  float_params.chunk = true; // allow chunking
209  float_params.compressWithGZIP(); // compress using gzip
210  float_params.setFillValue<float>(-999); // set the fill value to -999
211 
212  // Create the variables. Note the use of the createWithScales function. This should
213  // always be used when working with an ObsGroup object.
214  Variable tbVar = og.vars.createWithScales<float>(tbName, {nlocsVar, nchansVar}, float_params);
215  Variable latVar = og.vars.createWithScales<float>(latName, {nlocsVar}, float_params);
216  Variable lonVar = og.vars.createWithScales<float>(lonName, {nlocsVar}, float_params);
217 
218  // Add attributes to variables. In this example, we are adding enough attribute
219  // information to allow Panoply to be able to plot the ObsValue/brightness_temperature
220  // variable. Note the "coordinates" attribute on tbVar. It is sufficient to just
221  // give the variable names (without the group structure) to Panoply (which apparently
222  // searches the entire group structure for these names). If you want to follow this
223  // example in your code, just give the variable names without the group prefixes
224  // to insulate your code from any subsequent group structure changes that might occur.
225  tbVar.atts.add<std::string>("coordinates", {"longitude latitude nchans"}, {1})
226  .add<std::string>("long_name", {"ficticious brightness temperature"}, {1})
227  .add<std::string>("units", {"K"}, {1})
228  .add<float>("valid_range", {100.0, 400.0}, {2});
229  latVar.atts.add<std::string>("long_name", {"latitude"}, {1})
230  .add<std::string>("units", {"degrees_north"}, {1})
231  .add<float>("valid_range", {-90.0, 90.0}, {2});
232  lonVar.atts.add<std::string>("long_name", {"longitude"}, {1})
233  .add<std::string>("units", {"degrees_east"}, {1})
234  .add<float>("valid_range", {-360.0, 360.0}, {2});
235 
236  // Let's create some data for this example.
237  Eigen::ArrayXXf tbData(numLocs, numChans);
238  std::vector<float> lonData(numLocs);
239  std::vector<float> latData(numLocs);
240  float midLoc = static_cast<float>(numLocs) / 2.0f;
241  float midChan = static_cast<float>(numChans) / 2.0f;
242  for (std::size_t i = 0; i < numLocs; ++i) {
243  lonData[i] = static_cast<float>(i % 8) * 3.0f;
244  // We use static code analysis tools to check for potential bugs
245  // in our source code. On the next line, the clang-tidy tool warns about
246  // our use of integer division before casting to a float. Since there is
247  // no actual bug, we indicate this with NOLINT.
248  latData[i] = static_cast<float>(i / 8) * 3.0f; // NOLINT(bugprone-integer-division)
249  for (std::size_t j = 0; j < numChans; ++j) {
250  float del_i = static_cast<float>(i) - midLoc;
251  float del_j = static_cast<float>(j) - midChan;
252  tbData(i, j) = 250.0f + sqrt(del_i * del_i + del_j * del_j);
253  }
254  }
255 
256  // Write the data into the variables.
257  tbVar.writeWithEigenRegular(tbData);
258  latVar.write(latData);
259  lonVar.write(lonData);
260 
261  // Done!
262  } catch (const std::exception& e) {
264  return 1;
265  }
266  return 0;
267 }
268 
269 /// @}
270 /// @}
IODA's error system.
Definitions for setting up backends with file and memory I/O.
Interfaces for ioda::Group and related classes.
Interfaces for ioda::ObsGroup and related classes.
Groups are a new implementation of ObsSpaces.
Definition: Group.h:159
An ObsGroup is a specialization of a ioda::Group. It provides convenience functions and guarantees th...
Definition: ObsGroup.h:32
static ObsGroup generate(Group &emptyGroup, const NewDimensionScales_t &fundamentalDims, std::shared_ptr< const detail::DataLayoutPolicy > layout=nullptr)
Create an empty ObsGroup and populate it with the fundamental dimensions.
Definition: ObsGroup.cpp:72
Variables store data!
Definition: Variable.h:680
IODA_DL Group constructFromCmdLine(int argc, char **argv, const std::string &defaultFilename)
This is a wrapper function around the constructBackend function for creating a backend based on comma...
Definition: Factory.cpp:21
int main(int argc, char **argv)
string latName
Definition: 05-ObsGroup.py:118
list newDims
Definition: 05-ObsGroup.py:95
string tbName
Definition: 05-ObsGroup.py:117
string lonName
Definition: 05-ObsGroup.py:119
constexpr int Unlimited
Specifies that a dimension is resizable to infinity.
IODA_DL void unwind_exception_stack(const std::exception &e, std::ostream &out=std::cerr, int level=0)
Convenience function for unwinding an exception stack.
Definition: Exception.cpp:48
std::vector< std::shared_ptr< NewDimensionScale_Base > > NewDimensionScales_t
Used to specify Variable creation-time properties.
Definition: Has_Variables.h:57
VariableCreationParameters & setFillValue(DataType fill)
Definition: Has_Variables.h:69
bool chunk
Do we chunk this variable? Required for extendible / compressible Variables.
Definition: Has_Variables.h:84