IODA
04-VariablesAttributesAndDimensions.py
Go to the documentation of this file.
1 #
2 # (C) Copyright 2020-2021 UCAR
3 #
4 # This software is licensed under the terms of the Apache Licence Version 2.0
5 # which can be obtained at http://www.apache.org/licenses/LICENSE-2.0.
6 #
7 
8 # This example supplements the previous discussion on variables.
9 #
10 # Variables store data, but how should this data be interpreted? This is the
11 # purpose of attributes. Attributes are bits of metadata that can describe groups
12 # and variables. Good examples of attributes include tagging the units of a variable,
13 # describing a variable, listing a variable's valid range, or "coding" missing
14 # or invalid values. Other examples include tagging the source of your data,
15 # or recording when you ingested / converted data into the ioda format.
16 #
17 # Basic manipulation of attributes was already discussed in Tutorial 2. Now, we want to
18 # focus instead on good practices with tagging your data.
19 #
20 # Supplementing attributes, we introduce the concept of adding "dimension scales" to your
21 # data. Basically, your data have dimensions, but we want to attach a "meaning" to each
22 # axis of your data. Typically, the first axis corresponds to your data's Location.
23 # A possible second axis for brightness temperature data might be "instrument channel", or
24 # maybe "pressure level". This tutorial will show you how to create new dimension scales and
25 # attach them to new Variables.
26 
27 import os
28 import sys
29 
30 import ioda
31 
33  name = "Example-04-python.hdf5",
34  mode = ioda.Engines.BackendCreateModes.Truncate_If_Exists)
35 
36 # Let's start with dimensions and Dimension Scales.
37 #
38 # ioda stores data using Variables, and you can view each variable as a
39 # multidimensional matrix of data. This matrix has dimensions.
40 # A dimension may be used to represent a real physical dimension, for example,
41 # time, latitude, longitude, or height. A dimension might also be used to
42 # index more abstract quantities, for example, color-table entry number,
43 # instrument number, station-time pair, or model-run ID
44 #
45 # A dimension scale is simply another variable that provides context, or meaning,
46 # to a particular dimension. For example, you might have ATMS brightness
47 # temperature information that has dimensions of location by channel number. In ioda,
48 # we want every "axis" of a variable to be associated with an explanatory dimension.
49 #
50 # Let's create a few dimensions... Note: when working with an already-existing Obs Space,
51 # (covered later), these dimensions may already be present.
52 #
53 # Create two dimensions, "nlocs", and "ATMS Channel". Set distinct values within
54 # these dimensions.
55 
56 num_locs = 3000
57 num_channels = 23
58 
59 dim_location = g.vars.create('nlocs', ioda.Types.int32, [ num_locs ])
60 dim_location.scales.setIsScale('nlocs')
61 
62 dim_channel = g.vars.create('ATMS Channel', ioda.Types.int32, [num_channels])
63 dim_channel.scales.setIsScale('ATMS Channel')
64 
65 # Now that we have created dimensions, we can create new variables and attach the
66 # dimensions to our data.
67 #
68 # But first, a note about attributes:
69 # Attributes provide metadata that describe our variables.
70 # In IODA, we at least must to keep track of each variable's:
71 # - Units (in SI; we follow CF conventions)
72 # - Long name
73 # - Range of validity. Data outside of this range are automatically rejected for
74 # future processing.
75 #
76 # Let's create variables for Latitude, Longitude and for
77 # ATMS Observed Brightness Temperature.
78 #
79 # There are two ways to define a variable that has attached dimensions.
80 # First, we can explicitly create a variable and set its dimensions.
81 #
82 # Longitude has dimensions of nlocs. It has units of degrees_east, and has
83 # a valid_range of (-180,180).
84 
85 longitude = g.vars.create('MetaData/Longitude', ioda.Types.float, [num_locs])
86 longitude.scales.set([dim_location])
87 longitude.atts.create('valid_range', ioda.Types.float, [2]).writeVector.float([-180, 180])
88 longitude.atts.create('units', ioda.Types.str).writeVector.str(['degrees_east'])
89 longitude.atts.create('long_name', ioda.Types.str).writeVector.str(['Longitude'])
90 
91 # The above method is a bit clunky because you have to make sure that the new variable's
92 # dimensions match the sizes of each dimension.
93 # Here, we do the same variable creation, but instead use dimension scales to determine sizes.
94 latitude = g.vars.create('MetaData/Latitude', ioda.Types.float, scales=[dim_location])
95 # Latitude has units of degrees_north, and a valid_range of (-90,90).
96 latitude.atts.create('valid_range', ioda.Types.float, [2]).writeVector.float([-90,90])
97 latitude.atts.create('units', ioda.Types.str).writeVector.str(['degrees_north'])
98 latitude.atts.create('long_name', ioda.Types.str).writeVector.str(['Latitude'])
99 
100 # The ATMS Brightness Temperature depends on both location and instrument channel number.
101 tb = g.vars.create('ObsValue/Brightness_Temperature', ioda.Types.float, scales=[dim_location, dim_channel])
102 tb.atts.create('valid_range', ioda.Types.float, [2]).writeVector.float([100,500])
103 tb.atts.create('units', ioda.Types.str).writeVector.str(['K'])
104 tb.atts.create('long_name', ioda.Types.str).writeVector.str(['ATMS Observed (Uncorrected) Brightness Temperature'])
105 
106 
107 # Variable Parameter Packs
108 # When creating variables, you can also provide an optional
109 # VariableCreationParameters structure. This struct lets you specify
110 # the variable's fill value (a default value that is a placeholder for unwritten data).
111 # It also lets you specify whether you want to compress the data stored in the variable,
112 # and how you want to store the variable (contiguously or in chunks).
114 
115 # Variable storage: contiguous or chunked
116 # See https://support.hdfgroup.org/HDF5/doc/Advanced/Chunking/ and
117 # https://www.unidata.ucar.edu/blogs/developer/en/entry/chunking_data_why_it_matters
118 # for detailed explanations.
119 # To tell ioda that you want to chunk a variable:
120 p1.chunk = True
121 p1.chunks = [100]
122 
123 # Fill values:
124 # The "fill value" for a dataset is the specification of the default value assigned
125 # to data elements that have not yet been written.
126 # When you first create a variable, it is empty. If you read in a part of a variable that
127 # you have not filled with data, then ioda has to return fake, filler data.
128 # A fill value is a "bit pattern" that tells ioda what to return.
129 p1.setFillValue.float(-999)
130 
131 # Compression
132 # If you are using chunked storage, you can tell ioda that you want to compress
133 # the data using ZLIB or SZIP.
134 # - ZLIB / GZIP:
135 p1.compressWithGZIP()
136 
137 
138 # Let's create one final variable, "Solar Zenith Angle", and let's use
139 # out new variable creation parameters.
140 sza = g.vars.create(name='ObsValue/Solar Zenith Angle', dtype=ioda.Types.float, scales=[dim_location], params=p1)
141 
142 
IODA_DL Group createFile(const std::string &filename, BackendCreateModes mode, HDF5_Version_Range compat=defaultVersionRange())
Create a ioda::Group backed by an HDF5 file.
Definition: HH.cpp:120
Used to specify Variable creation-time properties.
Definition: Has_Variables.h:57