H5DataWriter Satellite#

Name	H5DataWriter
Description	Satellite receiving data and writing it to HDF5 files
Category	Data Receivers
Language	Python

Description#

This satellite receives data from all satellites and stores it in an HDF5 file.

Requirements#

The H5DataWriter satellite requires the [hdf5] component, which can be installed with:

PyPI

pip install "ConstellationDAQ[hdf5]"

Source

pip install --no-build-isolation -e ".[hdf5]"

This requires the HDF5 development libraries to be installed, which can be installed with:

Debian/Ubuntu

sudo apt install libhdf5-dev

ALMA/Fedora

TODO

MacOS

TODO

Windows

TODO

Data Format#

A group is created for each transmitter with its canonical name. In this group further groups are created for the begin-of-run (BOR) and end-of-run (EOR) messages, as well as a group for each data (data_XXXXXXXXX) message.

The BOR and EOR groups contain two empty datasets, one called user_tags containing the user provided tags for those message, and one called configuration for the BOR or run_metadata for the EOR respectively. The configuration and run metadata are added as attributes to those datasets.

Each data group contains the tags of the data record as attributes and contains a dataset for each block contained in the data record (block_XX). By default data is stored with the uint8 data type. The data type might differ if the data record contained a dtype tag, in which case numpy is used to extract store data with the given data type.

Additionally, a group for the receiver is created. It contains three attributes: constellation_version, date_utc and swmr_mode.

SWMR Mode#

Usually, it is not safe to read from an HDF5 file while data is being written to it. However, a special writing mode available in recent versions of HDF5 allows such concurrent reading: “Single-Writer Multiple-Reader” (SWMR) mode.

If the SWMR mode is enabled via the allow_concurrent_reading parameter, the data format differs due to certain limitations of the SWMR mode.

Important

In SWMR mode no new datasets can be created, thus all datasets for a transmitter need to be created as soon the BOR has arrived. All data transmitters have to be specified explicitly via the _data_transmitters parameter.

Once all BOR messages have been received, the file will be set to SWMR mode. After this, incoming data will be appended to the data dataset instead of new datasets being created for each data record. Individual data records can be separated using the data_idx dataset, which contains the indices where the data record ends. Similarly, metadata contained in the data records will be appended to the meta dataset encoded as json, with indices stored in the meta_idx dataset. If a data record contains multiple block, the are appended and a block_lengths entry is added the the metadata, which can be used to extract the individual blocks.

EOR messages are encoded as JSON instead of using attributes.

Python Decoder#

The following Python code can be used to dump the content of a file to the console:

import json
import h5py
import numpy as np

# Open the h5 file
file_name = "my_data_file.h5"
h5file = h5py.File(file_name)

# Select group of satellite of interest
sender = "PyRandomTransmitter.sender"
grp = h5file[sender]

# Get datasets
dset_data = h5file[sender]["data"]
dset_meta = h5file[sender]["meta"]
dset_data_idx = h5file[sender]["data_idx"]
dset_meta_idx = h5file[sender]["meta_idx"]

# Loop over each data record
prev_data = 0
prev_meta = 0
i = 0
for i in range(dset_data_idx.shape[0]):
    # Start with metadata, extract end position
    idx = dset_meta_idx[i]
    if idx == 0:
        # Reached the end
        break
    # Slice the metadata out of the dataset and convert back to dict
    tags = json.loads(str(dset_meta[prev_meta:idx], encoding='utf-8'))
    # Store current index value for next iteration
    prev_meta = idx

    # Maybe we can determine the data type from the metadata?
    dtype = np.dtype(tags.get("dtype", "uint8"))

    # Now load actual data
    idx = dset_data_idx[i]
    data_raw = np.split(np.array(dset_data[prev_data:idx], dtype=np.uint8), np.cumsum(tags["block_lengths"])[:-1])
    data = [raw_block.view(dtype) for raw_block in data_raw]
    prev_data = idx
    print(f"Data record #{i} loaded with metadata {tags} and data {data}")

Parameters#

Parameter	Type	Description	Default Value
`output_directory`	String	Base path to which to write output files to	-
`flush_interval`	Float	Interval in seconds controlling how often to flush the file	`10.0`
`allow_concurrent_reading`	Bool	Enable concurrent reading (aka SWMR mode)	`False`

Metrics#

Metric	Description	Value Type	Metric Type	Interval
`CONCURRENT_READING_ENABLED`	Concurrent reading status	Bool	`LAST_VALUE`	5s

Custom Commands#

Command	Description	Arguments	Return Value
`get_concurrent_reading_status`	Get if concurrent reading is enabled	-	Bool

Framework Parameters#

This satellite inherits the following framework parameters from its base classes:

Parameters inherited from `ReceiverSatellite`#

Parameter	Type	Description	Default Value
`_data_transmitters`	List of strings	List of canonical names of transmitter satellites this receiver should connect to and receive data messages from. If empty, this receiver will connect to all transmitters.	`[]`
`_eor_timeout`	Unsigned integer	Timeout waiting for the reception of the end-of-run message. The receiver satellite will wait this number of seconds for receiving the EOR message from each connected transmitter satellite, and will go into error state if the message has not been received within this period. The timeout will only be started after the pending data messages have been read from the queue.	`10`

Parameters inherited from `Satellite`#

Autonomous Operation#

Parameter	Type	Description	Default Value
`_role`	Role name	Role this satellite should take in the Constellation. Accepted values are `ESSENTIAL`, `DYNAMIC`, `TRANSIENT` and `NONE`.	`DYNAMIC`

Transition Orchestration#

Parameters to control autonomous transition orchestration of satellites:

Parameter	Type	Description	Default Value
`_conditional_transition_timeout`	Unsigned integer	Timeout in seconds to wait for all conditions to be satisfied for conditional transitioning	`30`
`_require_initializing_after`	List of strings	List of canonical names of remote satellites for conditional transitioning in `initializing` state	-
`_require_launching_after`	List of strings	List of canonical names of remote satellites for conditional transitioning in `launching` state	-
`_require_landing_after`	List of strings	List of canonical names of remote satellites for conditional transitioning in `landing` state	-
`_require_starting_after`	List of strings	List of canonical names of remote satellites for conditional transitioning in `starting` state	-
`_require_stopping_after`	List of strings	List of canonical names of remote satellites for conditional transitioning in `stopping` state	-

Framework Metrics#

This satellite inherits the following framework metrics from its base classes:

Metrics inherited from `ReceiverSatellite`#

Metric	Description	Value Type	Metric Type	Interval
`RX_BYTES`	Amount of bytes received from all transmitters during current run	Integer	`LAST_VALUE`	10s

Metrics inherited from `Satellite`#

None

H5DataWriter Satellite#

Description#

Requirements#

Data Format#

SWMR Mode#

Python Decoder#

Parameters#

Metrics#

Custom Commands#

Framework Parameters#

Parameters inherited from ReceiverSatellite#

Parameters inherited from Satellite#

Autonomous Operation#

Transition Orchestration#

Framework Metrics#

Metrics inherited from ReceiverSatellite#

Metrics inherited from Satellite#

Parameters inherited from `ReceiverSatellite`#

Parameters inherited from `Satellite`#

Metrics inherited from `ReceiverSatellite`#

Metrics inherited from `Satellite`#