aixd.data
This package contains classes for defining data objects, data blocks, normalizations and transformations that are used to describe datasets.
Dataset
This class manages the Dataset. The data, model checkpoints and other logging information resides in the respective folder/file structure: |
Data objects
Master data object, to define each of the different building blocks that are going to be used to form the design parameters and performance attributes vectors. |
|
Real data type. |
|
Base class for the discrete type. |
|
Integer data type. |
|
Categorical data type. |
|
Boolean type, i.e., categorical type with options 'True', 'False' |
Domain definitions
Abstract base class for domains used by DataObjects. |
|
Defines a set of options, e.g., for categorical variables. |
|
Defines a closed interval [a, b], for a <= b. |
|
A domain that allows to define an interval with several options/masks, not in the interval. |
Data blocks
Generic block of data, i.e., a concatenation of different instances of data types |
|
A data block for design parameters. |
|
A data block for design parameters. |
|
Additional types, obtained from the design parameters |
|
A special data block that is transformable. |
|
A special data block used for the input of the ML model. |
|
A special data block used for the output of the ML model. |
Custom callbacks
Callback for pre or post data loading transformation and normalization. |
|
Receives the output of the NN, and performs a transformation to it, before returning the value to the user, or before feeding it again to the |
|
Transformation
Defines the decorator to register transformations. |
|
Simple resolver that returns the transformation by its name. |
|
Abstract base class to implement DataObject transformation. |
|
Implements a log10 transformation of the data. |
|
Implements a scaled sigmoid transformation of the data. |
|
Implements standardization as (x - mean) / std. |
|
Implements a min-max scaling of the data. |
|
Implements the zero-to-one (or min-max) normalization as (x - min) / (max - min). |
|
Implements the minus-one-to-one normalization as (x - min) / (max - min). |
|
Implements min-max scaling of the data with a masked domain. |
|
Implements the zero-to-one (or min-max) normalization as (x - min) / (max - min) for DataObject's with a MaskedInterval domain. |
|
Implements the minus-one-to-one normalization for DataObject's with a MaskedInterval domain. |
|
Implements encoding for string to integers. |
|
Implement a transformation to convert integers to floats. |
Utils
Takes any data format, detect the type, and convert it if neccessary. |
|
Combines data into a single output and converts them into the specified target format. |
|
Reformats data formatted as a (list of) dictionaries to a pandas dataframe. |
|
Reformats data formatted as a dataframe into a dictionary. |
|
Reformats data formatted as a dataframe into a dictionary. |
|
Reformats data stored in a dataframe to a dictionary collated accordingly to the provided list of data objects. |
|
Reformats a dataframe that may represent multidimensional data objects (cells containing lists of values, if dim>1), to a flattened dataframe. |
|
Reformats a flattened dataframe into a dataframe that may represent multidimensional data objects (cells containing n lists of values, if dim>1). |
|
Reformats data formatted as a nested list into a numpy.ndarray. |
|
Reformat numpy.ndarray into a nested list. |
|
Reformats data formatted as a nested list of data into a dataframe. |
|
Reformats a pandas dataframe into a nested list. |
|
Reformats data formatted as a nested list into a dictionary. |
|
Reformats data formatted as a nested list into a list of dictionaries. |
|
The input is a list of dictionaries, where each dictionary, e.g., corresponds to one data sample and the keys correspond to the data object names. |
|
In the input dictionary, each key (e.g. corresponding to a data object name) contains a list of items (e.g. corresponding to individual samples). |
|
Converts a numpy array to a torch tensor. |
|
Converts a torch tensor into a numpy array. |