Core Time Series Processing Objects

`MultivariateTS`

Core object for processing multivariate time series. Use this object to set up and save all necessary transformations for pre and post model processing of a time series that only contains one dimension outside of time.

Source code in timewarpy/core.py

class MultivariateTS:
    """Core object for processing multivariate time series. Use this object
    to set up and save all necessary transformations for pre and post model
    processing of a time series that only contains one dimension outside of time.
    """

    def __init__(self, train_horizon: int, pred_horizon: int, scaler: object = None,
                 roll_increment: int = 0):
        """Initializes the core class. First, this sets the values for how many
        time points should be in the training and forecasting windows.

        TODO: Need example

        Second, this also defines any scaling that needs to occur the variable changing in time.
        Scaling functionality follows the scikit-learn standards for methods needed. See an example
        of a standard scaler [here](https://scikit-learn.org/stable/modules/generated/sklearn.preprocessing.StandardScaler.html).

        Args:
            train_horizon (int): number of time steps in each training vector
            pred_horizon (int): number of time steps in each prediction vector
            scaler (object, optional): scaling function to use, usage follows scikit-learn. Defaults to None.
            roll_increment (int, optional): how many time sets to skip while rolling windows. Defaults to 0.
        """
        self.__str__ = 'Multivariate Time Series Processing Class'
        self.train_horizon = train_horizon
        self.pred_horizon = pred_horizon
        self.roll_increment = roll_increment
        if scaler is not None:
            self.scaler = scaler
        else:
            self.scaler = None

    def fit(self, df, train_columns, pred_columns):
        """Given a pandas dataframe and columns for training and prediction columns in a
        multivariate time series data, this will fit necessary preprocessing to the
        given data columns. Currently this is only fitting the scalar to
        the given column in the __init__ function.

        Args:
            df (pandas.DataFrame): multivariate time series
            train_columns (list): columns for training features to use in the dataframe
            train_columns (list): columns for prediction variables

        """
        self.train_columns = train_columns
        self.pred_columns = pred_columns
        all_cols = set(self.train_columns + self.pred_columns)
        if self.scaler is not None:
            self.scaler_dict = dict(zip(all_cols, [self.scaler() for i in all_cols]))
            for column in all_cols:
                self.scaler_dict[column] = self.scaler_dict[column].fit(df[column].to_numpy().reshape(-1, 1))
        else:
            self.scaler_dict = None
        return None

    def transform(self, df, train_columns, pred_columns):
        """Given a pandas dataframe and columns for the multivariate
        time series data, tranform the data to a neural network friendly
        set of vectors.

        Args:
            df (pandas.DataFrame): multivariate time series
            train_columns (list): columns for training features to use in the dataframe
            train_columns (list): columns for prediction variables

        Returns:
            tuple: X (np.array) training vectors, y (np.array) forecasting/prediction vectors
        """
        if self.scaler is not None:
            all_cols = set(self.train_columns + self.pred_columns)
            for column in all_cols:
                df[column] = self.scaler_dict[column].transform(df[column].to_numpy().reshape(-1,1)).T[0]
        X, y = preprocess.create_multivariate_windows(
            df,
            train_horizon=self.train_horizon,
            pred_horizon=self.pred_horizon,
            train_columns=train_columns,
            pred_columns=pred_columns,
        )
        return X, y

    def fit_transform(self, df, train_columns, pred_columns):
        """Runs the core.MultivariateTS fit and transform functions in one call.

        Args:
            df (pandas.DataFrame): multivariate time series
            train_columns (list): columns for training features to use in the dataframe
            train_columns (list): columns for prediction variables

        Returns:
            tuple: X (np.array) training vectors, y (np.array) forecasting/prediction vectors
        """
        self.fit(df, train_columns, pred_columns)
        X, y = self.transform(df, train_columns, pred_columns)
        return X, y

`init(self, train_horizon, pred_horizon, scaler=None, roll_increment=0)` `special`

Initializes the core class. First, this sets the values for how many time points should be in the training and forecasting windows.

TODO: Need example

Second, this also defines any scaling that needs to occur the variable changing in time. Scaling functionality follows the scikit-learn standards for methods needed. See an example of a standard scaler here.

Parameters:

Name	Type	Description	Default
`train_horizon`	`int`	number of time steps in each training vector	required
`pred_horizon`	`int`	number of time steps in each prediction vector	required
`scaler`	`object`	scaling function to use, usage follows scikit-learn. Defaults to None.	`None`
`roll_increment`	`int`	how many time sets to skip while rolling windows. Defaults to 0.	`0`

Source code in timewarpy/core.py

def __init__(self, train_horizon: int, pred_horizon: int, scaler: object = None,
             roll_increment: int = 0):
    """Initializes the core class. First, this sets the values for how many
    time points should be in the training and forecasting windows.

    TODO: Need example

    Second, this also defines any scaling that needs to occur the variable changing in time.
    Scaling functionality follows the scikit-learn standards for methods needed. See an example
    of a standard scaler [here](https://scikit-learn.org/stable/modules/generated/sklearn.preprocessing.StandardScaler.html).

    Args:
        train_horizon (int): number of time steps in each training vector
        pred_horizon (int): number of time steps in each prediction vector
        scaler (object, optional): scaling function to use, usage follows scikit-learn. Defaults to None.
        roll_increment (int, optional): how many time sets to skip while rolling windows. Defaults to 0.
    """
    self.__str__ = 'Multivariate Time Series Processing Class'
    self.train_horizon = train_horizon
    self.pred_horizon = pred_horizon
    self.roll_increment = roll_increment
    if scaler is not None:
        self.scaler = scaler
    else:
        self.scaler = None

`fit(self, df, train_columns, pred_columns)`

Given a pandas dataframe and columns for training and prediction columns in a multivariate time series data, this will fit necessary preprocessing to the given data columns. Currently this is only fitting the scalar to the given column in the init function.

Parameters:

Name	Type	Description	Default
`df`	`pandas.DataFrame`	multivariate time series	required
`train_columns`	`list`	columns for training features to use in the dataframe	required
`train_columns`	`list`	columns for prediction variables	required

Source code in timewarpy/core.py

def fit(self, df, train_columns, pred_columns):
    """Given a pandas dataframe and columns for training and prediction columns in a
    multivariate time series data, this will fit necessary preprocessing to the
    given data columns. Currently this is only fitting the scalar to
    the given column in the __init__ function.

    Args:
        df (pandas.DataFrame): multivariate time series
        train_columns (list): columns for training features to use in the dataframe
        train_columns (list): columns for prediction variables

    """
    self.train_columns = train_columns
    self.pred_columns = pred_columns
    all_cols = set(self.train_columns + self.pred_columns)
    if self.scaler is not None:
        self.scaler_dict = dict(zip(all_cols, [self.scaler() for i in all_cols]))
        for column in all_cols:
            self.scaler_dict[column] = self.scaler_dict[column].fit(df[column].to_numpy().reshape(-1, 1))
    else:
        self.scaler_dict = None
    return None

`fit_transform(self, df, train_columns, pred_columns)`

Runs the core.MultivariateTS fit and transform functions in one call.

Parameters:

Name	Type	Description	Default
`df`	`pandas.DataFrame`	multivariate time series	required
`train_columns`	`list`	columns for training features to use in the dataframe	required
`train_columns`	`list`	columns for prediction variables	required

Returns:

Type	Description
`tuple`	X (np.array) training vectors, y (np.array) forecasting/prediction vectors

Source code in timewarpy/core.py

def fit_transform(self, df, train_columns, pred_columns):
    """Runs the core.MultivariateTS fit and transform functions in one call.

    Args:
        df (pandas.DataFrame): multivariate time series
        train_columns (list): columns for training features to use in the dataframe
        train_columns (list): columns for prediction variables

    Returns:
        tuple: X (np.array) training vectors, y (np.array) forecasting/prediction vectors
    """
    self.fit(df, train_columns, pred_columns)
    X, y = self.transform(df, train_columns, pred_columns)
    return X, y

`transform(self, df, train_columns, pred_columns)`

Given a pandas dataframe and columns for the multivariate time series data, tranform the data to a neural network friendly set of vectors.

Parameters:

Name	Type	Description	Default
`df`	`pandas.DataFrame`	multivariate time series	required
`train_columns`	`list`	columns for training features to use in the dataframe	required
`train_columns`	`list`	columns for prediction variables	required

Returns:

Type	Description
`tuple`	X (np.array) training vectors, y (np.array) forecasting/prediction vectors

Source code in timewarpy/core.py

def transform(self, df, train_columns, pred_columns):
    """Given a pandas dataframe and columns for the multivariate
    time series data, tranform the data to a neural network friendly
    set of vectors.

    Args:
        df (pandas.DataFrame): multivariate time series
        train_columns (list): columns for training features to use in the dataframe
        train_columns (list): columns for prediction variables

    Returns:
        tuple: X (np.array) training vectors, y (np.array) forecasting/prediction vectors
    """
    if self.scaler is not None:
        all_cols = set(self.train_columns + self.pred_columns)
        for column in all_cols:
            df[column] = self.scaler_dict[column].transform(df[column].to_numpy().reshape(-1,1)).T[0]
    X, y = preprocess.create_multivariate_windows(
        df,
        train_horizon=self.train_horizon,
        pred_horizon=self.pred_horizon,
        train_columns=train_columns,
        pred_columns=pred_columns,
    )
    return X, y

`UnivariateTS`

Core object for processing univariate time series. Use this object to set up and save all necessary transformations for pre and post model processing of a time series that only contains one dimension outside of time.

Source code in timewarpy/core.py

class UnivariateTS:
    """Core object for processing univariate time series. Use this object
    to set up and save all necessary transformations for pre and post model
    processing of a time series that only contains one dimension outside of time.
    """

    def __init__(self, train_horizon: int, pred_horizon: int, scaler: object = None,
                 roll_increment: int = 0):
        """Initializes the core class. First, this sets the values for how many
        time points should be in the training and forecasting windows. See [here](/#univariate-data)
        for a visual on the training and prediction window lengths. Second, this
        also defines any scaling that needs to occur the variable changing in time.
        Scaling functionality follows the scikit-learn standards for methods needed. See an example
        of a standard scaler [here](https://scikit-learn.org/stable/modules/generated/sklearn.preprocessing.StandardScaler.html).

        Args:
            train_horizon (int): number of time steps in each training vector
            pred_horizon (int): number of time steps in each prediction vector
            scaler (object, optional): scaling function to use, usage follows scikit-learn. Defaults to None.
            roll_increment (int, optional): how many time sets to skip while rolling windows. Defaults to 0.
        """
        self.__str__ = 'Univariate Time Series Processing Class'
        self.train_horizon = train_horizon
        self.pred_horizon = pred_horizon
        self.roll_increment = roll_increment
        if scaler is not None:
            self.scaler = scaler
        else:
            self.scaler = None

    def fit(self, df, column):
        """Given a pandas dataframe and column for the univariate
        time series data, this will fit necessary preprocessing to that
        given data column. Currently this is only fitting the scalar to
        the given column in the __init__ function.

        Args:
            df (pandas.DataFrame): univariate time series
            column (str): column to use in the dataframe
        """
        self.column = column
        if self.scaler is not None:
            self.scaler = self.scaler().fit(df[column].to_numpy().reshape(-1, 1))

    def transform(self, df, column):
        """Given a pandas dataframe and column for the univariate
        time series data, tranform the data to a neural network friendly
        set of vectors.

        Args:
            df (pandas.DataFrame): univariate time series
            column (str): column to use in the dataframe

        Returns:
            tuple: X (np.array) training vectors, y (np.array) forecasting/prediction vectors
        """
        time_series = self.scaler.transform(df[[column]])
        X, y = preprocess.create_univariate_windows(
            time_series, self.train_horizon, self.pred_horizon
        )
        return X, y

    def fit_transform(self, df, column):
        """Runs the core.UnivariateTS fit and transform functions in one call.

        Args:
            df (pandas.DataFrame): univariate time series
            column (str): column to use in the dataframe

        Returns:
            tuple: X (np.array) training vectors, y (np.array) forecasting/prediction vectors
        """
        self.fit(df, column)
        X, y = self.transform(df, column)
        return X, y

    def inverse_transform(self, vec):
        """Takes a vector and applies all non-windowing processing inversely
        to get back to the original windowed matrix. This is useful for error
        measurement that is unscaled.

        Args:
            vec (array_like): time series set of vectors (univariate)

        Returns:
            array_like: inverse processes set of vectors
        """

        if len(vec.shape) > 2:
            vec_inv = self.scaler.inverse_transform(vec[:, :, 0])
        else:
            vec_inv = self.scaler.inverse_transform(vec)
        return vec_inv

`init(self, train_horizon, pred_horizon, scaler=None, roll_increment=0)` `special`

Initializes the core class. First, this sets the values for how many time points should be in the training and forecasting windows. See here for a visual on the training and prediction window lengths. Second, this also defines any scaling that needs to occur the variable changing in time. Scaling functionality follows the scikit-learn standards for methods needed. See an example of a standard scaler here.

Parameters:

Name	Type	Description	Default
`train_horizon`	`int`	number of time steps in each training vector	required
`pred_horizon`	`int`	number of time steps in each prediction vector	required
`scaler`	`object`	scaling function to use, usage follows scikit-learn. Defaults to None.	`None`
`roll_increment`	`int`	how many time sets to skip while rolling windows. Defaults to 0.	`0`

Source code in timewarpy/core.py

def __init__(self, train_horizon: int, pred_horizon: int, scaler: object = None,
             roll_increment: int = 0):
    """Initializes the core class. First, this sets the values for how many
    time points should be in the training and forecasting windows. See [here](/#univariate-data)
    for a visual on the training and prediction window lengths. Second, this
    also defines any scaling that needs to occur the variable changing in time.
    Scaling functionality follows the scikit-learn standards for methods needed. See an example
    of a standard scaler [here](https://scikit-learn.org/stable/modules/generated/sklearn.preprocessing.StandardScaler.html).

    Args:
        train_horizon (int): number of time steps in each training vector
        pred_horizon (int): number of time steps in each prediction vector
        scaler (object, optional): scaling function to use, usage follows scikit-learn. Defaults to None.
        roll_increment (int, optional): how many time sets to skip while rolling windows. Defaults to 0.
    """
    self.__str__ = 'Univariate Time Series Processing Class'
    self.train_horizon = train_horizon
    self.pred_horizon = pred_horizon
    self.roll_increment = roll_increment
    if scaler is not None:
        self.scaler = scaler
    else:
        self.scaler = None

`fit(self, df, column)`

Given a pandas dataframe and column for the univariate time series data, this will fit necessary preprocessing to that given data column. Currently this is only fitting the scalar to the given column in the init function.

Parameters:

Name	Type	Description	Default
`df`	`pandas.DataFrame`	univariate time series	required
`column`	`str`	column to use in the dataframe	required

Source code in timewarpy/core.py

def fit(self, df, column):
    """Given a pandas dataframe and column for the univariate
    time series data, this will fit necessary preprocessing to that
    given data column. Currently this is only fitting the scalar to
    the given column in the __init__ function.

    Args:
        df (pandas.DataFrame): univariate time series
        column (str): column to use in the dataframe
    """
    self.column = column
    if self.scaler is not None:
        self.scaler = self.scaler().fit(df[column].to_numpy().reshape(-1, 1))

`fit_transform(self, df, column)`

Runs the core.UnivariateTS fit and transform functions in one call.

Parameters:

Name	Type	Description	Default
`df`	`pandas.DataFrame`	univariate time series	required
`column`	`str`	column to use in the dataframe	required

Returns:

Type	Description
`tuple`	X (np.array) training vectors, y (np.array) forecasting/prediction vectors

Source code in timewarpy/core.py

def fit_transform(self, df, column):
    """Runs the core.UnivariateTS fit and transform functions in one call.

    Args:
        df (pandas.DataFrame): univariate time series
        column (str): column to use in the dataframe

    Returns:
        tuple: X (np.array) training vectors, y (np.array) forecasting/prediction vectors
    """
    self.fit(df, column)
    X, y = self.transform(df, column)
    return X, y

`inverse_transform(self, vec)`

Takes a vector and applies all non-windowing processing inversely to get back to the original windowed matrix. This is useful for error measurement that is unscaled.

Parameters:

Name	Type	Description	Default
`vec`	`array_like`	time series set of vectors (univariate)	required

Returns:

Type	Description
`array_like`	inverse processes set of vectors

Source code in timewarpy/core.py

def inverse_transform(self, vec):
    """Takes a vector and applies all non-windowing processing inversely
    to get back to the original windowed matrix. This is useful for error
    measurement that is unscaled.

    Args:
        vec (array_like): time series set of vectors (univariate)

    Returns:
        array_like: inverse processes set of vectors
    """

    if len(vec.shape) > 2:
        vec_inv = self.scaler.inverse_transform(vec[:, :, 0])
    else:
        vec_inv = self.scaler.inverse_transform(vec)
    return vec_inv

`transform(self, df, column)`

Given a pandas dataframe and column for the univariate time series data, tranform the data to a neural network friendly set of vectors.

Parameters:

Name	Type	Description	Default
`df`	`pandas.DataFrame`	univariate time series	required
`column`	`str`	column to use in the dataframe	required

Returns:

Type	Description
`tuple`	X (np.array) training vectors, y (np.array) forecasting/prediction vectors

Source code in timewarpy/core.py

def transform(self, df, column):
    """Given a pandas dataframe and column for the univariate
    time series data, tranform the data to a neural network friendly
    set of vectors.

    Args:
        df (pandas.DataFrame): univariate time series
        column (str): column to use in the dataframe

    Returns:
        tuple: X (np.array) training vectors, y (np.array) forecasting/prediction vectors
    """
    time_series = self.scaler.transform(df[[column]])
    X, y = preprocess.create_univariate_windows(
        time_series, self.train_horizon, self.pred_horizon
    )
    return X, y

Core Time Series Processing Objects

MultivariateTS

__init__(self, train_horizon, pred_horizon, scaler=None, roll_increment=0) special

fit(self, df, train_columns, pred_columns)

fit_transform(self, df, train_columns, pred_columns)

transform(self, df, train_columns, pred_columns)

UnivariateTS

__init__(self, train_horizon, pred_horizon, scaler=None, roll_increment=0) special

fit(self, df, column)

fit_transform(self, df, column)

inverse_transform(self, vec)

transform(self, df, column)

`MultivariateTS`

`init(self, train_horizon, pred_horizon, scaler=None, roll_increment=0)` `special`

`fit(self, df, train_columns, pred_columns)`

`fit_transform(self, df, train_columns, pred_columns)`

`transform(self, df, train_columns, pred_columns)`

`UnivariateTS`

`init(self, train_horizon, pred_horizon, scaler=None, roll_increment=0)` `special`

`fit(self, df, column)`

`fit_transform(self, df, column)`

`inverse_transform(self, vec)`

`transform(self, df, column)`