Skip to contents

An S4 class to store the parameters used in the preprocessing pipeline.

Slots

id_col

Character vector specifying the name of the ID column.

target_col

Character vector specifying the name of the target variable column.

split_col

Character vector specifying the name of the column used for train/test split.

scale_option

Character string specifying the scaling option ("train", "test", "all").

scale_method

Character string specifying the scaling method ("scale", "norm").

mean_vals

Numeric vector storing the mean values for z-score scaling.

sd_vals

Numeric vector storing the standard deviation values for z-score scaling.

min_vals

Numeric vector storing the minimum values for minmax normalization.

max_vals

Numeric vector storing the maximum values for minmax normalization.

t_mean_vals

Numeric vector storing the mean of transformed values for z-score scaling.

t_sd_vals

Numeric vector storing the standard deviation of transformed values for z-score scaling.

t_min_vals

Numeric vector storing the minimum values of transformed for minmax normalization.

t_max_vals

Numeric vector storing the maximum values of transformed for minmax normalization.

fun_transform

List of functions for data transformation.

fun_inverse

List of functions for inverse data transformation.

transformed

Boolean value, `TRUE` if object contains transformed values, `NA` if it is the result of an inverse transformation.