Preprocessing#
Any transformation of the data matrix that is not a tool.
Preprocessing steps usually perform a basic transformation on the data array in .X or .layers.
Basic preprocessing#
Encode categoricals of a data object. |
|
Computes a principal component analysis. |
|
Regress out (mostly) unwanted sources of variation. |
|
Sample a fraction or a number of observations / variables with or without replacement. |
|
Annotate highly variable features. |
|
Returns a Winsorized version of the input array. |
|
Clips (limits) features. |
|
Summarizes numerical measurements into minimum, maximum and average values. |
Filtering#
Filter features based on missing data thresholds. |
|
Filter observations based on missing data thresholds (features/measurements). |
Quality control#
Calculates various quality control metrics. |
|
Examines lab measurements for reference ranges and outliers. |
|
Statistical hypothesis test for Missing Completely At Random (MCAR). |
|
Detects biases in the data using feature correlations, standardized mean differences, and feature importances. |
Imputation#
Replaces all missing values in all columns or a subset of columns specified by the user with the passed replacement value. |
|
Impute missing values in numerical data using mean/median/most frequent imputation. |
|
Imputes missing values in the input data object using K-nearest neighbor imputation. |
|
Impute data using the MissForest strategy. |
|
Impute data using the miceforest method. |
Normalization#
Apply log normalization. |
|
Apply max-abs normalization. |
|
Apply min-max normalization. |
|
Apply power transformation normalization. |
|
Apply quantile normalization. |
|
Apply robust scaling normalization. |
|
Apply scaling normalization. |
|
Offsets negative values into positive ones with the lowest negative value becoming 0. |
Dataset Shift Correction#
Partially overlaps with dataset integration. Note that a simple batch correction method is available via pp.regress_out().
Neighbors#
Compute a neighborhood graph of observations [MHM18]. |