API reference

Data

german_data()

Return german data as a data frame.

Exploratory data analysis

missing_stat(data[, columns, show_print, …])

Return missing values’ statistics.

numeric_stat(data[, percentiles])

Describe numeric columns.

categorical_stat(data)

Generate descriptive statistics for categorical columns.

describe(data[, percentiles])

Generate descriptive statistics.

corr_analysis(data[, tight_layout, …])

Correlation analysis.

Preprocessing

General precessing

replace_blank(data[, inplace])

Replace blank strings in data if any.

Handle missing values

rf_fill_missing(data, column[, target, n_digits])

Fill missing values with predicted values output by a random forest regressor.

Score card

check_target(Y[, inplace])

Check validity of target.

mono_bin(Y, X[, n, precision, duplicates])

Generate monotonous bins.

Plot utilities

rocplot(y_true, y_preds[, equal_aspect])

Plot a ROC curve.

ksplot(preds, labels[, data, n, is_prob, …])

Plot distributions of good and bad clients, including an estimate of the KS statistics.

woebinplot(data[, stacked, grouped, width, …])

Visulize the binning.