datto.Eda

class datto.Eda[source]

Exploratory data analysis (EDA)

__init__()

Methods

__init__()

bar_graphs_by_col(df[, path, group_by_var])

Makes a bar graph for each categorical column.

check_for_mistyped_cols(numerical_vals, ...)

Check for columns coded incorrectly

check_unique_by_identifier_col(df, ...)

Check if there are duplicates by entity (e.g.

find_cols_to_exclude(df)

Returns columns that may not be helpful for model building.

find_correlated_features(df)

Find & sort correlated features

sample_unique_vals(df)

Examine a few unique vals in each column

separate_cols_by_type(df)

Split the DataFrame into two groups by type

violin_plots_by_col(df[, path, group_by_var])

Makes a violin plot for each numerical column.