This function creates a correlation heatmap for all discrete categories.
Usage
plot_correlation(
data,
type = c("all", "discrete", "continuous"),
maxcat = 20L,
cor_args = list(),
geom_text_args = list(),
title = NULL,
ggtheme = theme_gray(),
theme_config = list(legend.position = "bottom", axis.text.x = element_text(angle = 90))
)
Arguments
- data
input data
- type
column type to be included in correlation calculation. "all" for all columns, "discrete" for discrete features, "continuous" for continuous features.
- maxcat
maximum categories allowed for each discrete feature. The default is 20.
- cor_args
a list of other arguments to cor
- geom_text_args
a list of other arguments to geom_text
- title
plot title
- ggtheme
complete ggplot2 themes. The default is theme_gray.
- theme_config
a list of configurations to be passed to theme.
Details
For discrete features, the function first dummifies all categories, then calculates the correlation matrix (see cor) and plots it.
Examples
plot_correlation(iris)
plot_correlation(iris, type = "c")
plot_correlation(airquality, cor_args = list("use" = "pairwise.complete.obs"))