sparklyr
packagesparklyr
for machine learning using the Titanic data setInstall sparklyr
and H2O on your local computer. Run the code below to install all necessary packages and set the correct options.
install.packages(c("sparklyr", "rsparkling"))
options(rsparkling.sparklingwater.version = "2.1.0")
library(sparklyr)
spark_install(version = "2.1.0")
Last year, 70% of students were able to successfully install these packages without problems. The others ran into problems. Make sure to attempt installing these packages before class so if you have errors we can debug them before you need to use the packages.
dbplyr
plyr
package is now deprecated in favor of dplyr
and the other tidyverse
packagesdplyr
bigrquery
- instructions for setting up an account to access Google Bigquery databasessparklyr
- introduction to the sparklyr
interface for Spark
This work is licensed under the CC BY-NC 4.0 Creative Commons License.