Monday, April 24, 2017 at 6:00:00 PM Pinsight Media 1100 Main St #1500, Kansas City, MO
Presenters:
Yotabites Consulting -
Atul Khachar & Yeshwanth Jagini
Agenda:
To make the most of Big Data, and to reveal the hidden stories, we need to analyze the data.
In this meetup we are going to walk you through tools and framework to boost data scientists stack
Sparklyr - An R interface to run R code in Spark
Rstudio - Open Source and Enterprise Ready Professional Software for R
In Specific we are going to cover:
1) A little intro about HDFS and SPARK
2) What is Sparklyr?
3) Difference between SparklyR/SparkR/Sparkling water
4) A little intro on RStudio
5) Deep dive into sparklyr with a use case
-> Overview of environment and
-> Install and Setup
-> Explore sparklyR package
-> Reading and Writing Data
-> Exploring dplyr package support for sparklyr
-> Demo on analysis of diabetes dataset & build some models
6) Conclusion
If you arrive after 6pm the elevators require a badge. Please sign in at the guard station and they will let you up. Click here for event
0 Response to "April 24: Data Science KC - Creating sparks with Rstudio - Analysis of diabetes dataset"
Post a Comment