Monday, April 24, 2017 at 6:00:00 PM , ,

Presenters:


Yotabites Consulting - 

Atul Khachar & Yeshwanth Jagini

Agenda:

To make the most of Big Data, and to reveal the hidden stories, we need to analyze the data.

In this meetup we are going to walk you through tools and framework to boost data scientists stack

Sparklyr - An R interface to run R code in Spark

Rstudio  - Open Source and Enterprise Ready Professional Software for R

In Specific we are going to cover:

1) A little intro about HDFS and SPARK

2) What is Sparklyr?

3) Difference between SparklyR/SparkR/Sparkling water

4) A little intro on RStudio

5) Deep dive into sparklyr with a use case

    -> Overview of environment and 

    -> Install and Setup

    -> Explore sparklyR package

    -> Reading and Writing Data

    -> Exploring dplyr package support for sparklyr

    ->  Demo on analysis of diabetes dataset & build some models

6) Conclusion

Click here for event

0 Response to "April 24: Data Science KC - Creating sparks with Rstudio - Analysis of diabetes dataset"

Post a Comment

Blog Archive

Followers