Wednesday, June 11, 2014

6:30 PM

  • Centriq Training & Centriq's TechSmart KC Program

    8700 State Line Road #200, Leawood, KS (map)
    38.969131 -94.609184

  • The Apache Crunch Java library provides a framework for writing, testing, and running MapReduce pipelines. Its goal is to make pipelines that are composed of many user-defined functions simple to write, easy to test, and efficient to run.
    Agenda
    • 6:30 PM - Professional Networking and Social Time
    • 7:00 PM - Presentation Starts
    Bio:
            Micah is a committer on the Apache Crunch project as well as a Software Architect for Cerner Corporation, a leading provider of healthcare technology. For almost a decade he has worked on building infrastructure and reusable assets for a number of healthcare solutions. In the last few years his focus has shifted towards enabling the adoption of Big Data technologies at Cerner helping to build infrastructure for ingestion of Big Data and efficient processing in both a batch and near real time environment.

    Abstract:
            The MapReduce framework is a proven method for processing large volumes of data but even simple problems require expertise. Tackling the learning curve for Big Data and efficient processing is a daunting task for developers just getting started.  The Apache Crunch project helps to break down complex processing problems into simple concepts which can be utilized on industry standard frameworks such as Hadoop and Spark. Apache Crunch is being used as an integral part of building processing pipelines for healthcare data allowing for quick development of new solutions and architectures.  The talk will also cover how the core concepts of Apache Crunch enable first class integration, rapid scaling of development across teams, and development of extensible processing infrastructure.
    Sponsor: Cerner
  • 0 Response to "June 11th: KCJava - Kansas City Java User Group Meeting - Apache Crunch"

    Post a Comment

    Blog Archive

    Followers