Back

Speaker "Sridhar Reddy" Details Back

 

Topic

Build a Time Series Application with Spark Streaming and HBase Workshop

Abstract

More and more applications have to store and process time series data, a very good example of this are all the Internet of Things -IoT- applications. 
 
This hands on tutorial will help you get a jump-start on scaling distributed computing by taking an example time series application and coding through different aspects of working with such a dataset. We will cover building an end to end distributed processing pipeline using various distributed stream input sources, Apache Spark, and Apache HBase, to rapidly ingest, process and store large volumes of high speed data. 
 
Participants will use Scala to work on exercises intended to teach them the features of Spark Streaming for processing live data streams ingested from sources like Apache Kafka, sockets or files, and storing the processed data in HBase.

Profile

Sridhar is Director of Professional Services at MapR. He leads the Application development practice at MapR and helps customers in building Hadoop & HBase solutions and migrating data from RDBMS databases. Sridhar worked as a Technology Evangelist at Sun Microsystems for over 10 years, where he presented at many Technical conferences world wide and helped increase awareness and adoption of Java technology in the worldwide developer community. Sridhar holds a BS in Mechanical Engineering from Osmania University in India, and an MS in Computer Science from the Florida Institute of Technology.