January 20 to 22 2016, Santa Clara, USA.
Speaker "Sridhar Reddy" Details
Build a Time Series Application with Spark Streaming and HBase Workshop
More and more applications have to store and process time series data, a very good example of this are all the Internet of Things -IoT- applications.
This hands on tutorial will help you get a jump-start on scaling distributed computing by taking an example time series application and coding through different aspects of working with such a dataset. We will cover building an end to end distributed processing pipeline using various distributed stream input sources, Apache Spark, and Apache HBase, to rapidly ingest, process and store large volumes of high speed data.
Participants will use Scala to work on exercises intended to teach them the features of Spark Streaming for processing live data streams ingested from sources like Apache Kafka, sockets or files, and storing the processed data in HBase.
Sridhar is Director of Professional Services at MapR. He leads the Application development practice at MapR and helps customers in building Hadoop & HBase solutions and migrating data from RDBMS databases. Sridhar worked as a Technology Evangelist at Sun Microsystems for over 10 years, where he presented at many Technical conferences world wide and helped increase awareness and adoption of Java technology in the worldwide developer community. Sridhar holds a BS in Mechanical Engineering from Osmania University in India, and an MS in Computer Science from the Florida Institute of Technology.
Get latest updates of Big Data Bootcamp
sent to your inbox.
Weekly insight from industry insiders.
Plus exclusive content and offers.