March 07 to 09 2016, Santa Clara, USA.


Speaker "Seshu Adunuthula" Details

Name :
seshu adunuthula
Company :
Title :
Topic :

Role of Spark in transforming eBay’s Enterprise Data Platform

Abstract :

eBay has one of the most mature Enterprise Data Platform’s in the industry with over 200PBs of data stored in our Hadoop and Teradata Warehouses. On average 30 TB of transactional and behavioral data is extracted on a daily basis and thousands of metrics are computed, analyzed and monitored for decision making and detecting anomalies. eBay has embarked on an ambitious project to transform the batch oriented ETL processes which could take 24 to 48 hour for metric computation to near real time infrastructure based on Kafka for messaging, Spark Streaming for stream processing and Spark SQL for data preparation.

Profile :
Seshu Adunuthula is Director of Analytics Platform at eBay responsible for managing some of the world┬╣s largest deployments of Hadoop, Teradata and ETL Ingest infrastructure. He is an industry veteran with over 20 years of Distributed Computing and Analytics Experience. Most recently he was managing the San Jose Development Team of MapR responsible for MapReduce, MapR-DB and MapR Control System Teams. Prior to that he was with Microsoft and Oracle in individual contributor and managerial roles in Microsoft SQL Server BI and at Oracle BPEL Workflow teams.

Get latest updates of Global Predictive Analytics Conference
sent to your inbox.

Weekly insight from industry insiders.
Plus exclusive content and offers.