
Speaker "Seshu Adunuthula" Details Back

-
Name
Seshu Adunuthula
-
Company
Ebay Inc
-
Designation
Director
Topic
Role of Spark in transforming eBay’s Enterprise Data Platform
Abstract
eBay has one of the most mature Enterprise Data Platform’s in the industry with over 200PBs of data stored in our Hadoop and Teradata Warehouses. On average 30 TB of transactional and behavioral data is extracted on a daily basis and thousands of metrics are computed, analyzed and monitored for decision making and detecting anomalies. eBay has embarked on an ambitious project to transform the batch oriented ETL processes which could take 24 to 48 hour for metric computation to near real time infrastructure based on Kafka for messaging, Spark Streaming for stream processing and Spark SQL for data preparation.