Speaker "Vladimir Bacvanski" Details Back
-
Name
Vladimir Bacvanski
-
Company
Paypal
-
Designation
Principal Architect
Topic
Fast Data: Big Data Analytics Using Streams
Abstract
Fast Data is the next step in the evolution of Big Data technologies. Traditional Big Data techniques are dealing with large volumes of data, but Fast Data introduces a new dimension: Velocity, which demands near real-time response to Big Data.
We begin with characteristics of Stream Processing, and then discuss the dominant architectures for Streaming systems and continue into examples of technologies and application solutions for streaming applications. We compare the important technologies, such as Storm, Spark, Flink and Apache Beam. We then discuss the use new area of Fast Data in the AI space. We conclude with implementation guide and a summary of best practices for Fast Data.
Outline:
- Fast Data and Stream Processing: What is it?
- Architectures for Stream Processing
- Apache Storm: The Dedicated Stream system
- Apache Spark Streaming: A versatile in-memory batch/streaming system
- Apache Flink: Novel integration of batch and streaming
- Apache Beam: The common API layer
- NoSQL Data stores for Fast Data
- Using Fast Data with AI systems
- Best practices of mastering Fast Data