March 27 to 29 2017, Santa Clara, USA.

Speakers

Speaker "David Talby" Details

Name :
david talby
Company :
Title :
Chief Technology Officer
Topic :

Architecting a predictive, petabyte-scale, self-learning fraud detection system

Abstract :

Fraud detection is a classic adversarial analytics challenge: As soon as an automated system successfully learns to stop one scheme, fraudsters move on to attack another way. Each scheme requires looking for different signals (i.e. features) to catch; is relatively rare (one in millions for finance or e-commerce); and may take months to investigate a single case (in healthcare or tax, for example) – making quality training data scarce.

This talk covers key lessons learned while building such real-world software systems over the past few years. We’ll be looking for fraud signals in public email datasets, using popular Python based open-source data science libraries to generate graph based, rule based, language based and time series based features, tied together with ensemble learning algorithms.

Apache Spark is used to run these models at scale – in batch mode for model training and with Spark Streaming for production use. We’ll discuss the data model, computation, and feedback workflows, as well as some tools and libraries built on top of the open-source components to enable faster experimentation, optimization and productization.

Profile :

David Talby is a chief technology officer at Pacific AI, helping fast-growing companies apply big data and data science techniques to solve real-world problems in healthcare, life science, and related fields. David has extensive experience in building and operating web-scale data science and business platforms, as well as building world-class, Agile, distributed teams. Previously, he was with Microsoft’s Bing Group, where he led business operations for Bing Shopping in the US and Europe, and worked at Amazon both in Seattle and the UK, where he built and ran distributed teams that helped scale Amazon’s financial systems. David holds a PhD in computer science and master’s degrees in both computer science and business administration.

x

Get latest updates of Global Predictive Analytics Conference
sent to your inbox.

Weekly insight from industry insiders.
Plus exclusive content and offers.