Back

Speaker "John Ternent" Details Back

 

Topic

Text analytics with spark: hands on workshop

Abstract

This hands-on workshop will introduce the tools and techniques for analyzing textual data using Spark and Python, from data acquisition and ingestion to parsing and manipulating document elements in order to apply machine learning algorithms to the data. We will also cover integration with approaches like clustering, classification, recommenders, and sentiment analysis to extract value from unstructured textual data sources.  Participants should have their own laptop with a current version of a Hadoop sandbox like Hortonworks or Cloudera installed if they want to follow along with the exercises.

Profile

Mr. Ternent is a director for RCG Global Services focusing on big data and advanced analytics. He has over 20 years' experience as a consultant, architect, and technology executive specializing in data and analytics for the hospitality, retail, and healthcare industries. He has led multiple Agile development teams and has held the Certified Scrum Professional designation in addition to being an INFORMS Certified Analytics Professional. A resident of Orlando, Florida, he volunteers on multiple boards and is a co-organizer of the Orlando Data Science Meetup group.