Large Scale Machine Learning with Apache Spark

Online Webinar | Wednesday, May 21, 2014 | 10:00am PT / 1:00pm ET

Spark offers a number of advantages over its predecessor MapReduce that make it ideal for large-scale machine learning. For example, Spark includes MLLib, a library of machine learning algorithms for large data. The presentation will cover the state of MLLib and the details of some of the scalable algorithms it includes.


Sandy Ryza – Software Engineer, Cloudera

Sandy Ryza is an engineer on the data science team at Cloudera. He is a committer on Apache Hadoop and recently led Cloudera's Apache Spark development.

Yes, I would like to start receiving email updates from Cloudera.