Building Predictive Models Over Big Data Using Elastic MapReduce

Earlier this week, Robert Grossman and Collin Bennett from Open Data Group gave a lecture as part of a tutorial at the SC 12 Conference in Salt Lake City about big data. They described some of the ways of building predictive models over big data using Hadoop streams and Hadoop’s implementation of MapReduce.

They illustrated the lecture with an example of building a predictive model over data provided by the City of Chicago about CTA busses using Amazon’s Elastic MapReduce.

You can find some of the materials for the lecture on the web page The materials also contain links to some best practices for deploying analytics in operational systems using PMML and PMML-compliant scoring engines, such as Augustus.

This entry was posted in analytic models, big data, Blog, PMML. Bookmark the permalink.