I will talk about recent developments in Mahout and real-time learning. In particular, I will cover the results from quality and speed testing of Mahout's new super-fast k-means clustering algorithms (hint, quality is very good and speed is phenomenal). I will also dive deep into a design for a on-line clustering facility that can cluster the full content of the Twitter fire-hose into thousands of clusters in real-time.
Ted Dunning is Chief Application Architect for MapR Technologies. He also is a PMC member for Apache Zookeeper and Mahout projects and is a commiter and champion for the new Apache Drill project. Opinionated about software and data mining and passionate about open source, he is an active participant of Hadoop and its community and loves helping projects get going with new technologies. Ted holds a Ph.D. in computer science from the University of Sheffield and an M.S. from New Mexico State University.
|When:||03.05.2013 | 6:30 pm|
|Where:||28 West 23rd Street, New York, NY|