Strata & Hadoop World, London
London
Tuesday, May 5, 2015
to
Thursday, May 7, 2015
Explore solutions to your most challenging problems, connect with the brightest minds in data, find out what’s new in emerging technologies and Apache Hadoop—and see for yourself what data can do.

Talks

How (the internet of) Things are Turning the Internet Upside Down

Ted Dunning View Bio

Wednesday, May 6, 2015
Just when we thought the last mile problem was solved, the Internet of Things is turning the last mile problem of the consumer internet into the first mile problem of the industrial internet. Whereas previously, we had servers moving bits to billions of consumers with almost perfect conservation of bits in the transmission from server to consumer, we are faced with a new patterns where billions of machines send measurements to servers. This inversion impacts every aspect of the design of networked applications from the switching hardware, to the algorithms used to store and understand the data. I will show how to use existing Hadoop ecosystem tools, such as Spark, Drill and others, to deal successfully with this inversion. This will involve a tour of key algorithms, techniques and tools used in such systems. This will be a practical talk which details real problems and real solutions.
Users love Spark. Does Spark love (multiple) users?

Michael Hausenblas View Bio

Apache Spark is a powerful, unified data processing engine offering a number of APIs, from batch/SQL over streaming to manipulations over graphs. The core architecture of Spark has not necessarily been designed with a multi-user environment in mind. We will review existing and emerging approaches how to use Spark in multi-user environments, such as the Tachyon project.
Using the Zeta Architecture: To Become A Hero

Jim Scott View Bio

Wednesday, May 6, 2015
In this session, I’ll explain how to move your business to the next level by implementing your enterprise architecture. I’ll lay out which products can fit into this architecture in order to fulfill the needs of your business, as well as the benefits that can be derived from these components, both independently and together. With this architecture, log shipping is a thing of the past. It will no longer be needed, because this architectural simplification removes work that is required for most businesses to function and drive revenue. Log shipping is one less thing that employees will have to worry about in the wee hours of the night, and it won’t break. By storing the data where it is created in a distributed file system, and processing it in place, you can remove all the required transport technologies, and simplify the application architectures of many enterprise applications because the enterprise architecture solves these problems.

Speakers

Ted Dunning

Ted Dunning is Chief Application Architect at MapR Technologies and committer and PMC member of the Apache Mahout, Apache ZooKeeper, and Apache Drill projects​. Ted has been very active in mentoring new Apache projects and is currently serving as vice president of incubation for the Apache Software Foundation​.​ Ted was the chief architect behind the MusicMatch (now Yahoo Music) and Veoh recommendation systems. He built fraud detection systems for ID Analytics (later purchased by LifeLock) and he has 24 patents issued to date and a dozen pending. Ted has a PhD in computing science from the University of Sheffield. When he’s not doing data science, he plays guitar and mandolin. He also bought the beer at the first Hadoop user group meeting..

Michael Hausenblas

Michael is Chief Data Engineer, EMEA, for MapR, where he helps people tap the potential of Big Data by bridging the technical (architecture, scalability, etc.) and the business side (RoI, TCO, etc.). His background is in large-scale data integration, the Internet of Things, and Web applications and he's experienced in advocacy and standardization (World Wide Web Consortium). Michael's sharing his experience with the Lambda Architecture, distributed systems and polyglot persistence through blog posts and public speaking engagements and is a contributor to Apache Drill.

Jim Scott

Jim drives enterprise architecture and strategy at MapR. Jim Scott is the cofounder of the Chicago Hadoop Users Group. As cofounder, Jim helped build the Hadoop community in Chicago for the past four years. He has implemented Hadoop at three different companies, supporting a variety of enterprise use cases from managing Points of Interest for mapping applications, to Online Transactional Processing in advertising, as well as full data center monitoring and general data processing. Prior to MapR, Jim was SVP of Information Technology and Operations at SPINS, the leading provider of retail consumer insights, analytics reporting and consulting services for the Natural, Organic and Specialty Products industry. Additionally, he served as Lead Engineer/Architect for dotomi, one of the world’s largest and most diversified digital marketing companies. Prior to dotomi, Jim held several architect positions with companies such as aircell, NAVTEQ, Classified Ventures, Innobean, Imagitek, and Dow Chemical, where his work with high-throughput computing was a precursor to more standardized big data concepts like Hadoop.