June 27, 2016 at 6:30pm - Room LL21C
The computing world seems lately to be all a quiver about the novelty deep learning models and how they seem so mysterious.
In fact, the basic ideas behind these systems are very closely related to commonly known algorithms like k-means clustering.
In this session, Ted will present a simple example of an anomaly detector built using k-means clustering and show how it provides a insight into how much more advanced models such as neural networks and recurrent networks.
June 28, 2016 at 5:50pm - Ballroom C
The growing adoption of Spark SQL and Apache Drill presents an interesting conundrum- who should use which tool for what and when? Can they potentially be used together? This session will compare and contrast these technologies across a variety of parameters – architectures, ease of use, data types and data sources supported. We will demo a rich set of queries on some very interesting datasets to bring the representative use cases to life for both Spark SQL and Apache Drill.
June 29, 2016 at 12:20pm - Ballroom B
Real-time requirements are becoming increasingly common for new applications. Developers and architects need a reliable way to move data streams fast from the point of creation across different systems in order to respond to events as they happen. But a successful architecture requires more than just the use of an event-based system; a sustainable solution needs to incorporate best practices to ensure its viability over the long-term.
Application developers and architects need to have foresight; they must anticipate common pitfalls and roadblocks from the start. Getting a real-time system right requires planning and provisioning for:
• Scaling issues, from increased message volumes to increased numbers of producers/consumers or topics
• Security and authentication needs
• The possibility of needing to incorporate systems across multiple data centers
• The ability to support multiple query engines and other multi-tenancy considerations.
In this talk, we will describe how to plan for a successful and sustainable real-time system that can grow with your organization's needs.
Sridhar Reddy View Bio
June 29, 2016 at 4:10pm - Room 230C
The Internet of Things is rapidly evolving in several directions, for several use cases, and on several platforms and protocols. Like the Betamax-VHS war and many others, it's not clear what the best way to build your IoT architecture today while the dust hasn't yet settled. Regardless of the underlying framework or use case, there are a few common components and criteria that must be satisfied for any IoT solution. The primary purpose of this presentation is to identify what are the lowest common denominator must-have functionalities in your solution. Open source software is becoming more and more mainstream as a de facto technology choice in all levels of organization. Contributing to and consuming open source software is especially important in cutting-edge technologies such as IoT.
Will Ochandarena View Bio
June 29, 2016 at 5:00pm - Room 212
Co-presented with Brad Anderson, Liaison Technologies
Building a robust, responsive, secure data service for healthcare is tricky. For starters, healthcare data lends itself to multiple models: Document representation for patient profile view or update Graph representation to query relationships between patients, providers, and medications Search representation for advanced lookups Keeping these different systems up to date requires an architecture that can synchronize them in real time as data is updated. Furthermore, meeting audit requirements in Healthcare requires the ability to apply granular cross-datacenter replication policies to data and be able to provide detailed lineage information for each record. In this talk Will Ochandarena will describe how stream-first architectures can solve these challenges, and Brad Anderson will talk about how this has been implemented at Liaison Technologies.
Ted Dunning is Chief Application Architect at MapR Technologies and committer and PMC member of the Apache Mahout, Apache ZooKeeper, and Apache Drill projects. Ted has been very active in mentoring new Apache projects and is currently serving as vice president of incubation for the Apache Software Foundation. Ted was the chief architect behind the MusicMatch (now Yahoo Music) and Veoh recommendation systems. He built fraud detection systems for ID Analytics (later purchased by LifeLock) and he has 24 patents issued to date and a dozen pending. Ted has a PhD in computing science from the University of Sheffield. When he’s not doing data science, he plays guitar and mandolin. He also bought the beer at the first Hadoop user group meeting..
Sridhar is a Director of Professional Services for MapR Technologies. Sridhar has over 20 years of experience working with Java and JaveEE in many roles of the software development life cycle, including design, development, management, training and technology evangelism. Prior to MapR, Sridhar managed the Java Platform development team at PayPal, where he led a team of Java developers to build the next generation of the Java platform. Prior to PayPal, Sridhar worked as a Technology Evangelist at Sun Microsystems for over 10 years, where he increased awareness and adoption of Java technology in the worldwide developer community. While at Sun, Sridhar also managed the JavaOne Hands-On Labs as well as Sun Tech Days, a worldwide developer conference. Sridhar holds a BS in Mechanical Engineering from Osmania University in India, and an MS in Computer Science from the Florida Institute of Technology.