NoSQL Now! 2015
San Jose, CA
Tuesday, August 18, 2015
to
Thursday, August 20, 2015
The fifth annual NoSQL Now! Conference is the largest vendor-neutral forum focused on NoSQL (Not Only SQL) technologies.

Talks

Analytics and Data Warehousing with NoSQL Databases

Tomer Shiran View Bio

August 18m 2015 1:00pm - 1:45pm

NoSQL databases like MongoDB have emerged as popular datastores for operational applications such as Web and mobile apps. They offer scalable architectures and flexible data models, making them ideal for managing large volumes of rapidly evolving data. With the recent release of Apache Drill 1.0, these datastores can now be used for data warehousing, BI and advanced analytics. 

In this talk we'll provide an overview of Apache Drill and explain how to build a next-generation data warehouse with MongoDB and Apache Drill. We'll also provide a live demo leveraging Tableau to visualize data in a MongoDB cluster without sacrificing performance. 

 

Apache Drill Bootcamp

Jacques Nadeau & Tomer Shiran View Bio

August 20, 2015 1:00pm - 04:15pm

In this tutorial you’ll learn how to use Apache Drill, the open source, distributed, schema-free SQL engine. Drill enables SQL-based analytics and BI on MongoDB, HBase and other NoSQL databases. At the end of this tutorial, you’ll be able to explore and analyze your NoSQL data in situ with standard SQL queries and BI tools such as Tableau, MicroStrategy, Qlikview, Spotfire and Excel.

Agenda:

  • Hello World!
  • Data Model and Data Types
  • Data Sources: Storage Plugin Architecture; Using Storage Plugins; Navigating the Namespace; HBase; MongoDB
  • Metadata in Drill: Decentralized Metadata; Optional Schemas; Information Catalog
  • Exploring, Analyzing and Transforming Data: Exploration (SELECT) LIMIT 10 and Drill Explorer); Analysis (SELECT); Transformation (CREATE TABLE AS)
  • Using Virtual Datasets (Views): Why Virtual Datasets?; Creating Virtual Datasets (CLI and Drill Explorer); Virtual Dataset Internals (.drill); How Virtual Datasets are Exposed
  • APIs: ODBC; JDBC; REST; C; Java
  • Clients: CLI; BI (Excel, Tableau, etc.); Python (PyData, Pandas); R
  • Querying Complex and/or Schemaless Data: Handling Schemaless Data; Traditional BI on Complex Data

Note that this is a hands-on tutorial, so attendees will benefit from having their own laptop as they'll be able to run all the examples throughout the tutorial.

Speakers

Tomer Shiran

Tomer Shiran heads the product management team at MapR and is responsible for product strategy, roadmap and requirements. Prior to MapR, Tomer held numerous product management and engineering roles at Microsoft, most recently as the product manager for Microsoft Internet Security & Acceleration Server (now Microsoft Forefront). He is the founder of two websites that have served tens of millions of users, and received coverage in prestigious publications such as The New York Times, USA Today and The Times of London. Tomer is also the author of a 900-page programming book. He holds an MS in Computer Engineering from Carnegie Mellon University and a BS in Computer Science from Technion - Israel Institute of Technology.

Jacques Nadeau & Tomer Shiran

Jacques Nadeau leads Apache Drill development efforts at MapR Technologies. He is an industry veteran with over 15 years of big data and analytics experience. Most recently, he was cofounder and CTO of search engine startup YapMap. Before that, he was director of new product engineering with Quigo (contextual advertising, acquired by AOL in 2007). He also built the Avenue A | Razorfish analytics data warehousing system and associated services practice (acquired by Microsoft).

Tomer Shiran heads the product management team at MapR and is responsible for product strategy, roadmap and requirements. Prior to MapR, Tomer held numerous product management and engineering roles at Microsoft, most recently as the product manager for Microsoft Internet Security & Acceleration Server (now Microsoft Forefront). He is the founder of two websites that have served tens of millions of users, and received coverage in prestigious publications such as The New York Times, USA Today and The Times of London. Tomer is also the author of a 900-page programming book. He holds an MS in Computer Engineering from Carnegie Mellon University and a BS in Computer Science from Technion - Israel Institute of Technology.