Video
Streaming with MapR
MapR Streams is a global publish-subscribe event streaming system for big data. It connects data producers and consumers worldwide in real-time, with unlimited scale. Publishers (data producers) write data to one or more topics in MapR Streams. Subscribers (data consumers) to the topic can read the data instantaneously, anywhere across the globe.
 
Video
MapR Ecosystem Packs for Updating Ecosystem Components | Whiteboard Walkthrough
In this week’s Whiteboard Walkthrough, Rachel Silver, Ecosystem Product Manager at MapR, talks about MapR Ecosystem Packs or MEPs that give you a convenient way to upgrade open source ecosystem components without having to upgrade the core MapR platform.
 
Video
MPP Database and Data Warehouse vs Data Lake | Whiteboard Walkthrough
Sameer Nori, Senior Product Marketing Manager at MapR Technologies, compares a traditional data warehouse or MPP database versus a modern data lake.
 
Video
Apache Drill SQL Query Optimization | Whiteboard Walkthrough
In this week's Whiteboard Walkthrough video, Neeraja Rentachintala, Senior Director of Product Management at MapR Technologies, explains how Apache Drill optimization achieves interactive performance for low latency SQL queries on very large data sets when working with familiar BI tools such as Tableau, Microstrat
 
Infographic
4 Hot Business Intelligence Trends: The Big Data Expansion
4 Hot Business Intelligence Trends: The Big Data Expansion. Big data is growing exponentially - and so is its usefulness.
 
Video
How to Replicate Streaming Data Across Data Centers with MapR Streams | Whiteboard Walkthrough
In this week's Whiteboard Walkthrough Jorge Geronimo, Solutions Architect at MapR, explains how with a single line of code you can create a replica of a MapR data stream within the same cluster or to another cluster even in another part of the world.
 
Video
Anomaly Detection in Telecommunications Using Complex Streaming Data | Whiteboard Walkthrough
In this week's Whiteboard Walkthrough Ted Dunning, Chief Application Architect at MapR, explains in detail how to use streaming IoT sensor data from handsets and devices as well as cell tower data to detect strange anomalies.
 
Video
Apache Drill SQL Queries on Parquet Data | Whiteboard Walkthrough
In this Whiteboard Walkthrough Parth Chandra, Chair of PMC for Apache Drill project and member of MapR engineering team, describes how the Apache Drill SQL query engine reads data in Parquet format and some of the best practices to get maximum performance from Parquet.
 
Video
Connected vs. Converged Big Data Environments | Whiteboard Walkthrough
Nick Amato, Director Technical Marketing at MapR, explains the advantages of a converged environment for streaming applications vs. running these services in separate clusters.
 
Video
State vs. Flow Data Architecture in the Financial Sector | Whiteboard Walkthrough
In this Whiteboard Walkthrough, MapR’s Chief Application Architect, Ted Dunning, explains the move from state to flow and shows how it works in a financial services example.
 
Video
Best practices for Hadoop in production- Panel discussion facilitated by Forrester analyst
 
Video
MapR 5.2: Getting More Value from the MapR Converged Community Edition
Thank you for using the MapR Converged Community Edition. We hope you have enjoyed great success with your big data projects with the MapR Platform.
 
Video
MapR Converged Data Platform: What's Important about "Converged?"
In this week's Whiteboard Walkthrough, Ellen Friedman, Solutions Consultant at MapR, describes what happens when certain fundamental big data capabilities are engineered together as a part of the same technology. This brief overview compares the converged data platform as a foundation for big data projects versus building solutions on a base of separate pieces.
 
Video
ETL and Interactive Analytics with Apache Spark and Apache Drill
Vinay Bhat, Solution Architect at MapR Technologies, takes you step-by-step through a widespread big data use case: data warehouse offload and building an interactive analytics application using Apache Spark and Apache Drill. Vinay explains how the MapR Converged Data Platform provides unique capabilities to make this process easy and efficient, including support for multi-tenancy.
 
Video
Why MapR Monitoring is Different
Dale Kim, Sr. Director of Industry Solutions at MapR, describes the monitoring capabilities of the MapR Converged Data Platform, which easily give you a single view of all cluster operations. Leveraging popular open source technologies, the monitoring system is customizable and extensible to address the challenges of your big data deployment requirements.
 
Video
Big Data SQL: Overview of Apache Drill Query Execution Capabilities
In this Whiteboard Walkthrough, Neeraja Rentachintala, Senior Director of Product Management at MapR Technologies, gives an overview of how open source Apache Drill achieves low latency for interactive SQL queries carried out on large datasets. With Drill, you can use familiar ANSI SQL BI tools, such as Tableau or MicroStrategy, plus do exploration directly on big data.
 
Video
Machine Learning with Apache Spark
Spark’s machine learning (ML) library goal is to make practical machine learning scalable and easy.
 
Video
ESS 101 - Apache Hadoop Essentials
Get a glimpse of what free Hadoop on-demand training is like in this preview of the course "ESS 101 - Apache Hadoop Essentials."
 
Video
ESS 100 - Introduction to Big Data
Get a glimpse of what free Hadoop on-demand training is like in this preview of the course "ESS 100 - Introduction to Big Data"
 
Video
Terbium Labs Panel at Spark Summit 2016
Hear the CEO of Terbium Labs describe their use case and how they utilize Spark on the MapR platform.
 
Video
Security Log Analytics Explainer Video
Watch this explainer video on security log analytics.
 
Video
Better and Faster Machine Learning Classifiers in Python
Building a good classifier in Python can be a tedious process. There are multiple models to try, and each one has its own set of hyperparameters that can affect the results. The most common way to approach this is to do an exhaustive search of all possible combinations with something like GridSearchCV.
 
Video
Supercharging Machine Learning Regression in Python
Getting useful results from machine learning is a process of experimentation, fine-tuning, and trying multiple models and algorithms. Each type of estimator has its own set of hyperparameters, which must be exhaustively searched and cross-validated. This process can take a while, even on a machine with lots of cores and memory.
 
Video
How to Code for the Apache Kafka 0.9 API
The new Kafka API in 0.9 is much easier to use and provides better control over how data moves from message producer to consumer. As a side benefit, it also hides the details of the implementation of Kafka which opens the door for better performance and new implementation strategies such as the ones used in MapR Streams.
 
Video
March Madness Meets Graph Theory
Additional Resources
 
Video
Gartner BI Summit 2016 – Continuous Analytics: The Time is Now
To get value out of today’s big and fast data, organizations must evolve beyond traditional analytic cycles that are heavy with data transformation and schema management. Achieving an “as-it-happens” business requires flexible, real-time data access, collapsing data silos and automating data-to-action for immediate operational benefits.
 
Video
Securing Your Data: How to Audit a MapR Cluster
In this week's Whiteboard Walkthrough, Mitesh Shah, Product Management at MapR, describes how you can track administrative operations and data accesses in your MapR Converged Data Platform in a seamless and efficient way with the built-in auditing feature.
 
Video
What is MapR-FS?
In this week's Whiteboard Walkthrough, Fabian Wilckens, EMEA Solutions Architect at MapR, discusses some of the key themes, including "real-time" and "standard interfaces," that are important in a big data environment for driving business value.
 
Video
Das Publish-Subscribe Models für Echtzeit Datastreams [GER]
In diesem Whiteboard Walkthrough erklärt, Fabian Wilckens, EMEA Solutions Architect bei MapR, die Vorteile des Publish-Subscribe Models für Echtzeit Data Streams.
 
Video
The Publish-Subscribe Model for Real-Time Data Streams
In this week's whiteboard walkthrough, Tugdual Grall, technical evangelist at MapR, explains the advantages of a publish-subscribe model for real-time data streams.
 
Video
Securing Files in Hadoop at the Right Levels
In this week's Whiteboard Walkthrough, Mitesh Shah, Product Management at MapR, describes how you can make sure you aren’t opening more access permissions to your sensitive data in Hadoop than you intended, using File Access Control Expressions in MapR.
 
Demo
Demo: Access Control Expressions in MapR 5.1
This video shows an example scenario where you can use volume-level ACEs to provide secure access to match your requirements, and how ACEs are superior to ACLs used by other platforms.
 
Demo
Demo: Security Audit Logs in MapR 5.1
Securing data in Hadoop and Spark is easy in MapR. This video shows some of the new auditing and tracking capabilities in 5.1, including ways you can detect unauthorized data access and use any BI tool you might be using now (such as Tableau) to audit security and ensure compliance.
 
Video
MapR-DB Explainer Video
MapR-DB is an enterprise-grade, high performance, in-Hadoop NoSQL (“Not Only SQL”) database management system. It is used to add real-time, operational analytics capabilities to Hadoop. NoSQL primarily addresses two critical data architecture requirements:
 
Video
MSA Putting the Power of Data to Work with Cisco and MapR
Management Science Associates (MSA) uses the MapR Converged Data Platform for big data and analytics on Cisco UCS Integrated Infrastructure to power insights for better decisions.
 
Video
Genome Analysis Pipelines with Spark and ADAM
In this presentation with Dr. Allen Day, we’ll explore how a step that is common to many bioinformatics workflows, sequence alignment, can done with Bowtie and ADAM inside a Spark environment to quickly align short reads to a reference genome.
 
Video
Super Bowl Predictions with Apache Spark
Data scientist Joe Blue shares betting tips by demonstrating how he used Apache Spark to extract features from raw NFL games and applied the K-nearest-neighbor (KNN) machine learning algorithm to find a nugget the sharps in Vegas may have missed.
 
Video
Analyzing and Storing JSON Data with MapR-DB and Python
With the native JSON support in MapR-DB, application development is greatly simplified. In this session we'll look at how to natively persist large JSON data sets on the MapR platform, using Python, without any transformations or pipelines required.
 
Video
Import Large JSON Data Sets with OJAI
Do you have large, complex JSON data sets you need to store and analyze? With the easy-to-use open source OJAI (Open JSON Application Interface) API, you can combine the flexibility of JSON with the scale and performance of NoSQL to quickly write the modern applications your users need. Learn the basics of coding in OJAI on MapR-DB in this session of Free Code Fridays.
 
Video
Apache Spark vs. Apache Flink
In this week's whiteboard walkthrough, Balaji Narasimhalu, product manager at MapR, explains the difference between Apache Spark and Apache Flink and how to make a decision which to use.
 
Video
Drill 201 - Hands-on Tutorial for Apache Drill
If you've been using SQL for years or are more familiar with BI tools such as Tableau, MicroStrategy, or Qlik, you may have experienced some lag in the data-to-action cycle. With on-the-fly schema discovery, Apache Drill leverages your current skill set and tools so that you don't have to wait on IT to transform and load your data.
 
Video
MapR Streams vs. Kafka
In this week's Whiteboard Walkthrough, Jim Scott, Director of Enterprise Strategy and Architecture at MapR, discusses a business use case that leverages the power of MapR Streams.
 
Video
Fine-Grained Scaling with Apache Myriad
In this week's Whiteboard Walkthrough, Santosh Marella, committer on the Apache Myriad project, explains how Apache Myriad enables fine-grained scaling in Mesos environments alongside YARN, the resource management framework for Apache Hadoop.
 
Video
MapR Streams Under the Hood
In this week's Whiteboard Walkthrough, Will Ochandarena, Director of Product Management at MapR, explains how we are able to build the MapR Streams capabilities that differentiate us from similar products in the market. Learn more about MapR Streams
 
Video
The MapR CTO's Perspective on the Converged Data Platform
MC Srivas, MapR Co-Founder and CTO, walks you through the MapR Converged Data Platform that has been in the making for the last 6 years and is now finally complete with MapR Streams.
 
Video
Drill 101 - Basics of Apache Drill
Want to discover how you can get self-service data exploration capabilities on data stored in multiple formats in files or NoSQL databases? Watch this session of Free Code Fridays to get a basic understanding of Apache Drill.
 
Video
The Evolution of Business Intelligence and Self-Service Analytics
In this week's Whiteboard Walkthrough, Sameer Nori, Business Intelligence Expert at MapR, explains how BI has evolved over the last 3 decades from being IT driven to analyst driven with Self-Service tools.
 
Video
Discover the Best Time to Post a Cute Cat Picture on Reddit
One of the beauties of Apache Drill is how it enables users to leverage existing skills, such as coding in different programming languages and using BI tools, while providing agility and flexibility to analyze complex, semi-structured data.
 
Video
MapR SVP Highlights the Benefits of Cisco UCS for Big Data
MapR SVP of Worldwide Field Operations, Steve Fitz, comments on the benefits of Cisco UCS Integrated Infrastructure for Big Data.
 
Video
How to Turn Raw Data into Insights in Minutes with Apache Drill
One of the key use cases for Apache Drill is exploring and analyzing the raw data coming into Hadoop/NoSQL systems, using SQL. Along with meeting the table stakes for SQL-on-Hadoop, which is to achieve low latency performance at scale, Drill allows users to analyze the data without any ETL or upfront schema definitions.
 
Video
Connecting to Apache Drill from Applications and Scripts
One of the challenges of connecting to multiple data sources in Hadoop is managing the proliferation of modules that must be loaded into an application to access them -- each with their own implementations to learn, code libraries to manage, and caveats to consider. There is (very often) more than one way to connect to the same data, in the same database, within the same language.
 
Video
Recommendation Engines 101
Most of you have experienced the power of data-driven recommendations. Maybe you found a former colleague through LinkedIn’s “People You May Know” feature or you watched a movie because Netflix suggested it to you. In all of these instances, recommendation engines help narrow your choices to those that best meet your particular needs.
 
Video
Introduction to Spark Web UI
Ready to get started with Spark? Join us for Free Code Fridays where Carol McDonald, HBase Hadoop Instructor at MapR, will demonstrate how to start using the Apache Spark Web UI to track useful information about how your Spark application is executing on a Hadoop cluster.
 
Video
Practical Uses and Methods for Synthetic Data
Synthetic data is remarkably useful for many data science tasks and can even improve security. Join us for Free Code Fridays where Ted Dunning, Chief Application Architect at MapR, will use log-synth, an open-source program, to generate interesting randomized data.   Watch this 30-min demo to see how you can use log-synth to:
 
Video
Discover Drill Custom Functions
Apache Drill allows users to explore any type of data using ANSI SQL – and takes it a step further by allowing you to create custom functions to extend the query engine. While these custom functions have all the performance of any of the Drill primitive operations, writing these functions may be a little tricker than you'd expect.   
 
Video
Parallel and Iterative Processing for Machine Learning Recommendations with Spark
Recommendation systems help narrow your choices to those that best meet your particular needs. They are among the most popular applications of big data processing. In this Free Code Friday session, you’ll learn how to build a recommendation model from movie ratings using an iterative algorithm and parallel processing with Apache Spark MLlib.
 
Video
Real-Time Profiles with Spark and Python
An appealing aspect of Spark for handling application workloads is its support for Python as a first-class citizen.  This can be especially useful for quickly aggregating and summarizing large datasets and feature generation for machine learning models. Join this 30-minute Free Code Friday with Nick Amato, Director of Technical Marketing for MapR. He'll show you how to:
 
Video
Document Classification with Apache Spark
There are copious tutorials, demos and walk-throughs that illustrate how to apply machine learning algorithms to perfectly-manicured data sets. But this doesn’t reflect real-life situations for those who have big opportunities to find big value. What happens when your dataset is massive and unformatted, such as the internet search history for…everyone?
 
Video
Simplify Hadoop Cluster Management
Managing complex distributed systems like Hadoop is really hard without some kind of configuration management or automation tool. Ansible is a great way to manage your Hadoop cluster that's easy to get started with. In this Free Code Friday session, Vince Gonzalez, Systems Engineer at MapR, will share:
 
Video
Spark Streaming with HBase
Spark Streaming is an extension of the core Spark API that enables continuous data stream processing. It is particularly useful when data needs to be processed in real-time. Carol McDonald, HBase Hadoop Instructor at MapR, will cover:
 
Video
Simplify Application Development with MapR-DB JSON and OJAI, a JSON-based API
The beauty of JSON lies in its flexible schema nature. The Open JSON Application Interface (OJAI), a JSON-based API, simplifies application development, especially when managing complex, evolving, hierarchical data types. Tugdual Grall, Technical Evangelist at MapR, will show you how to build a REST API using Java and quickly add new features to an application.
 
Video
On-demand YARN Clusters with Isolated Compute, Network, & Storage
Businesses strive for higher utilization and better returns from their data center investments. This need is causing businesses to explore consolidating their infrastructure into one giant cluster instead of several silos. It is not uncommon for an organization to have several Hadoop/YARN clusters that need to be isolated from each other.
 
Video
Identify Your Data Breach with Apache Drill
Numerous big data methods have been unable to eradicate fraud completely. It’s important to score customer transactions to prevent the takeover, but crucial information about where the accounts were intercepted may be lurking in plain sight, completely overlooked.
 
Demo
Building a Dashboard with 300M Events: Teradata ThinkBig on MapR
How do you make sense of 27 years of data? Is that even possible? Watch this video to see an example of a real-time analytics dashboard using the MapR Distribution and ThinkBig Analytics.
 
Video
The OJAI Document Lifecycle in MapR-DB
In this week's Whiteboard Walkthrough, Bharat Baddepudi, engineer on the MapR-DB team, explains how documents in MapR-DB are inserted and updated.
 
Video
Free Hadoop Training: Spark Essentials
Get a glimpse of what free Hadoop on-demand training is like in this preview of the course "DEV 360 - Spark Essentials"
 
Video
Free Hadoop Training: Apache Hive Essentials
Get a glimpse of what free Hadoop on-demand training is like in this preview of the course " DA 440: Apache Hive Essentials"
 
Video
The MapR CTO/Founder on MapR-DB and Project Kudu
In this week's Whiteboard Walkthrough, MC Srivas, MapR CTO and Co-Founder, explains what vision he had in mind when architecting MapR-DB, and how MapR delivers on the vision of being much faster than other technologies like Project Kudu.
 
Video
Apache Spark Use Case for Better Drug Discovery
In this week's Whiteboard Walkthrough, Steve Wooledge, VP of Industry Solutions at MapR, talks about an Apache Sark + Hadoop use case for drug discovery that one of our customers is currently running in production.
 
Video
Horizontal Scaling with MapR-DB
In this week's Whiteboard Walkthrough, Anurag Choudhary, Engineer on the MapR-DB team, explains how horizontal scaling in MapR-DB works and how hot spotting is automatically avoided.
 
Video
Joe Blue – Data Science for the Health Care Industry
Hear from Joe Blue, a Data Scientist with expertise in Health Care & Life Sciences industries.
 
Video
Joe Blue – Data Science for Financial Services
Hear from Joe Blue, a Data Scientist with expertise in Financial Services.
 
Video
Ben Sadeghi - Data Science for Ad-tech and Telco
Hear from Ben Sadeghi, a Data Scientist with expertise in Telecommunications & the Advertising, Media and Entertainment industries.
 
Video
Apache Drill – Enabling High-Performance SQL with a JSON Data Model
Tomer Shiran, PMC member and Apache Drill committer, walks you through the magic behinde Drill.
 
Video
Apache Drill – The Rise of the Non-Relational Datastore
Tomer Shiran, PMC member and Apache Drill committer, walks you through the history of the non-relational datastore and why Apache Drill is so important for this type of technology.
 
Video
Apache Drill – How to Deploy Apache Drill and Connect to BI Tools
Tomer Shiran, PMC member and Apache Drill committer, walks you through the deployment of Apache Drill with different storage systems and the connection with BI tools.
 
Video
Apache Drill – How Drill Connects to Data Sources
Tomer Shiran, PMC member and Apache Drill committer, explains how the Drill execution engine connects to different data sources through its storage plugins.
 
Video
Apache Spark vs. MapReduce #WhiteboardWalkthrough
In this week's Whiteboard Walkthrough, Anoop Dawar, Senior Product Director at MapR, shows you the basics of Apache Spark and how it is different from MapReduce.
 
Demo
Cluster Auditing Demo in MapR 5.0
The ability to audit Hadoop cluster operations, including filesystem, ACLs, and modifications to the cluster, is critical to maintaining data governance and security standards. This video shows a quick example of how to quickly see if data is being modified after hours.
 
Video
HDFS vs. MapR FS – 3 Numbers for a Superior Architecture
In this week's Whiteboard Walkthrough, Ted Dunning, Chief Application Architect at MapR, talks about the architectural differences between HDFS and MapR FS that boil down to three numbers.
 
Video
Free Hadoop Training: Developing HBase Applications – Advanced
Get a glimpse of what free Hadoop on-demand training is like in this preview of the course "Developing HBase Applications: Advanced"
 
Video
Free Hadoop Training: Developing HBase Applications
Get a glimpse of what free Hadoop on-demand training is like in this preview of the course "DEV 330 - Developing HBase Applications: Basics"
 
Demo
MapR Quick Start Solution - Recommendation Engine Demo
This Quick Start Solution template can help you get started making recommendations with Hadoop and machine learning.
 
Demo
MapR Quick Start Solution -- Data Warehouse Optimization Demo
This Quick Start Solution demo shows how you can make the most of your data warehouse resources and start to get value from new data in your enterprise.
 
Video
LucidWorks Search and MapR Integration
Grant Ingersoll, CTO of LucidWorks and Ted Dunning, Chief Application Architect, MapR talk about LucidWorks Search and it's ease-of use and integration with MapR.
 
Video
Introduction to the Zeta Architecture #WhiteboardWalkthrough
In this week's Whiteboard Walkthrough, Jim Scott, Director of Enterprise Strategy and Architecture at MapR, gives you an introduction to the Zeta Architecture, a high-level enterprise architectural construct which enables simplified business processes and defines a scalable way to increase the speed of integrating data into the business.
 
Demo
Drill Configuration Options - Demo
 
Demo
3 Ways to Analyze Packet Captures with BI Tools and Drill
Using Drill you can analyze packet data from your network directly, without loading it into a database or knowing all the details about how the data is structured. This video shows 3 examples of how you can analyze Wireshark data with SAP Lumira and Tableau.
 
Video
Apache Drill Meetup – We are 1.0!
Rewatch the first meetup session after the Apache Drill 1.0 release!
 
Video
Spark & Hadoop at Production Scale – Spark Summit Keynote
How are leading companies deploying Spark with Hadoop in production? What insights have they learned and what key considerations should you consider to put your Spark-based innovative app to work faster?
 
Demo
Twitter Analytics with Apache Drill and MicroStrategy - Demo
Follow this demo and learn how you can perform your own social media analysis by loading complex, nested JSON data from Twitter directly into MicroStrategy with Apache Drill. No Schema needed!
 
Video
Using the Zeta Architecture – Jim Scott at Strata London 2015
Rewatch Jim Scott's presentation at Starata+Hadoop London 2015 on "Using the Zeta Architecture".
 
Video
Free Hadoop Training: MapR Distribution
Get a glimpse of what free Hadoop on-demand training is like in this preview of the course "HDE 110 - MapR Distribution Essentials"
 
Demo
Apache Drill with Tableau - Demo
This video walks you through a practical scenario where the business analyst can use Apache Drill via a visualization tool such as Tableau to gain instant insights from a variety of data including complex nested data.
 
Demo
Drill Explorer Demo
Use the Drill Explorer to quickly discover and analyze semi-structured and structured data sources together. This demo shows how you can save time analyzing JSON, HBase and Parquet, without building any schema, writing scripts, or doing any upfront transformations.
 
Video
Fully Real-Time Recommendation – Ted Dunning at SF Data Mining
Ted Dunning talks about fully real-time recommendation engines, best practices, and use cases.
 
Video
Hadoop On-Demand Training Preview: DA 415 Drill Architecture
Get a glimpse of what free Hadoop on-demand training is like in this preview of the course "DA 415 Drill Architecture"
 
Demo
Apache Spark on MapR with MLlib - Demo
In this demo we are using Spark and PySpark to process and analyze the data set, calculate aggregate statistics about the user base in a Pi Spark script, persist all of that back into MapR-DB for use in Spark and Tableau, and finally use MLlib to build logistic regression models.
 
Video
Better Anomaly Detection with the T-Digest #WhiteboardWalkthrough
In this week's Whiteboard Walkthrough, Ted Dunning, Chief Application Architect at MapR, gets you up to speed on the t-digest, an algorithm you can add to any anomaly detector to set the number of alarms that you get as a percentage of the total samples. It estimates percentiles very accurately–especially high or low percentiles–and allows you to set a threshold for alarms.
 
Video
Anomaly Detection with Poisson Distribution #WhiteboardWalkthrough
In this week's Whiteboard Walkthrough, Ted Dunning, Chief Application Architect at MapR, gets you up to speed on anomaly detection, a simple and easy to implement technique to "figure out stuff that just happened but shouldn't have". Learn more about Poisson Distribution on Wikipedia.
 
Video
Free Hadoop Training: HBase Schema Design
Get a glimpse of what free Hadoop on-demand training is like in this preview of the course "DEV 325 - HBase Schema Design"
 
Video
Internet of Things – Big Data Outbreak #WhiteboardWalkthrough
In this week's Whiteboard Walkthrough, Jay Margalus, Demo Specialist at MapR, shows you an awesome project he developed last year called "Big Data Outbreak". The code is available on Github for both the badges and the server.
 
Demo
Hadoop On-Demand Training: Drill - Demo
Get a glimpse of what free Hadoop on-demand training is like in this preview of the course "DA 410 Drill Essentials"
 
Video
NoSQL and Hadoop for Solving Big Data #WhiteboardWalkthrough
In this week's Whiteboard Walkthrough, Dale Kim, Director of Industry Solutions at MapR, gets you up to speed on Apache Hadoop and NoSQL. He talks about the similarities and differences between the two, but most importantly how both technologies should be a requirement for any true big data environment.
 
Video
The MapR + Teradata Partnership
MapR and Teradata have partnered to deliver the top-ranked, enterprise-grade Hadoop distribution with the industry-leading data warehouse platform to address business-critical use cases. MapR shares a best-of-breed view on workload-specific systems, and fully supports the Teradata Unified Data Architecture (UDA).
 
Video
The MapR + Attunity Partnership
Attunity Ltd. is a leading enterprise-class software provider empowering some of the world’s largest companies with the solutions to seamlessly and efficiently connect, transfer and join to and from virtually any data source.
 
Video
The MapR + Avlino Partnership
Avlino is an industry leading Big Data solutions provider. Proliferating all market segments, Big Data is expected to completely change enterprise and cloud IT infrastructure in the next 5-10 years. The challenge is to select among a myriad of technology options that exactly fit end-users exacting corporate goals.
 
Video
The MapR + AtScale Partnership
AtScale delivers the power of Hadoop to ordinary, human data analysts. Analyze Hadoop data with the tools you already know, including Tableau, Microsoft Excel, or your web browser. AtScale masks the complexity of Hadoop and exposes powerful SQL and MDX interfaces. Enjoy interactive performance with automatic aggregations and cube optimization.
 
Video
The MapR + Alpine Data Partnership
Alpine Data Labs is the leader in data science for Hadoop and big data. The company’s products uniquely combine intuitive interfaces, native analytic processing in Hadoop, high performance in-database analytics, and the efficiencies of cloud computing to define the new paradigm in advanced analytics: accessible, easy to use and built for big data. Alpine’s customers include A.T.
 
Video
The MapR + Waterline Data Partnership
Waterline Data is an early-stage Big Data software company, founded in December 2013, backed by Menlo Ventures and Sigma West. Waterline Data Inventory builds a complete inventory of your data assets in Hadoop, automatically and securely, and lets enterprise users find, understand, and help govern Hadoop data.
 
Video
The MapR + RedPoint Partnership
RedPoint Global empowers marketers to bring together all the customer data they need to create precise one-to-one interactions with customers across any and all marketing channels.
 
Video
The MapR + Splice Machine Partnership
Splice Machine delivers a database solution that incorporates the scalability of Hadoop, the standard ANSI SQL and ACID transactions of an RDBMS and the distributed computing power of HBase.
 
Video
The MapR + Simba Partnership
Simba Technologies is the recognized world leader in standards-based data access and analytics products and solutions for both relational and multi-dimensional data sources. Simba provides the world’s leading companies with data connectivity solutions running on multiple platforms, including Windows, Mac, UNIX, Linux and many mobile platforms.
 
Video
The MapR + Skytree Partnership
Machine learning involves the creation of algorithms that allow for the analysis of massive data sets that traditional BI tools cannot handle. The more data that is analyzed over time, the smarter and more accurate the algorithms become. It’s an iterative learning process of continuous improvement.
 
Video
The MapR + Syncsort Partnership
Moving workloads from the data warehouse and mainframe to Apache Hadoop can be intimidating. How do you know where to begin, and what will deliver the most savings? Big data specialists Syncsort and MapR have teamed up to create a unique end-to-end approach to solve these challenges economically and efficiently.
 
Video
The MapR + Talend Partnership
The combination of Talend®and MapR® provides a new generation of highly scalable data management processing across vast amounts of structured and unstructured data, so your big data projects can be completed faster, at a lower cost, and using existing skillsets.
 
Video
The MapR + Protegrity Partnership
Protegrity is the only enterprise data security software platform that leverages scalable, data-centric encryption, tokenization and masking to help businesses secure sensitive information while maintaining data usability.
 
Video
The MapR + BlueData Partnership
BlueData is the pioneer in Big Data private clouds. The company is democratizing Big Data by streamlining and simplifying Big Data infrastructure and eliminating complexity as a barrier to adoption.
 
Video
The MapR + Centrify Partnership
Centrify provides unified identity management across cloud, mobile and data center environments that delivers single sign-on (SSO) for users and a simplified identity infrastructure for IT.
 
Video
The MapR + Information Builders Partnership
Unleash the potential of your organization’s data with Information Builders and MapR. Information yields higher returns the more it is used, shared, and managed effectively.
 
Video
The MapR + Dataiku Partnership
Founded in January 2013, Dataiku is the technology startup behind Data Science Studio (DSS) whose first version, DSS 1.0, was launched in February 2014. Data Science Studio (DSS) is a software platform designed for developers and analysts that aggregates all the steps and big data tools necessary to get from raw data to production ready applications.
 
Video
The MapR + Voltage Partnership
Voltage Security®, Inc. is the leading expert in data-centric encryption and tokenization, providing MapR customers with secure, scalable, and proven security and stateless key management solutions for both structured and unstructured data.
 
Video
The MapR + Dataguise Partnership
Dataguise delivers data privacy protection and risk assessment analytics that allow organizations to safely leverage and share enterprise data. The Dataguise solutions simplify governance as they proactively locate sensitive data, automatically protect it with appropriate remediation policies, and provide actionable compliance intelligence to decision makers, in real-time.
 
Video
Overview of an Enterprise Data Hub on MapR
This video provides an overview of an Enterprise Data Hub and highlights the critical capabilities that MapR provides to enable customers to build an enterprise grade data hub. The video provides examples of 2 customers who have optimized their data architecture with MapR.
 
Video
Hadoop On-Demand Training – Anytime, Anywhere
Free online Hadoop courses are now available anytime, anywhere for developers, data analysts and administrators. The in-depth Hadoop curriculum with interactive labs and quizzes leads to certification.
 
Video
Comparing MapR FS and HDFS NFS and Snapshots
This demo by Bruce Penn, Principal Solution Architect at MapR, compares NFS and Snapshots between MapR FS and HDFS.
 
Video
Hadoop On-Demand Training Preview: ADM 201 - Hadoop Operations
Get a glimpse of what free Hadoop on-demand training is like in this preview of the course "ADM 201 - Hadoop Operations: Cluster Administration"
 
Video
Hadoop On-Demand Training Preview: DEV 301 - Developing Hadoop Applications
Get a glimpse of what free Hadoop on-demand training is like in this preview of the course "DEV 301 - Developing Hadoop Applications"
 
Demo
SQL Queries on data in Amazon S3 storage with Drill - Demo
Did you know you can connect Tableau and other BI tools to data stored in Amazon S3, and run SQL queries directly on JSON and other types of data? This example walks through how to configure the S3 storage plugin for Drill, and an example query in Tableau.
 
Video
Hadoop On-Demand Training Preview: HDE 100 - Hadoop Essentials
Get a glimpse of what free Hadoop on-demand training is like in this preview of the course "HDE 100 - Hadoop Essentials"
 
Video
Why MapR? John Schroeder, MapR CEO, explains the MapR approach to enterprise Hadoop.
The MapR Distribution including Hadoop is the only distribution built from the ground up for your business-critical production applications. Built-in enterprise-grade features such as high availability, disaster recovery, security, and consistent snapshots let you deploy production-ready systems.
 
Video
Apache Drill Introduction
Inspired by Google Dremel and a vision to support modern big data applications, Drill provides the agility, flexibility and the familiarity required for users to derive timely insights from big data and to build the next generation big data applications.
 
Demo
Apache Drill: Redefining SQL on Hadoop - Demo
Multiple data sources. One SQL Query Engine. See how Apache Drill is used with the latest visualization tools to query multiple data sets from your big data platform.
 
Video
Whiteboard Walkthrough – Time Series Databases in the Upside-down Internet
In this week's Whiteboard Walkthrough, Ted Dunning, Chief Application Architect at MapR, talks about how current trends are turning the internet upside down. He also talks about how this is leading to the requirements for very very high speed time series databases, which leads to practical designs based on modern NoSQL architectures to implement these high speed time series databases.
 
Video
Impacting Business as it Happens - Anil Gadre (Strata + Hadoop 2015)
To get value out of today’s big and fast data, organizations must evolve beyond traditional analytic cycles that are heavy with data transformation and schema management. The Hadoop revolution is about merging business analytics and production operations to create the ‘as-it-happens’ business.
 
Demo
MapR Quick Start Solution - Security Log Analytics - Demo
This Quick Start Solution enables you to start bringing security logs into Hadoop for analysis and better detection and prevention of security threats.
 
Video
Whiteboard Walkthrough - Apache Mesos vs Hadoop YARN
In this week's Whiteboard Walkthrough, Jim Scott, Director of Enterprise Strategy and Architecture at MapR, walks you through the basics of Mesos, how the scheduler varies from YARN, and why one is better for global resource management than the other.
 
Video
Whiteboard Walkthrough - Append-only vs. Read-Write File System
Jim Scott, Director of Enterprise Strategy and Architecture at MapR, on how an append-only versus a random read-write capable file system impacts downstream projects. He uses HBase to demonstrate this concept. HBase is one of the most well-known applications that runs on top of HDFS.
 
Video
Whiteboard Walkthrough - HBase Key Design with OpenTSDB
In this week's Whiteboard Walkthrough, Jim Scott, Director of Enterprise Strategy and Architecture at MapR, walks you through HBase key design with OpenTSDB.
 
Video
Whiteboard Walkthrough - Handling Disk Failure in MapR FS
Abizer Adenwala, Technical Support Engineer at MapR, walks you through what a storage pool is, why disks are striped, reasons disk would be marked as failed, what happens when a disk is marked failed, what to watch out for before reformatting/re-adding disk back, and what is the best path to recover from disk failure.
 
Video
Whiteboard Walkthrough - Container Location Databases (CLDB) vs. NameNode
Jon Allen, Instructional Designer at MapR, walks you through how the CLDB differs from the name node in standard Hadoop. Continue the conversation with a comment below, or follow @MapR on Twitter using #WhiteboardWalkthrough. What topics would you like to see tackled next?
 
Demo
Multi-Tenancy in MapR Overview - Demo
The MapR Distribution features true multi-tenancy for Hadoop, including the ability to manage storage, I/O, CPU and jobs according to policies. This video gives an overview of multi-tenancy and an example demo with simulated users from the "retail" and "trading" lines of business within a company, using the same cluster.
 
Video
Whiteboard Walkthrough - How to Configure the Network for the MapR Sandbox for Hadoop
 
Demo
Tableau Analytics on MongoDB using Apache Drill - Demo
This video example walks through how to query and analyze data with the MongoDB storage plugin for Apache Drill.
 
Demo
Gaining insight with Drill and MicroStrategy Analytics Desktop - Demo
Drill enables self-service data discovery with data stored in Hive, JSON, CSV and other formats. This shows how to use Drill with MicroStrategy and a demo of solving a sales problem.
 
Demo
Analyzing JSON and packet data with SAP Lumira and Apache Drill - Demo
This video demo shows an example scenario where JSON and packet capture data (from Wireshark) can be analyzed together in SAP Lumira using the Drill ODBC driver. No transformations or schemas are required -- the data itself can be examined.
 
Video
Teradata and MapR announce partnership
Scott Gnau, President, Teradata Labs and John Schroeder, CEO, MapR discuss the Teradata-MapR partnership
 
Video
Apache Drill: Redefining SQL-on-Hadoop
Multiple data sources. One SQL Query Engine. See how Apache Drill is used with the latest visualization tools to query multiple data sets from your big data platform.
 
Video
MapR and the Easiest Access to Hadoop Data
See how MapR provides the easiest access to your data in Hadoop.
 
Video
Hunk MapR Setup
  Step by step to setup MapR Hadoop VMWare image with Hunk - Splunk Analytics for Hadoop
 
Demo
Self-Service SQL Exploration with MongoDB and Drill - Demo
Apache Drill queries against MongoDB data
 
Video
Whiteboard Walkthrough - Spark Streaming vs. Storm Trident
Abhinav Chawade, Data Engineer at MapR, gives an introduction for people who are wondering which stream or real time data processing framework to use. The video offers some comparison points between Storm Trident and Spark Streaming.
 
Video
Jack Norris (MapR) Interview -- Strata + Hadoop 2014
From the 2014 Strata Conference + Hadoop World in New York City, MapR's Chief Marketing Officer on their Apache Drill project, the idea of data gravity, and how the industry will change within the next year.
 
Demo
Comparing MapR-FS and HDFS NFS snapshots - Demo
This demo by Bruce Penn, Principal Solution Architect at MapR, compares NFS and Snapshots between MapR FS and HDFS.
 
Video
Recommendation Systems on MapR
Learn how MapR can be used to make recommendations to your customers on products/services that are likely most interesting to them.
 
Video
Detecting Anomalies Using MapR
Learn how MapR can be used to detect phishing attacks on your secure website to protect your business and your customers.
 
Video
WATF - What Are The Facts - NFS
Jack Norris of MapR talks to Donnie Berkholz, PhD, of RedMonk about NFS
 
Video
WATF - What Are The Facts - Snapshots
Jack Norris of MapR talks to Donnie Berkholz, PhD, of RedMonk about Snapshots
 
Video
WATF - What Are The Facts - Disaster Recovery
Jack Norris of MapR talks to Donnie Berkholz, PhD, of RedMonk about Disaster Recovery (DR)
 
Video
John Schroeder, MapR CEO, explains the MapR approach
 
Video
Dave Normandeau, Big Data Alliance Manager, Syncsort
 
Video
Best Practices for Hadoop in Production: Panel Discussion with MapR Customers
MapR customers, Cisco, Rubicon Project, Solutionary, and Climate Corporation talk about their experience with Hadoop in production using the MapR distribution for Hadoop with Forrester analyst, Mike Gualtieri.
 
Video
Chris Selland, VP of Marketing & Business Development, HP Vertica
 
Video
Brad Nelson, Technical Sales, Alpine Data Labs
 
Video
Jeff Hartson, VP of Business Development, LucidWorks
 
Video
Martin Hack, CEO & Co-Founder of Skytree
 
Video
Tom Aliotti, Senior Vice President, Global Sales, Dataguise
 
Video
Steve Garrou, VP of Global Solutions Management, CenturyLink
 
Video
Reena Tiwari, Senior IT Manager, Cisco
 
Video
Kandarp Desai, Sr Manager of Engineering, Xactly
 
Video
Jan Gelin, VP, Engineering and Chief Systems Architect, Rubicon Project
 
Video
Andy Sautins, CTO of Return Path
 
 
Video
Scott Russmann, Director of Software Development, Solutionary
 
Video
Mike Brown, CTO of comScore
 
 
Video
John Schroeder, MapR CEO at Strata 2014: "Hadoop in 5 Minutes or Less"
John Schroeder, MapR CEO and co-founder, talks about Hadoop at Strata 2014.
 
Video
John Schroeder, MapR and Colin Mahony, HP Vertica at BigDataSV 2014
Colin Mahony, HP Vertica & John Schroeder, MapR at BigDataSV 2014 with John Furrier and Dave Vellante
 
Video
Jack Norris, MapR CMO, speaks at BigDataSV 2014 - theCUBE
Jack Norris, CMO, MapR, at BigDataSV 2014 with John Furrier and Dave Vellante
 
Video
Customer Video: Saum Mathur, Global CIO, HP Software
 
Video
MapR Snapshots and the Big Data Ecosystem
Jack Norris discusses a few items that are important for the development of the Hadoop ecosystem and what he hopes to see in 2014.
 
Tech Brief
HP Reference Architecture for MapR M7
This white paper provides several performance optimized configurations for deploying MapR M7 distribution of Apache Hadoop clusters of varying sizes on HP infrastructure that provide a significant reduction in complexity and increase in value and performance. DOWNLOAD PDF
 
Video
MapR Heatmaps
 
Video
MapR Consistent Snapshots
 
Video
John Schroeder, CEO, MapR on Google Compute Engine at Google I/O 2012
 
Video
MinuteSort Record using MapR with Google Compute Engine
 
Video
Read-write NFS Access
 
Video
MapR Snapshots
 
Video
MapR Makes Hadoop Highly Available
Watch the video.
 
Video
Separating Hadoop Myths from Reality
Jack Norris, MapR CMO, talks about Hadoop myths vs realities in his keynote at Strata+HadoopWorld.
 
Video
Hadoop’s Power to Transform Business
Learn how Hadoop allows organizations to better leverage data to improve business results and gain a competitive edge.
 
Video
MapR on Google Compute Engine
The combination of MapR's Distribution for Hadoop and Google enables customers to quickly provision large MapR clusters on demand and take advantage of a cloud-based solution.
 
Video
John Schroeder, at Google I/O
MapR is becoming the defacto standard for Hadoop in the Cloud. John Schroeder, MapR CEO, discusses the benefits of MapR's Hadoop Distribution on the Google Compute Engine at Google I/O.
 
Video
MapR NFS
MapR NFS: A radically simpler way to get your data out of a Hadoop cluster.
 
Video
MapR Snapshots
MapR's snapshot capability uniquely provides data protection into the fundamental architecture of our Hadoop distribution. Recover to a given point anytime with a simple file copy.
 
Video
MapR: Faster with less hardware
With MapR your jobs will run faster with less hardware than required by other Apache Hadoop distributions.
 
Video
MapR Introduction
CEO and Co-Founder, John Schroeder discusses MapR's enterpise Hadoop distribution for big data analytics.
 
Video
MapR: The Answer
Learn why MapR is the most open, enterprise-grade distribution for Hadoop. MapR is The Answer for all of your Hadoop and Big Data questions.
 
Video
MapR-Terasort Record
Apache Hadoop Terasort benchmark new world record set by MapR Technologies on a 1003-node cluster running on Google Compute Engine on the Google Cloud Platform.
 
Tech Brief
RHadoop and MapR
RHadoop is an open source collection of three R packages created by Revolution Analytics that allow users to manage and analyze data with Hadoop from an R environment. It allows data scientists familiar with R to quickly utilize the enterprise-grade capabilities of the MapR Hadoop distribution directly with the analytic capabilities of R. DOWNLOAD PDF
 
Tech Brief
IBM System x Reference Architecture for Hadoop: MapR
The MapR-validated reference architecture solution from IBM for Hadoop big data analytics is built around powerful, affordable, scalable System x servers and IBM networking solutions so you can deploy your MapR-validated solution more quickly. DOWNLOAD PDF
 
Tech Brief
Cisco UCS CPA for Big Data with MapR
As part of the Cisco Validated Design program, consisting of systems and solutions designed, tested, and documented to facilitate faster, more reliable, and more predictable customer deployments, this document is intended to assist solution architects, sales engineers, field consultants, professional services, IT managers, partner engineering and customers in deploying MapR on the Cisco Common Platform Architecture (CPA) for Big Data.