Apache Spark Tour: Westlake Village Data Science
Westlake Village, CA
Tuesday, May 13, 2014

The Westlake Village Data Science group brings together data science enthusiasts and those looking to network with professionals practicing data science.  

The Apache Spark Tour is bringing MapR experts to 10 cities over the next 3 months to talk about Spark and how you can run programs 10-100x faster on Hadoop.


Malware Detection using Spark on MapR

Sungwook Yoon View Bio


Data Science is commonly used to analyze activity and access data to uncover unknown risks. This talk discusses how Spark can be used in MapR for malware detection applications. MapR is an enterprise software company that develops and sells Apache Hadoop-derived software. The company contributes to Apache Hadoop projects like HBase, Pig (programming language), Apache Hive, and Apache ZooKeeper. MapR's Apache Hadoop distribution claims to provide full data protection, no single points of failure, improved performance, and dramatic ease of use advantages. Apache Spark is an open-source data analytics cluster computing framework originally developed in the AMPLab at UC Berkeley. Spark fits into the Hadoop open-source community, building on top of the Hadoop Distributed File System (HDFS).


Sungwook Yoon

Sungwook is a Data Scientist at MapR. Sungwook's data experience includes malware detection algorithms for packet stream analysis, mobile network signaling analysis, social network analysis, job title analysis as well as call center data analysis. Before joining MapR, Sungwook worked as an architect for Seven Networks, a company that delivers device-centric mobile traffic management and analytics for wireless carriers. Previously, Sungwook worked as a Research Scientist at Palo Alto Research Center, where he worked on projects for both DARPA and Xerox. Sungwook's main technical background lies in Artificial Intelligence and Machine Learning. His Artificial Intelligence reserach has been published in top-tier conferences and journals, including AAAI, ICAPS, NIPS, UAI, ICML, JAIR, and JMLR.

Sungwook holds a Ph.D. in Computer Engineering from Purdue University,and M.S.and B.S. degrees in Electrical Engineering from Seoul National University.