Академический Документы
Профессиональный Документы
Культура Документы
• This webinar is being recorded. Later this week, you will receive
an email on how to get the recording and slide deck.
• If you have any audio problems, please let us know in the chat
window and we’ll try to resolve them quickly.
• If you have any questions during the webinar, please type them in
the chat window.
Dale Kim
Sr. Director, Industry Solutions
MapR Technologies
Alex Gorelik
Founder and CEO
Waterline Data
Data Lake
Alex Gorelik
Founder and CEO, Waterline Data
Waterline Data Overview
Alex Gorelik Oliver Claude Jason Chen Ravi Ramachandran Mohan Sadashiva
Founder, CEO Marketing Engineering Sales Product
Founded Exeros (IBM) VP SAP, VP Informatica, VP Teradata, Acta, CSC Infochimps, AppLabs, Narus (Boeing), Intel,
and Acta (SAP), IBM DE, IBM Siebel, Nova Sybase. USC PhD CS. Xchanging. Scient-Razorfish. Synchronoss, Trimble
Informatica GM, MSCS Southeastern MS MIS MBA Clark, BS Delhi University. Navigation. MBA Columbia,
Stanford, Columbia BSCS MSCS Queens University
Healthcare Insurance
Fortune 500 Fortune 500 Health Insurer
Healthcare Provider & Global Insurer
Government Automotive
Government Agency in EMEA Leading US Vehicle
Remarketing Provider
Consumer Marketing
Leading Market Research Firm in EMEA
Data Lakes Power Data Driven Decision Making
Business Value
Data
Data Lake
Data Puddles
Warehouse
Data Off-loading
Swamp
Limited Scope
No Value Cost Savings Enterprise Impact
and Value
Value
Data Swamps
Raw data
Data Exhaust
Clean, trusted,
prepared data
Raw data
Clean, trusted,
prepared data
Raw data
Clean, trusted,
prepared data
Raw data
Find and
Understand
Provision Prep Analyze
Finding, understanding and governing data in a data lake
is like shopping at a flea market
“We have 100 million fields of data – how can anyone find or trust
anything?” – AT&T Executive
I need data to use with
I can’t govern and trust I can’t inventory all
self-service tools but I
the data (metadata, the data manually and
can’t explore everything
data quality, PII, data keep up with data
manually to find and
lineage) provisioning
understand it
Inventory
GOVERNANCE
Waterline Data is like Amazon for Data in Hadoop
– an Enterprise Data Marketplace
Provision
Inventory
GOVERNANCE
Find and
Finding and Understanding Data Understand
Tooling
• Many great dedicated data wrangling tools on the horizon
• Some capabilities in BI/data visualization tools
• SQL and scripting languages for the more technical analysts
Data Analysis Analyze