Вы находитесь на странице: 1из 11

Real-time analytics with Azure

Databricks
Agenda

1. Real-time analytics scenarios

2. Real-time analytics with Azure Databricks


Real-time analytics scenarios
Real-time analytics scenarios

AdTech - Real time actionable insights on Ad by joining impressions and clicks


through data.

Telco - Real time analysis of data for Fraud Detection.

Industrial Automation - Preventative Maintenance in Real time. Applying ML models to


incoming real time data to detect and fix problems before they happen.
Real-time analytics with Azure
Databricks
What is Azure Databricks ?

Azure Databricks is a fast, easy, and collaborative Apache Spark-based analytics service. For a big data pipeline, the
data (raw or structured) is ingested into Azure through Azure Data Factory in batches, or streamed near real-time
using Kafka, Event Hub, or IoT Hub. This data lands in a data lake for long term persisted storage, in Azure Blob
Storage or Azure Data Lake Storage. As part of your analytics workflow, use Azure Databricks to read data from
multiple data sources.
Introduction to Spark

Spark Unifies:
▪ Batch Processing
▪ Interactive SQL
▪ Real-time processing
▪ Machine Learning
▪ Deep Learning
▪ Graph Processing
Architecture Azure Databricks

INGEST STORE PREP & TRAIN MODEL & SERVE

Cosmos DB –
Spark Connector
Azure Cosmos
DB Intelligent Apps

(ML Scoring with


Spark ML, SparkR,
SparklyR)
Logs, files and Event Hub
media IoT Hub Real-time
(unstructured) HDInsight Analysis
(Kafka) Azure Databricks Power BI
(Stream Processing
with Spark Structured
Sensors and Streaming)
IoT
(unstructured
)

Azure Blob
Storage
Thank You !

Вам также может понравиться