Академический Документы
Профессиональный Документы
Культура Документы
aka.ms/latamai
Agenda
• Best Practices
• Predictions/trends
• AI on Azure
Agenda
• Best Practices
• Predictions/trends
• AI on Azure
Not Best Practices – Canonical Cavestatistician
kill data
Best Practices – working with a team
• Working as a team means working together
• Pick a framework and use it
• CRISP-DM
• Microsoft Team Data Science Process
Best Practices - Frameworks
Best Practices - Frameworks
Best Practices – Source Control
• Key for collaboration
• The company is paying for these assets, make sure they're
usable
• Not always a normal skill for a statistician
• Some options
• Git
• Visual Studio Team Foundation Services/Server -> Azure DevOps
Best Practices – Source Control
Best Practices – Org Chart
• Should team include Data Engineering?
• Where should Analytics reside:
• IT: bias towards hardware or DevOps?
• LOB: multiple Analytic groups which all need to be linked
by a COP
• Analytics: need close relationship with the business for
domain expertise
Best Practices – building a team
• Talent level
• Headcount cost
• Level of resource needed
• Grow your own?
Best Practices – building a team
• Backgrounds
• all analytics/stats?
• size of team
• Roles
• business analysts, data engineers, developers, architects, machine learning
engineers, DevOps specialists, compliance specialists, security professionals
• Strengths
• Sales
• Consulting
Best Practices – building a team
• Languages/tools
• Data Engineering
• Front end/consumption
Best Practices – leveraging data
Best Practices – leveraging data (data protection)
• PII information
• GDPR
• bias blog post
https://en.wikipedia.org/wiki/List_of_data_breaches
Best Practices – prepping data
Before we begin remember:
• Think about:
(1) Value
(2) Difficulty
(3) Time
Best Practices - Operationalization
• The best models are meaningless if you don't
do something with them
• Model scores can be used directly, but think
about decisioning (where we tie in with rules
engines or optimization)
Agenda
• Best Practices
• Predictions/trends
• AI on Azure
Predictions/trends - AI
Not everything is AI, BUT it is truly disruptive
https://www.telegraph.co.uk/technology/2016/03/24/microsofts-teen-girl-ai-turns-into-a-hitler-loving-sex-robot-wit/
AI has great potential, but also risk
https://www.cnet.com/news/what-happens-when-ai-bots-invent-their-own-language/
AI has great potential, but also risk
https://www.propublica.org/article/facebook-enabled-advertisers-to-reach-jew-haters
AI has great potential, but also risk
https://www.reuters.com/article/us-amazon-com-jobs-automation-insight/amazon-scraps-secret-
ai-recruiting-tool-that-showed-bias-against-women-idUSKCN1MK08G
AI has great potential, but also risk
https://www.techrepublic.com/article/google-home-mini-spied-on-user-thousands-of-times-a-day-sent-recordings-to-google/
Predictions/trends – Open Source
The battle has won been won;
the empire is still growing
Usage options
• Auto-scale / Serverless /
PaaS
Modeling
Predictions/trends - Automation
• Tracking changes to input stream
• Model management
• Push inference to event
Custom
Pre-built
Training |
Deployment
Hardware
AI
Azure Conversational
Agents
Bots
Search
On-premises Edge
Architectural Patterns
The Azure Data/Analytics Landscape
AZURE
AZURE AZURE IMPORT AZURE SQL DB AZURE COSMOS DB AZURE SQL DATA WAREHOUSE POWER BI
ANALYSIS SERVICES
DATA FACTORY EXPORT SERVICE
AZURE EXPRESSROUTE AZURE AZURE NETWORK AZURE KEY OPERATIONS AZURE FUNCTIONS
VISUAL STUDIO
ACTIVE DIRECTORY SECURITY GROUPS MANAGEMENT SERVICE MANAGEMENT SUITE
The Azure BIG Data/Analytics Landscape
AZURE
AZURE AZURE IMPORT AZURE SQL DB AZURE COSMOS DB AZURE SQL DATA WAREHOUSE POWER BI
ANALYSIS SERVICES
DATA FACTORY EXPORT SERVICE
AZURE EXPRESSROUTE AZURE AZURE NETWORK AZURE KEY OPERATIONS AZURE FUNCTIONS
VISUAL STUDIO
ACTIVE DIRECTORY SECURITY GROUPS MANAGEMENT SERVICE MANAGEMENT SUITE
DevOps Clients
Management
App Frameworks
& Tools
Databases &
Middleware
Infrastructure
Customers System integrators ISVs Training partners
A D VA N C E D A N A LY T I C S PAT T E R N I N A Z U R E
Performing data collection/understanding, modeling and deployment
APPLICATIONS
SQL DB
r
LOGS, FILES AND MEDIA
(UNSTRUCTURED) DATA LAKE AZURE COSMOS DB SQL DB DATA LAKE ANALYTICS AZURE DATABRICKS HDINSIGHT
SQL DW
STORE STORAGE
AZURE
ANALYSIS
SERVICES DASHBOARDS
BUSINESS / CUSTOM
APPS AZURE CONTAINER SQL Server
(STRUCTURED) DATA SERVICE (In-database ML)
FACTORY
Batch scoring on Azure for deep learning models
Big Data Real Time Architecture
AZURE DATABRICKS
(Spark ML, SparkR, sparklyr)
AZURE HDINSIGHT
(Kafka)
Polybase
ANALYTICAL DASHBOARDS
Improved text prediction
Deep learning and natural language processing boosts search efficacy and tagging accuracy
SQL
Azure
Microsoft Azure Machine Azure
Machine
SQL Server Learning Kubernetes
Learning
managed Service
deployment
Predictive web
SDK for AML service Jupyter application
Python Notebook
Machine learning
model
Case Studies
Drone-based electric grid
inspector powered by deep
learning
Challenge
• Traditional power line inspection services are costly
• Demand for low cost image scoring and support for
multiple concurrent customers
• Needed powerful AI to execute on a drone solution
Solution
• Deep learning to analyze multiple streaming data
feeds
• Azure GPUs support Single Shot multibox detectors
• Reliable, consistent, and highly elastic scalability
with Azure Batch Shipyards
eSmart architecture
Data Sources Ingest Prepare Analyze Publish Consume
Azure
10
01
Functions
10
01
Azure
Drone Blob On-prem
collected Azure Blob Azure Cosmos DB command
images Raw storage Batch Contain center
Batch upload inventory
of drone
Cosmos results and
images DB state
Docker Image
changes
DNN contained in
a Docker image
In Development*
Project BrainWave
A Scalable FPGA-powered DNN Serving Platform
vs
Freely Available Imagery Labeled Training Data Inferred Land Cover Map
Real-Time Low-Latency Inferencing with FPGAs
Setup:
800 FPGAs on Azure
195 Million Images; 20TB
Real-time inferencing 1 image at a time
Results: 415K
images/sec
Satellite Images
for the Entire US
NAIP Data
20TB, 200M images
Azure Batch AI
Stored on
10 Visual Studio
01 Azure Premium
Storage Tools for AI
¡Muchas Gracias!