Вы находитесь на странице: 1из 27

Leveraging Jupyter to

build an Excel-Python bridge


JupyterCON 2017

Christine Doig, Senior Product Manager & Data Scientist


Fabio Pliger, Tech Lead

August 24th, 2017


© 2016 Continuum Analytics - Confidential & Proprietary
Overview & Examples

2
How to increase by 100x the impact of
data science in your organization?

Data Scientist Business Analysts


Jupyter Users Excel Users
Est. 3-6 million Est. 750 Million

© 2016 Continuum Analytics - Confidential & Proprietary 3


Business analysts vs Data scientists

Works with: Excel, Tableau, SQL Python, Hadoop, Spark


Data: spreadsheets, tables dataframes, arrays
Delivers: Reports, dashboards, Notebooks, code, interactive
spreadsheets visualizations

© 2016 Continuum Analytics - Confidential & Proprietary 4


Business analysts are being left out of the
data science revolution

Interactive Data
Visualizations
Statistics and
Advanced Analytics

Machine Learning
Big Data & ETL

© 2016 Continuum Analytics - Confidential & Proprietary 5 5


Anaconda Fusion is a bridge between Excel & Python

Interactive Data
Visualizations
Statistics and
Advanced Analytics

Machine Learning
Big Data & ETL

© 2016 Continuum Analytics - Confidential & Proprietary 6 6


Analysts and Data Scientists can keep using their
preferred tools

© 2016 Continuum Analytics - Confidential & Proprietary 7 7


Self-service Big Data analytics
Jupyter
notebook

Interactive Data Visualizations Head node

Compute
nodes
Machine Learning Predictions

Extract, transform
and query data

© 2016 Continuum Analytics - Confidential & Proprietary 8


“No Code” Data Science Example

Select Anaconda Select function Click “Run” Data is loaded


1 2 3 4
Fusion Notebook you wish to run into spreadsheet
and click “Upload”

© 2016 Continuum Analytics - Confidential & Proprietary 9


Just change one line of code in your
notebook

© 2016 Continuum Analytics - Confidential & Proprietary 10


Anaconda Fusion use cases

• Extract data - pull data directly into Excel to


perform analysis
• Machine Learning – use trained models
created by Data Scientists and plug them
into your spreadsheet data
• Interactive Visualizations – create custom
advanced interactive graphs, charts and
plots from Excel data Figure: Anaconda Fusion on Mac

• Big Data – analyze, transform, model and


query data stored in Hadoop and Spark

© 2016 Continuum Analytics - Confidential & Proprietary 11


Examples
• Run Queries
• Run Predictive Models
• Running Big Data Text Analytics

© 2016 Continuum Analytics - Confidential & Proprietary 12


© 2016 Continuum Analytics - Confidential & Proprietary 13
© 2016 Continuum Analytics - Confidential & Proprietary 14
© 2016 Continuum Analytics - Confidential & Proprietary 15
Features & architecture

16
Features – Formula bar

© 2016 Continuum Analytics - Confidential & Proprietary 17


Features – Write back to Excel

© 2016 Continuum Analytics - Confidential & Proprietary 18


Features – Interactive visualizations

© 2016 Continuum Analytics - Confidential & Proprietary 19


Features – Save custom advanced visualizations

© 2016 Continuum Analytics - Confidential & Proprietary 20


Jupyter as a Platform
• OSS – base of most successful modern software
• Maturity – long history
• Diversity
• 100s of projects
• 1000s of contributors
• Vision
• Jupyterlab
• Community & Support
• Popularity

© 2016 Continuum Analytics - Confidential & Proprietary 21


Jupyter As a Tech Choice
• The jupyter ecosystem
• https://github.com/jupyter
• https://github.com/jupyterlab
• https://github.com/phosphorjs
• Great community/support
• Very pluggable*
• Perfect for our use case
• I.e.: why can’t excel have ML?
• i.e.: why can’t excel do things that numpy/pandas do?
• i.e.: we need better graphics (ala bokeh ;) ) for a dashboard of our
metrics in excel

© 2016 Continuum Analytics - Confidential & Proprietary 22


Use Case

fusion Fusion
Server
Excel

Notebook

kernels

© 2016 Continuum Analytics - Confidential & Proprietary 23


Use case Excel Apps
fusion

API Anaconda Platform

Notebooks, Apps, …

kernels

© 2016 Continuum Analytics - Confidential & Proprietary 24


A marketplace for Anaconda Fusion notebooks

https://anaconda.org

© 2016 Continuum Analytics - Confidential & Proprietary 25


THANK YOU!

Christine Doig
cdoig@continuum.io
@ch_doig

Fabio Pliger
fpliger@continuum.io
@b_smoke

© 2016 Continuum Analytics - Confidential & Proprietary


QUESTIONS?

© 2016 Continuum Analytics - Confidential & Proprietary

Вам также может понравиться