Академический Документы
Профессиональный Документы
Культура Документы
Before class even begins, students start an at-home pre-work phase. When they convene in class, students spend the rst eight
weeks doing iterative, project-centered skill acquisition. Over the course of five data science projects, they develop skills across
key aspects of data science, and results from each project are added to the students' portfolios. In the last four weeks, students
build out and complete their individual final projects, culminating in a presentation of their work to representatives from the Metis
Hiring Network.
ONLINE PRE-WORK
Students work through a curated collection of tutorials that cover the basics
so they can hit the ground running. First, they're guided through initial
software setup. Introductory materials then start with productivity at the
command line, using an editor effectively, and becoming familiar with Python
basics. Students reinforce their statistics knowledge through a set of readings
with exercises that start to blend the statistical and computational. Metis
teaching assistants review these preparatory exercises and provide feedback
online.
INSTALLING PACKAGES
COMMAND LINE
CODE EDITOR
PYTHON
STATISTICS
WEEK 1
UNIT ONE
Introduction to the Data Science Toolkit
Review probability and statistics, including distributions, bootstrapping, hypothesis testing, maximum
likelihood estimation, and Bayes theorem (This review
spans the first three weeks.)
Use UNIX, Git, and IPython to organize data science
project resources
Load and manipulate data with the pandas Python
package
Visualize results using the matplotlib Python package
Communicate data science results
PRO J EC T O5
PRO J EC T O3
PRO J EC T O4
P ROJ E CT O 1
PR OJ EC T O 2
CODENAME
LUTHER
CODENAME
MCNULTY
CODENAME
CODENAME
CODENAME
BENSON
FLETCHER
KOJAK (AKA PASSION PROJECT)
first
pass
atwork
machine
learning,
students
dive
prediction
Students
form
small
groups
that
each
work
as
an internal
data
science
team
at awith
fictional
In theFor
firstthe
week,
Thestudents
last
Students
guided
are
project
in
free
small
to
focuses
use
groups
anything
on
using
unsupervised
covered
MTA
turnstile
in deep
class
learning
data
orinto
toand
learn
estimate
NLP
something
algorithms,
the
new
NoSQL
to answer
regression
models.
They
experience
the
beauty
of
at
les,
and
learn
to scrape
informain the
insurance
industry
(details
are
leftwork
to and
the
students
to
determine).
volume
ofcompany
people
databases,
the
on
questions
and
street,
APIthey
sodata
that
want
collection.
(theoretical)
to
address.
Students
nonprofits
Some
students
individually
companies
know
what
and
can
will
have
be very
theirfew
passion
tionstreet
fromconstraints
websites
using
tools
likeofPython
Requests,
Beautiful
Soup,
and
Selenium.
Supervised
learning
algorithms
and
relational
databases
been
covered
in class.
deploy
teams
project
efficiently.
for
atthe
the
The
design
admissions
students
thisstage.
are
project.
provided
Others
with
embark
thehave
data
on
entirely
and
guided
new turf.
Every student
Afterexploratory
scrapingworks
together
some
movie
box office
data,
students
and
scrape
more
Students
work
on
their
own
classification
models
that
within
overall
goals
of the
through
data
intensely
analysis
and
and
challenges
plotting
so
him
they
or herself
canfitfocus
to find
create
onthe
new
something
tools,
cool,
interesting,
resources
on
own
and
present
their movie
industry
regression
predictions
to the
company
and
theorteam.
During McNulty,
students
perform
a deep
dive into the
visualibrainstorming,
andtheir
useful,
communication.
worthwhile.
class.zation package D3 and create their own APIs on the Python Flask micro framework to
serve data from their databases to their visualizations.
thisismetis.com
WEEK 2
WEEK 3
PRO J EC T O5
PRO J EC T O3
PRO J EC T O4
P ROJ E CT O 1
PR OJ EC T O 2
CODENAME
LUTHER
CODENAME
MCNULTY
CODENAME
CODENAME
CODENAME
BENSON
FLETCHER
KOJAK (AKA PASSION PROJECT)
first
pass
atwork
machine
learning,
students
prediction
Students
form
small
groups
that
each
work
as
an dive
internal
data
science
team
at awith
fictional
In theFor
firstthe
week,
Thestudents
last
Students
guided
are
project
in
free
small
to
focuses
use
groups
anything
on
using
unsupervised
covered
MTA
turnstile
in deep
class
learning
data
orinto
toand
learn
estimate
NLP
something
algorithms,
the
new
NoSQL
to answer
regression
models.
experience
beauty
of left
at
les,
and
learn
scrape
in questions
insurance
industry
(details
are
to
the
students
determine).
volume
ofcompany
people
databases,
the
on
theThey
and
street,
API
they
sodata
that
want
collection.
(theoretical)
tothe
address.
Students
Some
nonprofits
students
work
and
individually
know
companies
whatto
and
will
can
be
have
their
very
finalfew
project
information
from
web
sites
using
tools
likeprovided
Python
Requests,
Beautiful
Soup,
and
Supervised
learning
algorithms
and
relational
databases
have
been
in class.
deploy
street
constraints
teams
at efficiently.
the
for
admissions
the
The
design
students
stage.
of
thisOthers
are
project.
embark
with
on the
entirely
data
new
and covered
guided
turf.
Every
student works
Selenium.
After
scraping
some
boxtothat
office
data,
students
find
andof the useful, or
Students
work
on their
ownand
classification
models
fitfocus
within
thenew
overall
goals
through
exploratory
intensely
data
analysis
andtogether
challenges
plotting
himmovie
or herself
so
they
can
create
something
on
cool,
tools,
interesting,
scrape
more
on their
own
and present
their
movie
industry
company
and
the team.
During
McNulty,
students
perform
a deep
diveregression
into the visualibrainstorming,
andresources
worthwhile.
communication.
predictions
to the class.
zation package
D3 and create their own APIs on the Python Flask micro framework to
serve data from their databases to their visualizations.
thisismetis.com
WEEK 4
WEEK 5
WEEK 6
PRO J EC T O5
PRO J EC T O3
PRO J EC T O4
P ROJ E CT O 1
PR OJ EC T O 2
CODENAME
LUTHER
CODENAME
MCNULTY
CODENAME
CODENAME
CODENAME
BENSON
FLETCHER
KOJAK (AKA PASSION PROJECT)
first
pass
atwork
machine
learning,
students
prediction
Students
form
small
groups
that
each
work
as
an dive
internal
data
science
team
at awith
fictional
In theFor
firstthe
week,
Thestudents
last
Students
guided
are
project
in
free
small
to
focuses
use
groups
anything
on
using
unsupervised
covered
MTA
turnstile
in deep
class
learning
data
orinto
toand
learn
estimate
NLP
something
algorithms,
the
new
NoSQL
to answer
regression
models.
They
experience
the(details
beauty
of left
atwork
les,
and
learn
scrape
in the
insurance
industry
are
to
the
students
to
determine).
volume
ofcompany
people
databases,
the
on
questions
and
street,
API
they
sodata
that
want
collection.
(theoretical)
to
address.
Students
nonprofits
Some
students
and
individually
companies
know
what
and
can
will
have
be very
theirfew
passion
information
from
web
using
tools
likeprovided
Python
Requests,
Beautiful
Soup,
and
Supervised
learning
algorithms
and
relational
databases
been
in class.
deploy
street
constraints
teams
project
efficiently.
for
atsites
the
the
The
design
admissions
students
of
thisstage.
are
project.
Others
with
embark
thehave
data
on
entirely
and covered
guided
new turf.
Every student
Selenium.
After
scraping
together
some
movie
box
office
data,
students
find
and
Students
work
on
their
own
classification
models
that
within
thenew
overall
goals
of the
through
exploratory
works
data
intensely
analysis
and
and
challenges
plotting
so
himthey
or herself
canfitfocus
to create
on
something
tools,
cool,
interesting,
scrape
more
on their
own
and present
their
movie
industry
company
and
theorteam.
During
McNulty,
students
perform
a deep
diveregression
into the visualibrainstorming,
andresources
useful,
communication.
worthwhile.
predictions
to the class.
zation package
D3 and create their own APIs on the Python Flask micro framework to
serve data from their databases to their visualizations.
thisismetis.com
WEEK 7
The project for the fourth unit involves text data. Students
round out data acquisition methods with APIs and online
database servers. Students also learn about NoSQL
databases and start using MongoDB.
WEEK 8
PRO J EC T O5
PRO J EC T O3
PRO J EC T O4
P ROJ E CT O 1
PR OJ EC T O 2
CODENAME
LUTHER
CODENAME
MCNULTY
CODENAME
CODENAME
CODENAME
BENSON
FLETCHER
KOJAK (AKA PASSION PROJECT)
first
pass
atwork
machine
learning,
students
prediction
Students
form
small
groups
that
each
work
as
an dive
internal
data
science
team
at awith
fictional
In theFor
firstthe
week,
Thestudents
last
Students
guided
are
project
in
free
small
to
focuses
use
groups
anything
on
using
unsupervised
covered
MTA
turnstile
in deep
class
learning
data
orinto
toand
learn
estimate
NLP
something
algorithms,
the
new
NoSQL
to answer
regression
models.
They
experience
the(details
beauty
of left
atwork
les,
and
learn
scrape
in the
insurance
industry
are
to
the
students
to
determine).
volume
ofcompany
people
databases,
the
on
questions
and
street,
API
they
sodata
that
want
collection.
(theoretical)
to
address.
Students
nonprofits
Some
students
and
individually
companies
know
what
and
can
will
have
be very
theirfew
passion
information
from
web
using
tools
likeprovided
Python
Requests,
Beautiful
Soup,
and
Supervised
learning
algorithms
and
relational
databases
been
in class.
deploy
street
constraints
teams
project
efficiently.
for
atsites
the
the
The
design
admissions
students
of
thisstage.
are
project.
Others
with
embark
thehave
data
on
entirely
and covered
guided
new turf.
Every student
Selenium.
After
scraping
together
some
movie
box
office
data,
students
find
and
Students
work
on
their
own
classification
models
that
within
thenew
overall
goals
of the
through
exploratory
works
data
intensely
analysis
and
and
challenges
plotting
so
himthey
or herself
canfitfocus
to create
on
something
tools,
cool,
interesting,
scrape
more
on their
own
and present
their
movie
industry
company
and
theorteam.
During
McNulty,
students
perform
a deep
diveregression
into the visualibrainstorming,
andresources
useful,
communication.
worthwhile.
predictions
to the class.
zation package
D3 and create their own APIs on the Python Flask micro framework to
serve data from their databases to their visualizations.
thisismetis.com
WEEKS 9-12
UNIT FIVE
Final Project
PRO J EC T O5
PRO J EC T O3
PRO J EC T O4
P ROJ E CT O 1
PR OJ EC T O 2
CODENAME
LUTHER
CODENAME
MCNULTY
CODENAME
CODENAME
CODENAME
BENSON
FLETCHER
KOJAK (AKA PASSION PROJECT)
first
pass
atwork
machine
learning,
students
prediction
Students
form
small
groups
that
each
work
as
an dive
internal
data
science
team
at awith
fictional
In theFor
firstthe
week,
Thestudents
last
Students
guided
are
project
in
free
small
to
focuses
use
groups
anything
on
using
unsupervised
covered
MTA
turnstile
in deep
class
learning
data
orinto
toand
learn
estimate
NLP
something
algorithms,
the
new
NoSQL
to answer
regression
models.
experience
beauty
of left
at
les,
and
learn
scrape
in questions
insurance
industry
(details
are
to
the
students
determine).
volume
ofcompany
people
databases,
the
on
theThey
and
street,
API
they
sodata
that
want
collection.
(theoretical)
tothe
address.
Students
Some
nonprofits
students
work
and
individually
know
companies
whatto
and
will
can
be
have
their
very
finalfew
project
information
from
web
sites
using
tools
likeprovided
Python
Requests,
Beautiful
Soup,
and
Supervised
learning
algorithms
and
relational
databases
have
been
in class.
deploy
street
constraints
teams
at efficiently.
the
for
admissions
the
The
design
students
stage.
of
thisOthers
are
project.
embark
with
on the
entirely
data
new
and covered
guided
turf.
Every
student works
Selenium.
After
scraping
some
boxtothat
office
data,
students
find
andof the useful, or
Students
work
on their
ownand
classification
models
fitfocus
within
thenew
overall
goals
through
exploratory
intensely
data
analysis
andtogether
challenges
plotting
himmovie
or herself
so
they
can
create
something
on
cool,
tools,
interesting,
scrape
more
on their
own
and present
their
movie
industry
company
and
the team.
During
McNulty,
students
perform
a deep
diveregression
into the visualibrainstorming,
andresources
worthwhile.
communication.
predictions
to the class.
zation package
D3 and create their own APIs on the Python Flask micro framework to
serve data from their databases to their visualizations.
thisismetis.com