Академический Документы
Профессиональный Документы
Культура Документы
Ever since the term “data scientist” came onto the tech scene, there’s
been a cross-generational debate raging, attempting to define and
distinguish newly branded data scientists and traditional statisticians. I
personally adopted the data scientist title around 2012, and I recall a
rather pithy definition float across the Twittersphere around this time:
It seems that the designation “data scientist” has taken the world by
storm. It’s a title that conjures up almost mystical abilities of a person
https://medium.com/@ODSC/data-scientists-versus-statisticians-8ea146b7a47f 1/6
19/6/2019 Data Scientists Versus Statisticians – ODSC - Open Data Science – Medium
garnering information from deep data lakes with ease. It comes from a
belief that a data scientist can wave his or her hand like a 21st century
Houdini and effortlessly extract insights from the data.
What’s intriguing about the field of data science is its perceived threat
to other disciplines, specifically statistics. I don’t see this threat as real
however as the two fields are quite distinct and complementary. In the
past decade, it’s clear that though the two fields can exist separately on
their own, each is weak without the other. Statisticians need to
understand the modeling and structure of data, while data scientists
need to understand applied statistics.
Data scientists on the other hand, closely follow the “data science
process” that is more approachable; data ingest, data transformation,
exploratory data analysis, model selection, model evaluation, and data
storytelling. Sure, many of these steps follow statistical methods
behind the scene, but they’re sealed in a more engaging and
understandable wrapper. Many more people can embrace data science.
https://medium.com/@ODSC/data-scientists-versus-statisticians-8ea146b7a47f 2/6
19/6/2019 Data Scientists Versus Statisticians – ODSC - Open Data Science – Medium
[Related article: What Will the Next Generation of Data Scientists Look
Like?]
https://medium.com/@ODSC/data-scientists-versus-statisticians-8ea146b7a47f 3/6
19/6/2019 Data Scientists Versus Statisticians – ODSC - Open Data Science – Medium
Conclusion
Given time, the fields of data science and statistics likely will converge
to a common end-point. Statisticians have gone about gathering data
and performing analysis techniques like linear regressions for several
centuries. Eventually, as more statisticians pick up on skills like
implementing algorithms that learn from data, and provide predictions
and actions and more data scientists pick up on statistical science
(sampling, experimental design, confidence intervals, p-values, etc.)
the boundary between data scientists and statisticians will eventually
blur.
. . .
https://medium.com/@ODSC/data-scientists-versus-statisticians-8ea146b7a47f 4/6
19/6/2019 Data Scientists Versus Statisticians – ODSC - Open Data Science – Medium
https://medium.com/@ODSC/data-scientists-versus-statisticians-8ea146b7a47f 5/6
19/6/2019 Data Scientists Versus Statisticians – ODSC - Open Data Science – Medium
https://medium.com/@ODSC/data-scientists-versus-statisticians-8ea146b7a47f 6/6