Академический Документы
Профессиональный Документы
Культура Документы
Scientist/Analyst?
Already crowned as the best job in America for 2016, the definition and skill set required to be a
data scientist is in a constant state of flux. Advancements in technology and business demand
drive its evolution in an ever-changing industry. In this article, we take a closer look at the role
of a Data Scientist in 2016.
Dave Holtz writes that the title 'data scientist' is often used as a blanket title to describe a set of
jobs that are drastically different. He attributes this to the fact that the field of data science is still
in its infancy and so is ill-defined. Adopting the all-encompassing sub-title of being part of an
interdisciplinary field, a data scientist works to extract knowledge or insights from large
volumes of data in various forms.
The age of big data is upon us, and its here to stay. With more data being collected than ever
before, extracting value from this data is only going to become more intricate and demanding as
time goes on. The logic behind the big data economy is shaping our personal lives in ways that
we probably cant even conceive or predict; every electronic move that we make produces a
statistic and insight into our life.
As participants in the consumer economy, we are mined for data when we connect to any
website or electronic service, and a data scientist is there to collect, clean, analyse and predict the
data that we provide by using a combination of computer science, statistical analysis and
intricate business knowledge.
The following diagram shows the skillsets required for a Data Scientist. As we can see, this
responsibility is a combination of multiple skillsets and expertise compared to a typical Big Data
Developer or Business Analyst.
The success of business networking site LinkedIn is a prime example of the crucial benefit that
data scientists are bringing to business intelligence. As an enterprise that relies almost solely on
the data transferred by its 380,000,000 users making connections with each other, LinkedIn is
utilising those professionals with the training and curiosity to make discoveries in the world of
big data.
LinkedIn, alongside other large knowledge industries such as Facebook and Google, is utilising
the role of data scientists to bring structure to large quantities of formless data and to determine
significance in its value, and systematic relationships between the variables.
A recent survey of C-suite executives by KPMG found 99% of respondents thought analysis of
big data was important to their strategy next year. In an age where enterprise data is expected to
exceed 240 exabytes per day by 2020, the need for data scientists with the skills to extract
valuable insights from this data is more important than ever. . However, an article by Travis
Wright for Venture Beat suggests that demand for data scientists is very much outstripping
supply and that companies in the United States alone will need to hire between 140,000
190,000 data scientists if they are to keep up with the new data economy.
Ironically, there is a great deal of conflicting data on the average salary for a data scientist,
however, what is clear is that the average salary does tend to be inherently concurrent with the
high demand level for data scientists. Not surprisingly, if employers are asking candidates to be
experienced with data mining algorithms, able to work comprehensively in languages like R and
Python, experienced in working with large databases (SQL or similar), implementing Java
applications, manipulating NoSQL databases (to quote about 10% of a job specification) all
with the ability to communicate all of this to a non-technical audience, an average salary of about
$120,000 doesnt seem too far fetched.
Successful data analytics rely on one being able to clean, integrate and transform the data and
this is the crucial combination of skills all data scientists must possess. By combining a scientific
background with computational and analytical skills, you can put yourself a cut above the rest.
Figure 3 below shows the several areas of focus for typical data science discipline.
But lets dig deeper into the actual skills required to become a data scientist. Mark van
Rijmenam, CEO at Data Floq, recommends that data scientists possess the following skills:
statistical, mathematical and ethical, as well as a high degree of predictive modelling experience
in order to build the algorithms necessary to ask the right questions and find the right answers.
Ferris Jumah from LinkedIn goes further to neatly group the skills required, despite the huge
array of skills and different job roles a data scientist might perform.
A data scientist must:
Look at data with a mathematical mind-set. Learning skills such as machine learning,
data mining, data analysis and statistics are crucial. A data scientist will need to interpret
and represent data mathematically.
Use a common language to access, explore and model data. Knowledge of a statistical
programming language will be critical. Languages like R, Python or MATLAB, and a
database querying language like SQL are some of the most popular skills in demand.
Data extraction, exploration and hypothesis testing are central to the data science
practice.
Develop strong computer science and software engineering backgrounds. This
involves developing a skill set which could include Java, C++ or knowledge of
algorithms and Hadoop. These skills will be used to leverage data to architect systems.