Вы находитесь на странице: 1из 14

Internship Applications

Name : ...
Title of Content

1. Clue
2. Descriptive
3. Data Cleansing
4. Data Analysis
5. Data Storytelling
6. Structured Thinking
Clue
This is clue
Clue
In Datanest, we believe is problem solving is the key of data science
activity, ability to resourcefully solve the business problem trough data
is beyond any separated technical skills.

Please solve the problem effectively, minimize work on create synthetic


data, code, visualization, etc. Bring minimal answer that you can
defend either on technical or non-technical people effectively.

The problem is not hard, but it require you to being resourceful and
take problem at closer look
Dataset 1

Here’s the sample of Dataset

outlet timeframe Areas TipeOutlet Distributor Freq Order Juml Sales Jarak (m)

out_1 Jan 19 DKI warung Dist A 8 40.000.000 28.000

out_2 Feb 19 Jabar minimarke Dist B 10 150.000.000 10.000


t

out_3 Mar 19 Bante supermark Dist C 5 250.000.000 5.000


n et
Dataset 2

Here’s the sample of Dataset


Dataset 3
Number of
Buyers
Dataset 4

Phone Number Status

085674872274 Real

085612341234 Unreal

081243579357 Real

081328648738 Real

081122334455 Unreal

081234567890 Unreal

081726842689 Real
Problem 1: Descriptive

1. What is difference of purpose between Pie Chart and Bar Chart?


2. In pie chart, what you going to do if there are any 30 categories?
3. Explain what is waterfall chart?
4. Give one use case to use bar chart in Dataset 1?
5. Give one use case to use bubble plot in Dataset 1?
6. Give one use case Stacked Area 100% Chart in Dataset 1?
7. What chart you use to compare Juml Sales and Jarak (m) time to
time?
Problem 2: Data Cleansing

1. In Dataset 2, How to make description columns easier to


analyze?
2. Guess the purpose of the `label` columns exist?
3. If the columns `label` is empty in 10 millions rows what will
you do to fill the missing data?
4. What yo do to deal with abbreviation and misspelled words?
5. How to deal with unwanted observations?
Problem 3: Data Analysis

1. How to determine a sample is statistically significant?


2. What is difference between bias and variance?
3. How do you know if one algorithm is better than other?
4. How we multiply a matrix?
5. What is difference between convex and non-convex?
6. What is difference between close-form and non close-form?
7. What is difference between feature, parameter, interception,
and variables?
Problem 4: Data Storytelling

1. Based on left chart on Dataset 3, How many people that came


in May 2018 is still come in July 2018?
2. What data need to make chart on Dataset 3?
3. How to make left chart on Dataset 3?
4. How to make right chart on Dataset 3?
5. If we make chart based on left chart in Data
Problem 5: Structured Thinking

1. Based on dataset 4, What pattern that determined the


number is real and unreal?
2. Write pseudocode for determined the number is real and
unreal?
Be resourceful !!!
Closing There is nothing wrong with
making mistakes, but one should
always make new ones.

Вам также может понравиться