Добро пожаловать в Scribd!

Пропустить карусель

Data Warehousing & Data Mining

Загружено:

Anurag Singh

0% нашли этот документ полезным (0 голосов)

31 просмотров4 страницы

It Contains Some Lab Program of Data Warehousing & Data Mining.

Авторское право

Доступные форматы

PDF, TXT или читайте онлайн в Scribd

Поделиться этим документом

Поделиться или встроить документ

Параметры публикации

Этот документ был вам полезен?

Это неприемлемый материал?

Пожаловаться на этот документ

It Contains Some Lab Program of Data Warehousing & Data Mining.

Авторское право:

Доступные форматы

Скачайте в формате PDF, TXT или читайте онлайн в Scribd

Отметить как неприемлемый контент

0% нашли этот документ полезным (0 голосов)

31 просмотров4 страницы

Data Warehousing & Data Mining

Загружено:

Anurag Singh

It Contains Some Lab Program of Data Warehousing & Data Mining.

Авторское право:

Доступные форматы

Скачайте в формате PDF, TXT или читайте онлайн в Scribd

Отметить как неприемлемый контент

Перейти к странице

Вы находитесь на странице: 1из 4

Поиск в документе

EXPERIMENT NO.

11
5
OBJECTIVE: Implementation of binning method of data cleaning.

Binning or discretization is the process of transforming numerical variables into categorical

counterparts. An example is to bin values for Age into categories such as 20-39, 40-59, and
60-79. Numerical variables are usually discretized in the modeling methods based on
frequency tables (e.g., decision trees). Moreover, binning may improve accuracy of the
predictive models by reducing the noise or non-linearity. Finally, binning allows easy
identification of outliers, invalid and missing values of numerical variables.

There are two types of binning, unsupervised and supervised.

Binning methods sorted data value by consulting its “neighbor- hood,” that is, the values
around it.The sorted values are distributed into a number of “buckets,” or bins.

For example

Price = 4, 8, 15, 21, 21, 24, 25, 28, 34

Partition into (equal-frequency) bins:

Bin a: 4, 8, 15

Bin b: 21, 21, 24

Bin c: 25, 28, 34

In this example, the data for price are first sorted and then partitioned into equal-frequency
bins of size 3.

Smoothing by bin means:

Bin a: 9, 9, 9

Bin b: 22, 22, 22

Bin c: 29, 29, 29

In smoothing by bin means, each value in a bin is replaced by the mean value of the bin.

Smoothing by bin boundaries:

Bin a: 4, 4, 15

Bin b: 21, 21, 24

Bin c: 25, 25, 34

In smoothing by bin boundaries, each bin value is replaced by the closest boundary value.
EXPERIMENT NO. 12
6
OBJECTIVE: Implementation of z score of data cleaning.

Z-scores are linearly transformed data values having a mean of zero and a standard
deviation of 1.Z-scores are also known as standardized scores; they are scores (or data
values) that have been given a common standard. This standard is a mean of zero and
a standard deviation of 1.

Z-Scores - Standardization

We suggested earlier on that giving scores a common standard of zero mean and unity
standard deviation facilitates their interpretation. We can do just that by

 first subtracting the mean over all scores from each individual score and
 then dividing each remainder by the standard deviation over all scores.
These two steps are the same as the following formula:
Zx=Xi−X¯¯¯¯Sx

Example.

A group of 100 people took some IQ test. My score was 5. So is that good or bad? At this
point, there's no way of telling because we don't know what people typically score on this
test. However, if my score of 5 corresponds to a z-score of 0.91, you'll know it was pretty
good: it's roughly a standard deviation higher than the average (which is always zero for z-
scores).
What we see here is that standardizing scores facilitates the interpretation of a single test
score. Let's see how that works.

our 100 scores have a mean of 3.45 and a standard deviation of 1.70.

By
entering these numbers into the formula, we see why a score of 5 corresponds to a z-score
of 0.91:
Zx=5−3.451.70=0.91Zx=5−3.451.70=0.91

In a similar vein, the screenshot below shows the z-scores for all distinct values of our first
IQ test added to the data.

Вам также может понравиться

The Subtle Art of Not Giving a F*ck: A Counterintuitive Approach to Living a Good Life
От Everand
The Subtle Art of Not Giving a F*ck: A Counterintuitive Approach to Living a Good Life
Mark Manson
Рейтинг: 4 из 5 звезд
4/5 (5794)
Energizing Your Scales
Документ3 страницы
Energizing Your Scales
john
Оценок пока нет
Shoe Dog: A Memoir by the Creator of Nike
От Everand
Shoe Dog: A Memoir by the Creator of Nike
Phil Knight
Рейтинг: 4.5 из 5 звезд
4.5/5 (537)
Republic of The Philippines Division of Bohol Department of Education Region VII, Central Visayas
Документ12 страниц
Republic of The Philippines Division of Bohol Department of Education Region VII, Central Visayas
Cecille Hernando
Оценок пока нет
Yes Please
От Everand
Yes Please
Amy Poehler
Рейтинг: 4 из 5 звезд
4/5 (1891)
The Concepts and Principles of Equity and Health
Документ18 страниц
The Concepts and Principles of Equity and Health
Paulo César López Barrientos
Оценок пока нет
The Yellow House: A Memoir (2019 National Book Award Winner)
От Everand
The Yellow House: A Memoir (2019 National Book Award Winner)
Sarah M. Broom
Рейтинг: 4 из 5 звезд
4/5 (98)
Quality of Life After Functional Endoscopic Sinus Surgery in Patients With Chronic Rhinosinusitis
Документ15 страниц
Quality of Life After Functional Endoscopic Sinus Surgery in Patients With Chronic Rhinosinusitis
Narendra
Оценок пока нет
Hidden Figures: The American Dream and the Untold Story of the Black Women Mathematicians Who Helped Win the Space Race
От Everand
Hidden Figures: The American Dream and the Untold Story of the Black Women Mathematicians Who Helped Win the Space Race
Margot Lee Shetterly
Рейтинг: 4 из 5 звезд
4/5 (895)
Exercise and Ppismp Students
Документ6 страниц
Exercise and Ppismp Students
Liyana Rose
Оценок пока нет
The Hard Thing About Hard Things: Building a Business When There Are No Easy Answers
От Everand
The Hard Thing About Hard Things: Building a Business When There Are No Easy Answers
Ben Horowitz
Рейтинг: 4.5 из 5 звезд
4.5/5 (344)
Herramientas para Pronosticar en STATA
Документ53 страницы
Herramientas para Pronosticar en STATA
Marcos Polo
100% (1)
The Little Book of Hygge: Danish Secrets to Happy Living
От Everand
The Little Book of Hygge: Danish Secrets to Happy Living
Meik Wiking
Рейтинг: 3.5 из 5 звезд
3.5/5 (399)
Khutbah About The Quran
Документ3 страницы
Khutbah About The Quran
takwania
Оценок пока нет
Grit: The Power of Passion and Perseverance
От Everand
Grit: The Power of Passion and Perseverance
Angela Duckworth
Рейтинг: 4 из 5 звезд
4/5 (588)
PHNC
Документ6 страниц
PHNC
Amit Mangaonkar
Оценок пока нет
The Emperor of All Maladies: A Biography of Cancer
От Everand
The Emperor of All Maladies: A Biography of Cancer
Siddhartha Mukherjee
Рейтинг: 4.5 из 5 звезд
4.5/5 (271)
Causing v. Comelec
Документ13 страниц
Causing v. Comelec
Christian Edward Coronado
Оценок пока нет
Devil in the Grove: Thurgood Marshall, the Groveland Boys, and the Dawn of a New America
От Everand
Devil in the Grove: Thurgood Marshall, the Groveland Boys, and the Dawn of a New America
Gilbert King
Рейтинг: 4.5 из 5 звезд
4.5/5 (266)
Future Dusk Portfolio by Slidesgo
Документ40 страниц
Future Dusk Portfolio by Slidesgo
NATALIA ALSINA MARTIN
Оценок пока нет
Never Split the Difference: Negotiating As If Your Life Depended On It
От Everand
Never Split the Difference: Negotiating As If Your Life Depended On It
Chris Voss
Рейтинг: 4.5 из 5 звезд
4.5/5 (838)
Subculture of Football Hooligans
Документ9 страниц
Subculture of Football Hooligans
Cristi Berdea
Оценок пока нет
A Heartbreaking Work Of Staggering Genius: A Memoir Based on a True Story
От Everand
A Heartbreaking Work Of Staggering Genius: A Memoir Based on a True Story
Dave Eggers
Рейтинг: 3.5 из 5 звезд
3.5/5 (231)
Architecture of Neural NW
Документ79 страниц
Architecture of Neural NW
api-3798769
Оценок пока нет
Principles: Life and Work
От Everand
Principles: Life and Work
Ray Dalio
Рейтинг: 4 из 5 звезд
4/5 (599)
Steps To Create Payment Document in R12 Payables
Документ2 страницы
Steps To Create Payment Document in R12 Payables
srees_15
Оценок пока нет
On Fire: The (Burning) Case for a Green New Deal
От Everand
On Fire: The (Burning) Case for a Green New Deal
Naomi Klein
Рейтинг: 4 из 5 звезд
4/5 (73)
Eco 407
Документ4 страницы
Eco 407
LUnwei
Оценок пока нет
Elon Musk: Tesla, SpaceX, and the Quest for a Fantastic Future
От Everand
Elon Musk: Tesla, SpaceX, and the Quest for a Fantastic Future
Ashlee Vance
Рейтинг: 4.5 из 5 звезд
4.5/5 (474)
The Flowers of May by Francisco Arcellana
Документ5 страниц
The Flowers of May by Francisco Arcellana
MarkNicoleAnicas
75% (4)
Team of Rivals: The Political Genius of Abraham Lincoln
От Everand
Team of Rivals: The Political Genius of Abraham Lincoln
Doris Kearns Goodwin
Рейтинг: 4.5 из 5 звезд
4.5/5 (234)
Mamaoui Passages
Документ21 страница
Mamaoui Passages
Sennah
Оценок пока нет
The World Is Flat 3.0: A Brief History of the Twenty-first Century
От Everand
The World Is Flat 3.0: A Brief History of the Twenty-first Century
Thomas L. Friedman
Рейтинг: 3.5 из 5 звезд
3.5/5 (2259)
Rousseau Notes
Документ4 страницы
Rousseau Notes
Akhilesh Issur
Оценок пока нет
Angela's Ashes: A Memoir
От Everand
Angela's Ashes: A Memoir
Frank McCourt
Рейтинг: 4.5 из 5 звезд
4.5/5 (440)
Product Design and Development
Документ14 страниц
Product Design and Development
ajay3480
100% (1)
Rise of ISIS: A Threat We Can't Ignore
От Everand
Rise of ISIS: A Threat We Can't Ignore
Jay Sekulow
Рейтинг: 3.5 из 5 звезд
3.5/5 (137)
1 2 4 Ak Sequentiallogicdesign Counters DLB
Документ7 страниц
1 2 4 Ak Sequentiallogicdesign Counters DLB
api-290804719
100% (1)
Steve Jobs
От Everand
Steve Jobs
Walter Isaacson
Рейтинг: 4.5 из 5 звезд
4.5/5 (806)
PREETI and Rahul
Документ22 страницы
PREETI and Rahul
nitinkhandelwal2911
Оценок пока нет
Fear: Trump in the White House
От Everand
Fear: Trump in the White House
Bob Woodward
Рейтинг: 3.5 из 5 звезд
3.5/5 (738)
Anais Nin - Under A Glass Bell-Pages-29-32 Word
Документ6 страниц
Anais Nin - Under A Glass Bell-Pages-29-32 Word
Armina M
Оценок пока нет
The Unwinding: An Inner History of the New America
От Everand
The Unwinding: An Inner History of the New America
George Packer
Рейтинг: 4 из 5 звезд
4/5 (45)
Case Study 3
Документ6 страниц
Case Study 3
monika_pratiwi_2
Оценок пока нет
Bad Feminist: Essays
От Everand
Bad Feminist: Essays
Roxane Gay
Рейтинг: 4 из 5 звезд
4/5 (1015)
Scholarly Article: Ritam Mukherjee: Post-Tagore Bengali Poetry: Image of God' and Secularism
Документ6 страниц
Scholarly Article: Ritam Mukherjee: Post-Tagore Bengali Poetry: Image of God' and Secularism
bankans
Оценок пока нет
John Adams
От Everand
John Adams
David McCullough
Рейтинг: 4.5 из 5 звезд
4.5/5 (2409)
MQM100 MultipleChoice Chapter2
Документ9 страниц
MQM100 MultipleChoice Chapter2
Nakin K
Оценок пока нет
The Gifts of Imperfection: Let Go of Who You Think You're Supposed to Be and Embrace Who You Are
От Everand
The Gifts of Imperfection: Let Go of Who You Think You're Supposed to Be and Embrace Who You Are
Brené Brown
Рейтинг: 4 из 5 звезд
4/5 (1090)
Niper Syllabus
Документ9 страниц
Niper Syllabus
dirghayu
Оценок пока нет
The Glass Castle: A Memoir
От Everand
The Glass Castle: A Memoir
Jeannette Walls
Рейтинг: 4.5 из 5 звезд
4.5/5 (1712)
History Rizal
Документ6 страниц
History Rizal
Irvin Levie
Оценок пока нет
The Light Between Oceans: A Novel
От Everand
The Light Between Oceans: A Novel
M.L. Stedman
Рейтинг: 4.5 из 5 звезд
4.5/5 (789)
French Legal System
Документ3 страницы
French Legal System
GauravChoudhary
Оценок пока нет
The Outsider: A Novel
От Everand
The Outsider: A Novel
Stephen King
Рейтинг: 4 из 5 звезд
4/5 (1839)
Chapter 14
Документ4 страницы
Chapter 14
Rafael Costa Sampaio
Оценок пока нет
The Sympathizer: A Novel (Pulitzer Prize for Fiction)
От Everand
The Sympathizer: A Novel (Pulitzer Prize for Fiction)
Viet Thanh Nguyen
Рейтинг: 4.5 из 5 звезд
4.5/5 (120)
Chapter 2
Документ14 страниц
Chapter 2
Um E AbdulSaboor
Оценок пока нет
The Woman in Cabin 10
От Everand
The Woman in Cabin 10
Ruth Ware
Рейтинг: 3.5 из 5 звезд
3.5/5 (2322)
Analysis of The SPM Questions
Документ5 страниц
Analysis of The SPM Questions
Haslina Zakaria
Оценок пока нет
Brooklyn: A Novel
От Everand
Brooklyn: A Novel
Colm Tóibín
Рейтинг: 3.5 из 5 звезд
3.5/5 (1937)
A Man Called Ove: A Novel
От Everand
A Man Called Ove: A Novel
Fredrik Backman
Рейтинг: 4.5 из 5 звезд
4.5/5 (4609)
The Perks of Being a Wallflower
От Everand
The Perks of Being a Wallflower
Stephen Chbosky
Рейтинг: 4.5 из 5 звезд
4.5/5 (2101)
Wolf Hall: A Novel
От Everand
Wolf Hall: A Novel
Hilary Mantel
Рейтинг: 4 из 5 звезд
4/5 (3811)
Little Women
От Everand
Little Women
Louisa May Alcott
Рейтинг: 4 из 5 звезд
4/5 (104)
A Tree Grows in Brooklyn
От Everand
A Tree Grows in Brooklyn
Betty Smith
Рейтинг: 4.5 из 5 звезд
4.5/5 (1929)
Manhattan Beach: A Novel
От Everand
Manhattan Beach: A Novel
Jennifer Egan
Рейтинг: 3.5 из 5 звезд
3.5/5 (792)
The Art of Racing in the Rain: A Novel
От Everand
The Art of Racing in the Rain: A Novel
Garth Stein
Рейтинг: 4 из 5 звезд
4/5 (4200)
Sing, Unburied, Sing: A Novel
От Everand
Sing, Unburied, Sing: A Novel
Jesmyn Ward
Рейтинг: 4 из 5 звезд
4/5 (1103)
Her Body and Other Parties: Stories
От Everand
Her Body and Other Parties: Stories
Carmen Maria Machado
Рейтинг: 4 из 5 звезд
4/5 (821)
The Constant Gardener: A Novel
От Everand
The Constant Gardener: A Novel
John le Carré
Рейтинг: 3.5 из 5 звезд
3.5/5 (104)