Добро пожаловать в Scribd!

Пропустить карусель

Bigdata Edu Lecture Slide PDFs W001V005

Загружено:

Victor Ibrahim Cordero OHiggins

0% нашли этот документ полезным (0 голосов)

16 просмотров31 страница

Lecture Slide

Авторское право

Доступные форматы

PDF, TXT или читайте онлайн в Scribd

Поделиться этим документом

Поделиться или встроить документ

Параметры публикации

Этот документ был вам полезен?

Это неприемлемый материал?

Пожаловаться на этот документ

Lecture Slide

Авторское право:

Доступные форматы

Скачайте в формате PDF, TXT или читайте онлайн в Scribd

Отметить как неприемлемый контент

0% нашли этот документ полезным (0 голосов)

16 просмотров31 страница

Bigdata Edu Lecture Slide PDFs W001V005

Загружено:

Victor Ibrahim Cordero OHiggins

Lecture Slide

Авторское право:

Доступные форматы

Скачайте в формате PDF, TXT или читайте онлайн в Scribd

Отметить как неприемлемый контент

Перейти к странице

Вы находитесь на странице: 1из 31

Поиск в документе

Week 1, Video 5

Classifiers, Part 3

Classification
There is something you want to predict (the label)
The thing you want to predict is categorical

The

answer is one of a set of categories, not a number

In a Previous Class
Step Regression
Logistic Regression
J48/C4.5 Decision Trees

Today

More Classifiers

Decision Rules

Sets of if-then rules which you check in order

Decision Rules Example

IF time < 4 and knowledge > 0.55 then CORRECT
ELSE IF time < 9 and knowledge > 0.82 then
CORRECT
ELSE IF numattempts > 4 and knowledge < 0.33
then INCORRECT
OTHERWISE CORRECT

Many Algorithms

Differences are in terms of how rules are generated

and selected
Most popular subcategory (including JRip and PART)
repeatedly creates decision trees and distills best
rules

Generating Rules from Decision Tree

Create Decision Tree

If there is at least one path that is worth keeping, go to 3 else go to 6

Take the Best single path from root to leaf and make that path a rule

Remove all data points classified by that rule from data set

Go to step 1

Take all remaining data points

Find the most common value for those data points

Make an otherwise rule using that

Relatively conservative

Leads to simpler models than most decision trees

Very interpretable models

Unlike most other approaches

Good when multi-level interactions are common

Just like decision trees

Predicts a data point from neighboring data points

Weights

points more strongly if they are nearby

Good when data is very divergent

Lots of different processes can lead to the same

result
Intractable to find general rules
But data points that are similar tend to be from the
same group

Big Advantage

Sometimes works when nothing else works

Has been useful for my group in detecting emotion
from log files (Baker et al., 2012)

Big Drawback

To use the model, you need to have the whole data

set

Bagged Stumps
Related to decision trees
Lots of trees with only the first feature
Relatively conservative

A close variant is Random Forest

Common Thread

So far, all the classifiers Ive discussed are

conservative
Find

simple models
Dont over-fit

These algorithms appear to do better for most

educational data mining than less conservative
algorithms
In

brief, educational data has lots of systematic noise

Some less conservative algorithms

Support Vector Machines

Conducts dimensionality reduction on data space
and then fits hyperplane which splits classes
Creates very sophisticated models
Great for text mining
Great for sensor data
Not optimal for most other educational data

Logs,

grades, interactions with software

Genetic Algorithms
Uses mutation, combination, and natural selection to
search space of possible models
Can produce inconsistent answers

Neural Networks
Composes extremely complex relationships through
combining perceptrons
Finds very complicated models

Soller & Stevens (2007)

In fact

The difficulty of interpreting non-linear models is so

well known, that they put up a sign about it in New
York City

Note

Support Vector Machines, Genetic Algorithms, and

Neural Networks are great for some problems
For most types of educational data, they have not
historically produced the best solutions

Later Lectures

Goodness metrics for comparing classifiers

Validating classifiers

Classifier conservatism and over-fitting

Next Lecture

A case study in classification

Вам также может понравиться

11 A Long Day
Документ5 страниц
11 A Long Day
Victor Ibrahim Cordero OHiggins
Оценок пока нет
4 The Best Buy
Документ5 страниц
4 The Best Buy
Victor Ibrahim Cordero OHiggins
Оценок пока нет
Ingles Nivel 3
Документ5 страниц
Ingles Nivel 3
Victor Ibrahim Cordero OHiggins
Оценок пока нет
Adverbs of Frequency Time Expressions & Likes and Dislikes
Документ24 страницы
Adverbs of Frequency Time Expressions & Likes and Dislikes
Victor Ibrahim Cordero OHiggins
Оценок пока нет
Lesson 10
Документ5 страниц
Lesson 10
Victor Ibrahim Cordero OHiggins
Оценок пока нет
Lesson 1
Документ5 страниц
Lesson 1
Victor Ibrahim Cordero OHiggins
Оценок пока нет
Adverbs of Frequency Time Expressions & Likes and Dislikes
Документ24 страницы
Adverbs of Frequency Time Expressions & Likes and Dislikes
Victor Ibrahim Cordero OHiggins
Оценок пока нет
Blue Velvet
Документ2 страницы
Blue Velvet
Victor Ibrahim Cordero OHiggins
Оценок пока нет
Countable and Uncountable Nouns
Документ23 страницы
Countable and Uncountable Nouns
Victor Ibrahim Cordero OHiggins
Оценок пока нет
Regimen Tributario Municipalidad SJL 2009 1230611473123224 1
Документ52 страницы
Regimen Tributario Municipalidad SJL 2009 1230611473123224 1
Victor Ibrahim Cordero OHiggins
Оценок пока нет
Unidad 1
Документ94 страницы
Unidad 1
Victor Ibrahim Cordero OHiggins
Оценок пока нет
Rcodes Markov Switching Model
Документ2 страницы
Rcodes Markov Switching Model
Victor Ibrahim Cordero OHiggins
Оценок пока нет
Bigdata Edu Lecture Slide PDFs W003V005
Документ18 страниц
Bigdata Edu Lecture Slide PDFs W003V005
Victor Ibrahim Cordero OHiggins
Оценок пока нет
Introduction To Macroeconomics: Wpalomino@econ - Rutgers.edu
Документ2 страницы
Introduction To Macroeconomics: Wpalomino@econ - Rutgers.edu
Victor Ibrahim Cordero OHiggins
Оценок пока нет
Shoe Dog: A Memoir by the Creator of Nike
От Everand
Shoe Dog: A Memoir by the Creator of Nike
Phil Knight
Рейтинг: 4.5 из 5 звезд
4.5/5 (537)
The Yellow House: A Memoir (2019 National Book Award Winner)
От Everand
The Yellow House: A Memoir (2019 National Book Award Winner)
Sarah M. Broom
Рейтинг: 4 из 5 звезд
4/5 (98)
The Subtle Art of Not Giving a F*ck: A Counterintuitive Approach to Living a Good Life
От Everand
The Subtle Art of Not Giving a F*ck: A Counterintuitive Approach to Living a Good Life
Mark Manson
Рейтинг: 4 из 5 звезд
4/5 (5794)
Yes Please
От Everand
Yes Please
Amy Poehler
Рейтинг: 4 из 5 звезд
4/5 (1891)
The Little Book of Hygge: Danish Secrets to Happy Living
От Everand
The Little Book of Hygge: Danish Secrets to Happy Living
Meik Wiking
Рейтинг: 3.5 из 5 звезд
3.5/5 (400)
Grit: The Power of Passion and Perseverance
От Everand
Grit: The Power of Passion and Perseverance
Angela Duckworth
Рейтинг: 4 из 5 звезд
4/5 (588)
Elon Musk: Tesla, SpaceX, and the Quest for a Fantastic Future
От Everand
Elon Musk: Tesla, SpaceX, and the Quest for a Fantastic Future
Ashlee Vance
Рейтинг: 4.5 из 5 звезд
4.5/5 (474)
A Heartbreaking Work Of Staggering Genius: A Memoir Based on a True Story
От Everand
A Heartbreaking Work Of Staggering Genius: A Memoir Based on a True Story
Dave Eggers
Рейтинг: 3.5 из 5 звезд
3.5/5 (231)
Hidden Figures: The American Dream and the Untold Story of the Black Women Mathematicians Who Helped Win the Space Race
От Everand
Hidden Figures: The American Dream and the Untold Story of the Black Women Mathematicians Who Helped Win the Space Race
Margot Lee Shetterly
Рейтинг: 4 из 5 звезд
4/5 (895)
Team of Rivals: The Political Genius of Abraham Lincoln
От Everand
Team of Rivals: The Political Genius of Abraham Lincoln
Doris Kearns Goodwin
Рейтинг: 4.5 из 5 звезд
4.5/5 (234)
Never Split the Difference: Negotiating As If Your Life Depended On It
От Everand
Never Split the Difference: Negotiating As If Your Life Depended On It
Chris Voss
Рейтинг: 4.5 из 5 звезд
4.5/5 (838)
The Emperor of All Maladies: A Biography of Cancer
От Everand
The Emperor of All Maladies: A Biography of Cancer
Siddhartha Mukherjee
Рейтинг: 4.5 из 5 звезд
4.5/5 (271)
Devil in the Grove: Thurgood Marshall, the Groveland Boys, and the Dawn of a New America
От Everand
Devil in the Grove: Thurgood Marshall, the Groveland Boys, and the Dawn of a New America
Gilbert King
Рейтинг: 4.5 из 5 звезд
4.5/5 (266)
On Fire: The (Burning) Case for a Green New Deal
От Everand
On Fire: The (Burning) Case for a Green New Deal
Naomi Klein
Рейтинг: 4 из 5 звезд
4/5 (74)
Principles: Life and Work
От Everand
Principles: Life and Work
Ray Dalio
Рейтинг: 4 из 5 звезд
4/5 (599)
The Unwinding: An Inner History of the New America
От Everand
The Unwinding: An Inner History of the New America
George Packer
Рейтинг: 4 из 5 звезд
4/5 (45)
Fear: Trump in the White House
От Everand
Fear: Trump in the White House
Bob Woodward
Рейтинг: 3.5 из 5 звезд
3.5/5 (738)
Rise of ISIS: A Threat We Can't Ignore
От Everand
Rise of ISIS: A Threat We Can't Ignore
Jay Sekulow
Рейтинг: 3.5 из 5 звезд
3.5/5 (137)
The Hard Thing About Hard Things: Building a Business When There Are No Easy Answers
От Everand
The Hard Thing About Hard Things: Building a Business When There Are No Easy Answers
Ben Horowitz
Рейтинг: 4.5 из 5 звезд
4.5/5 (345)
Steve Jobs
От Everand
Steve Jobs
Walter Isaacson
Рейтинг: 4.5 из 5 звезд
4.5/5 (806)
The World Is Flat 3.0: A Brief History of the Twenty-first Century
От Everand
The World Is Flat 3.0: A Brief History of the Twenty-first Century
Thomas L. Friedman
Рейтинг: 3.5 из 5 звезд
3.5/5 (2259)
Bad Feminist: Essays
От Everand
Bad Feminist: Essays
Roxane Gay
Рейтинг: 4 из 5 звезд
4/5 (1016)
Angela's Ashes: A Memoir
От Everand
Angela's Ashes: A Memoir
Frank McCourt
Рейтинг: 4.5 из 5 звезд
4.5/5 (440)
The Glass Castle: A Memoir
От Everand
The Glass Castle: A Memoir
Jeannette Walls
Рейтинг: 4.5 из 5 звезд
4.5/5 (1713)
The Gifts of Imperfection: Let Go of Who You Think You're Supposed to Be and Embrace Who You Are
От Everand
The Gifts of Imperfection: Let Go of Who You Think You're Supposed to Be and Embrace Who You Are
Brené Brown
Рейтинг: 4 из 5 звезд
4/5 (1090)
John Adams
От Everand
John Adams
David McCullough
Рейтинг: 4.5 из 5 звезд
4.5/5 (2409)
The Outsider: A Novel
От Everand
The Outsider: A Novel
Stephen King
Рейтинг: 4 из 5 звезд
4/5 (1839)
The Light Between Oceans: A Novel
От Everand
The Light Between Oceans: A Novel
M.L. Stedman
Рейтинг: 4.5 из 5 звезд
4.5/5 (789)
The Sympathizer: A Novel (Pulitzer Prize for Fiction)
От Everand
The Sympathizer: A Novel (Pulitzer Prize for Fiction)
Viet Thanh Nguyen
Рейтинг: 4.5 из 5 звезд
4.5/5 (121)
A Man Called Ove: A Novel
От Everand
A Man Called Ove: A Novel
Fredrik Backman
Рейтинг: 4.5 из 5 звезд
4.5/5 (4609)
Wolf Hall: A Novel
От Everand
Wolf Hall: A Novel
Hilary Mantel
Рейтинг: 4 из 5 звезд
4/5 (3811)
Brooklyn: A Novel
От Everand
Brooklyn: A Novel
Colm Toibin
Рейтинг: 3.5 из 5 звезд
3.5/5 (1937)
The Woman in Cabin 10
От Everand
The Woman in Cabin 10
Ruth Ware
Рейтинг: 3.5 из 5 звезд
3.5/5 (2322)
Little Women
От Everand
Little Women
Louisa May Alcott
Рейтинг: 4 из 5 звезд
4/5 (104)
Manhattan Beach: A Novel
От Everand
Manhattan Beach: A Novel
Jennifer Egan
Рейтинг: 3.5 из 5 звезд
3.5/5 (792)
The Perks of Being a Wallflower
От Everand
The Perks of Being a Wallflower
Stephen Chbosky
Рейтинг: 4.5 из 5 звезд
4.5/5 (2104)
Sing, Unburied, Sing: A Novel
От Everand
Sing, Unburied, Sing: A Novel
Jesmyn Ward
Рейтинг: 4 из 5 звезд
4/5 (1103)
The Art of Racing in the Rain: A Novel
От Everand
The Art of Racing in the Rain: A Novel
Garth Stein
Рейтинг: 4 из 5 звезд
4/5 (4200)
Her Body and Other Parties: Stories
От Everand
Her Body and Other Parties: Stories
Carmen Maria Machado
Рейтинг: 4 из 5 звезд
4/5 (821)
A Tree Grows in Brooklyn
От Everand
A Tree Grows in Brooklyn
Betty Smith
Рейтинг: 4.5 из 5 звезд
4.5/5 (1929)
The Constant Gardener: A Novel
От Everand
The Constant Gardener: A Novel
John le Carré
Рейтинг: 3.5 из 5 звезд
3.5/5 (104)
Data Warehousing - Informatica Experienced Questions
Документ5 страниц
Data Warehousing - Informatica Experienced Questions
Svr Ravi
Оценок пока нет
CCS367 Storage Technologies Lecture Notes 1
Документ172 страницы
CCS367 Storage Technologies Lecture Notes 1
Shruthi S
Оценок пока нет
Database Management System Multiple Choice Questions and Answers - DBMS - Microsoft Access
Документ6 страниц
Database Management System Multiple Choice Questions and Answers - DBMS - Microsoft Access
Arshad Khan
100% (1)
AndrodApplicationEng 07march2012
Документ22 страницы
AndrodApplicationEng 07march2012
Richard Abbott
Оценок пока нет
Surrogate Key in Data Warehouse
Документ3 страницы
Surrogate Key in Data Warehouse
vivek_xwiki
Оценок пока нет
JDBC Program List: 1. Using JDBC, Create Table Called CONTACT - INFO With The Following Fields
Документ3 страницы
JDBC Program List: 1. Using JDBC, Create Table Called CONTACT - INFO With The Following Fields
Akash Gajjar
Оценок пока нет
Raw Data To SDTM Conversion
Документ1 страница
Raw Data To SDTM Conversion
Komalrawal Rawal
Оценок пока нет
LO1 Determine Database Requirements
Документ47 страниц
LO1 Determine Database Requirements
Galaxy teck
Оценок пока нет
RMAN Backup of Oracle Database
Документ3 страницы
RMAN Backup of Oracle Database
kamakom78
Оценок пока нет
Sem-C (mt1)
Документ30 страниц
Sem-C (mt1)
Kumar Paurush
100% (1)
Pindi Gowtham Offcampus Resume
Документ1 страница
Pindi Gowtham Offcampus Resume
pindi.gowtham797
Оценок пока нет
Chapter 12
Документ28 страниц
Chapter 12
Purna Wayan
Оценок пока нет
DBMS Lab Manual
Документ54 страницы
DBMS Lab Manual
hardik kumar jain
Оценок пока нет
RMAN
Документ51 страница
RMAN
francy_raj
Оценок пока нет
Infor LN Analytics Program Cost Ledger Installation Guide
Документ28 страниц
Infor LN Analytics Program Cost Ledger Installation Guide
tomactin
Оценок пока нет
Data Visualization Complete Notes
Документ28 страниц
Data Visualization Complete Notes
primevideo09871234
Оценок пока нет
DABD (MBA-III Sem.) 2022-24
Документ5 страниц
DABD (MBA-III Sem.) 2022-24
Akash Rajput
Оценок пока нет
VE IT Tests Unit04
Документ2 страницы
VE IT Tests Unit04
gtfo impro
Оценок пока нет
Information Storage And: Retrieval Techniques
Документ56 страниц
Information Storage And: Retrieval Techniques
amirthaa sri
Оценок пока нет
Socall0 4 2023
Документ25 страниц
Socall0 4 2023
ruwaine
Оценок пока нет
DO - 158 - s2022 Data Governance Program
Документ37 страниц
DO - 158 - s2022 Data Governance Program
brian paul
Оценок пока нет
Release 94049
Документ2 страницы
Release 94049
PEPINOKING
Оценок пока нет
Chat AI
Документ13 страниц
Chat AI
Rovin Garcia
Оценок пока нет
Bioinformatics & Computational Biology Syllabus
Документ2 страницы
Bioinformatics & Computational Biology Syllabus
Kamlesh Sahu
Оценок пока нет
Mis & DSS: Unit-Ii
Документ59 страниц
Mis & DSS: Unit-Ii
SHRID46
Оценок пока нет
Final Report Kepala Ruangan
Документ8 страниц
Final Report Kepala Ruangan
Apry Asmara Andika
Оценок пока нет
Data Integration Using ETL, EAI, and EII Tools To Create An Integrated Enterprise - Colin White, BI Research (2005 Nov)
Документ40 страниц
Data Integration Using ETL, EAI, and EII Tools To Create An Integrated Enterprise - Colin White, BI Research (2005 Nov)
ALTernativo
Оценок пока нет
1 - Siexpo
Документ96 страниц
1 - Siexpo
Jennyfer
Оценок пока нет
Almacenamiento de Información.: LMSGI06-Solución A La TAREA
Документ5 страниц
Almacenamiento de Información.: LMSGI06-Solución A La TAREA
Thomas Williams
Оценок пока нет
Assignment - 3
Документ3 страницы
Assignment - 3
Dev Upadhyay
Оценок пока нет