Академический Документы
Профессиональный Документы
Культура Документы
Contact:are
wlzhao@xmu.edu.cn
All rights
reserved by Wan-Lei zhao
1 / 42
Outline
Syllabus
Course plan
Major subjects
Deal with information such as text, image and video
Text retrieval, content-based image retrieval and video retrieval
Focus on how to retrieve above mentioned information
Popular machine learning approaches will be covered
K-means, SVM and decision tree
Popular model tting approaches will be covered
RANSAC and Hough transform
Popular algorithms in computer vision will be covered
SIFT, BoVW and Hamming Embedding
Objectives
Bring you into this interesting topic
Get you familiar with basic & popular algorithms in this eld
Able to build a simple but workable search engine on your own
Able to apply algorithms to solve the problems in your eld
Syllabus
4 / 42
Syllabus
Model Fitting
RANSAC
Hough Transform
Image & Video Retrieval (22 hours)
Challenges & Trends
Image Features: SIFT and et al.
BoVW Framework
Fisher Kernel Framework
Challenges in Video Retrieval
Temporal Verication Approach
Image Classication and MISC (12 hours)
Challenges & Trends
One-against-all Framework
Tricks in model training
Convolutional Neural Network
Syllabus
Syllabus
Syllabus
Mr. Zhihui Chen will be in charge of the course project related issues
Miss Haihui Liu helps to do proofreading on the course materials
Experiment lectures are held in Labotrary building, Room 501
Time slot: 2:30pm -4:20pm, in the 6th, 8th and 10th weeks
I will remind you one week ahead
Syllabus
Course website
Platform of online teaching in XMU
URL: l.xmu.edu.cn, please go to there and register the course
Password: 007
Syllabus
the beginning
Me too:)
Several advantages:
Computer science is dened in
English
Get you guys used to English
Syllabus
Syllabus
Syllabus
two aspects
1
Computer Vision
Machine Learning
2
All rights
are reserved by Wan-Lei zhao
13 / 42
Course plan
Course plan
Be an Active Learner
Level 1
Catch the concept
Level 2
Understand the idea
Know how to use it
Level 3
Able to re-implement the algorithms
Knows where it works
Knows where it fails
Outline
Syllabus
Course plan
Language
Mandarin
English
Hindi
Spanish
Russian
Population
1.2 billion
508 million
497 million
392 million
277 million
Category
isolating language
reecting language
reecting language
reecting language
reecting language
Region
China
UK, North America
India & Pakistan
Span & South America
Russia & East Europe
18 / 42
&
&
&
&
countries
All rights
arebyreserved
by Wan-Lei zhao
1
Conducted
Webb.
19 / 42
Pay attention that not all the languages have their written forms
Egyptian papyrus2
Babylonian clay tablet (3000 B.C.)
Chinese Oracle (1400 B.C.)
In 105 A.D., paper was invented in China
2
It is not are
paper
in real sense.
All rights
reserved
by Wan-Lei zhao
21 / 42
22 / 42
on papyrus
The rst library (as far as we know) was established in north Syria,
around 3000 BC
Later, Empire Assyria built Library Nineveh (current Mosul) in 612
BC
Best well-known library was built by Alexander the Great about 350
BC in Egypt
25 / 42
Media
Publishing
Storage
Indexing
Interface
before WWW
text document, TV, lm & CD
months or years
books & papers
title, author, keywords and date
library
WWW era
in electronic forms
hours
disc, DVD and etc & web
and contents
browser
information/knowledge
WWW is everywhere
Ubiquitous web (2002-present)
Introduction of Web 2.0 is the milestone
Wikipedia was born in 2001
Flickr was born in 2004
Facebook was born in 2004
Youtube was born in 2006
Twitter was born in 2006
Smartphone was released in 2007
All technologies and media are intertwined to reshape the world
Impact on our daily life of many aspects
IR becomes the main interface to them all
Semantic Web
Web 3.0 (20??)
Proposed by Berners-Lee3
Websites are linked by semantic meta data
Machine builds the link automatically
Requires technology of natural language understanding
Still a vague concept
Automatic documenting, e.g. books and recipes
Weaving the Web: The Original Design and Ultimate Destiny of the World Wide
Web,
in American
Scientic, 2000
All
rights
are reserved
by Wan-Lei zhao
32 / 42
Statistics on WWW
Num. of websites and users (2000-1013)
Num. of sites
Num. of users
Number
2B
1B
100M
2000 2001 2002 2003 2004 2005 2006 2007 2008 2009 2010 2011 2012 2013
Year
4
Statisticsare
wasreserved
collected on by
Apr.Wan-Lei
28th 2010. zhao
All rights
35 / 42
All rights
are reserved by Wan-Lei zhao
Given the thickness of one photo: 0.2 mm
36 / 42
38 / 42
Observations
Information are highly distributed in Internet
The indexer (search engine) keeps information in a centralized manner
Structure of a crawler
Observations
Crawler plays very important role
Experiences of using Baidu and Google
Q&A