Академический Документы
Профессиональный Документы
Культура Документы
Computer
Vision:
From
3D
Reconstruc:on
to
Recogni:on
Required:
- [FP]
D.
A.
Forsyth
and
J.
Ponce.
Computer
Vision:
A
Modern
Approach
(2nd
Edi:on).
Pren:ce
Hall,
2011.
- [HZ]
R.
Hartley
and
A.
Zisserman.
Mul(ple
View
Geometry
in
Computer
Vision.
Academic
Press,
2002.
Recommended:
- R.
Szeliski.
Computer
Vision:
Algorithms
and
Applica(ons.
Springer,
2011.
- D.
Hoiem
and
S.
Savarese.
Representa(ons
and
Techniques
for
3D
Object
Recogni(on
and
Scene
Interpreta(on,
Synthesis
lecture
on
Ar:cial
Intelligence
and
Machine
Learning.
Morgan
Claypool
Publishers,
2011
- Learning
OpenCV,
by
Gary
Bradski
&
Adrian
Kaehler,
O'Reilly
Media,
2008.
6
Silvio Savarese! Lecture 1 !
Course
Projects
Replicate
an
interes:ng
paper
Comparing
dierent
methods
to
a
test
bed
A
new
approach
to
an
exis:ng
problem
Original
research
Write
a
10-page
paper
summarizing
your
results
Release
the
nal
code
Give
a
nal
in-class
presenta:on
SCPD
students
can
send
videos
instead.
Collabora:on
policy
Read
the
student
code
book,
understand
what
is
collabora:on
and
what
is
academic
infrac:on.
Discussing
project
assignment
with
each
other
is
allowed,
but
coding
must
be
done
individually
Home
works
or
class
project
coding
policy:
using
on
line
code
or
other
students/researchers
code
is
not
allowed
in
general.
Excep:ons
can
be
made
and
individual
cases
will
be
discussed
with
the
instructor.
Lecture
1
Introduc:on
Image/video
Object 1 Object N
- semantic
-semantic
Computer vision
Image/video
Object 1 Object N
- semantic
-geometry
-semantic
-geometry
Computer vision
Image/video
Object 1 Object N
- semantic
-geometry
-semantic
-geometry
Image/video
Object 1 Object N
- semantic
-geometry
-semantic
-geometry
Scene
-Semantic
- geometry
Computer
vision
InformaPon
extracPon
InterpretaPon
Sensing
device
ComputaPonal
device
EosSystems
20
Fingerprint
biometrics
Augmenta:on
with
3D
computer
graphics
22
3D
object
prototyping
EosSystems
Photomodeler
23
Computer
vision
and
Applica:ons
EosSystems AutosPch
24
Face
detec:on
Face
detec:on
Web
applica:ons
Photometria
27
Panoramic
Photography
kolor
3D
modeling
of
landmarks
29
Computer
vision
and
Applica:ons
Ecient
SLAM/SFM
Large
scale
image
repositories
Deep
learning
(e.g. ImageNet)
EosSystems AutosPch
30
Computer
vision
and
Applica:ons
Ecient
SLAM/SFM
Large
scale
image
repositories
Deep
learning
(e.g. ImageNet)
Bemer
clouds
J
More
bandwidth
Increase
computa:onal
power
Kinect
A9
Google
Goggles
Kooaba
EosSystems
AutosPch
31
Image
search
engines
Google
Goggles
33
Visual
search
and
landmarks
recogni:on
34
Augmented
reality
35
Mo:on
sensing
and
gesture
recogni:on
36
Autonomous
naviga:on
and
safety
38
Computer
vision
and
Applica:ons
Kinect
A9
Google
Goggles
Kooaba
EosSystems
AutosPch
3D EosSystems
Google
Goggles
2D
3D EosSystems
Google
Goggles
2D
43
Current
state
of
computer
vision
3D
shape
recovery
3D
scene
reconstruc:on
Camera
localiza:on
Pose
es:ma:on
3D
shape
recovery
3D
scene
reconstruc:on
Camera
localiza:on
Pose
es:ma:on
Object
detec:on
Texture
classica:on
Target
tracking
Ac:vity
recogni:on
Object
detec:on
Texture
classica:on
Target
tracking
Ac:vity
recogni:on
Perceiving
the
World
in
3D!
48
Visual
processing
in
the
brain
where
pathway
(dorsal
stream)
V1
what
pathway
(ventral
stream)
49
Visual
processing
in
the
brain
where
pathway
(dorsal
stream)
Pre-frontal
V1
cortex
what
pathway
(ventral
stream)
50
CS
231A
course
overview
1. Geometry
2. Seman:cs
Geometry:
-
How
to
extract
3d
informa:on?
-
Which
cues
are
useful?
-
What
are
the
mathema:cal
tools?
Camera
systems
Establish
a
mapping
from
3D
to
2D
How
to
calibrate
a
camera
Es:mate
camera
parameters
such
pose
or
focal
length
?
Single
view
metrology
Es:mate
3D
proper:es
of
the
world
from
a
single
image
?
Single
view
metrology
Es:mate
3D
proper:es
of
the
world
from
a
single
image
Mul:ple
view
geometry
Es:mate
3D
proper:es
of
the
world
from
mul:ple
views
Mathema:cal
tools
Epipolar geometry
1. Geometry
2. Seman:cs
Seman:cs:
-
How
to
recognize
objects?
-
How
to
classify
images
or
understand
a
scene?
-
How
to
segment
out
cri:cal
seman:cs
-
How
to
es:mate
3D
proper:es
(pose,
size,
shape)
Object
recogniPon
and
categorizaPon
Downtown
chicago
Building
clock
person
car
No!
ClassicaPon:
Does
this
image
contain
a
building?
[yes/no]
Yes!
DetecPon:
Does
this
image
contain
a
car?
[where?]
car
DetecPon:
Which
objects
do
this
image
contain?
[where?]
Building
clock
person
car
DetecPon:
Accurate
localizaPon
(segmentaPon)
clock
DetecPon:
EsPmaPng
3D
geometrical
properPes
Building
45
degree
1. Geometry
2. Seman:cs
V1
what
pathway
(ventral
stream)
77
Visual processing in the brain
where
pathway
(dorsal
stream)
Pre-frontal
V1
cortex
what
pathway
(ventral
stream)
78
Joint
reconstruc:on
and
recogni:on
Input images
Input images
1
Introduc:on
2
Camera
models
3D
geometry
3
Camera
calibra:on
4
Single
view
metrology
5
Epipolar
geometry
6
Mul:-view
geometry
April
Recogni:on
11
Intro
to
Recogni:on;
Object
classica:on
I
12
Object
classica:on
II
13
2D
Object
detec:on
May
14
3D
Object
recogni:on
15
Scene
understanding
&
segmenta:on
16
3D
Scene
understanding
June
Project presentations
CS231
IntroducPon
to
Computer
Vision