44
1 CS6550 Computer Vision Class Meeting: W3W4 (10:10am 12pm), F3 (10:10am 11am). Rm 127 EECS Bldg., 資電館 127Instructor: Prof. Shang-Hong Lai, Rm. 639 EECS Bldg., 賴尚宏, 資電館639, Tel: ext. 42958, Email: [email protected] , URL: http://www.cs.nthu.edu.tw/~lai Office Hours: M4, F4 or by appointment Teaching Assistant: 李東穎、喬彥豪, 台達館 719721, CV lab. Email: [email protected]

CS6550 Computer Visioncv.cs.nthu.edu.tw/upload/courses/17/uploads/CS6550...李東穎、喬彥豪, 台達館719、721, CV lab. Email: [email protected] 2 Prerequisite •Linear

  • Upload
    others

  • View
    1

  • Download
    0

Embed Size (px)

Citation preview

Page 1: CS6550 Computer Visioncv.cs.nthu.edu.tw/upload/courses/17/uploads/CS6550...李東穎、喬彥豪, 台達館719、721, CV lab. Email: d9562818@oz.nthu.edu.tw 2 Prerequisite •Linear

1

CS6550 Computer Vision

• Class Meeting:

W3W4 (10:10am – 12pm), F3 (10:10am – 11am).

Rm 127 EECS Bldg.,

資電館 127室

• Instructor: Prof. Shang-Hong Lai, Rm. 639 EECS Bldg.,

賴尚宏, 資電館639室,

Tel: ext. 42958, Email: [email protected],

URL: http://www.cs.nthu.edu.tw/~lai

• Office Hours: M4, F4 or by appointment

• Teaching Assistant:

李東穎、喬彥豪, 台達館 719、721, CV lab.

Email: [email protected]

Page 2: CS6550 Computer Visioncv.cs.nthu.edu.tw/upload/courses/17/uploads/CS6550...李東穎、喬彥豪, 台達館719、721, CV lab. Email: d9562818@oz.nthu.edu.tw 2 Prerequisite •Linear

2

Prerequisite

• Linear Algebra

• Probability and Statistics

• Basic Programming

Page 3: CS6550 Computer Visioncv.cs.nthu.edu.tw/upload/courses/17/uploads/CS6550...李東穎、喬彥豪, 台達館719、721, CV lab. Email: d9562818@oz.nthu.edu.tw 2 Prerequisite •Linear

3

Course Contents1. Image Formation (1 week)

2. Image Features (2 weeks)

3. Image Segmentation (2 weeks)

4. Camera Calibration (1 week)

5. Two-View Geometry (1 week)

6. Stereo Reconstruction (1 week)

7. Image Matching (1 week)

8. Motion Analysis (1 weeks)

9. Texture analysis (1 weeks)

10. Object Recognition (1 week)

11. Object Detection (1 week)

Page 4: CS6550 Computer Visioncv.cs.nthu.edu.tw/upload/courses/17/uploads/CS6550...李東穎、喬彥豪, 台達館719、721, CV lab. Email: d9562818@oz.nthu.edu.tw 2 Prerequisite •Linear

4

Textbooks• Primary:

Computer Vision: Algorithms and Applications,

by Richard Szeliski, draft (9/3/2010 version)

http://szeliski.org/Book/

•Secondary:

Computer Vision: A Modern Approach, by

David Forsyth and Jean Ponce, Prentice Hall,

2003.

Image Processing, Analysis, and Machine

Vision, by M. Sonka, V. Hlavac, R. Boyle,

Thomson Engineering, 3rd Edition, 2007(8).

• Lecture slides distributed in class.

Page 5: CS6550 Computer Visioncv.cs.nthu.edu.tw/upload/courses/17/uploads/CS6550...李東穎、喬彥豪, 台達館719、721, CV lab. Email: d9562818@oz.nthu.edu.tw 2 Prerequisite •Linear

Sample Contents

5

Page 6: CS6550 Computer Visioncv.cs.nthu.edu.tw/upload/courses/17/uploads/CS6550...李東穎、喬彥豪, 台達館719、721, CV lab. Email: d9562818@oz.nthu.edu.tw 2 Prerequisite •Linear

6

Grading

Midterm Exam. (11/23) 30%

Final Project 20%

Homeworks (4) 40%

Class Participation 5%

Quizzes 5%

Page 7: CS6550 Computer Visioncv.cs.nthu.edu.tw/upload/courses/17/uploads/CS6550...李東穎、喬彥豪, 台達館719、721, CV lab. Email: d9562818@oz.nthu.edu.tw 2 Prerequisite •Linear

7

Homework Policy

• Discussion of homework is encouraged, but you

have to write your own. No copying is strictly

enforced.

• Homework should be delivered before the

announced due time, normally before the lecture.

Late homework will be degraded by 25% per day.

• No make-up homeworks 4 days after the deadline.

Page 8: CS6550 Computer Visioncv.cs.nthu.edu.tw/upload/courses/17/uploads/CS6550...李東穎、喬彥豪, 台達館719、721, CV lab. Email: d9562818@oz.nthu.edu.tw 2 Prerequisite •Linear

8

Course Webpage

• http://cv.cs.nthu.edu.tw/courses.php

• It contains the course slides, basic course

information, and class announcement.

• Important course announcement will also be

posted on this webpage.

Page 9: CS6550 Computer Visioncv.cs.nthu.edu.tw/upload/courses/17/uploads/CS6550...李東穎、喬彥豪, 台達館719、721, CV lab. Email: d9562818@oz.nthu.edu.tw 2 Prerequisite •Linear

9

Class Participation

• Class attendance is required and treated as the

basic requirement for class participation.

• Asking questions is strongly encouraged.

• Extra credit will be given for finding mistakes or

asking questions.

Page 10: CS6550 Computer Visioncv.cs.nthu.edu.tw/upload/courses/17/uploads/CS6550...李東穎、喬彥豪, 台達館719、721, CV lab. Email: d9562818@oz.nthu.edu.tw 2 Prerequisite •Linear

10

CS 6550 Classroom Rule

• No eating is permitted in class.

• No sleeping is allowed in class.

• Disturbance to others in class should be

minimized.

• Cell phone should be turned off during the class.

Page 11: CS6550 Computer Visioncv.cs.nthu.edu.tw/upload/courses/17/uploads/CS6550...李東穎、喬彥豪, 台達館719、721, CV lab. Email: d9562818@oz.nthu.edu.tw 2 Prerequisite •Linear

11

What is Computer Vision?

• To extract useful information about real physical

objects and scenes from sensed images/video.

– 3D reconstruction from images

– Object detection/recognition

• Automatic understanding of images and video

– Computing properties of the 3D world from visual data

(measurement)

– Algorithms and representations to allow a machine to

recognize objects, people, scenes, and activities.

(perception and interpretation)

Page 12: CS6550 Computer Visioncv.cs.nthu.edu.tw/upload/courses/17/uploads/CS6550...李東穎、喬彥豪, 台達館719、721, CV lab. Email: d9562818@oz.nthu.edu.tw 2 Prerequisite •Linear

Vision for measurement

Real-time stereo Structure from motion

NASA Mars Rover

Pollefeys et al.

Multi-view stereo for

community photo collections

Goesele et al.

Slide credit: L. Lazebnik

Page 13: CS6550 Computer Visioncv.cs.nthu.edu.tw/upload/courses/17/uploads/CS6550...李東穎、喬彥豪, 台達館719、721, CV lab. Email: d9562818@oz.nthu.edu.tw 2 Prerequisite •Linear

sky

wate

r

Ferris

wheel

amusement park

Cedar

Point

12 E

tree

tree

tree

carouseldeck

people waiting in

line

rid

e

rid

e rid

e

umbrellas

pedestrians

maxair

bench

tree

Lake Erie

people sitting on

ride

Objects

Activities

Scenes

Locations

Text / writing

Faces

Gestures

Motions

Emotions…

The

Wicked

Twister

Vision for perception, interpretation

Page 14: CS6550 Computer Visioncv.cs.nthu.edu.tw/upload/courses/17/uploads/CS6550...李東穎、喬彥豪, 台達館719、721, CV lab. Email: d9562818@oz.nthu.edu.tw 2 Prerequisite •Linear

Related disciplines

Cognitive

science

Algorithms

Image

processing

Artificial

intelligence

GraphicsMachine

learningComputer

vision

Page 15: CS6550 Computer Visioncv.cs.nthu.edu.tw/upload/courses/17/uploads/CS6550...李東穎、喬彥豪, 台達館719、721, CV lab. Email: d9562818@oz.nthu.edu.tw 2 Prerequisite •Linear

Vision and Graphics

ModelImages Vision

Graphics

Inverse problems: analysis and synthesis.

Page 16: CS6550 Computer Visioncv.cs.nthu.edu.tw/upload/courses/17/uploads/CS6550...李東穎、喬彥豪, 台達館719、721, CV lab. Email: d9562818@oz.nthu.edu.tw 2 Prerequisite •Linear

Why computer vision?

• As image sources multiply, so do applications

– Relieve humans of boring, easy tasks

– Enhance human abilities: human-computer interaction,

visualization

– Perception for robotics / autonomous agents

– Organize and give access to visual content

Page 17: CS6550 Computer Visioncv.cs.nthu.edu.tw/upload/courses/17/uploads/CS6550...李東穎、喬彥豪, 台達館719、721, CV lab. Email: d9562818@oz.nthu.edu.tw 2 Prerequisite •Linear

Why computer vision?• Images and video are everywhere!

Personal photo albums

Surveillance and security

Movies, news, sports

Medical and scientific images

Slide credit; L. Lazebnik

Page 18: CS6550 Computer Visioncv.cs.nthu.edu.tw/upload/courses/17/uploads/CS6550...李東穎、喬彥豪, 台達館719、721, CV lab. Email: d9562818@oz.nthu.edu.tw 2 Prerequisite •Linear

Again, what is computer vision?

• Mathematics of geometry of image formation?

• Statistics of the natural world?

• Models for neuroscience?

• Engineering methods for matching images?

• Science Fiction?

Page 19: CS6550 Computer Visioncv.cs.nthu.edu.tw/upload/courses/17/uploads/CS6550...李東穎、喬彥豪, 台達館719、721, CV lab. Email: d9562818@oz.nthu.edu.tw 2 Prerequisite •Linear

19

Applications of Computer Vision

• Robot Vision / Autonomous Vehicles

• Biometric Identification / Recognition

• Industrial Inspection

• Video Surveillance

• Digital Camera

• Medical Image Analysis/Processing

• Remote Sensing

• Multimedia Retrieval

Page 20: CS6550 Computer Visioncv.cs.nthu.edu.tw/upload/courses/17/uploads/CS6550...李東穎、喬彥豪, 台達館719、721, CV lab. Email: d9562818@oz.nthu.edu.tw 2 Prerequisite •Linear

Consumer Applications

20

(a) image stitching: merging different views (Szeliski and Shum

1997) (b) exposure bracketing: merging different exposures.

Page 21: CS6550 Computer Visioncv.cs.nthu.edu.tw/upload/courses/17/uploads/CS6550...李東穎、喬彥豪, 台達館719、721, CV lab. Email: d9562818@oz.nthu.edu.tw 2 Prerequisite •Linear

21

Real-Time Stereo Camera

Bumblebee

Point Grey Research makes video

rate stereo camera (640 x 480 at

30 fps).

Page 22: CS6550 Computer Visioncv.cs.nthu.edu.tw/upload/courses/17/uploads/CS6550...李東穎、喬彥豪, 台達館719、721, CV lab. Email: d9562818@oz.nthu.edu.tw 2 Prerequisite •Linear

3D Reconstruction from Images

22

Page 23: CS6550 Computer Visioncv.cs.nthu.edu.tw/upload/courses/17/uploads/CS6550...李東穎、喬彥豪, 台達館719、721, CV lab. Email: d9562818@oz.nthu.edu.tw 2 Prerequisite •Linear

Earth viewers (3D modeling)

Image from Microsoft’s Virtual Earth

(see also: Google Earth)

23

Page 24: CS6550 Computer Visioncv.cs.nthu.edu.tw/upload/courses/17/uploads/CS6550...李東穎、喬彥豪, 台達館719、721, CV lab. Email: d9562818@oz.nthu.edu.tw 2 Prerequisite •Linear

Photosynth

http://photosynth.net/

24

Page 25: CS6550 Computer Visioncv.cs.nthu.edu.tw/upload/courses/17/uploads/CS6550...李東穎、喬彥豪, 台達館719、721, CV lab. Email: d9562818@oz.nthu.edu.tw 2 Prerequisite •Linear

25

Object Detection

Page 26: CS6550 Computer Visioncv.cs.nthu.edu.tw/upload/courses/17/uploads/CS6550...李東穎、喬彥豪, 台達館719、721, CV lab. Email: d9562818@oz.nthu.edu.tw 2 Prerequisite •Linear

Optical Character Recognition (OCR)

Digit recognition, AT&T labs

http://www.research.att.com/~yann/

Technology to convert scanned docs to text• If you have a scanner, it probably came with OCR software

License plate readershttp://en.wikipedia.org/wiki/Automatic_number_plate_recognition

26

Page 27: CS6550 Computer Visioncv.cs.nthu.edu.tw/upload/courses/17/uploads/CS6550...李東穎、喬彥豪, 台達館719、721, CV lab. Email: d9562818@oz.nthu.edu.tw 2 Prerequisite •Linear

Face Detection

• Many new digital cameras now detect faces

– Canon, Sony, Fuji, …

27

Page 29: CS6550 Computer Visioncv.cs.nthu.edu.tw/upload/courses/17/uploads/CS6550...李東穎、喬彥豪, 台達館719、721, CV lab. Email: d9562818@oz.nthu.edu.tw 2 Prerequisite •Linear

Face Detection and Recognition

29

Face detection algorithms, coupled with color-based clothing and hair detection

algorithms, can locate and recognize the individuals in this image (Sivic,

Zitnick, and Szeliski 2006)

Page 30: CS6550 Computer Visioncv.cs.nthu.edu.tw/upload/courses/17/uploads/CS6550...李東穎、喬彥豪, 台達館719、721, CV lab. Email: d9562818@oz.nthu.edu.tw 2 Prerequisite •Linear

Face recognition

Who is she? 30

Page 31: CS6550 Computer Visioncv.cs.nthu.edu.tw/upload/courses/17/uploads/CS6550...李東穎、喬彥豪, 台達館719、721, CV lab. Email: d9562818@oz.nthu.edu.tw 2 Prerequisite •Linear

Vision-based biometrics

How the Afghan Girl was Identified by Her Iris Patterns?

http://www.cl.cam.ac.uk/~jgd1000/afghan.html

31

Page 32: CS6550 Computer Visioncv.cs.nthu.edu.tw/upload/courses/17/uploads/CS6550...李東穎、喬彥豪, 台達館719、721, CV lab. Email: d9562818@oz.nthu.edu.tw 2 Prerequisite •Linear

Login without a password…

Fingerprint scanners on

many new laptops,

other devices

Face recognition systems now

beginning to appear more widelyhttp://www.sensiblevision.com/

32

Page 33: CS6550 Computer Visioncv.cs.nthu.edu.tw/upload/courses/17/uploads/CS6550...李東穎、喬彥豪, 台達館719、721, CV lab. Email: d9562818@oz.nthu.edu.tw 2 Prerequisite •Linear

Object recognition (in mobile phones)

• This is becoming real:

– Microsoft Research

– Point & Find, Nokia

33

Page 34: CS6550 Computer Visioncv.cs.nthu.edu.tw/upload/courses/17/uploads/CS6550...李東穎、喬彥豪, 台達館719、721, CV lab. Email: d9562818@oz.nthu.edu.tw 2 Prerequisite •Linear

The Matrix movies, ESC Entertainment, XYZRGB, NRC

Special effects: shape capture

34

Page 35: CS6550 Computer Visioncv.cs.nthu.edu.tw/upload/courses/17/uploads/CS6550...李東穎、喬彥豪, 台達館719、721, CV lab. Email: d9562818@oz.nthu.edu.tw 2 Prerequisite •Linear

Sports – Augmented Reality

Sportvision first down line

Nice explanation on www.howstuffworks.com 35

Page 36: CS6550 Computer Visioncv.cs.nthu.edu.tw/upload/courses/17/uploads/CS6550...李東穎、喬彥豪, 台達館719、721, CV lab. Email: d9562818@oz.nthu.edu.tw 2 Prerequisite •Linear

Smart cars

• Mobileye

– Vision systems currently in high-end BMW, GM, Volvo models

36

Page 37: CS6550 Computer Visioncv.cs.nthu.edu.tw/upload/courses/17/uploads/CS6550...李東穎、喬彥豪, 台達館719、721, CV lab. Email: d9562818@oz.nthu.edu.tw 2 Prerequisite •Linear

Google street view

Page 38: CS6550 Computer Visioncv.cs.nthu.edu.tw/upload/courses/17/uploads/CS6550...李東穎、喬彥豪, 台達館719、721, CV lab. Email: d9562818@oz.nthu.edu.tw 2 Prerequisite •Linear

Google street view

Page 39: CS6550 Computer Visioncv.cs.nthu.edu.tw/upload/courses/17/uploads/CS6550...李東穎、喬彥豪, 台達館719、721, CV lab. Email: d9562818@oz.nthu.edu.tw 2 Prerequisite •Linear

Assisted driving

meters

met

ers

Ped

Ped

Car

Lane detection

Pedestrian and car detection

• Collision warning

systems with adaptive

cruise control,

• Lane departure

warning systems,

• Rear object detection

systems,

Page 40: CS6550 Computer Visioncv.cs.nthu.edu.tw/upload/courses/17/uploads/CS6550...李東穎、喬彥豪, 台達館719、721, CV lab. Email: d9562818@oz.nthu.edu.tw 2 Prerequisite •Linear

Vision-based interaction (and games)

Nintendo Wii has camera-based IR

tracking built in.

40

Control games with your own body

motion/gesture and create immersive

experiences by combining 3D personal

image into the game scene

Page 41: CS6550 Computer Visioncv.cs.nthu.edu.tw/upload/courses/17/uploads/CS6550...李東穎、喬彥豪, 台達館719、721, CV lab. Email: d9562818@oz.nthu.edu.tw 2 Prerequisite •Linear

Vision in space

Vision systems (JPL) used for several tasks• Panorama stitching

• 3D terrain modeling

• Obstacle detection, position tracking

• For more, read “Computer Vision on Mars” by Matthies et al.

NASA'S Mars Exploration Rover Spirit captured this westward view from atop

a low plateau where Spirit spent the closing months of 2007.

41

Page 42: CS6550 Computer Visioncv.cs.nthu.edu.tw/upload/courses/17/uploads/CS6550...李東穎、喬彥豪, 台達館719、721, CV lab. Email: d9562818@oz.nthu.edu.tw 2 Prerequisite •Linear

Robotics

http://www.robocup.org/NASA’s Mars Spirit Rover

http://en.wikipedia.org/wiki/Spirit_rover

42

Page 43: CS6550 Computer Visioncv.cs.nthu.edu.tw/upload/courses/17/uploads/CS6550...李東穎、喬彥豪, 台達館719、721, CV lab. Email: d9562818@oz.nthu.edu.tw 2 Prerequisite •Linear

Medical imaging

Image guided surgery

Grimson et al., MIT3D imaging

MRI, CT

43

Page 44: CS6550 Computer Visioncv.cs.nthu.edu.tw/upload/courses/17/uploads/CS6550...李東穎、喬彥豪, 台達館719、721, CV lab. Email: d9562818@oz.nthu.edu.tw 2 Prerequisite •Linear

• Read Chap. 1 (Szeliski)

• Next class

– Image formation (Chap. 2, Szeliski)

44

Things to Do