C032703/F032528 Computer Vision 计算机视觉

 

课程名称 (Course Name) Computer Vision

课程代码 (Course Code)   C032703/F032528

学分/学时 (Credits/Credit Hours) 3/48

开课时间 (Course Term )  Spring

开课学院(School Providing the Course:  School of ElectronicInformation and Electrical Engineering

任课教师(Teacher:  Xu Zhao

课程讨论时数(Course Discussion Hours:  4

课程实验数(Lab Hours:   8

课程内容简介(Course Introduction):

Computer vision aims to recover useful information about a 3D scene from its 2D projections (images), such as the depth and structure, motion, surfaces curvature and orientation of 3D objects and status and meaning of the actions of 3D scene. In this course, basic concept, theories and algorithms of computer vision are introduced. First, how an image could be formed is systematically analyzed, from the viewpoints of geometric camera models, light, shading and digital camera. Then, the theories and algorithms about image filtering, binary image processing and local image features will be reviewed. Multiple-view based techniques as stereo vision, 3D motion analysis and image alignment are then studied.  Mid-level and high-level vision topics, such as optical flow, tracking, segmentation, object detection and recognition will be discussed in details.

教学大纲(Course Teaching Outline):

Part 0: Introduction

Course 1: Introduction

1)      Concept of computer vision

2)      Related fields

3)      Human vision

4)      State of the art

Part I: Image formation

Course 2: Geometric camera models

1)      Geometric primitives and transformations

2)      Projection models

3)      Intrinsic and extrinsic parameters

4)      Camera calibration

Course 3: Light and shading

1)      Lighting

2)      Reflectance and shading

3)      Shape from shading

Course 4: Digital camera

1)      Image sensing pipeline

2)      Sampling and aliasing

3)      Color

Part II: Early vision – single view

Course 5: Image filtering

1)      Linear filters and convolution

2)      Non-linear filters

3)      Spatial frequency and Fourier transforms

4)      Histogram equalization

5)      Image pyramids

Course 6: Binary image processing

1)      Binary image generation

2)      Binary image representation

3)      Morphological processing

Course 7: Local image features

1)      Image gradient

2)      Edges

3)      Lines

4)      Points and patch

5)      SIFT and HOG

6)      Texture

Part III: Early vision – multiple views

Course 8: Stereopsis

1)      Binocular camera geometry

2)      Epipolar constraint and fundamental matrix

3)      Stereo of arbitrary camera arrangement

4)      Structured Lighting

Course 9: Motion

1)      Motion from 3D PCs

2)      Motion from 2D PC

3)      Motion from LC’s

4)      Motion from other image clues

Course 10: Alignment and warping

1)      Background

2)      Mosaics and warping

3)      Outlier processing: RANSAC

Part IV: Mid-level vision

Course 11: Tracking

1)      Optical flow

2)      Linear dynamical models and Kalman filters

3)      Particle filtering

Course 11: Segmentation

1)      K-means and EM

2)      Graph cuts and energy-based methods

Course 12: Grouping and fitting

1)      Hough transform

2)      Deformable contours

Part V: High-level vision

Course 13: Object detection

1)      Sliding window method

2)      Pyramid match kernel

3)      Part-based models

Course 14: Recognition

1)      Generative categorization: Naïve Bayes model for classification

2)      Discriminative classifiers: SVM

3)      Bag-of-words models

课程进度计划(Course Schedule):

Week num.

Topic

Assignments

2

Intro.

3

Geometric camera models

4

Light and shading

5

Digital camera

Problem set 1 out

6

Image filtering

7

Binary image processing

8

Local image features

9

Stereopsis

Problem set 2 out

10

Motion

11

Alignment and warping

12

Tracking

13

Segmentation

Problem set 3 out

14

Grouping and fitting

15

Object detection and Recognition

16

Exam. review

17

Final exam.

课程考核要求(Course Assessment Requirements)

Problem set:           40%

Final exam:             60%

参考文献(Course References)

[1] Computer Vision: Algorithms and Applications. Richard Szeliski.

[2] Computer Vision: A Modern Approach. David A Forsyth

[3] Multiple View Geometry in Computer Vision. Richard Hartley et al

[4] Machine Vision. Ramerh Jian, et al

[5] Vision. David Marr.

[6] Pattern Recognition and Machine Learning. Bishop.

[7] 机器视觉,贾云得 编著

[8] 计算机视觉的理论和实践, 李介谷著

[9] 计算机视觉——计算机理论与算法基础, 马颂德

预修课程(Prerequisite Course

(1)  Digital image processing

(2)  Mathematics: Matrix, Probability theory

(3)  Programming: C, C++, Matlab

[ 2015-11-26 ]