Intro to Computer Vision

NYU, Spring 2023



Source: Willow

Course webpage for the NYU Spring 2023 Course Special Topics in Data Science, DS-GA 3001-009 (Introduction to Computer Vision). This course aims to cover broad topics in computer vision, and is not primarily a deep learning course. We will cover topics in traditional computer vision such as camera geometry, image formation, segmentation, object recognition, classification, and detection (see Syllabus).

Logistics

  • DS-GA 3001-009 (Lecture)
    Mondays 4:55pm-6:35pm
    Location: 208, Silver Center for Arts and Science, 32 Waverly Pl, New York, NY 10011 (SILV 208, location link)

    The class will be only in-person. Slides will be available after the class on this webpage and the Brightspace. (see Schedule).

  • DS-GA 3001.010 (Lab)
    Thursdays 5:55pm-6:45pm
    Location: 60 Fifth Avenue, New York, NY 10011 (60FA 150, location link)

    The labs will be in-person. Labs will be used to cover additional materials or to work through practical exercises with the TA.

  • Office Hours:
    Jean Ponce (jean.ponce@inria.fr)
    Wednesdays, 2:00pm-3:30pm
    Room No. 500, 60 5th Ave, New York, NY 10011

  • Campuswire Link: Link
    Please try first to post any questions about course logistics and material, HWs and final project on Campuswire. We also highly encourage you to help each other out (but please do not reveal answers). For additional questions, please email course staff.

Instructors

Jean Ponce (jean.ponce@inria.fr)

TAs and Graders

Ayush Jain (aj3152@nyu.edu)
Zuhaib Akhtar (za2023@nyu.edu)

Grading

Four programming assignments (50% of the grade) + final project (40% of the grade) + class participation and attendance (5%) + lab participation and attendance (5%). Homeworks should be submitted using the Brightspace site.

  • Homework 1 on camera calibration. Link Due on Feb. 17.
  • Homework 2 on Canny edge detector. Link Due on Mar. 10.
  • Homework 3 on Epipolar Line Estimation. Link Due on Apr. 14.
  • Homework 4 on Mean Shift Algorithm. Link Due on May 4.
  • Final project: Fill in the course project survey form and submit the project abstract by March 13 on Brightspace. Draft report submissions by April 14. Final presentations will be held during the last lecture (Room 208, Silver Center for Arts and Science, 4:55 PM - 6:35 PM, May 8) and in final exam slot (Room 269, Global Center for Academic and Spiritual Life, 4:00 PM - 5:50 PM, May 12). The project report will be due May 7, 11.59 PM EST.

Participation and Attendance

You are expected to attend and participate in classes and labs in person. Class attendance will count for 5% of your grade and lab attendance will count for 5% of your grade.

Syllabus

  • Introduction
  • Camera geometry and calibration
  • Filtering and feature detection
  • Radiometry and color
  • Texture and image segmentation
  • Stereopsis
  • Structure from motion and 3D models from images
  • Object recognition - historical perspective
  • CNNs for object classification and detection
  • 3D CNNs, Applications in Medical Imaging
  • Weakly-supervised and unsupervised approaches to image and video interpretation

References:

We do not require purchase of any textbooks and the course will be self-contained. You may wish to consult the resources below for additional material formalization.

  • D.A. Forsyth and J. Ponce, “Computer Vision: A Modern Approach”, second edition, Pearson, 2011. (Link)
  • R. Szeliski, “Computer Vision: Algorithms and Applications”. (PDF)
  • R. Hartley and A. Zisserman, “Multiple View Geometry in Computer Vision”, Cambridge University Press, 2004. (Link)
  • M.F. Land and D.E. Nilsson, “Animal Eyes”, Oxford University Press, 2012.
  • O. Faugeras, Q.T. Luong, and T. Papadopoulo, “Geometry of Multiple Images,” MIT Press, 2001.

Schedule:

Note: lecture slides will be posted after each lecture.

Date Lecture Topic Link
01/23 Lecture 1 Introduction to Computer Vision - 1 Slides (PPTX)
01/26 Lab 1 Introduction to Computer Vision - 1.5 Slides (PPTX)
01/30 Lecture 2 Introduction to Computer Vision - 2 Slides (PPTX)
02/02 Lab 2 Class Logistics + HW1 review Slides (PPTX), HW1 (Link)
02/06 Lecture 3 Camera geometry and calibration Slides (PPTX)
02/09 Lab 3 Introduction to Computer Vision - 3.5 Slides (PPTX)
02/13 Lecture 4 Image Filtering Slides (PPTX)
02/16 Lab 4 Canny Edge Detectors and HW2 Review Slides (PPTX)
02/17 Homework 1 DUE HW on Camera Calibration HW1 (Link)
02/20 No class (Presidents’ Day)  
02/23 Lab 5 Radiometry Slides (PPTX)
02/27 Lecture 5 Color and Radiometry Slides (PPTX)
03/02 Lab 6 Color and Radiometry No slides
03/06 Lecture 6 Color Slides (PPTX)
03/02 Lab 7 Radiometry and Reflectance Slides (PPTX)
03/10 Homework 2 DUE HW on Canny Edge Detector HW2 (Link)
03/13 No class (spring break)  
03/13 Course Project DUE Project abstract DUE Submission form (link)
03/20 Lecture 7 Guest Lecture - Visual Learning in the Open World Slides (PPTX)
03/23 Lab 8 NMS and RANSAC Implementation Notebook (PPTX)
03/27 Lecture 8 Stereopsis Slides (PPTX)
03/23 Lab 9 HW3 Review: Epipolar Line Computation Slides (PPTX)
04/03 Lecture 9 Structure from Motion Slides (PPTX)
04/10 Lecture 10 Self-Calibration Slides (PPTX)
04/13 Lab 10 Introduction to computer vision - 11.5 Slides (PPTX)
04/14 Course Project DUE Draft Report DUE Brightspace
04/17 Lecture 11 Introduction to computer vision - 12 Slides (PPTX)
04/20 Lab 11 HW4 Review: Mean Shift Slides (PPTX)
04/24 Lecture 12 Guest Lecture by Albert Bietti on Generative Modeling Slides (PPTX)
04/27 Lab 12 Introduction to computer vision - 13 (Deep Learning) Slides (PPTX)
05/01 Lecture 13 Introduction to computer vision - 14 (Deep Learning) Slides (PPTX)
05/04 Lab 13 Introduction to computer vision - 14.5 Slides (PPTX)
05/07 Course Project Final Project Report DUE Brightspace
05/08 Course Project Project Presentations - I 4:55 PM - 6:35 PM, Room 208, Silver Center for Arts and Science
05/12 Course Project Project Presentations - II 4:00 PM - 5:50 PM, Room 269, Global Center for Academic and Spiritual Life

Acknowledgements

Much of the material for this course relies on the Computer Vision course given at ENS Paris by Mathieu Aubry, Karteek Alahari, Ivan Laptev, and Josef Sivic. Many of the slides are taken from James Hays, Svetlana Lazebnik, and Derek Hoeim. Website was originally designed by Matthew Trager.