Computer Vision 2025 Spring
Quick links: schedule,
policies,
Ketangpai (assignment submission)
|
Computer vision is a science which uses cameras and computers to "see" and "understand" the
world. Research related to computer vision attempts to establish artificial intelligence
systems to obtain "information" from image data. The "information" here is defined by
Shannon which can be used to help make a "decision". Because perception can be regarded as
extracting information from sensory signals, computer vision can be considered as a science
that study how to make artificial system "perceive" from images or multi-dimensional data.
This year we will focus on an elementary hands-on introduction to neural networks and deep
learning. Topics covered will include: linear classifiers; multi-layer neural networks;
back-propagation and stochastic gradient descent; convolutional neural networks and their
applications to computer vision tasks like object detection and dense image labeling;
generative models (generative adversarial networks and diffusion models); sequence models
like recurrent networks and transformers; applications of transformers for language and
vision; and deep reinforcement learning.
Instructor: QI Lin (qilin -at-
ouc.edu.cn)
Lectures: Friday, 10:30AM-12:15PM
412 Learning Complex South Building
Labs: Friday (odd-numbered weeks), 13:30PM-15:20PM
B227 Information South Building
TAs:
Instructor and TA office hours: Friday (odd-numbered weeks), 15:30PM-16:30PM
Contacting the course staff: For emergencies and special circumstances, please
email the instructor. For questions about lectures and assignments, use Ketangpai. For
questions about your scores (including regrade requests), email the responsible TAs.
|
Prerequisites: Multi-variable calculus, linear algebra, data structures, statistics. No
previous exposure to machine learning is required.
Grading scheme:
- Programming assignments: 40% of the grade
- SOLO work
- Will be using Python, PyTorch, PaddlePaddel, MindSpore, Baidu AI Studio, Huawei Cloud,
Google Colab, and Google Cloud
- Attendence: 10% of the grade
- Project: 50% of the grade
Be sure to read the course policies!
Schedule (tentative)
| Date
| Topic
| Assignments
|
| February 28
| Introduction: PPTX, PDF
| Self-study: Python/numpy
tutorial
|
| March 7
| Linear classifiers: PPTX, PDF
|
|
| March 14
| Linear classifiers cont.: PPTX, PDF
| Assignment 1 is out
|
| March 21
| Linear classifiers cont.
|
|
| March 28
| Multi-layer networks: PPTX, PDF
|
|
| April 11
| Backpropagation: PPTX, PDF
|
|
| April 25
| Convolutional networks: PPTX, PDF
| Assignment 2 is out
|
| Aprial 27
| Convolutional architectures: PPTX,
PDF
|
|
| May 9
| Dense prediction: PPTX, PDF
|
|
| May 16
| Training in detail: PPTX, PDF
|
|
| May 23
| PyTorch tutorial: Jupyter notebook
| Assignment 3 is out
|
| May 30
| Object detection: PPTX, PDF
|
|
| June 6
| Recurrent networks: PPTX, PDF
|
|
| June 13
| Transformers: PPTX, PDF
|
|
| June 18
| Deep generative models: PPTX, PDF
|
|
| June 20
| Deep reinforcement learning: PPTX, PDF
|
|
Resources
Other deep learning courses with useful materials
Tutorials
Useful textbooks available online
|