Computer Vision 2025 Spring
Quick links: schedule,
policies,
Ketangpai (assignment submission)
|
Computer vision is a science which uses cameras and computers to "see" and "understand" the world. Research related to computer vision attempts to establish artificial intelligence systems to obtain "information" from image data. The "information" here is defined by Shannon which can be used to help make a "decision". Because perception can be regarded as extracting information from sensory signals, computer vision can be considered as a science that study how to make artificial system "perceive" from images or multi-dimensional data. This year we will focus on an elementary hands-on introduction to neural networks and deep learning. Topics covered will include: linear classifiers; multi-layer neural networks; back-propagation and stochastic gradient descent; convolutional neural networks and their applications to computer vision tasks like object detection and dense image labeling; generative models (generative adversarial networks and diffusion models); sequence models like recurrent networks and transformers; applications of transformers for language and vision; and deep reinforcement learning.
Instructor: QI Lin (qilin -at- ouc.edu.cn)
Lectures: Friday, 10:30AM-12:15PM
412 Learning Complex South Building
Labs: Friday (odd-numbered weeks), 13:30PM-15:20PM
B227 Information South Building
TAs:
Instructor and TA office hours: Friday (odd-numbered weeks), 15:30PM-16:30PM
Contacting the course staff: For emergencies and special circumstances, please email the instructor. For questions about lectures and assignments, use Ketangpai. For questions about your scores (including regrade requests), email the responsible TAs.
|
Prerequisites: Multi-variable calculus, linear algebra, data structures, statistics. No previous exposure to machine learning is required.
Grading scheme:
- Programming assignments: 40% of the grade
- SOLO work
- Will be using Python, PyTorch, PaddlePaddel, MindSpore, Baidu AI Studio, Huawei Cloud, Google Colab, and Google Cloud
- Attendence: 10% of the grade
- Project: 50% of the grade
Be sure to read the course policies!
Schedule (tentative)
Date
| Topic
| Assignments
|
February 28
| Introduction: PPTX, PDF
| Self-study: Python/numpy tutorial
|
March 7
| Linear classifiers: PPTX, PDF
|
|
March 14
| Linear classifiers cont.: PPTX, PDF
| Assignment 1 is out
|
March 21
| Linear classifiers cont.
|
|
March 28
| Multi-layer networks: PPTX, PDF
|
|
April 11
| Backpropagation: PPTX, PDF
|
|
April 25
| Convolutional networks: PPTX, PDF
| Assignment 2 is out
|
Aprial 27
| Convolutional architectures: PPTX, PDF
|
|
May 9
| Dense prediction: PPTX, PDF
|
|
May 16
| Training in detail: PPTX, PDF
|
|
May 23
| PyTorch tutorial: Jupyter notebook
| Assignment 3 is out
|
May 30
| Object detection: PPTX, PDF
|
|
June 6
| Recurrent networks: PPTX, PDF
|
|
June 13
| Transformers: PPTX, PDF
|
|
June 18
| Deep generative models: PPTX, PDF
|
|
June 20
| Deep reinforcement learning: PPTX, PDF
|
|
Resources
Other deep learning courses with useful materials
Tutorials
Useful textbooks available online
|