Skip to content

Course about deep learning for computer vision and graphics co-developed by YSDA and Skoltech.

License

Notifications You must be signed in to change notification settings

virtualphoton/deep_vision_and_graphics

 
 

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

87 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Deep Vision and Graphics

This repo supplements course "Deep Vision and Graphics" taught at YSDA @fall'22. The course is the successor of "Deep Learning" course taught at YSDA in 2015-2021. New course focuses more on applications of deep learning for computer vision.

Lecture and seminar materials for each week are in ./week* folders. Homeworks are in ./homework* folders.

General info

  • Telegram chat room (russian).
  • YSDA deadlines & admin stuff can be found at the YSDA LMS (ysda students only).
  • Any technical issues, ideas, bugs in course materials, contribution ideas - add an issue

Syllabus

  • week01 Intro, recap of Neural network basics, optimization, backprop, biological networks
  • week02 Images, linear filtering, convolutional networks, batchnorms, augmentations
  • week03 ConvNet architectures and how to find them, sparse convolutions in 3D, ConvNets for videos, transfer learning
  • week04 Dense prediction: semantic segmentation, superresolution/image synthesis, perceptual losses
  • week05 Non-convolutional architectures: transformers (some recap of their use in NLP), mixers, FFT convolutions
  • week06 Visualizing and understanding deep architectures, adversarial examples
  • week07 Object detection, instance/panoptic segmentation, 2D/3D human pose estimation
  • week08 Representation learning: face recognition, verification tasks, self-supervised learning, image captioning
  • week09 Generative adversarial networks
  • week10 Latent models (GLO, AEs, VQ-VAE, generative transformers)
  • week11 Flow models, diffusion models, generative transformers, CLIP, DALL-E
  • week12 Shape and motion estimation: spatial transformers, optical flow, stereo, monodepth, point cloud generation, implicit and semi-implicit shape representations
  • week13 New view synthesis: multi-plane images, neural radiance fields, mesh-based and point-based representations for NVS, neural renderers

Contributors & course staff

Course materials and teaching performed by

About

Course about deep learning for computer vision and graphics co-developed by YSDA and Skoltech.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • Jupyter Notebook 98.5%
  • Python 1.4%
  • Other 0.1%