====== Computer Vision Methods ====== [[https://intranet.fel.cvut.cz/cz/education/rozvrhy-ng.B232/public/html/predmety/46/84/p4684506.html|Schedule on FEL (CZ course)]] [[https://intranet.fel.cvut.cz/cz/education/rozvrhy-ng.B232/public/html/predmety/46/85/p4685206.html|Schedule on FEL (EN course)]]\\ [[https://cw.felk.cvut.cz/upload/|Upload system]]\\ [[https://cw.felk.cvut.cz/forum/forum-1875.html|Discussion forum]] [[https://cw.felk.cvut.cz/forum/forum-1819.html|Discussion forum archive (2022/2023)]]\\ [[https://cw.fel.cvut.cz/wiki/courses/mpv/labs/start|Labs]] ===== Course Description ===== This course focuses on the following computer vision problems: finding correspondences between images using image features and their robust invariant descriptors, image retrieval, object detection and recognition, and visual tracking. ===== Pre-requisites ===== The course has no formal pre-requisites. However, certain skills and knowledge are assumed, and it is the responsibility of the student to get to the required level. The assignments are implemented in the Python, numpy, [[https://pytorch.org| pytorch]] computing environment, mostly in form of [[https://jupyter.org|jupyter notebooks]], and familiarity with it will help. The programing assignments, involving either implementing, modifying or testing computer vision methods, are a substantial part of the [[https://cw.fel.cvut.cz/wiki/courses/mpv/labs/start|labs]]. Knowledge of the basics of digital image processing as convolution, filtration, intensity transformations, image function interpolations and basic geometric transformations of the image (see the first lab) is assumed. Knowledge of linear algebra and probability theory is needed to understand the presented computer vision methods. ===== Lectures: Monday 9:15 - 10:45, KN:E-107 ===== Lecturers: JM Jiří Matas, JC Jan Čech, DM Dmytro Mishkin, GT Giorgos Tolias, MS Milan Šulc Note: some of the lectures have changed, but the 2021 recordings mostly provide a good idea about the content.\\ Lectures will be streamed on YouTube, live link: https://www.youtube.com/playlist?list=PLQL6z4JeTTQkqF6KkcZZDi2KFwky9SQpq \\ **Recorded lectures** - playlist: https://www.youtube.com/playlist?list=PLQL6z4JeTTQnozHfghnzq3AK-lIBZmQdP \\ **Recorded lectures** - playlist (2023): https://www.youtube.com/playlist?list=PLQL6z4JeTTQneuiXekoB639gEOzuen79_ Changes compared to the last year are //in italics//. ^Week ^ Date ^ Lecturer ^ Slides ^ Topic ^ | 1| 19.2.|JM| [[https://drive.google.com/file/d/1vJte0ibXB-8AZgneiYEwKBWgadxbSDvE/view?usp=share_link| Slides 2023 ( 1st lecture slides 1-65)]], [[https://www.youtube.com/watch?v=-v_Ex2pL7Tc&list=PLQL6z4JeTTQneuiXekoB639gEOzuen79_&index=1|recording 2023]],[[https://drive.google.com/file/d/1MD6oDfQKXkwUu9_8gblDwQ2M8PDsYpYW/view?usp=sharing|recording 2021]]|Correspondences and wide baseline stereo. Motivation and applications. //Intro in the image processing. Perspective pinhole camera model//. Interest point and distinguished regions detection: Moravec detector (corner detection)| | 2| 26.2.|DM| [[https://drive.google.com/file/d/1vJte0ibXB-8AZgneiYEwKBWgadxbSDvE/view?usp=share_link| Slides 2023 ( 2nd lecture slides 65--178)]], [[http://cmp.felk.cvut.cz/~mishkdmy/MPV2022/MPV2021_local_features_day2.pdf| Slides 2022]], [[https://www.youtube.com/watch?v=v5U1d_AklT8&list=PLQL6z4JeTTQneuiXekoB639gEOzuen79_&index=2|recording 2023]] [[https://drive.google.com/file/d/1Xk50BJeC8fywLiOHxNqOTMw4y2pxIrAg/view?usp=sharing|recording 2021]] | Harris operator. //Image gradient//. Laplace operator and its approximation by difference of Gaussians, Hessian detector, affine covariant version. Descriptors of measurement regions: SIFT (scale invariant feature transform), RootSIFT. | | 3| 4.3.|JM|{{ :courses:mpv:2021_ransac.pdf |RANSAC}} [[https://drive.google.com/file/d/1QwY98vUtvSNrkb7LsaDQ9jCQXJJgoa2a/view?usp=sharing|recording 2021]] | RANSAC. | | 4| 11.3.|DM| [[https://drive.google.com/file/d/1vJte0ibXB-8AZgneiYEwKBWgadxbSDvE/view?usp=share_link| Slides 2023 ( 3rd lecture slides: 179 -- end)]] [[http://cmp.felk.cvut.cz/~mishkdmy/MPV2022/MPV2021_local_features_day3.pdf| Slides 2022]], [[https://drive.google.com/file/d/10qGr4Eek6W3w7trElCg45brC2JCtFLqW/view?usp=sharing|recording 2021]] | Matching. Deep learned features: R2D2, //SuperGlue//. | | 5| 18.3. | GT| [[http://ptak.felk.cvut.cz/personal/toliageo/share/mpv/2024/mpv24_retrieval.pdf | Image Retrieval]] [[https://www.youtube.com/watch?v=WoXBbx8TgOU&list=PLQL6z4JeTTQnozHfghnzq3AK-lIBZmQdP&index=5&ab_channel=CVUTFEL|recording]] | task formulation, evaluation metrics, Bag-of-Words, VLAD, spatial verification, special objectives: zoom in/out . | | 6| 25.3. |JC|{{deep_learning_MPV_2024.pdf|Deep learning}}\\ [[https://youtu.be/zO26Ik4v4aA?list=PLQL6z4JeTTQnozHfghnzq3AK-lIBZmQdP|recording 2024]] |A shallow introduction into the deep machine learning. Convolutional Neural Networks, Transformers. Principles, layers, architectures for image recognition. | | 7| 1.4.|^ Easter Monday | | 8| 8.4.|JC|{{deep_learning_2_mpv_2024.pdf|Deep learning II}}\\ [[https://youtu.be/E8JSg1WFFT0?list=PLQL6z4JeTTQnozHfghnzq3AK-lIBZmQdP|recording 2024]] | Deep architectures for object detection and semantic segmentation. Further insights into the deep nets. Foundation models (CLIP, DINO, Segment Anything, Depth Anything). | | 9| 15.4.| GT| [[http://ptak.felk.cvut.cz/personal/toliageo/share/mpv/2024/mpv24_deepmetriclearning.pdf|Deep Metric Learning]] [[https://www.youtube.com/watch?v=vfhCf9ArBu8&list=PLQL6z4JeTTQnozHfghnzq3AK-lIBZmQdP&index=8&ab_channel=CVUTFEL| recording]] | architectures, losses, types of supervision | | 10| 22.4.| GT| [[ http://ptak.felk.cvut.cz/personal/toliageo/share/mpv/2024/mpv24_ssl.pdf | Self-supervised Representation Learning ]] [[https://www.youtube.com/watch?v=Zv6qt8nKAfg&list=PLQL6z4JeTTQnozHfghnzq3AK-lIBZmQdP&index=9&ab_channel=CVUTFEL|recording]] | tasks with self-supervisory signal, auto-encoders, learning via augmentations, contrastive approaches | | 11| 29.4.| JM| {{ :courses:mpv:matas-2018.04-klt-only.pdf |KLT}}, [[https://cmp.felk.cvut.cz/~matas/teaching/mpv/2021-04-26%2010.56.37%20MPV%20-%20Computer%20Vision%20Methods%20-%20Lecture%2093818244152/zoom_0.mp4|recording 2021]]| Tracking I. Introduction. Kanade-Lucas-Tomasi tracker. | | 12| 6.5.| JM| {{kcf_lecture2016.pdf|KCF Tracking}} {{TLD.pdf|TLD}}, {{Tracking_by_Segmentation.pdf|Tracking_by_Segmentation}}[[https://cmp.felk.cvut.cz/~matas/teaching/mpv/2021-05-03%2011.11.59%20MPV%20-%20Computer%20Vision%20Methods%20-%20Lecture%2093818244152/zoom_0.mp4|recording 2021]]| Tracking II. KCF Kernel Correlation Filter. Long-term Tracking, TLD: Tracking-Learning-Detection, Tracking by Segmentation. Introduction to [[https://cw.fel.cvut.cz/b212/courses/mpv/labs/4b_tracking/start| KCF lab task]].| | 13| 13.5.| JM |{{2016.05_hough-transform.pdf|Hough Transform}}\\ [[https://cmp.felk.cvut.cz/~matas/teaching/mpv/2021-05-10%2011.05.25%20MPV%20-%20Computer%20Vision%20Methods%20-%20Lecture%2093818244152/zoom_0.mp4|recording 2021]]| Hough transform .| | 14| 20.5.| JC| | Generative modelling for Computer Vision| ===== Evaluation ===== Work during the semester 50%, written part of the exam 40%, oral part of the exam 10%. \\ Note that the points from the labs are reweighted using a "normalization factor" so that they correspond to 50% of your evaluation. That means, the points from the labs that contribute to your exam, are ''(your total number of points from semester including bonus points)/(sum of points available from all non-bonus tasks) * 50''. ===== Exam ===== Examples of exam [[courses:mpv:labs:exam_questions|questions]]. There will be 4-5 similar questions at the written part of the exam. The oral part of the questions takes place after the written part and will focused on discussion of your answers. ===== Literature ===== Lecture slides constitute the main source of study literature in this course. ===== Further Info ===== Further information is available in next sections of this page. We would appreciate your feedback on the contents and organization on the discussion [[https://cw.felk.cvut.cz/forum/forum-1819.html|forum]] of the course. ----- \\ Good luck to all participants of the course. | Lecturers |||| | [[http://cmp.felk.cvut.cz/~matas|{{./jmatas.jpg?120}}]] | [[http://cmp.felk.cvut.cz/~cechj|{{jcech.jpg?120}}]] | [[https://cmp.felk.cvut.cz/~toliageo/index.html|{{./gtolias.jpg?120}}]] | [[http://dmytro.ai/|{{./dmishkin.jpg?120}}]] | | Jiří Matas | Jan Čech | Giorgos Tolias | Dmytro Mishkin | Consultations are possible upon request. /* | [[http://cmp.felk.cvut.cz/~drbohlav/|{{http://cmp.felk.cvut.cz/~drbohlav/ondrej_drbohlav.jpg?120}}]] */