Indian Institute of Information Technology, Allahabad

Computer Vision and Biometrics Lab (CVBL)

Visual Recognition

July-Dec 2023 Semester


Previous Offerings


Course Information

Objective of the course: The field of visual recognition has become part of our lives with applications in self-driving cars, satellite monitoring, surveillance, video analytics particularly in scene understanding, crowd behaviour analysis, action recognition etc. It has eased human lives by acquiring, processing, analyzing and understanding digital images and extraction of high-dimensional data from the real world in order to produce numerical or symbolic information. The visual recognition encapsulates image classification, localization and detection. The course on visual recognition will help students understand new tools, techniques and methods which are influencing the visual recognition field.

Outcome of the course: At the end of this course, the students will be able apply the concepts to solve some real problems in recognition. The students will be able to use computational visual recognition for problems ranging from extracting features, classifying images, to detecting and outlining objects and activities in an image or video using machine learning and deep learning concepts. The student will be also being able to invent new methods in visual recognition for various applications.



Class meets
Thursday: 09.00 AM - 11.00 AM, Thursday: 03.00 PM - 05.00 PM

Course Ethics
  • Students are strictly advised to avoid the unethical practices in the course including review tests and practice components.
  • The project component will be done in team. The team will be formed by the course instructors. The project allotment will be also done by the course instructors.
  • Students are not allowed to simply claim the existing solutions available in public domain as your own work in this course.
  • If it happens that you have already done the similar projects in any other course or with any other faculty which is allotted to you, you should immediately inform us for the same as it is not allowed to have similar projects in this course which you might have already done previously.
  • It is best to try to solve problems on your own, since problem solving is an important component of the course.
  • You are not allowed to do or continue same project in any other course and with any other faculty.
  • You are allowed to discuss class material, problems, and general solution strategies with your classmates. But, when it comes to formulating or writing solutions you must work/implement by yourself.
  • You may use free and publicly available sources, such as books, journal and conference publications, and web pages, as research material for your answers. (You will not lose marks for using external sources.) It is does not mean that you claim these existing resources as your work.
  • You may not use any paid service and you must clearly and explicitly cite all outside sources and materials that you made use of.
  • Students are not allowed to post the code/report/any other material of course project in public domain or share with any one else without written permission from course instructors.
  • We consider the use of uncited external sources as portraying someone else's work as your own, and as such it is a violation of the Institute's policies on academic dishonesty.
  • Instances will be dealt with harshly and typically result in a failing course grade.

Schedule

Schedule Topic Resources
L01:Course Introduction Slide
L02:Local Features: What, Why and How Slide
L03:Corner Detection Slide
L04:Harris Detector and Invariance Property Slide
L05:Blob and Region Detection Slide
L06:Region Descriptors Slide
L07:Local Descriptors Slide
L08:Image Categorization Slide
L09:Image Classifiers Slide
L10:Neural Networks Slide
L11:Convolutional Neural Networks Slide
L12:CNN Training 1 Slide
L13:CNN Training 2 Slide
L14:CNN Architectures 1 Slide
L15:CNN Architectures 2 Slide
L16:Object Detection Slide
L17:Semantic Segmentation Slide
L18:Adversarial Attack Slide
L19:Generative Models Slide
L20:Transformer Models Slide
L21:Video Recognition Slide

Computational Projects Added to Teaching Laboratories

Project ID Team Project Title Abstract
VLR23-P01 IIT2020011 ANKIT KUMAR Image Super-resolution
VLR23-P02 IIB2020008 SAMRIDDHI V WALIAm IIB2020014 MOHAN LAL AGARWALA, IIB2020502 ANIRUDDH SHARMA, IIT2020166 SHANTANU CHAUDHARY Human Counting in Crowded Scenerio using DETR
VLR23-P03 MML2022001 RUPESH G, MML2022004 RAJ AHAMED SHAIK, MML2022016 ASHUTOSH VERMA Image Dehazing
VLR23-P04 IIT2020018 BOTTE SHREYA, IIT2020040 KATAM BALA PRASANNA BABU, IIT2020199 VELPULA VAMSHI, IIT2020217 VELAGANA NAGENDRA, IIT2020255 DONTHOJU RAGHAVA Cross Day-Night Image Classification
VLR23-P05 IIT2020173 ANISH JAIN, IIT2020181 JINIYA SINGAL, IIT2020182 DABERAO AKSHAY GAJANAN, IIT2020185 PATEL SAURABH, IIT2020188 SOLANKI TANMAY MOHANBHAI Hand Gesture Recognition
VLR23-P06 IIT2020031 RAUNAK KRISHAN JAISWAL, IIT2020033 ADITYA BISWAKARMA, IIT2020055 SAURABH KUMAR, IIT2020106 NEEL PATEL, IIT2020243 AKULA ABHIRAM Facial Micro-Expression Recognition using 3D CNNs
VLR23-P07 IIT2020005 PUSHKAL MADAAN, IIT2020006 RITEJ DHAMALA, IIT2020008 AVISHKAR SINGH, IIT2020077 ANUSHKA AJIT DANDAWATE, IIT2020252 KAVITA Self-Supervised Image Retrieval
VLR23-P08 MML2022002 HARSH, MML2022011 UMESH MAURYA Tiny Face Detection
VLR23-P09 IIB2020030 MANISH KUMAR, IIT2020021 HARSHITA VYAS, IIT2020037 SAKSHI, IIT2020095 AMBIKESH ARMAN, IIT2020134 SHAH KRISHNA DINESHKUMAR Image Denoising using Image-to-Image Translation
VLR23-P10 IIB2020036 MIRIYALA POOJITHA, IIT2020144 PRANAV RAJ, IIT2020151 SHIVAM KATIYAR, IIT2020163 SARTHAK DALMIA, IIT2020205 ADITYA RAJ Drowsiness Detection using Faces
VLR23-P11 IIT2020227 MOHD WASIF, IIT2020242 MOHD SARFARAZ, IIT2020247 SANJAY RAM, IIT2020254 CHAUDHARI YOGIRAJ PRAKASH, IIT2020259 ANKIT KUMAR Photo ID Retrieval from Arbitrary Face Query
VLR23-P12 IIB2020016 ANURAG HARSH, IIB2020018 ABHISHEK KUMAR, IIB2020024 VAIDIK SHARMA, IIB2020027 AMAN UTKARSH, IIT2020140 AYUSHI Image Caption Generation
VLR23-P13 IIT2020052 SANJEET, IIT2020053 SAMEER AHMED, IIT2020082 HARSH GARG, IIT2020218 JITU RAJAK, IIT2020244 RAHUL Selfie vs Non-selfie Classification
VLR23-P14 MML2022009 MANISH KUMAR, MML2022013 BHAVESH KUMAR BOHARA, MML2022014 KAVATHIYA KHYATI HARESHBHAI Image Deraining
VLR23-P15 IIT2020158 S ANURAG REDDY, IIT2020164 SAVALA DEEPIKA, IIT2020213 ANKADALA JEEVAN, IIT2020250 PULUKURI JAGADEESH, IIT2020266 NENAVATH ABHIRAM NAIK Homography Matrix Computation between Images using Deep Learning
VLR23-P16 IIT2020044 PRIYA DEVI, IIT2020060 PERISETLA SRI SATWIK, IIT2020065 DASA AKSHITHA, IIT2020196 KALYANI BHUSHAN PHARKANDEKAR, IIT2020208 MARPINA SRUJANA Face Recognition from Partial Faces
VLR23-P17 MHC2022001 AMIT ROY, MHC2022011 BHARGAV BURMAN, MHC2022013 HARSHIT GUPTA, MML2022003 DIPANKAR KARMAKAR Plant Disease Classification
VLR23-P18 MHC2022005 DASAROJU JAGANNADHACHARI, MML2022005 PRAGATI, MML2022007 SAYANTAN CHAKRABORTY, MRM2022006 BEHERA JYOTHIKRISHNA Image Inpainting Using GAN
VLR23-P19 IIT2020025 MANPREET SINGH, IIT2020032 KARTIK GUPTA, IIT2020219 Tanu Shree Suthar, IIT2020221 TUSHAR AGGARWAL Visual Grounding using CNNs
VLR23-P20 IIT2020009 AASHISH AGRAWAL, IIT2020010 RAJ CHHARI, IIT2020183 LOKESH MEHTA, IIT2020209 AADITYA RATHOD, IIT2020505 AKSHAT GHARIYA Viewpoint Invariant Scene Recognition of IIITA Campus using Deep Learning
VLR23-P21 IIB2020021 GAGAN BANSAL, MML2022006 MOHD FAIZ ANSARI, MML2022008 RAKSHIT SANDILYA, MML2022010 NIKHIL RAJPUT, MML2022012 HIMANSHU MITTAL Thermal to Visible Image Translation
VLR23-P22 IIT2020154 SHIVEK PAMNANI, IIT2020160 ANUSHKA ARUN KALWALE, IIT2020179 KARUS MANISHA, IIT2020189 ROUNAK DEV, IIT2020190 MALYALA MEGHAMSH Clothing Outfit Rating using CNNs
VLR23-P23 MRM2022002 AKASH TYAGI, MRM2022003 ANKIT RAJ RAVI, MRM2022004 ADITYA, MRM2022005 HIMANSHU MISHRA Impact of Different Activation Functions on ViT Model
VLR23-P24 IIT2020007 SHUBHAM KUMAR BHOKTA, IIT2020022 RAHUL MAHTO, IIT2020024 SHASHIKANT THAKUR, IIT2020043 ROHIT CHOWDHURY, IIT2020220 MOHIT KUMAR Identification of Artificially Generated Images
VLR23-P25 IIT2020067 ADITYA SINGH, IIT2020070 EKAGRA SINHA, IIT2020089 DEVESH KUMAR PARTE, IIT2020101 LUKESH NITIN PATIL, IIT2020105 JAMBHULE SAHAS DEVIDAS Student Counting in Classroom
VLR23-P26 RSI2022502 AJAY KUMAR YADAV Analysis of Robustness in Deep Learning Models
VLR23-P27 RSI2023001 AKASH VERMA Natural Disguise Detection

Grading

Prerequisites

Books

Disclaimer

The content (text, image, and graphics) used in this slide are adopted from many sources for Academic purposes. Broadly, the sources have been given due credit appropriately. However, there is a chance of missing out some original primary sources. The authors of this material do not claim any copyright of such material.