Toggle

Hi!

I'm Praneet Singh , a research student interested in Video Analytics. I'm actively working on understanding the effects of Video Compression on learning applications, Neural Network feature compression & Image Quality Estimation for Computer Vision tasks.

Get in touch singh671@purdue.edu / praneet195@gmail.com
Useful Links: Github / LinkedIn / Google Scholar

Background

I'm currently a PhD Student in the Elmore Family School of Electrical and Computer Engineering, Purdue University. I am also a Graduate Research Assistant affiliated to the Video Analytics for Daily Living (VADL) lab. Previously, I was a research assistant at the Indian Institute of Science

Seeking Full-time opportunities in Summer 2024 in the field of learning-based video compression and quality estimation

Skills

Languages

Python
C++
C

Frameworks

Pytorch
TensorFlow
Numpy
OpenCV
FFMpeg
JM, HM, VVC

Software & Tools

Bash
Git & Github
Vim

Education

Purdue University, West Lafayette

PhD in Electrical and Computer Engineering, (Communications, Networking, Signal & Image Processing)

Computer Vision Digital Image Processing I Digital Video Systems Deep Learning Digital Signal Processing I Random Variables Computation Models & Methods Introduction to Convex Optimization Linear Algebra Mathematical Statistics

Ramaiah Institute of Technology

Bachelor of Engineering in Electronics & Communication

Experience

Video Analytics for Daily Living (VADL), Purdue University

Aug 2019 - Present

Graduate Research Assistant

Dolby Laboratories

May 2023 - August 2023

Machine Learning Intern

Apple Inc.

May 2022 - August 2022

Video Coding Intern

Robert Bosch Centre for Cyber-Physical Systems, Indian Institute of Science

March 2018 - Aug 2019

Research Assistant

Flux Auto

September 2017 - Feb 2018

Self-Driving Engineer

ECI Telecom

March 2017 - Nov 2017

R&D Software & Testing Engineer

View My CV

Publication

NeRVA, Joint Implicit Neural Representations for Videos and Audios

IEEE International Conference on Multimedia and Expo, 2024 (In Progress)

Gallery-Query Protocol for Evaluating Face Image Quality Metrics

IEEE Multimedia Signal Processing, 2023

End-to-end Evaluation of Practical Video Analytics Systems for Face Detection and Recognition

Autonomous Vehicles and Machines, Electronic Imaging, 2023

Evaluating Image Quality Estimators for Face Matching

IEEE Conference on Multimedia Information Processing and Retrieval (MIPR)

Video-Analytics Task-Aware Quad-Tree Partitioning and Quantization for HEVC

IEEE International Conference on Image Processing 2022

Edge-Detect, Edge-centric Network Intrusion Detection using Deep Neural Network

IEEE Annual Consumer Communications & Networking Conference (CCNC), 2021

Animal localization in camera-trap images with complex backgrounds

IEEE Southwest Symposium on Image Analysis and Interpretation (SSIAI), 2020

Performance evaluation of cryptographic ciphers on IoT devices

International Conference on Recent Trends in Computational Engineering and Technologies (ICTRCET) 2018

MV-Tractus, A simple and fast tool to extract motion vectors from H264 encoded video streams

Zenodo, 2018

Detection of Anomalous Behaviour in Crowds Using Newton Pratt's Curve Fitting Technique

International Conference on Computing, Communication and Networking Technologies (ICCCNT), 2018

Video Analytics using YOLO and DeepSORT for Traffic Modelling

Cyber-Physical Systems Symposium (CyPhySS), 2018

A Traffic Analytics Architecture and Dataset for Indian Roads Using a Monocular Surveillance Camera Network

Featured Projects

Face Image Quality Estimation

- Performed a systematic analysis of face image quality evaluators such as SER-FIQ and SDD-FIQA.

- Leveraged specific quality estimation and datasets for a deeper understanding of learning-based quality estimators.

- Employed prevalent distortions in face images to develop superior and more robust face quality estimators.

- Introduced a new, operational evaluation protocol that minimizes the computational complexity of assessing face image quality estimators.

Task Aware Video Encoding using Lightweight Edge-based Neural Networks

- Devised a task-oriented CU Frame Partitioning procedure for video encoders like HM and VVC.

- Employed lightweight, edge-based neural networks that predict frame partitioning depending on the task to systematically aid the encoder i.e., for region-specific video encoding.

- Achieved bit-rate conservation during transmission and diminished encoding time while ensuring the performance of learning-based analytics remains.

Multi-modal Neural Field Representations for Audio and Video, Pending Patent Approval with Dolby Laboratories

- Unifying the representation of audio and video with a singular Neural Field Representation.

- Worked on methods of Model Pruning and Quantization instead of focusing on modality-specific compression.

Understanding the Effects of Video Compression on Computer Vision Applications

- Investigated the impact of compression on computer vision tasks such as pedestrian detection and face recognition.

- Assessed compression's influence on task performance in a variety of conditions, including differing light, resolution, camera models (e.g., fisheye), camera streams (such as RGB vs IR), facial skin tones, object dimensions, etc.

- Appraised the impact of various encoders and their configurations on task performance.

End-to-end evaluation of Practical Video Analytics systems

- Pre-processing real-world data, eliminating bias in terms of evaluation scenarios to create interpretable results

- Creating consistent annotations for fair performance evaluation

- Exploring the performance co-dependence that exists between face detection and recognition tasks

- Using multi-view, multi-modal data to help with end-to-end detection and recognition in different illumination conditions and recording environments

Lightweight Compression of Intermediate Neural Network Features

- Encoding neural network features using existing vidoe codecs to see if it is a better alternative to encoding images/frames

- Working to understand if neural networks can be split effectively such that intermediate features can be encoded and transmitted

- Finding network splits such that the best balance between saving bit-rate and maintaining task accuracy can be obtained

- Developing Feature-To-Image mapping \& Auto-Encoder architectures for dimensionality reduction of features before encoding

Foreground Segmentation for Camera-Trap Images

- Developed a Robust PCA based saliency predictor that helps with the background-foreground segmentation,identificationand population estimation of fauna in camera-trap images from Senegal

- The RPCA method does not require training and it’s performance is comparable to learning models like R3-Net

- The saliency predictor is used to detect and track animals, estimate population density and regular animal activity patterns

EdgeDetect - A framework to detect DDoS attacks on Edge nodes

- Worked with Dr. Reshmi Mitra from Southeast Missouri State University in developing a novel framework that helped in detecting DDoS attacks on Edge Devices using Recurrent Neural Networks

- Achieved SOTA performance on the UNSW 2015 dataset while ensuring minimal model architecture i.e can be runon edge devices

Traffic Analytics Architecture and Dataset for Indian Roads using a Monocular Surveillance Camera Network

- Worked with Dr. Abhay Sharma and Dr. Raghu Krishnapuram in developing solutions to help in Traffic Analyticsthat involved tasks like vehicle counting, license-plate detection speed estimation and queue-length estimation

- Built a real-time front-end WebServer System that serves live video streams in RTMP and HLS formats. Implementedthe server with methods for Discovery,Sharing Content,Routing,Congestion and Load Balancing

- The entire framework has been deployed in Electronic City, Bangalore, India

Other Projects

MV-Tractus

- Tool to extract motion vectors from H.264 bit-streams

Information Insertion into H.264 bit-streams using SEI-NALU

- To allow for simpler marker insertion into H.264 video streams that can not only hold additional metadata such as sensor values corresponding to frames, bounding box values etc but can also help in live stream synchronisation.

If You Know What I've Seen

- An Audio Landmark Detection based approach for Video Fingerprinting

Optimization of Deep Neural Networks using Stochastic Quasi-Newton Methods - A Practical Study

- An understanding of whether second-order Quadi-Newton methods can be used for practical Deep Learning applications. If so, can they compete with popular first-order methods like SGD.