Aniruddha Sinha

IIT Kharagpur, Department of Electronics & Electrical Communication Engineering, Senior scientist

Followers

Following

Co-author

Public Views

Daniel D. Hutto

University of Wollongong

Galen Strawson

The University of Texas at Austin

Judith L Green

University of California, Santa Barbara

E. Wayne Ross

University of British Columbia

Bob Jessop

Lancaster University

Alejandra B Osorio

Wellesley College

Shaun Gallagher

University of Memphis

Florin Curta

University of Florida

Dr.Yousery Sherif

Mansoura University

Irina Kolesnik

Moscow State University

Interests

Uploads

Papers by Aniruddha Sinha

Recognition of channel logos from streamed videos for value added services in connected TV

by Arpan Pal and Aniruddha Sinha

2011 IEEE International Conference on Consumer Electronics (ICCE), 2011

ABSTRACT This paper presents a novel method for recognizing the channel logos from the streamed v... more

Bandwidth Efficient Advanced Multimedia Applications for Home entertainment based on H.264 CODEC

by Aniruddha Sinha and Arpan Pal

2009 IEEE International Advance Computing Conference, 2009

ABSTRACT This paper introduces three embedded solutions based on H.264 video codec on TI DaVinci ... more ABSTRACT This paper introduces three embedded solutions based on H.264 video codec on TI DaVinci processor. These solutions are remote video consultation system for the healthcare industry, point to point video chat and place-shifting system for the consumer industry. The main value additions in these solutions are bandwidth efficiencies and at the same time a better video quality as these systems are built on top of the H.264 video codecs.

Recognition of Characters from Streaming Videos

by Arpan Pal and Aniruddha Sinha

Character Recognition, 2010

Page 1. Recognition of Characters from Streaming Videos 21 Recogn Tanus X Recognition of Characte... more

Logo Recognition

by Arpan Pal and Aniruddha Sinha

Region-of-interest based compressed domain video transcoding scheme

We propose a fast video transcoding technique based on Region-of-Interest (ROI) determination. Th... more

A fast algorithm to find the region-of-interest in the compressed MPEG domain

We propose a fast, yet simple algorithm to find the region of interest (ROI) from a compressed MP... more We propose a fast, yet simple algorithm to find the region of interest (ROI) from a compressed MPEG video bitstream, with partial decoding. We have used the properties of the human visual system (HVS), applied in the DCT domain, to extract the ROIs. Though a lot of work has been done to obtain ROIs in images based on the HVS, all these systems work in the pixel domain. However, finding the ROIs in the compressed domain has wide applications such as emphasizing perceptually important regions while transrating, defining descriptors for encoded video (MPEG-7), changing image size to adapt to heterogeneous client displays, etc.

A fast algorithm to find the region-of-interest in the compressed MPEG domain

We propose a fast, yet simple algorithm to find the Region of Interest (ROD from a compressed MPE... more

Pattern based robust digital watermarking scheme for images

Digital Watermarking is an effective and popular technique to discourage illegal copying and dist... more Digital Watermarking is an effective and popular technique to discourage illegal copying and distribution of copyrighted digital image information. The important attributes are the picture quality of the watermarked image (similarity to the original) and robustness to attacks such as cropping. We propose a transform-domain robust digital watermarking technique which uses a pattern-based compression of the watermark image, an intelligent

Pattern based robust digital watermarking scheme for images

SYSTÈME POUR SERVICE À VALEUR AJOUTÉE SMS POUR PROGRAMMES TÉLÉVISÉS ACTIFS REÇUS PAR L'INTERMÉDIAIRE D'UN BOÎTIER DÉCODEUR

A robust heart rate detection using smart-phone video

An Iterative Methodolgy to Improve TV Onscreen Keyboard Layout Design through Evaluation of User Studies

Certain applicat ions (e.g web browsing, email etc.) in Television (TV) demand for text entry by ... more Certain applicat ions (e.g web browsing, email etc.) in Television (TV) demand for text entry by the user in a similar fashion as done in co mputers. User experience p lays a majo r role in the success of such applications. This paper discusses about an on-screen keyboard with hierarchical character or sy mbol organization which is operated by an accompanying remote control to allow navigation with reduced number of key strokes. The keyboard has been designed for Television and set top box users. It enhances user experience by allowing users to type easily and quickly. It also reduces cost by eliminating the need for a separate physical keyboard. This work has adopted iterative design cycles, where users' feedback arecollected and analysed in each cycle, and those have led to user-driven usability imp rovement. As the main contribution in this paper, we present a methodology for evaluating different on-screen keyboard layouts for telev ision and set-top box users. In order to do this we have extended and applied KLM-GOMS model. Lastly we have incorporated predictive text entry technique with the proposed layout. We have extended the KLM-GOM S model further to include a new parameter called dynamic mental operator(DM) which takes into account the additional cognitive load on the users while using predictive text entry techniques.

Download

Bandwidth Efficient Advanced Multimedia Applications for Home entertainment based on H. 264 CODEC

Low Computational Approach for Road Condition Monitoring Using Smartphones

Recognition of trademarks from sports videos for channel hyperlinking in consumer end

In this paper authors have proposed a system to automatically recognize the Trademarks from sport... more In this paper authors have proposed a system to automatically recognize the Trademarks from sports video for channel hyperlinking in client end. In this method we have used the output of Set Top Box (STB) video stream in YUV 4:2:2 formats as input to our application. In this work we have first localized the text regions using some characteristic of text and then recognized the trademark using the shape invariant features and color features from the restricted trademark database. Experimental results show that the proposed approach can work in real time in any commercially available DSP platform and can mark the trademarks in the video successfully. The system on different type of sports videos gives a recall rate of 86.6% and a precision rate of 85.42%.

LOGO RECOGNITION

UbiHeld: ubiquitous healthcare monitoring system for elderly and chronic patients

ABSTRACT Once the person&#39;s identity is established, the most important aspects of ubiquit... more ABSTRACT Once the person&#39;s identity is established, the most important aspects of ubiquitous healthcare monitoring of elderly and chronic patients are location, activity, physiological and psychological parameters. Since smartphones have become the most pervasive computing platform today, it is only a logical extension to use the same in healthcare domain for bringing ubiquity. Besides smartphone, skeleton based activity detection and localization using depth sensor like Kinect make ubiquitous monitoring effective without compromising privacy to a large extent. Finally sensing mental condition is made possible by analysis of the subject&#39;s social network feed. This paper presents an end-to-end healthcare monitoring system code named UbiHeld (Ubiquitous Healthcare for Elderly) using the techniques mentioned above and an IoT (Internet of Things) based back-end platform.

Recognition of channel logos from streamed videos for value added services in connected TV

IEEE Internet Computing, 2011

ABSTRACT This paper presents a novel method for recognizing the channel logos from the streamed v... more

Recognition of channel logo from analog video: An embedded realization

This paper presents the hardware implementation of the algorithm for recognizing the channel logo... more This paper presents the hardware implementation of the algorithm for recognizing the channel logos from the analog videos in real time in a commercially available DSP platform. This solution can be used for different value added services for connected TV like providing EPG on real time, real time TV viewership analysis, user view time analysis. This paper mainly focuses on the issues in optimization of dual core embedded realization of the algorithm that is performing efficiently in an x86 platform. The optimization techniques we have used can save up to 150% of CPU cycles and also the proposed ARM DSP communication made the system an efficient one.

Adaptive rate control for H. 264 based video conferencing over a low bandwidth wired and wireless channel

This paper presents a method for adaptive rate control in a video conferencing solution over a va... more This paper presents a method for adaptive rate control in a video conferencing solution over a variable bandwidth channel. The bandwidth is estimated using the round-trip delay of probe packets. Based on the estimated bandwidth, the video rate adaptation is done in two folds; one in the H.264 video encoder based on adaptive basic unit selection and the other by generation of fragments for encoding data and controlling the transmission delay of the same. The audio adaptation is done with the voice activity detection (VAD) of NB-AMR speech codec. System architecture is proposed for a generic video conferencing solution to implement the multimode adaptive rate control for a variable quality of service (QoS) wired channel and a low bandwidth wireless channel. The implementation is tested with ADSL and CDMA 1xRTT channels. The proposed method gives an improvement in image quality (PSNR) compared to the reference H.264 (JM9.5) encoder demonstrating an improved adaptive rate control behavior in the heterogeneous network.

Recognition of channel logos from streamed videos for value added services in connected TV

by Arpan Pal and Aniruddha Sinha

2011 IEEE International Conference on Consumer Electronics (ICCE), 2011

ABSTRACT This paper presents a novel method for recognizing the channel logos from the streamed v... more

Bandwidth Efficient Advanced Multimedia Applications for Home entertainment based on H.264 CODEC

by Aniruddha Sinha and Arpan Pal

2009 IEEE International Advance Computing Conference, 2009