Papers by Aniruddha Sinha
2011 IEEE International Conference on Consumer Electronics (ICCE), 2011
ABSTRACT This paper presents a novel method for recognizing the channel logos from the streamed v... more ABSTRACT This paper presents a novel method for recognizing the channel logos from the streamed videos in real time, which has various applications for value added services in the connected TV space. The results presented are based on the accuracy and performance in terms of ...
2009 IEEE International Advance Computing Conference, 2009
ABSTRACT This paper introduces three embedded solutions based on H.264 video codec on TI DaVinci ... more ABSTRACT This paper introduces three embedded solutions based on H.264 video codec on TI DaVinci processor. These solutions are remote video consultation system for the healthcare industry, point to point video chat and place-shifting system for the consumer industry. The main value additions in these solutions are bandwidth efficiencies and at the same time a better video quality as these systems are built on top of the H.264 video codecs.
Character Recognition, 2010
Page 1. Recognition of Characters from Streaming Videos 21 Recogn Tanus X Recognition of Characte... more Page 1. Recognition of Characters from Streaming Videos 21 Recogn Tanus X Recognition of Characters from Streaming Videos Tanushyam Chattopadhyay, Arpan Pal and Aniruddha Sinha Innovation Lab, Kolkata, Tata Consultancy Services Ltd. India 1. Introduction ...
We propose a fast video transcoding technique based on Region-of-Interest (ROI) determination. Th... more We propose a fast video transcoding technique based on Region-of-Interest (ROI) determination. The ROIs are identified using the properties of the Human Visual System (HVS), applied in the compressed domain. We use the edge, motion and spatial ...
We propose a fast, yet simple algorithm to find the region of interest (ROI) from a compressed MP... more We propose a fast, yet simple algorithm to find the region of interest (ROI) from a compressed MPEG video bitstream, with partial decoding. We have used the properties of the human visual system (HVS), applied in the DCT domain, to extract the ROIs. Though a lot of work has been done to obtain ROIs in images based on the HVS, all these systems work in the pixel domain. However, finding the ROIs in the compressed domain has wide applications such as emphasizing perceptually important regions while transrating, defining descriptors for encoded video (MPEG-7), changing image size to adapt to heterogeneous client displays, etc.
We propose a fast, yet simple algorithm to find the Region of Interest (ROD from a compressed MPE... more We propose a fast, yet simple algorithm to find the Region of Interest (ROD from a compressed MPEG video bitstream, with partial decoding. We have used the properties of the Human Visual System (HVS), applied in the DCT domain, to extract the ROIs. Though a lot of welt has ...
Digital Watermarking is an effective and popular technique to discourage illegal copying and dist... more Digital Watermarking is an effective and popular technique to discourage illegal copying and distribution of copyrighted digital image information. The important attributes are the picture quality of the watermarked image (similarity to the original) and robustness to attacks such as cropping. We propose a transform-domain robust digital watermarking technique which uses a pattern-based compression of the watermark image, an intelligent
Digital Watermarking is an effective and popular technique to discourage illegal copying and dist... more Digital Watermarking is an effective and popular technique to discourage illegal copying and distribution of copyrighted digital image information. The important attributes are the picture quality of the watermarked image (similarity to the original) and robustness to attacks such as cropping. We propose a transform-domain robust digital watermarking technique which uses a pattern-based compression of the watermark image, an intelligent

Certain applicat ions (e.g web browsing, email etc.) in Television (TV) demand for text entry by ... more Certain applicat ions (e.g web browsing, email etc.) in Television (TV) demand for text entry by the user in a similar fashion as done in co mputers. User experience p lays a majo r role in the success of such applications. This paper discusses about an on-screen keyboard with hierarchical character or sy mbol organization which is operated by an accompanying remote control to allow navigation with reduced number of key strokes. The keyboard has been designed for Television and set top box users. It enhances user experience by allowing users to type easily and quickly. It also reduces cost by eliminating the need for a separate physical keyboard. This work has adopted iterative design cycles, where users' feedback arecollected and analysed in each cycle, and those have led to user-driven usability imp rovement. As the main contribution in this paper, we present a methodology for evaluating different on-screen keyboard layouts for telev ision and set-top box users. In order to do this we have extended and applied KLM-GOMS model. Lastly we have incorporated predictive text entry technique with the proposed layout. We have extended the KLM-GOM S model further to include a new parameter called dynamic mental operator(DM) which takes into account the additional cognitive load on the users while using predictive text entry techniques.
ABSTRACT This paper introduces three embedded solutions based on H.264 video codec on TI DaVinci ... more ABSTRACT This paper introduces three embedded solutions based on H.264 video codec on TI DaVinci processor. These solutions are remote video consultation system for the healthcare industry, point to point video chat and place-shifting system for the consumer industry. The main value additions in these solutions are bandwidth efficiencies and at the same time a better video quality as these systems are built on top of the H.264 video codecs.
In this paper authors have proposed a system to automatically recognize the Trademarks from sport... more In this paper authors have proposed a system to automatically recognize the Trademarks from sports video for channel hyperlinking in client end. In this method we have used the output of Set Top Box (STB) video stream in YUV 4:2:2 formats as input to our application. In this work we have first localized the text regions using some characteristic of text and then recognized the trademark using the shape invariant features and color features from the restricted trademark database. Experimental results show that the proposed approach can work in real time in any commercially available DSP platform and can mark the trademarks in the video successfully. The system on different type of sports videos gives a recall rate of 86.6% and a precision rate of 85.42%.

ABSTRACT Once the person's identity is established, the most important aspects of ubiquit... more ABSTRACT Once the person's identity is established, the most important aspects of ubiquitous healthcare monitoring of elderly and chronic patients are location, activity, physiological and psychological parameters. Since smartphones have become the most pervasive computing platform today, it is only a logical extension to use the same in healthcare domain for bringing ubiquity. Besides smartphone, skeleton based activity detection and localization using depth sensor like Kinect make ubiquitous monitoring effective without compromising privacy to a large extent. Finally sensing mental condition is made possible by analysis of the subject's social network feed. This paper presents an end-to-end healthcare monitoring system code named UbiHeld (Ubiquitous Healthcare for Elderly) using the techniques mentioned above and an IoT (Internet of Things) based back-end platform.
IEEE Internet Computing, 2011
ABSTRACT This paper presents a novel method for recognizing the channel logos from the streamed v... more ABSTRACT This paper presents a novel method for recognizing the channel logos from the streamed videos in real time, which has various applications for value added services in the connected TV space. The results presented are based on the accuracy and performance in terms of ...
This paper presents the hardware implementation of the algorithm for recognizing the channel logo... more This paper presents the hardware implementation of the algorithm for recognizing the channel logos from the analog videos in real time in a commercially available DSP platform. This solution can be used for different value added services for connected TV like providing EPG on real time, real time TV viewership analysis, user view time analysis. This paper mainly focuses on the issues in optimization of dual core embedded realization of the algorithm that is performing efficiently in an x86 platform. The optimization techniques we have used can save up to 150% of CPU cycles and also the proposed ARM DSP communication made the system an efficient one.

This paper presents a method for adaptive rate control in a video conferencing solution over a va... more This paper presents a method for adaptive rate control in a video conferencing solution over a variable bandwidth channel. The bandwidth is estimated using the round-trip delay of probe packets. Based on the estimated bandwidth, the video rate adaptation is done in two folds; one in the H.264 video encoder based on adaptive basic unit selection and the other by generation of fragments for encoding data and controlling the transmission delay of the same. The audio adaptation is done with the voice activity detection (VAD) of NB-AMR speech codec. System architecture is proposed for a generic video conferencing solution to implement the multimode adaptive rate control for a variable quality of service (QoS) wired channel and a low bandwidth wireless channel. The implementation is tested with ADSL and CDMA 1xRTT channels. The proposed method gives an improvement in image quality (PSNR) compared to the reference H.264 (JM9.5) encoder demonstrating an improved adaptive rate control behavior in the heterogeneous network.
Uploads
Papers by Aniruddha Sinha