Academia.edu no longer supports Internet Explorer.
To browse Academia.edu and the wider internet faster and more securely, please take a few seconds to upgrade your browser.
2020, International journal for research in applied science and engineering technology ijraset
https://doi.org/10.22214/ijraset.2020.5320…
8 pages
1 file
Engineering science is widely accustomed to detect the movement of lips. The data generated through visual motion of mouth and corresponding audio are highly correlated. This fact has been exploited for lip reading and for improving speech recognition. A CNN(Convolutional Neural Network) shall detect the movement of lips and judge the words spoken. This trained CNN detects the words that are spoken within the video and displayed within the text format. The CNN also relies on information provided by the context, knowledge of the language, and any residual hearing. The aim is to verify whether the utilization of engineering science methods, namely DNN(Deep Neural Network), could also be an appropriate candidate for solving this problem. within the sensible part, the most target is on presenting the results both in terms of the accuracy of the trained neural network on test data.
Indian Scientific Journal Of Research In Engineering And Management, 2023
2019
Lip reading is a technique to understand words or speech by visual interpretation of face, mouth, and lip movement without the involvement of audio. This task is difficult as people use different dictions and various ways to articulate a speech. This project verifies the use of machine learning by applying deep learning and neural networks to devise an automated lip-reading system. A subset of the dataset was trained on two separate CNN architectures. The trained lip reading models were evaluated based on their accuracy to predict words. The best performing model was implemented in a web application for real-time word prediction.
2020
Previous studies on human-machine interaction have determined that visual information can augment the speech recognition accuracy especially in noisy surroundings. Here we show a model for predicting words from video data without audio. Although already existing models have succeeded in incorporating visual data into speech recognition, all of them contained some or the other deficiency. To overcome this, we have pre-processed the data by using the haar-cascade model [2] to detect and crop around the subject's face in all frames of the video data and then use the sequence of frames as input to the model. Our proposed lipreading classification model is unique in its usage for all ranges of speakers.
International Journal of Scientific Research in Computer Science, Engineering and Information Technology, 2020
In the world of development and advancement, deep learning has made its significant impact in certain tasks in such a way which seemed impossible a few years ago. Deep learning has been able to solve problems which are even complex for machine learning algorithms. The task of lip reading and converting the lip moments to text is been performed by various methods, one of the most successful methods for the following is Lip-net they provide end to end conversion form lip to text. The end to end conversion of lip moments to the words is possible because of availability of huge data and development of different deep learning methods such as Convolution Neural Network and Recurrent Neural Networks. The use of Deep Learning in lip reading is a recent concept and solves upcoming challenges in real-world such as Virtual Reality system, assisted driving systems, sign language recognition, movement recognition, improving hearing aid via Google lens. Various other approaches along with different datasets are explained in the paper.
IJCSNS, 2009
Lip-Reading has been practised over centuries for teaching deaf and dumb to speak and communicate effectively with the other people. In this study, the use of neural networks in lip reading is explored. We convert the video of the subject speaking different words into images and then images are further selected manually for processing. As per the research the horizontal and the vertical distance between the lips varies for each and every word considering the close proximities of similar sounding words. Based on this research we can create the database of commonly used words and our neural network model can form clusters of words based on its intelligent approach. This approach can be associated with various voice recognition softwares and help in increasing their efficiency readily even in a noisy environment, and creating new dimensions for human computer interaction..
SEEC, 2009
Nowadays, Speech recognition is a basic component in several research projects. To understand a speech, hearing is not enough, it is sometimes necessary to see it. Several domains like E-learning, Human-Machine Interaction, etc are concerned only with the use of visual information, whereas Lip-reading provides useful information in speech perception and language understanding, especially when the auditory speech is degraded. Employing neural network techniques to detect the motion of lips have been widely used to improve the performance of speech reading algorithms. Automatic visual feature extraction, however, is difficult for both lip reading and speech recognition. A relatively new type of neural networks, Spatio-Temporal NN, is used for lip-reading compared with other classical methods like Spiking NN, Hidden Markovin Model (HMM) , Time Delayed Neural Network (TDNN) and Ergodic HMM. In this paper for visual lip-reading, the available neural network models and related methods that can be used and discuss the pros and cons of each method.
The Routledge Handbook of Chinese Language and Culture, 2024
A relatively small number of keywords are so important to the study of classical Chinese philosophy that any insight into their etymology and semantic range would amply repay the effort of inquiry. The familiar difficulties of Chinese historical linguistics have impeded the comprehension of these keywords just as they have impeded the comprehension of every other aspect of the language. Philosophical texts in other languages rarely present commensurate hurdles. Most keywords of classical Greek and Roman philosophy, for example, are well understood from a linguistic point of view. Even Sanskrit philosophical terms usually pose fewer linguistic problems than Chinese ones. As research in the history of the Chinese language progresses, however, some keywords are slowly but surely beginning to reveal their mysteries. Decades having passed since the pioneering research by linguists such as Peter A. Boodberg (1979: 26-40) and Mei Tsu-lin (1994), the time is ripe for review. The following aperçu relies primarily on the Old Chinese reconstruction system of William H. Baxter and Laurent Sagart (2014), but many of the relevant phenomena would be discernible in competing systems as well.
Revista de Teoria da História, 2024
Neste texto pretendemos abordar as possíveis contribuições dos estudos críticos da branquitude para a revitalização do cânone racializado/generificado de “intérpretes do Brasil”. Para isso, abordaremos,em um primeiro momento,a relação entre racialização e cânone em um contexto mais amplo para em seguida abordar a discussão a partir do levantamento de coletâneas de intérpretes, evidenciando também algumas possibilidades alternativas ao cânone
Se podrán disponer libremente de los artículos y otros materiales contenidos en la revista solamente en el caso de que se usen con propósito educativo o científico y siempre u cuando sean citados correctamente. Queda expresamente penado por la ley cualquier aprovechamiento comercial. HISPANIA NOVA. Revista de Historia Contemporánea. Número 10 (2012) http://hispanianova.rediris.es DOSSIER De Genocidios, Holocaustos, Exterminios… Sobre los procesos represivos en España durante la Guerra Civil y la Dictadura Julio ARÓSTEGUI, Jorge MARCO y Gutmaro GÓMEZ BRAVO (Coord.) Prácticas genocidas en guerra, represión sistémica y reducación social en posguerra Genocidal practice in war, systematic repression and social reeducation in postwar. Matilde EIROA (Universidad Carlos III) [email protected] HISPANIA NOVA. Revista de Historia Contemporánea. Número 10 (2012) http://hispanianova.rediris.es HISPANIA NOVA http://hispanianova.rediris.es Matilde EIROA Prácticas genocidas en guerra, represión sistémica y reeducación social en posguerra. Título en inglés: Genocidal practice in war, systematic repression and social reeducation in postwar.
Etika Komunikasi di Media Soisal, 2019
VII Jornadas Internacionales de Innovación Universitaria, 2010
En la España Medieval, 2009
The Voice - Sawt Almaghreb, 2024
Rev Med Chile, 2023
Revista de la Universidad del Zulia, 2021
Baikal Research Journal, 2016
Cell Communication and Signaling, 2013
Editorial Jurídica Venezolana, 2024
ABCD. Arquivos Brasileiros de Cirurgia Digestiva (São Paulo)
Phytochemistry Reviews
Journal of the Brazilian Society of Mechanical Sciences and Engineering, 2019
Empresarial, 2018
Applied Mechanics and Materials, 2014