Academia.edu no longer supports Internet Explorer.
To browse Academia.edu and the wider internet faster and more securely, please take a few seconds to upgrade your browser.
2018, International Journal of Intelligent Systems Technologies and Applications
…
13 pages
1 file
The handwritten text reader is designed to help the visually impaired listen to an audio read-back of printed and handwritten scanned text. A hand-held page scanner is used to scan the text to be read. The image from the scanner is sent to the application in the paired Android phone over Bluetooth. An open source optical character recognition (OCR) engine, Tesseract is used to extract the text from the image, and this extracted text is converted to speech. Tesseract OCR engine is further trained to recognise handwritten text for a specific user. This OCR engine is trained with handwritten datasets. In addition to English, the application supports two regional languages-Hindi and Bengali.
IRJET, 2021
There is an image everywhere around us and we see the image and read the text in our day-today life. Like bus names, bus numbers, hotel names, newspapers, etc. But the question is how Visually Impaired or blind people can recognize this text. Surely they need some assistance to read the text. In this research, the images are converted into text and the text is converted into audio output. It is mainly used for low visual persons or blind peoples to recognize the text. The field of research in Character recognition, Speech recognition and computer vision. In this research, as the recognition process is done using OCR, Raspberry Pi, MAT lab and openCV library. It recognizes characters using API, the e-Speak algorithm, PYTHON, and JAVA programming. This paper explains the purpose, implementation, and test results of the device. This project consists of capturing the image, text localization, text to audio conversion.
International Journal of Engineering Research and Technology (IJERT), 2018
https://www.ijert.org/drushti-a-smart-reader-for-visually-impaired-people https://www.ijert.org/research/drushti-a-smart-reader-for-visually-impaired-people-IJERTCONV6IS15008.pdf According to the World Health organization (WHO), 285 million people are estimated to be visually impaired worldwide, among which 90% live in developing countries and forty five million are blind individuals worldwide. Though there are many existing solutions to the problem of assisting individuals who are blind to read .In particular, there is a need for a portable text reader that is affordable and readily available to the blind community. This project proposes a smart reader for visually challenged people using Raspberry Pi. This paper addresses the integration of a complete Text Read-out system designed for the visually challenged. A camera will be used to take input, speaker and LCD to give output. The system consists of a webcam interfaced with Raspberry Pi which accepts a page of printed text. The OCR (Optical Character Recognition) package installed in Raspberry Pi scans it into a digital document. Once it is scanned, the text is read out by a text to speech conversion unit (TTS engine) installed in Raspberry Pi. The output is fed to an audio amplifier before it is read out. The image to text conversion and text to speech conversion is done by the OCR software installed in Raspberry Pi. The system finds its interesting applications in libraries, auditoriums, offices where instructions and notices are to be read and also assists in filling of application forms.
2018 2nd International Conference on Inventive Systems and Control (ICISC), 2018
A Majority of the people in India are visually impaired and blind.This gives rise to the need for the development of devices that could bring relief to them. This paper aims to study the technology of image recognition with speech synthesis and to develop a cost effective, user friendly image to speech conversion system with help of Matlab.The paper includes system which has a inbuilt small camera that scans the text printed on a paper, converts it into audio format using a synthesized voice for reading out the scanned text quickly translating books, documents and other materials for daily living, especially away from home or office (TTS). Finger tracking based a virtual mouse application has been designed and implemented using a regular webcam. Not only it saves time and energy, but also makes life better for the visually impaired as it increases their independency
2015
A Majority of the visually impaired use Braille for reading documents and books which are difficult to make and less readily available. This gives rise to the need for the development of devices that could bring relief to the agonizing tasks that the visually impaired has to go through. Due to digitization of books there are many excellent attempts at building a robust document analysis system in industries, academia and research labs, but this is only for those who are able to see. This project aims to study the image recognition technology with speech synthesis and to develop a cost effective, user friendly image to speech conversion system with help of Raspberry Pi. The project has a small inbuilt camera that scans the text printed on a paper, converts it to audio format using a synthesized voice for reading out the scanned text quickly translating books, documents and other materials for daily living, especially away from home or office. Not only does this save time and energy, but also makes life better for the visually impaired as it increases their independency.
Journal of emerging technologies and innovative research, 2019
Communication plays a major role in human life. Vision has one of the utmost importance in making the communication complete. A person needs vision to access information from a text or image. Visually impaired people gather information from voice only. In many situations, information access for those people is very difficult due to their disability. The proposed paper here implements an idea for image conversion to speech with the help of Raspberry Pi, a Web cam and a speaker or an earphone. The proposed idea uses the concept of Tesseract OCR (Optical Character Recognition) to produce speech from image captured by the camera. This helps the visually impaired people to access the information of text in printed materials i.e. books as well as hand-written notes. The implemented prototype will be a portable device since a power bank is used for its battery backup and gives speech output in multiple languages with the help of Speech API and Microsoft translator. This idea can help milli...
International Journal of Research, 2018
Nowadays realtime hardware implementation of Text to Speech and Speech to text conversion systems playing a crucial role in several real time applications such as reading aid for blind people and talking aids for vocally handicapped people and robotics etc. This paper describes the design and implementation of a system which involves conversion of text information present in the image to speech information and conversion of speech information given by user into text information. In this context raspberry pi has been chosen as a hardware platform to implement the proposed method. For the implementation proposed system Logitech C170 camera module and Bluetooth HC-05 module were interfaced to raspberry pi device. The concept used in this project are tesseract OCR(Optical Character Recognition),espeak TTS(Text to Speech) engine,AMR(android meets robots) voice to text application software. The code which is used in the proposed system is used in the python programming language. The prop...
The visually impaired people faced many challenges in the day by day life, including these challenges reading the printed texts are often not well understood. This is based on a prototype which helps the user to listen to the contents of the text images in English. The Blind people have methods like Braille, which was introduced as an option for studying engraved text. But it has some issues. The main idea is the development of a system for dictating text for the blind people. The blind people can read the text from the image without taking the help of the human. The Image is captured by the camera module which was connected to the Raspberry Pi. On the captured image different operations are performed by using the open cv. From that image text is extracted by using the OCR and that text is converted into speech.
The biggest challenge faced by the blind people, is their inability to view real life objects and to read. The only efficient system that exists so far, is the braille system, that enables the blind to read. This system is time consuming and the time taken to recognize the text is long. Our aim here is to reduce the time taken to read. In our work, using a Raspberry Pi, we have designed a smart reader, so that the blind people may read. The module that we have designed, either uses a webcam or a mobile camera that is linked with a Raspberry Pi, to focus on a range of printed text. The OCR (Optical Character Recognition) package installed in raspberry pi tests it into a digital article which is then subjected to skew modification, segmentation, before feature extraction to perform sorting. Our proposed project automatically focuses the regions of text in the object, after which the text characters are then localized using a localization algorithm, that uses feature recognition and edge pixel distribution using artificial neural network. The text characters are then binarized into a machine readable form by using an OCR algorithm called as Tesseract. The recognized characters of the text are then converted into audio format, so that they may be recognized by the blind users.
IRJET, 2020
This paper describes an innovative, beneficial, and cost-effective model AMBERT which is a voice-controlled smart reader for visually impaired people. Visual impairment is a major problem faced by humanity. People suffering from complete or partial blindness are unable to read written or printed content on their own. Optical Character Recognition (OCR) is a quite popular and widely used technique to extract and recognize characters from any image. This model AMBERT combines the benefits of OCR with a Voice command system which makes it efficient and effective for use by the visually impaired. The title AMBERT is a German name meaning bright, shining light depicting that it will bring light in the lives of the visually impaired. AMBERT is designed to assist visually impaired people by reading books, articles, newspapers, magazines, etc. for them. It is a Raspberry Pi version 2 based hardware model with 1.2 GHz processor, 1 GB RAM, and using OCR (Optical Character Recognition) software, OpenCV (Open Source Computer Vision) library, web camera, etc. to serve the purpose.
International Journal of Engineering Research and Technology (IJERT), 2019
https://www.ijert.org/an-assistive-reading-system-for-visually-impaired-and-blinds-using-ocr-and-tts-techniques-on-labview https://www.ijert.org/research/an-assistive-reading-system-for-visually-impaired-and-blinds-using-ocr-and-tts-techniques-on-labview-IJERTCONV7IS10073.pdf Extraction of knowledge just by listening to sounds is a peculiar feature. Though text is a medium of communication but speech is more powerful means of communication than text. Optical character recognition have become one of the most successful technology in the field of pattern recognition and artificial intelligence.To improve the ability to access the textual information an assistive system has been used that reads the text from hand written and scanned document and converts the textual information to speech. Speech signals produced can be saved and reproduced for later use. The main objective of this paper is to develop an cost effective and user-friendly optical character recognition based speech synthesis.This paper integrates the text and speech synthesizer which is performed using Laboratory virtual instruments engineering workbench (LabVIEW 2017 version).
Direito dos Valores Mobiliários, 2015
CDELT Occasional Papers "In the Development of English Language Education" (Print), 2023
Archaeologica Pragensia, 2022
HAL (Le Centre pour la Communication Scientifique Directe), 2016
transcript Verlag eBooks, 2005
International Journal of Radiation Oncology*Biology*Physics, 2013
Ingeniería e Investigación, 2016
Japanese Journal of Infectious Diseases, 2021
Taiikugaku kenkyu (Japan Journal of Physical Education, Health and Sport Sciences)
Transplantation, 2004
INTERNATIONAL JOURNAL OF MANAGEMENT, SOCIAL SCIENCES, PEACE AND CONFLICT STUDIES, 2020