Raspberry-Pi & OCR-Based Image-Text to Speech Conversion

Authors

  • Shivkanya V. Dahiphale
  • S. J. Nandedkar

Keywords:

Image-text-to-speech conversion, OpenCV, Optical Character Recognition (OCR), Raspberry Pi, Text-To-Speech (TTS), Text takeout, Text translator

Abstract

For individuals with visual impairments, image-to-text conversion into speech represents an essential tool that significantly enhances their ability to navigate the world more skillfully and independently. This research explored the innovative application of image-to-speech technology specifically designed for those with visual impairments. The proposed system is capable of transforming visuals into voice output. It employs a Raspberry Pi 3 B, an earphone, and an 8MP Raspberry Pi camera to achieve this task effectively. The development of this system is realized using the Python programming language in conjunction with powerful libraries such as OpenCV and Pytesseract. After capturing an image, the system processes the visual data and reads any text it has successfully identified aloud. A variety of images have been utilized to test the functionality and performance of the system rigorously. The proposed method demonstrates a significant promise for future development. It has the potential to significantly enhance the overall quality of life for individuals who are blind or visually impaired, allowing them greater access to information and their environment.

Published

2024-10-14

How to Cite

Shivkanya V. Dahiphale, & S. J. Nandedkar. (2024). Raspberry-Pi & OCR-Based Image-Text to Speech Conversion. Journal of Electronics and Telecommunication System Engineering, 30–42. Retrieved from https://www.matjournals.net/engineering/index.php/JoETSE/article/view/1013