Speech to Text – Text to Speech Transcription for Indian Languages
Authors- Vishal Jadhav, Siddhesh Khedkar, Siddhesh Auti, Kushmeet Suri, Ms. Sapna Bhuskute, Dr. Nandini C. Nag
Abstract-– The project aims to develop a comprehensive system for Text-to-Speech (TTS) and Speech-to-Text (STT) conversion, providing users with a platform to seamlessly convert between text and speech formats. By leveraging user-friendly interfaces and powerful backend processing, the system offers multiple functionalities, including language translation, text synthesis, and speech recognition. This project is designed to cater to a wide range of user needs, from accessibility solutions for individuals with visual impairments to content creators looking to generate voiceovers or audiobooks. The system architecture integrates several modules, each tailored for specific functionalities. The TextShift module allows users to input text, select a language, and translate it to another language, while the TalkApp module focuses on converting text to speech using a speech synthesis engine. In parallel, the AudioBook module enables users to upload documents, which are then processed and converted into spoken audio. Furthermore, the BlendMaster module supports video processing, where users can input video files, add captions, and merge them with synthesized speech or translated text.
International Journal of Science, Engineering and Technology