Skip to content

Android app that helps visually impaired users understand English and Turkish texts in images using OCR, TTS, and Gemini LLM API

Notifications You must be signed in to change notification settings

zaker-amin/PicToSpeech

Repository files navigation

PicToSpeech (2024)

PicToSpeech is an Android app designed to help visually impaired users understand English and Turkish texts in images. The app uses Google’s ML Kit for character recognition, TTS (Text-to-Speech) libraries for audio output, and extracts context from text using the Gemini LLM API. Users can take photos of text, and the app reads it aloud in a natural voice.

Features

  • 📷 Image Text Recognition – Extracts text from images using Google ML Kit
  • 🔊 Text-to-Speech – Reads the extracted text aloud in English and Turkish
  • 🤖 Contextual Understanding – Uses Gemini LLM API for better comprehension
  • 🌐 Bilingual Support – English and Turkish texts supported
  • 📥 Offline APK Installation – Users can download and install the APK

Technologies Used

  • Android Studio
  • Java
  • Google ML Kit
  • Gemini API
  • OCR, TTS, Prompt Engineering

Getting Started

Download & Install

The app APK can be downloaded directly from the repository:

Instructions:

  1. Download both .z01 and .zip files.
  2. Extract the files using an archive tool (WinRAR, 7-Zip, etc.).
  3. Install the APK on your Android device (enable “Install from unknown sources” if required).

Build from Source

  1. Clone the repository:
    git clone https://github.com/yourusername/PicToSpeech.git
    
  2. Open the project in Android Studio.
  3. Sync Gradle and build the project.
  4. Run on a physical device or emulator.

Project Status

✅ Completed (2024) – Available for download and installation.


Made with ❤️ using Android Studio, ML Kit, and Gemini API

Do you want me to do that?

About

Android app that helps visually impaired users understand English and Turkish texts in images using OCR, TTS, and Gemini LLM API

Topics

Resources

Stars

Watchers

Forks

Languages