PicToSpeech (2024)

PicToSpeech is an Android app designed to help visually impaired users understand English and Turkish texts in images. The app uses Google’s ML Kit for character recognition, TTS (Text-to-Speech) libraries for audio output, and extracts context from text using the Gemini LLM API. Users can take photos of text, and the app reads it aloud in a natural voice.

Features

📷 Image Text Recognition – Extracts text from images using Google ML Kit
🔊 Text-to-Speech – Reads the extracted text aloud in English and Turkish
🤖 Contextual Understanding – Uses Gemini LLM API for better comprehension
🌐 Bilingual Support – English and Turkish texts supported
📥 Offline APK Installation – Users can download and install the APK

Technologies Used

OCR, TTS, Prompt Engineering

Getting Started

Download & Install

The app APK can be downloaded directly from the repository:

Instructions:

Download both .z01 and .zip files.
Extract the files using an archive tool (WinRAR, 7-Zip, etc.).
Install the APK on your Android device (enable “Install from unknown sources” if required).

Build from Source

Clone the repository:

git clone https://github.com/yourusername/PicToSpeech.git

Open the project in Android Studio.
Sync Gradle and build the project.
Run on a physical device or emulator.

Project Status

✅ Completed (2024) – Available for download and installation.

Made with ❤️ using Android Studio, ML Kit, and Gemini API

Do you want me to do that?

Name		Name	Last commit message	Last commit date
Latest commit History 56 Commits
.idea		.idea
app		app
gradle/wrapper		gradle/wrapper
.gitignore		.gitignore
PicToSpeech.z01		PicToSpeech.z01
PicToSpeech.zip		PicToSpeech.zip
README.md		README.md
build.gradle.kts		build.gradle.kts
gradle.properties		gradle.properties
gradlew		gradlew
gradlew.bat		gradlew.bat
image-outline.svg		image-outline.svg
settings.gradle.kts		settings.gradle.kts

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

PicToSpeech (2024)

Features

Technologies Used

Getting Started

Download & Install

Build from Source

Project Status

About

Uh oh!

Languages

zaker-amin/PicToSpeech

Folders and files

Latest commit

History

Repository files navigation

PicToSpeech (2024)

Features

Technologies Used

Getting Started

Download & Install

Build from Source

Project Status

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Languages