AI-powered smartglass ecosystem with real-time object detection, face recognition, translation, and AR-assisted interaction.
This monorepo contains the Flutter mobile app, FastAPI backend, and ML inference services powering the S.A.G.E wearable.
S.A.G.E is a software-first, modular smartglass system designed for:
- Real-time object detection
- Real-time face recognition
- Hands-free voice assistant interactions
- OCR + Translation (Google Vision + LibreTranslate)
- AR-like display through a mobile-assisted HUD pipeline
The system offloads heavy AI workloads to a mobile app + hosted backend, keeping the hardware minimal and efficient.
┌──────────────────┐
│ Smartglass (Pi) │
│ - Camera │
│ - Mic/Speaker │
│ - HUD Display │
│ - Lightweight │
│ FastAPI │
└─────────┬────────┘
│
Wi-Fi Local Network
│
┌────────▼────────┐
│ Flutter Mobile │
│ App (UI + I/O) │
│ - Voice Input │
│ - Camera Relay │
│ - TTS/STT │
│ - API Bridge │
└────────┬────────┘
│
REST API Calls
│
┌─────────▼──────────┐
│ Core Backend │
│ FastAPI │
│ - Translation │
│ - OCR │
│ - Gemini LLM │
│ - Orchestration │
└─────────┬──────────┘
│
ML Microservices (FastAPI)
│
┌────────────┼────────────┐
│ │ │
┌──▼───┐ ┌────▼────┐ ┌───────┐
│ Face │ │ Object │ │ Future │
│ Rec │ │ Detect │ │ Models │
└──────┘ └─────────┘ └───────┘
- Flutter UI/UX
- Camera streaming & communication with Pi
- STT/TTS integration
- Device pairing workflow
- Routing backend results to HUD preview
- FastAPI backend
- Google Vision OCR integration
- LibreTranslate pipeline
- Gemini integration
- Orchestration logic
- Redis caching / async tasks
- Dataset preparation & training
- ArcFace/FaceNet embeddings
- Faiss/Annoy nearest-neighbor search
- FastAPI inference server
- ONNX/TorchScript export
- YOLO/EfficientDet training
- Dataset annotation & augmentation
- Fast inference server (FastAPI)
- Model quantization / ONNX export
- Flutter
- HTTP (Dio)
- TTS / STT plugins
- Local WiFi communication
- FastAPI
- Redis
- Gemini API
- Google Vision OCR
- LibreTranslate
- PyTorch / ONNX Runtime
- YOLOv8 / EfficientDet
- ArcFace / FaceNet embeddings
- Faiss / Annoy
- Object Detection
- Facial Recognition
- Translation (OCR + LibreTranslate)
- Gemini Voice Assistant
- Hardware-light, software-heavy
- ML offloaded to backend
- Free/open-source friendly
- Modular microservices
- AR via reflective HUD