tarifin - AI-Powered Voice-Based Recipe Assistant

tarifin is a full-stack LLM-powered recipe assistant that enables users to request recipes via voice and receive personalized, culturally diverse, and health-conscious suggestions in real time — both as text and speech.

The system includes a fine-tuned Nous Hermes 2 - Mistral 7B, a Flask streaming API backend, and a Flutter mobile client supporting voice input/output (STT/TTS).

Chat List

This is the home screen showing the list of saved or past recipe conversations. Users can tap on any item to view the full response or start a new request using the floating action button.

💬 Example Prompt:
"All I have are chickpeas, carrots, and some tahini. I want to make a healthy but different dinner with these. What can I make?"

🧠 Model Response:
A full recipe titled "Chickpea and Carrot Tahini Salad" including ingredients and step-by-step instructions.

The response is displayed using flutter_markdown and read aloud using flutter_tts. Input can be provided by voice using speech_to_text, making the experience entirely hands-free and user-friendly.

💻 Development Environment

Component	Specification
OS	Windows 11 Pro
Linux Subsystem	WSL2 (Ubuntu 22.04 LTS)
Python Env	`venv`-based isolated environment
CUDA Version	11.8
PyTorch	2.2+ with CUDA support
Transformers	HuggingFace Transformers
TRL	`trl` (for SFTTrainer)

🔧 Hardware

Component	Detail
GPU	NVIDIA RTX 4060 (Laptop) – 8 GB VRAM
CPU	Intel Core i5-12500H (12-Core Hybrid)
RAM	16 GB DDR4 RAM

📁 Dataset Overview

Path: /model_files/data/all_data.jsonl
Size: 4,800 Alpaca-style training samples
Each Sample Contains:
- instruction: The user’s natural language request
- input: Optional context
- output: A detailed, minimum 1000-word recipe
- metadata: Nutritional info, allergens, cuisine type, etc.

📌 Sample Format

{
  "instruction": "Suggest a low-calorie Turkish dinner for a diabetic patient",
  "input": "",
  "output": "To prepare a balanced Turkish meal for someone managing diabetes...",
  "metadata": {
    "calories": "430 kcal",
    "diet": "diabetic-friendly",
    "cuisine": "Turkish",
    "allergens": "nut-free"
  }
}

🧪 Fine-Tuning Process

✅ Model Details

Base Model: Nous Hermes 2 - Mistral 7B
Quantization: 4-bit NF4 (via bitsandbytes)
Fine-tuning: LoRA (via peft) + SFTTrainer

🛠️ Training Pipeline (see `/model_files/train.py`)

Load and quantize model using NF4
Apply LoRA (PEFT) for efficient training
Filter and format dataset (prompt + metadata + output)
Tokenize with max_length = 1024
Fine-tune using SFTTrainer for 2 epochs

🧩 3-Stage Curriculum Fine-Tuning Strategy

To enhance learning dynamics, training was split into 3 progressive phases based on output length:

🥇 Phase 1: Short Outputs (< 2000 words)

Goal: Teach the model task format and cultural variability
Result: Learned the question-answer pattern effectively

🥈 Phase 2: Medium Outputs (2000–3000 words)

Goal: Improve fluency and semantic consistency
Result: Better contextual flow and structural awareness

🥇 Phase 3: Long Outputs (≥ 3000 words)

Goal: Handle complex, multi-step recipe generation
Filter Applied: len(output) >= 3000
Result: Stable performance across lengthy, dense outputs

⚙️ Training Configuration

TrainingArguments(
    output_dir="./output_longest",
    per_device_train_batch_size=1,
    gradient_accumulation_steps=4,
    num_train_epochs=2,
    learning_rate=2e-4,
    fp16=True,
    save_strategy="steps",
    save_steps=100,
    save_total_limit=2,
    logging_steps=10
)

Effective batch size: 4
Checkpointing: every 100 steps, only 2 retained
Precision: Mixed (fp16) for optimized memory usage

Training Flow Summary

AutoTokenizer + AutoModel (Nous Hermes 2 - Mistral 7B)
↓
4-bit quantization (NF4) + LoRA (PEFT)
↓
Dataset loaded → long outputs filtered (≥ 3000 words)
↓
Prompt + Metadata → `text` field merged
↓
Tokenizer applied (max length 1024)
↓
Trained with SFTTrainer (2 epochs)
↓
Saved to ./output_longest

🔁 Checkpoint-Based Continual Training

Training resumed over 3 stages using progressive dataset splits.
Each phase resumed training from the previous checkpoint using output_dir.
Model was re-saved after each phase using .save_model().

📉 Training Loss Analysis

After completing all fine-tuning stages, the training loss was monitored using trainer_state.json.
The plot below visualizes the loss trend across training steps:

📈 Observations:

Initial loss was above 1.2, indicating complex generation at the start.
A steady decline is observed throughout the training process.
Final loss converged around 0.38–0.42, showing:
- Stable and effective fine-tuning
- No significant signs of overfitting
- Consistent generation quality even with long outputs

✅ After testing the model with various held-out prompts and unseen data, we confirmed that it produces rich, structured, and context-aware recipes — validating the success of the fine-tuning process.

🧪 Post-Training Evaluation & Deployment

1️⃣ Gradio-Based UI Testing

File: /model_files/gradio_exe.py

python model_files/gradio_exe.py

Token-wise streaming via TextIteratorStreamer
Threaded generation with dynamic Markdown preview
Real-time evaluation for developer convenience

2️⃣ Flask Streaming API Integration

File: /model_files/app.py

cd model_files
python app.py

🔗 Endpoint: `POST /generate`

Request:

{ "text": "Suggest a quick and healthy gluten-free Turkish lunch option" }

Response:

Content-Type: text/plain
Token-wise streamed output using yield

⏳ Inference Flow:

Receive JSON request
Use tokenizer + streamer in a separate thread
generate() streams output line-by-line via Flask

✅ The API runs at http://localhost:5000/generate

📱 Flutter Mobile App: Voice Recipe Assistant

After Flask API deployment, a lightweight Android app was developed using Flutter to provide seamless voice-based interaction.

🎯 Workflow Summary

User speaks a recipe request
Speech is converted to text via speech_to_text
Text is POSTed to the Flask API
Response is streamed back
It is both rendered on screen and spoken aloud via flutter_tts

📦 Flutter Dependencies

Package	Functionality
`speech_to_text`	Converts voice to text
`http`	Sends requests to backend API
`flutter_tts`	Text-to-speech playback of results
`flutter_markdown`	Rich text rendering for model output
`uuid`	Unique message/session identification

🔁 App Flow

User speaks into mic → STT (speech_to_text)
↓
Text sent to Flask API → HTTP POST
↓
Streaming response shown in Markdown
↓
Result spoken aloud via TTS (flutter_tts)

📂 Flutter Code Structure

main.dart: Entry point, handles STT/TTS logic
chat_home.dart: UI + HTTP streaming integration
ChatMessage, ChatSession: message model structure

Name		Name	Last commit message	Last commit date
Latest commit History 20 Commits
android		android
ios		ios
lib		lib
linux		linux
macos		macos
model_files		model_files
test		test
web		web
windows		windows
.gitattributes		.gitattributes
.gitignore		.gitignore
.metadata		.metadata
LICENSE		LICENSE
README.md		README.md
analysis_options.yaml		analysis_options.yaml
loss.png		loss.png
pubspec.lock		pubspec.lock
pubspec.yaml		pubspec.yaml
ui.jpg		ui.jpg

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

tarifin - AI-Powered Voice-Based Recipe Assistant

Chat List

💻 Development Environment

🔧 Hardware

📁 Dataset Overview

📌 Sample Format

🧪 Fine-Tuning Process

✅ Model Details

🛠️ Training Pipeline (see `/model_files/train.py`)

🧩 3-Stage Curriculum Fine-Tuning Strategy

🥇 Phase 1: Short Outputs (< 2000 words)

🥈 Phase 2: Medium Outputs (2000–3000 words)

🥇 Phase 3: Long Outputs (≥ 3000 words)

⚙️ Training Configuration

Training Flow Summary

🔁 Checkpoint-Based Continual Training

📉 Training Loss Analysis

📈 Observations:

🧪 Post-Training Evaluation & Deployment

1️⃣ Gradio-Based UI Testing

2️⃣ Flask Streaming API Integration

🔗 Endpoint: `POST /generate`

⏳ Inference Flow:

📱 Flutter Mobile App: Voice Recipe Assistant

🎯 Workflow Summary

📦 Flutter Dependencies

🔁 App Flow

📂 Flutter Code Structure

📄 License

About

Uh oh!

Releases

Packages

Languages

License

erenyurtcu/Tarifin-App

Folders and files

Latest commit

History

Repository files navigation

tarifin - AI-Powered Voice-Based Recipe Assistant

Chat List

💻 Development Environment

🔧 Hardware

📁 Dataset Overview

📌 Sample Format

🧪 Fine-Tuning Process

✅ Model Details

🛠️ Training Pipeline (see /model_files/train.py)

🧩 3-Stage Curriculum Fine-Tuning Strategy

🥇 Phase 1: Short Outputs (< 2000 words)

🥈 Phase 2: Medium Outputs (2000–3000 words)

🥇 Phase 3: Long Outputs (≥ 3000 words)

⚙️ Training Configuration

Training Flow Summary

🔁 Checkpoint-Based Continual Training

📉 Training Loss Analysis

📈 Observations:

🧪 Post-Training Evaluation & Deployment

1️⃣ Gradio-Based UI Testing

2️⃣ Flask Streaming API Integration

🔗 Endpoint: POST /generate

⏳ Inference Flow:

📱 Flutter Mobile App: Voice Recipe Assistant

🎯 Workflow Summary

📦 Flutter Dependencies

🔁 App Flow

📂 Flutter Code Structure

📄 License

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

🛠️ Training Pipeline (see `/model_files/train.py`)

🔗 Endpoint: `POST /generate`

Packages