Skip to content

OleksiyM/ASR-app

Folders and files

NameName
Last commit message
Last commit date

Latest commit

ย 

History

40 Commits
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 

Repository files navigation

ASR-app ๐ŸŽ™๏ธโœจ

ASR-app is a lightweight, blazing-fast, and aesthetically stunning native macOS menu bar application designed for instant speech-to-text input across any application. Press a global hotkey, dictate your thoughts, and the transcribed text will be automatically typed directly into your active text field.

Designed with modern macOS LiquidGlass principles, combining deep translucency, vibrant neon gradients, and organic micro-animations.


๐ŸŒŸ Key Features

  • ๐ŸŽ™๏ธ Blazing-Fast Transcription (Whisper Turbo): Audio is sent directly to the Groq Whisper Large V3 Turbo API, ensuring near-instant transcription (under one second) and maximum accuracy.

  • โœ๏ธ Whisper Prompting & Punctuation Guide: Uses a hidden high-quality style template to guide the model to automatically format the transcribed text with perfect grammar, capitals, and punctuation (commas, periods, dashes, and question marks).

  • ๐Ÿ“š Custom Vocabulary Support: Add custom terms, names, or technical slang separated by commas directly in the Settings panel (up to 30-50 words). The application seamlessly appends these terms to the transcription prompt, ensuring 100% recognition accuracy.

  • ๐Ÿ“‹ Smart Auto-Paste: Once transcription is complete, the text is copied to your clipboard and immediately pasted into your active application (web browser, IDE, chat app) using a simulated Cmd+V keystroke.

  • ๐ŸŽจ Premium LiquidGlass UI:

    • A beautiful, adaptive translucent Menu Bar Popover that fits perfectly on your screen.
    • An interactive animated real-time volume indicator (Neon Wave) with glowing reflections.
    • A gorgeous borderless ยซAboutยป (About ASR-app) window featuring dynamic animated liquid glass gradient blobs.
  • โฑ๏ธ Auto-Stop Timer & Warm Warning: Set your preferred maximum recording duration (1, 2, 3, 5, or 10 minutes) in Settings. When recording gets within 30 seconds of the auto-stop limit, the popover transitions into a warm neon-orange color scheme with an active countdown timer to prevent text truncation.

  • ๐ŸŒ Complete Multilingual Support (En, Ru, Ua):

    • Matches your macOS system language by default.
    • Dynamic on-the-fly UI language switching (English, Russian, Ukrainian) without app restarts.
    • Choice of dictation language (Auto-detect or specific languages, including complete support for Ukrainian uk).
  • ๐ŸŒ— Adaptive Themes:

    • Integration with System (inherits macOS settings), Dark, and Light themes.
    • Adapts translucency, materials, and glows instantly.
  • โŒจ๏ธ Global Hotkeys: Dictate from anywhere in macOS without opening the popover. Pick your preferred global shortcut (e.g., โŒฅ + Space) in settings to start and stop recording seamlessly.


๐Ÿ› ๏ธ Tech Stack

  • Language: Swift 5.10+
  • Frameworks: SwiftUI & AppKit
  • System Integration:
    • AVFoundation โ€” high-fidelity audio capture and real-time decibel level monitoring.
    • Carbon API โ€” low-level global shortcut tracking from anywhere.
    • CoreGraphics (CGEvent) โ€” secure keyboard emulation for Smart Auto-Paste.
  • Build System: XcodeGen (dynamic .xcodeproj generation from project.yml for a clean Git repository).

๐Ÿš€ Getting Started

๐Ÿ“‹ Requirements

  • macOS 14.0 or newer.

๐Ÿ’ฟ Installation (Pre-built Release)

For the ultimate experience, you don't need to compile the code yourself. Simply download a pre-built premium DMG installer from our GitHub Releases:

  1. Download the DMG corresponding to your architecture:
    • ASR-App-AppleSilicon.dmg (for M1/M2/M3/M4 Apple Silicon Macs)
    • ASR-App-Intel.dmg (for older Intel x86_64 Macs)
  2. Mount the DMG and drag ASRApp.app into your Applications folder.
  3. Bypass Gatekeeper Warning (required due to Ad-Hoc signing): Since the app is ad-hoc signed without a paid Apple Developer Certificate, macOS will prevent it from running initially. You can resolve this instantly using one of two methods:
    • Method 1: Terminal (Fastest โšก) Open Terminal and run the following command to strip the quarantine attribute:
      xattr -d com.apple.quarantine /Applications/ASRApp.app
    • Method 2: System Settings Open System Settings โž” Privacy & Security, scroll down to the Security section, and click ยซOpen Anywayยป (ะŸะพะดั‚ะฒะตั€ะดะธั‚ัŒ ะฒั…ะพะด) for ASRApp.

โš™๏ธ How to Build and Run Locally

  1. Install XcodeGen using Homebrew:

    brew install xcodegen
  2. Clone the repository and navigate to its folder:

    git clone https://github.com/YOUR_USERNAME/ASR-app.git
    cd ASR-app
  3. Generate the Xcode project from project.yml:

    xcodegen generate
  4. Open the newly generated project file:

    open ASRApp.xcodeproj
  5. Press Cmd + R in Xcode to compile and run!

Important

On the first launch, the application will request access to the Microphone (for voice recording) and Accessibility permissions (required for keyboard simulation to support the Smart Auto-Paste feature). Please grant these permissions in your macOS System Settings.


๐Ÿ”‘ API Key Configuration

To enable speech-to-text recognition, you need a free API key from Groq:

  1. Log in or sign up at console.groq.com.
  2. Generate a new API key in the API Keys section.
  3. Click the waveform icon in the macOS status bar โž” Open Settings (gear icon โš™๏ธ) โž” Paste your API key into the field.

โš™๏ธ Automated CI/CD (GitHub Actions)

This project features a fully automated DevOps pipeline:

  • When you push a Git version tag starting with v (e.g., v1.2.3), the GitHub Actions release workflow is triggered.
  • On a high-performance cloud runner (macos-15 with Xcode 16+), the pipeline installs XcodeGen and compiles the application.
  • Matrix Architecture: Builds independent, native binaries for Apple Silicon (arm64) and Intel (x86_64) to keep the application lightweight without code bloat.
  • Ad-Hoc Signing: Automatically applies ad-hoc codesigning (codesign --force --deep --sign -) to allow seamless launching by bypassing macOS Gatekeeper blockades.
  • Premium DMG Packaging: Instead of basic ZIP archives, the pipeline uses create-dmg to package the .app into a gorgeous, customized DMG installer with a custom layout and drag-and-drop support.
  • Detailed release notes (Release Notes / Changelog) are auto-generated based on commit logs, and assets are uploaded directly to the GitHub Release.

๐Ÿซ‚ Authors & Contributions

  • Alex โ€” Product visionary, co-creator, and lead QA engineer.
  • Eva โ€” Your AI pair-programmer, developer, and visual designer ๐Ÿซ‚๐Ÿตโœจ.

Created with love, hot tea, and deep care for every pixel! ๐Ÿตโœจ


๐Ÿ”ฎ Future Ideas & Roadmap

Here are some exciting ideas for future releases of ASR-app:

  • ๐Ÿ”— Interactive Links in About: Added dynamic links to GitHub and X (Twitter) in the About window for quick navigation.
  • ๐Ÿš€ Launch at Login: Add a setting to automatically start ASR-app when macOS boots up.
  • ๐Ÿ“š Transcription History Log: A quick-access history panel in the popover to see, search, and copy past transcriptions.
  • ๐Ÿง  Local Offline Transcription (CoreML/whisper.cpp): Allow completely offline speech-to-text processing for maximum privacy when internet is unavailable.
  • โœจ AI Smart Rewrite (LLM Integration): Add quick LLM actions to improve grammar, summarize, translate, or rewrite the transcribed text in professional/friendly tones before pasting.
  • ๐Ÿ”„ Multi-language Real-time Translation: Instantly translate dictated speech into a target language before auto-pasting.

About

๐Ÿš€ The fastest Whisper-powered voice-to-text ASR app for macOS. Lightweight, beautiful UI, and instant global hotkey typing.

Topics

Resources

Stars

Watchers

Forks

Contributors

Languages