Skip to content

Commit 439f2f5

Browse files
authored
Update gladia documentation (#757)
* update gladia documentation * Revise Gladia documentation for clarity and conciseness * Update gladia details in speech-to-text sections * update latency metrics for gladia * gladia: rewrote sections to remove bullet points
1 parent 1be0fd1 commit 439f2f5

File tree

5 files changed

+45
-97
lines changed

5 files changed

+45
-97
lines changed

fern/assistants/examples/multilingual-agent.mdx

Lines changed: 1 addition & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -1155,6 +1155,7 @@ For a more structured approach with explicit language selection, see our compreh
11551155
## Provider Support Summary
11561156

11571157
**Speech-to-Text (Transcription):**
1158+
- **Gladia**: Solaria, automatic language detection and code-switching.
11581159
- **Deepgram**: Nova 2, Nova 3 with "Multi" language setting
11591160
- **Google**: Latest models with "Multilingual" language setting
11601161
- **All other providers**: Single language only, no automatic detection

fern/customization/multilingual.mdx

Lines changed: 4 additions & 4 deletions
Original file line numberDiff line numberDiff line change
@@ -29,9 +29,9 @@ Set up your transcriber to automatically detect and process multiple languages.
2929
2. Create a new assistant or edit an existing one
3030
3. In the **Transcriber** section:
3131
- **Provider**: Select `Deepgram` (recommended), `Google`, or `Gladia`
32-
- **Model**: For Deepgram, choose `Nova 2` or `Nova 3`; for Google, choose `Latest`; for Gladia, choose your preferred Gladia model
33-
- **Language / Mode**: Set `Multi` (Deepgram), `Multilingual` (Google), or enable automatic language detection (Gladia)
34-
4. **Other providers**: May require a single language and not auto-detect
32+
- **Model**: For Deepgram, choose `Nova 2` or `Nova 3`; for Google, choose `Latest`; for Gladia, choose `Solaria`
33+
- **Language / Mode**: Set `Multi` (Deepgram), `Multilingual` (Google), or choose the language you want to transcribe (Gladia)
34+
4. **Other providers**: May require a single languages and not auto-detect
3535
5. Click **Save** to apply the configuration
3636
</Tab>
3737
<Tab title="TypeScript (Server SDK)">
@@ -460,10 +460,10 @@ Validate your configuration with different languages and scenarios.
460460
|----------|---------------------|-----------|-------|
461461
| **Deepgram** | ✅ Full auto-detection | 100+ | **Recommended**: Nova 2/Nova 3 with "Multi" language setting |
462462
| **Google STT** | ✅ Full auto-detection | 125+ | Latest models with "Multilingual" language setting |
463+
| **Gladia** | ✅ Full auto-detection | 110+ | Supports automatic language detection and code-switching |
463464
| **Assembly AI** | ❌ English only | English | No multilingual support |
464465
| **Azure STT** | ❌ Single language | 100+ | Many languages, but no auto-detection |
465466
| **OpenAI Whisper** | ❌ Single language | 90+ | Many languages, but no auto-detection |
466-
| **Gladia** | ✅ Full auto-detection | 110+ | Supports automatic language detection and code-switching |
467467
| **Speechmatics** | ❌ Single language | 50+ | Many languages, but no auto-detection |
468468
| **Talkscriber** | ❌ Single language | 40+ | Many languages, but no auto-detection |
469469

fern/debugging.mdx

Lines changed: 1 addition & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -83,6 +83,7 @@ Start with these immediate checks before diving deeper:
8383
- [Anthropic Status](https://status.anthropic.com/) for Anthropic language models
8484
- [ElevenLabs Status](https://status.elevenlabs.io/) for ElevenLabs voice synthesis
8585
- [Deepgram Status](https://status.deepgram.com/) for Deepgram speech-to-text
86+
- [Gladia Status](https://status.gladia.io/) for Gladia speech-to-text
8687
- And other providers' status pages as needed
8788
</Step>
8889
</Steps>
Lines changed: 38 additions & 92 deletions
Original file line numberDiff line numberDiff line change
@@ -1,111 +1,57 @@
11
---
2-
title: Gladia
3-
subtitle: What is Gladia?
4-
slug: providers/transcriber/gladia
2+
title: Gladia
3+
slug: providers/transcriber/gladia
54
---
65

6+
## What is Gladia?
7+
Gladia is a state-of-the-art audio transcription and intelligence platform. It provides **real-time** speech-to-text for audio and video and adds advanced audio-intelligence features so you can turn unstructured audio into actionable insights. It integrates easily and scales so you can focus on building features instead of transcription infrastructure.
8+
<Tip>Try Gladia on their [playground](https://app.gladia.io/?utm_source=vapi) to get a feel for the product!</Tip>
79

8-
**What is Gladia?**
10+
## Why choose Gladia on Vapi for speech-to-text?
911

10-
Gladia is an advanced AI platform specializing in real-time transcription, translation, and audio intelligence. By leveraging state-of-the-art ASR (Automatic Speech Recognition), NLP (Natural Language Processing), and GenAI (Generative AI) models, Gladia helps businesses extract valuable insights from unstructured audio data. Their enterprise-grade API offers scalable, secure, and efficient solutions for various applications, from virtual meetings to customer service.
12+
### Low latency transcription
13+
Gladia delivers low-latency live transcription, often under ~600 ms, for calls and streaming audio, with super-fast partials around ~300 ms for immediate response processing. It provides word-level timestamps and detailed custom vocabulary to power downstream workflows.
1114

15+
### Global language coverage
16+
Gladia supports **110+ languages** and dialects and robustly handles multilingual and mixed-language audio. It also supports mixed-language and code-switch scenarios for natural conversations and multilingual conversations.
1217

13-
**The Evolution of AI Transcription:**
18+
### Audio intelligence add-ons
19+
Translation is available in one API call to one or more target languages. Gladia also offers summarization post-call, sentiment analysis, and named-entity recognition in real-time, enabling meeting notes, customer-call insights, and content production workflows on top of transcripts.
1420

15-
AI transcription has significantly evolved, moving from basic speech recognition systems to advanced platforms capable of real-time transcription, translation, and audio intelligence. Innovations in machine learning and natural language processing have enhanced accuracy and efficiency. Gladia utilizes these advancements to deliver top-tier transcription services tailored for modern business needs.
21+
### API and integrations
22+
Gladia offers telephony compatibility (SIP/VoIP) and noise resistance for live use cases, and supports real-time streaming with low-latency interfaces for platforms and contact centers. It also provides a developer-friendly playground to test and monitor your transcription workflows.
1623

17-
**Overview of Gladia’s Offerings:**
24+
## Getting started
1825

19-
Gladia provides a comprehensive suite of AI-driven tools:
26+
1. Go to the **Assistants** tab in the left-hand navigation.
27+
2. Create a new assistant, or select the voice assistant you want to configure.
28+
3. Open the **Transcriber** tab in the top navigation (or scroll to the Transcriber module).
29+
4. In the **Provider** dropdown, select **Gladia**.
2030

31+
<Tip>Watch the [Vapi x Gladia demo video](https://youtu.be/7EoYnMOHR5A?si=dIDTTXw2L--DY-QY) to see real-time features in action!</Tip>
2132

22-
**Speech-to-Text:**
33+
## Best practices
2334

24-
Gladia’s core offering is its AI-powered speech-to-text technology, delivering highly accurate and real-time transcription. This service supports automatic language detection (including code‑switching within a conversation) and 90+ languages, and includes speaker diarization.
35+
- **Region selection**: Use the region closest to your users; EU and US options are available for data residency and latency.
36+
- **Custom vocabulary**: Add domain-specific terms (product names, acronyms) to improve accuracy.
37+
- **Timestamps**: Use word-level timestamps when you need precise analytics or subtitles.
38+
- **Translation**: Use built-in translation when you need multilingual outputs from a single stream.
2539

26-
**Audio Intelligence:**
40+
## Use cases
2741

28-
Gladia’s audio intelligence add-ons offer features like summarization, chapterization, and sentiment analysis, providing deeper insights into audio data.
42+
- **Voice agents**: Real-time transcription, speaker attribution, translation, and post-call summaries.
43+
- **Virtual meetings**: Live transcription, speaker attribution, translation, and meeting notes.
44+
- **Customer service / contact centers**: Live call transcription, sentiment/keyword extraction, multilingual agent assistance.
45+
- **Sales enablement**: Capture names, emails, and details across languages and accents; feed CRMs.
46+
- **Media & content creation**: Transcribe/edit audio/video, generate subtitles (SRT/VTT), and translate for global distribution.
2947

30-
**API:**
48+
## Data protection and compliance
3149

32-
Gladia’s robust API allows seamless integration of speech-to-text capabilities into applications, ensuring low latency and high availability.
50+
Gladia offers enterprise-grade data governance, secure hosting options, and alignment with privacy and compliance frameworks such as GDPR. EU and US regions are available for data residency.
3351

34-
**AI Transcription Technology:**
52+
## Useful links
53+
- **Playground**: [app.gladia.io](https://app.gladia.io/?utm_source=vapi)
54+
- **Website**: [gladia.io](https://gladia.io/?utm_source=vapi)
55+
- **Documentation**: [docs.gladia.io](https://docs.gladia.io/?utm_source=vapi)
3556

36-
Gladia’s AI transcription technology offers several key features and benefits:
37-
38-
**Features:**
39-
40-
- High Accuracy: Industry-leading transcription accuracy.
41-
- Real-time and Async Transcription: Instantaneous and batch processing options.
42-
- Multilingual Support: Supports transcription and translation in 99 languages.
43-
44-
**Benefits:**
45-
46-
- Efficiency: Reduces the time needed for transcription and analysis.
47-
- Scalability: Handles large volumes of data efficiently.
48-
- Cost-Effective: Provides high performance at a competitive cost.
49-
50-
**Real-time Transcription and Translation:**
51-
52-
Gladia excels in providing real-time transcription and translation:
53-
54-
55-
**Multilingual Support:**
56-
57-
- Automatic language recognition: Detects the spoken language automatically and handles code‑switching
58-
- 90+ languages: Supports a wide range of languages and dialects
59-
- Real-time Translation: Near-instantaneous translation for diverse applications
60-
61-
**Use Cases:**
62-
63-
- Virtual Meetings: Provides real-time transcriptions, note-taking, and video captions.
64-
- Content Creation: Transcribes and translates videos and podcasts for global audiences.
65-
66-
**Developer API:**
67-
68-
Gladia offers a comprehensive API for easy integration:
69-
70-
**Integration:**
71-
72-
- SDKs: Available for multiple programming languages.
73-
- Comprehensive Documentation: Detailed guides and support for seamless implementation.
74-
75-
**Use Cases:**
76-
77-
- Application Development: Enhance applications with advanced AI capabilities.
78-
- Business Solutions: Improve operational efficiency and customer service.
79-
80-
**Use Cases for Gladia:**
81-
82-
Gladia supports a wide range of applications:
83-
84-
**Content Creation:**
85-
86-
Enhance content creation with high-quality transcription, translation, and subtitling.
87-
88-
89-
**Customer Service:**
90-
91-
Improve customer service with accurate call transcriptions and emotion detection.
92-
93-
**Market Research:**
94-
95-
Gain valuable insights into market trends and customer preferences through advanced speech analysis.
96-
97-
**Impact on Business Operations:**
98-
99-
Gladia is revolutionizing business operations by providing tools that enhance productivity and insights. By automating transcription and audio intelligence, businesses can focus on innovation and strategy rather than manual processes.
100-
101-
**Innovation and Research:**
102-
103-
Gladia is committed to continuous innovation and research in AI transcription. Their team of experts focuses on advancing the capabilities of ASR and NLP technologies, exploring new applications, and refining existing tools to stay at the forefront of the industry.
104-
105-
**AI Safety and Ethics:**
106-
107-
Ensuring the ethical use of AI is a core principle at Gladia. They implement robust safeguards to prevent misuse of their technology and are actively involved in promoting responsible AI development. Protecting user data and maintaining transparency in AI operations are central to their mission.
108-
109-
**Integrations and Compatibility:**
110-
111-
Gladia’s API allows seamless integration with various platforms and applications. This ensures that users can incorporate Gladia’s AI capabilities into their existing systems effortlessly, enhancing functionality and improving user experience.
57+
---

fern/quickstart/introduction.mdx

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -30,7 +30,7 @@ Every Vapi assistant combines three core technologies:
3030
</Card>
3131
</CardGroup>
3232

33-
You have full control over each component, with dozens of providers and models to choose from; OpenAI, Anthropic, Google, Deepgram, ElevenLabs, and many, many more.
33+
You have full control over each component, with dozens of providers and models to choose from; OpenAI, Anthropic, Google, Gladia, Deepgram, ElevenLabs, and many, many more.
3434

3535
## Two ways to build voice agents
3636

0 commit comments

Comments
 (0)