nvidia-nemo

📄 SmartSRT is a command-line tool for generating accurate subtitles with per-word timestamps. It uses WhisperAI for speech transcription, NVIDIA NeMo for diarization, and OpenCV for face recognition. The program is good at creating high accuracy subtitles. 🎧💻⚙️

audio python cuda subtitles text-summarization face-recognition srt cv2 whisper transcribe timestamps nvidia-nemo

Updated Feb 15, 2023

Rumeysakeskin / ASR-Quantization

Sponsor

Star

Post-training quantization on Nvidia Nemo ASR model

pytorch speech-recognition quantization model-deployment pytorch-lightning post-training-quantization nvidia-nemo

Updated Aug 23, 2023
Jupyter Notebook

denizariyan / Real-Time-Auto-Transcriber

Star

Automatic transcriber made with the Nvidia NeMo AI toolkit. Used to transcribe speech to text in real-time from any source. Requires CUDA capable GPU to run on the local machine, if setup using virtual audio cables can transcribe the audio that is being played in real-time without any other requirements.

real-time speech-recognition subtitle speech-to-text audio-processing nvidia-cuda transcriber accesibility hearing-impaired nvidia-nemo

Updated Oct 18, 2020
Python

HROlive / Poland-End-To-End-LLM-Bootcamp

Star

This bootcamp is designed to give NLP researchers an end-to-end overview on the fundamentals of NVIDIA NeMo framework, complete solution for building large language models. It will also have hands-on exercises complimented by tutorials, code snippets, and presentations to help researchers kick-start with NeMo LLM Service and Guardrails.

nvidia triton gpt tensorrt nvidia-nemo prompt-tuning p-tuning llm llm-training llm-inference llama2 nemo-guardrails

Updated Mar 7, 2024
Jupyter Notebook

j3soon / LLM-Tutorial

Star

LLM tutorial materials include but not limited to NVIDIA NeMo, TensorRT-LLM, Triton Inference Server, and NeMo Guardrails.

nemo nvidia-nemo llm nemo-guardrails tensorrt-llm

Updated Sep 19, 2024
Jupyter Notebook

aaaastark / NeMo-WeightsBiases-TTS

Star

Training and Tunning a Text to speech model with Nvidia NeMo and Weights and Biases

text-to-speech nemo weights-and-biases nvidia-nemo hifigan fastpitch

Updated Dec 8, 2022
Jupyter Notebook

JINHXu / tutorial-speaker-identification-with-nemo

Star

The simplest & most comprehensible tutorial on speaker identification with NVIDIA's `Nemo`.

machine-learning tutorial neural-network neural-networks classification nemo speaker-recognition nvidia-cuda nvidia-gpu speaker-identification nvidia-nemo

Updated Aug 5, 2021
Python

ssharkov03 / ru-speech-recognition

Star

Module for russian speech recognition using NVIDIA Nemo.

speech-recognition chunking spelling-correction asr russian-language nvidia-nemo

Updated Feb 12, 2023
Python

InfiniteHelios / nemo-audio-profanity-detector-app

Star

Audio profanity detector desktop app developed with PyQt5 using NVidia-Nemo tech.

audio pyqt5 speech-to-text nemo profanity-detection nvidia-nemo

Updated Dec 4, 2021
Python

GameOfPods / PAT

Star

PodcastProject Analytics Toolkit - Project that creates analytics various input data. Exported data is intended to be used in a PodcastProject website

audio books podcast nvidia openai summary transcription speaker-recognition speaker-diarization nvidia-nemo

Updated Aug 11, 2024
Python

transiteration / stt_kz_quartznet15x5

Star

Implementation of a Kazakh Speech-to-Text Model using the NVIDIA NeMo toolkit for efficient transcription of spoken Kazakh speech into text.

pytorch stt pytorch-lightning nvidia-nemo

Updated Jan 22, 2024
Python

Improve this page

Add a description, image, and links to the nvidia-nemo topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the nvidia-nemo topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

nvidia-nemo

Here are 15 public repositories matching this topic...

Rumeysakeskin / Turkish-Text-to-Speech

cr4yfish / nouv

GoogleCloudPlatform / nvidia-nemo-on-gke

Rumeysakeskin / Question-Answering-BERT

KevinGeLe / SmartSRT

Rumeysakeskin / ASR-Quantization

denizariyan / Real-Time-Auto-Transcriber

HROlive / Poland-End-To-End-LLM-Bootcamp

j3soon / LLM-Tutorial

aaaastark / NeMo-WeightsBiases-TTS

JINHXu / tutorial-speaker-identification-with-nemo

ssharkov03 / ru-speech-recognition

InfiniteHelios / nemo-audio-profanity-detector-app

GameOfPods / PAT

transiteration / stt_kz_quartznet15x5

Improve this page

Add this topic to your repo