Turn on Audio Sync for audio extraction process #192

dvh312 · 2024-09-18T06:58:19Z

Please see the original MR here morpheus65535/bazarr#2648

Problem description

I used Whisper AI and recognized that the subtitles generated didn't have the correct timestamps, especially near the end of the video.

Cause

The video files I'm using have some corrupted frames.
ffmpeg try to "skip" these frames, causing the duration of the extracted audio to be significantly shorter than the original video duration.

Solution

Turning on audio sync to make the extracted audio matches the video timestamps.

Screenshots

Original command

ffmpeg  -i input.mp4  -acodec pcm_s16le -ac 1 -ar 16000 -f s16le output.wav

ffmpeg-before.log

Imported to Audacity, only shows the duration of 22m52s532ms. The video duration is 22m59s960ms. This means subtitles will be shifted by almost 7 seconds ahead.

Updated command

ffmpeg  -i input.mp4  -acodec pcm_s16le -ac 1 -ar 16000 -f s16le -af aresample=async=1 output.wav

ffmpeg-after.log

Audacity now shows the duration of 22m59s349ms, which is approximately the original video duration of 22m59s960ms

Update speech_transformers.py

03dc875

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Turn on Audio Sync for audio extraction process #192

Turn on Audio Sync for audio extraction process #192

dvh312 commented Sep 18, 2024

Turn on Audio Sync for audio extraction process #192

Are you sure you want to change the base?

Turn on Audio Sync for audio extraction process #192

Conversation

dvh312 commented Sep 18, 2024

Problem description

Cause

Solution

Screenshots

Original command

Updated command