Auto Subtitles

Generate subtitles from speech using AI. Choose from 5 engines: Parakeet (lightning fast), Whisper (99 languages), or Qwen3 (52 languages). GPU-accelerated on server.

Processed on our servers โ€” requires a free account

Have feedback? Let us know

How to Use Auto Subtitles

  1. Upload a video with speech
  2. Select the language (or leave on auto-detect)
  3. Pick an AI engine โ€” Parakeet for speed, Whisper for accuracy, Qwen3 for Asian languages
  4. Click Process and download your SRT subtitle file

Features

  • 5 AI engines: Parakeet TDT (NVIDIA), Whisper Large V3 Turbo, Whisper Large V3, Whisper + Word Timestamps, Qwen3-ASR
  • Up to 99 languages with auto-detection
  • Parakeet Lightning: transcribe 10 minutes of audio in under a second
  • Word-level timestamps for karaoke-style subtitles
  • Exports SRT subtitle files

Frequently Asked Questions

Which AI engine should I use?
Parakeet TDT is the fastest and most accurate for English/European languages. Whisper Large V3 Turbo is a great default for any language. Whisper Large V3 provides the highest accuracy. Qwen3-ASR is best for Asian languages like Chinese, Japanese, and Korean.
What languages are supported?
Whisper supports 99 languages. Parakeet supports 25 European languages. Qwen3-ASR supports 52 languages including many Asian languages. Auto-detect works across all engines.
What are word-level timestamps?
The 'Whisper + Word Timestamps' option gives you precise timing for each individual word, useful for karaoke-style subtitles or precise caption sync. Other modes give sentence-level timing.
How fast is Parakeet?
Parakeet TDT processes at over 3,000x real-time โ€” a 10-minute video takes less than a second of compute. It's the fastest open-source speech recognition model available.