Skip to content

Home

Vid2Txt

A Python package for transcribing videos/audios to text using various speech-to-text services. Currently supports AssemblyAI for high-quality transcription.

Features

  • Download and transcribe from YouTube or any URL (via yt-dlp)
  • Extract audio from video files using FFmpeg
  • Direct support for audio formats (MP3, WAV, M4A, AAC, FLAC, OGG, WMA)
  • Transcribe audio using AssemblyAI API
  • Export transcripts in multiple formats:
    • Plain text (.txt)
    • SubRip subtitles (.srt)
    • Interactive HTML (.html) with embedded video/audio player
  • Language forcing support

Examples