Skip to content

SoftWhisper simplifies audio and video transcription using the powerful Whisper model. Easily select custom models, languages, and tasks, fine-tune transcription with beam size adjustment, and specify start and end times for targeted segments.

Notifications You must be signed in to change notification settings

NullMagic2/SoftWhisper

Repository files navigation

What is SoftWhisper?

SoftWhisper simplifies audio and video transcription using the powerful Whisper model. You can easily select custom models, languages, and tasks, fine-tune transcription with beam size adjustment, and specify start and end times for targeted segments.

Features

🎯 High-accuracy transcription (using Whisper model)
👥 Speaker identification
🌍 Supports all languages supported by the Whisper model (+30)
🎮 User-friendly GUI interface

Usage

  1. Run SoftWhisper.bat: .\SoftWhisper.bat When the GUI launches, follow these steps for transcription (screenshot credits: Sunwood-ai-labs):

Softwhisper interface – Credits to Sunwood-ai-labs

  1. Select an audio/video file.
  2. Choose a model size (tiny, base, small, medium, large).
  3. Enable speaker diarization if needed.
  4. Click the "Start" button.

Common issues and how to solve them

  1. libvlc.dll not found error

    • Please check if VLC Media Player is installed. Please download it here: https://www.videolan.org/
    • Restart the program after installation
  2. FFmpeg or corresponding library not found

About

SoftWhisper simplifies audio and video transcription using the powerful Whisper model. Easily select custom models, languages, and tasks, fine-tune transcription with beam size adjustment, and specify start and end times for targeted segments.

Resources

Stars

Watchers

Forks

Packages

No packages published