v1.5.0
Latest ReleaseFebruary 2026
Core Functionality & UX (Phase 1)
- Offline Translation — Translate non-English audio to English text directly.
- Setup Wizard — First-run interactive guide for new users to select themes, models, and paths.
- Benchmark Dashboard — Monitor system health (CPU, RAM, Disk Space) in real-time.
Customization & AI (Phase 2)
- Custom Vocabulary — Supply an initial prompt to improve Whisper's accuracy for jargon.
- Noise Reduction — Clean up noisy audio with a dedicated denoise processor.
Pro Features (Phase 3)
- Batch Processing — Transcribe multiple files sequentially using a threaded worker queue.
- Real-Time Streaming — Instant "Live Subtitles" during recording with a sliding window context.
- Interactive Transcript — Click segment text blocks to jump directly to those moments in the audio player.
- Speaker Diarization — Experimental support for labeling distinct speakers (Speaker A, Speaker B).
Bug Fixes & Hardening (Phase 4)
- Audio Path Hardening — Solved "Invalid data" and frame detection errors on MP3/MPEG files using an explicit FFmpeg pre-conversion pipeline.
- Build Stability & UX — Overhauled landing website with glassmorphism and modern UI updates.
v1.3.0
Quality & stability update
- Reset/New Session (Ctrl+N) — cancels active work, clears UI, new target filename, resets timer.
- Session guard — prevents stale background threads from updating the UI after reset.
- Editable transcript — transcription results are left editable by default.
- Persistent settings — appearance, model, and Online TTS toggle saved in
settings.json. - Online/Offline TTS — toggle in UI; online uses cloud TTS (off by default), offline uses system voice.
- FFmpeg hardening — imageio-ffmpeg resolution with local fallback copy.
v1.2.0
Core integration update
- Added Explorer right-click ("Transcribe with Qevoxyl") for
.wav/.mp3. - Stable PyInstaller spec (bundles whisper/assets; UPX disabled).
- StdIO guard for console-less builds (prevents tqdm crashes).
- Persistent per-user data paths and logs.