Roadmap
Actively developed features slated for deployment in an upcoming update.
- More multilingual translation models
- Improved Voice Activity Detection
- Improved context awareness
Scheduled features for the near future.
- OpenVino support for Intel CPU and GPUs
- UI & UX overhaul
- Toast notifications
- Reduced package size
Long-term conceptual features we are researching.
- Speaker diarization
- ARM64 support
Changelog
Multilingual Translation Update
This update focuses on additions and improvements to the UI and user-experience.
-
Added
Voice Activity Detection
Added a VAD pre-pass to the transcription process to filter audio.
-
Added
Multilingual translation models
Translations are now available in multiple languages.
-
Added
Small model
Added small model to the Basic (free) tier.
-
Added
Translation window
A new translation window now shows along with the transcription.
UI & UX Update
This update focuses on additions and improvements to the UI and user-experience.
-
Added
New UI window for models
Added a new UI window to manage downloaded models and changed the flow of model selection when starting a transcription.
-
Added
Model selection for Basic tier
Users are now able to select models in the Basic (free) tier.
-
Added
Feedback buttons
Added buttons to give feedback or report AI hallucinations.
-
Improved
More transcription details
The details panel now has more information about the transcription process, also known as inference.
-
Improved
Better hardware vendor details
The details panel now shows which CPU vendor, such as Intel or AMD, is being used for the transcription process.
Initial Preview Release
The foundational proof-of-concept launch bringing private, fully offline AI transcription & translation capabilities onto the Windows desktop environment. Initially only with live transcription in 95 languages and translation to English.