Whether it's transcribe audio/video files, microphone audio, or real-time transcribe apps, Audio Note will transcribe to text exactly for you locally.
1000+ people are using it
Audio Note leverages the open-source Whisper model to provide efficient and accurate speech recognition. In addition, it implements out-of-the-box GPU acceleration, which significantly improves processing speed and efficiency, and provides users with a smooth user experience.
Supports real-time transcription of multiple audio and video files to text, supporting MP3, WAV, FLAC, ACC, M4A, MKV, etc.
Listen to microphone audio and transcribe it to text in real time, and display it in lyric mode.
Support real-time transcription of screen and application audio, transnational meetings, online classes, watching live broadcast and other scenarios.
The transcribed text can be exported to a variety of subtitle formats, supporting SRT, VTT, SUB, ASS, SSA, LRC, SBV, SMI, etc.
You can choose a microphone or any input device to record audio and then transcribe it.
All transcription is done on your device, with no data leaving your machine, making it ideal for sensitive audio, such as interviews.
Gpu-accelerated transcription is supported on Mac devices with M series chips, and Cuda and Vulkan engines are supported on Windows platforms.
Devices without a graphics card can be used to fallback to using the CPU run model to transcribe.
Support for adjusting the parameters of the whisper running model (prompt, offset, greedy/beam search, entropy threshold, etc.)
Support for translating transcribed text.
You can choose an app to record and transcribe it later
Speed up your workflow with AI
More features are in the works
Here are some of the most frequently asked questions.
Ready to try-out AudioNote?
Quickly convert your audio to text
Contact us