Whether you're transcribing an audio file, transcribing microphone audio, or transcribing it in real-time, Audio Note can accurately transcribe text locally for you
1000+ people are using it
Audio Note leverages the open-source Whisper model to provide efficient and accurate speech recognition. In addition, it implements out-of-the-box GPU acceleration, which significantly improves processing speed and efficiency, and provides users with a smooth user experience.
Supports real-time transcription of multiple audio and video files to text, supporting MP3, WAV, FLAC, ACC, M4A, MKV, etc.
Listen to microphone audio and transcribe it to text in real time, and display it in lyric mode.
The transcribed text can be exported to a variety of subtitle formats, supporting SRT, VTT, SUB, ASS, SSA, LRC, SBV, SMI, etc.
You can choose a microphone or any input device to record audio and then transcribe it.
All transcription is done on your device, with no data leaving your machine, making it ideal for sensitive audio, such as interviews.
GPU-accelerated transcription is performed on Mac devices that support M-series chips, and Cuda acceleration is supported on Windows platforms.
Devices without a graphics card can be used to fallback to using the CPU run model to transcribe.
Support for adjusting the parameters of the whisper running model (prompt, offset, greedy/beam search, entropy threshold, etc.)
Support for translating transcribed text.
You can choose an app to record and transcribe it later
Speed up your workflow with AI
More features are in the works
Here are some of the most frequently asked questions.
Ready to try-out AudioNote?
Quickly convert your audio to text
Contact us