Last updated: 2025-05-11

Real-time App Transcription

Real-time app transcription is another innovative feature of Audio Note that allows users to select and transcribe audio from specific applications. Whether you need to transcribe online meetings, video content, or live streams, this feature provides a flexible and convenient solution.

Feature Overview

The real-time app transcription feature allows users to select and transcribe audio from specific applications, supporting simultaneous microphone recording. All processing is done locally, ensuring data security and privacy. Key features include:

  • Precise audio capture: Supports selecting specific application audio streams
  • Dual audio input: Can record both application audio and microphone simultaneously
  • Smart voice detection: Automatically filters silent segments
  • Multilingual support: Supports transcription in 98+ languages
  • Real-time subtitle generation: Can generate and display real-time subtitles

Main Features

  • Model selection: Users can choose between Whisper models or real-time models to balance accuracy and speed according to their needs.
  • Simultaneous recording and transcription: The software supports real-time transcription while recording audio, eliminating the need to wait for the recording to complete.
  • VAD (Voice Activity Detection): Automatically detects voice activity to ensure only meaningful content is transcribed, reducing blanks and noise.
  • Dynamic microphone switching: Users can seamlessly switch between different microphone devices during transcription.
  • Real-time translation: Supports real-time translation of transcribed text into other languages, facilitating multilingual communication.
  • Subtitle mode: Supports real-time display of transcribed text as subtitles.
  • Detailed mode: Supports viewing each speech transcription text to prevent missing key information.

Use Cases

Online Meetings

  • Real-time transcription of Zoom, Teams, etc. meeting content
  • Generate meeting minutes
  • Save important discussion content

Video Learning

  • Transcribe educational videos from platforms like YouTube, Bilibili, etc.
  • Generate study notes
  • Create bilingual subtitles

Game Streaming

  • Record game commentary
  • Save highlight moments
  • Create subtitles for stream replays

Online Classes

  • Transcribe online course content
  • Generate class notes
  • Create teaching materials

How to Use

  1. Open the software homepage and select "Real-time App Transcription".
  2. Choose your preferred transcription model (Whisper or real-time model) and transcription language.
  3. Select the microphone device to use (or disable the microphone).
  4. Configure transcription options, such as enabling GPU transcription (recommended for better experience), VAD, real-time translation, etc.
  5. Click the "Start" button.
  6. The software will start recording and display the transcribed text in real-time.
  7. You can switch, enable the microphone, or stop transcription at any time.

FAQ

Q: Can I transcribe multiple apps simultaneously?

A: Only one app can be selected for transcription at a time, but microphone audio can be recorded simultaneously.

Q: Why is audio from other apps also being transcribed?

A: This is due to system limitations. You can try closing other apps and system sounds.

Q: Why can't I record when an app is full-screen or moved to another Space on macOS?

A: This is due to macOS security restrictions that prevent accessing app screens not in the same hierarchy. Please ensure the app being recorded is in the same hierarchy and not in full-screen mode.

Q: What is the latency of real-time transcription?

A: When device performance is sufficient and GPU acceleration is enabled, latency is typically less than 500ms. If device performance is insufficient, please use real-time models.

Whisper-Powered Live Transcription: Capture Speech from Mic, Apps & Media Files in Real Time

Contact us

Email
Copyright © 2025. Made by AudioNote, All rights reserved.