--- title: NVIDIA Parakeet TDT 0.6B V2 Real Time Mic Transcription ASR STT emoji: 📊 colorFrom: purple colorTo: blue sdk: gradio sdk_version: 5.31.0 app_file: app.py pinned: false license: apache-2.0 short_description: Real-Time, Speak to Mic, NO MODEL DOWNLOAD NEEDED!! language: en inference: true tags: - audio - speech-recognition - asr - real-time - cpu - nvidia - parakeet - microphone - voice - speech - browser - gradio - nemo - huggingface --- **Real-time English speech-to-text in your browser — no GPU required.** This Space runs the 600 M-parameter [`nvidia/parakeet-tdt-0.6b-v2`](https://huggingface.co/nvidia/parakeet-tdt-0.6b-v2) model that fits comfortably on the **CPU Basic (2 vCPU)** tier. 1. Click **“Record”** 2. **Allow microphone** access and start speaking. 3. Watch live text appear in the **Transcription** box. **Stalled UI?** Refresh the browser tab — this fully restarts the Space and clears any stuck threads. | Technique | Why it matters | |-----------|----------------| | **`OMP_NUM_THREADS=2` & `torch.set_num_threads(2)`** | Matches the 2 vCPUs for optimal throughput | | **FBGEMM backend** | Fastest kernels on x86 | | **4-second streaming window** | Low latency & small memory footprint | | **Gradio `stream_every=0.5`** | Updates the transcript twice per second for real-time feel | | Item | Licence | |------|---------| | **Demo code (this repo)** | Apache-2.0 | | **Model weights** – `nvidia/parakeet-tdt-0.6b-v2` | CC-BY-4.0 (© NVIDIA) | **If you redistribute transcripts or fine-tuned weights, please retain the CC-BY-4.0 attribution notice.**