welcome to the docs ✏️

YapYap Documentation

Everything you need to get up and running, pick the right models for your machine, and tune the AI to write exactly the way you want.

How it works

Every recording goes through this pipeline — in under 3 seconds on a good machine.

Audio Capture

AVAudioEngine captures 16kHz mono audio from your microphone while you hold the hotkey.

VAD Filtering

Silero VAD strips silence and background noise before it reaches the STT model — preventing Whisper hallucinations.

Speech-to-Text

Parakeet TDT v3 (Neural Engine) or Whisper (CoreML GPU) converts audio to raw text on-device.

LLM Cleanup

A local LLM (Qwen / Llama / Gemma via MLX, llama.cpp, or Ollama) removes fillers, fixes grammar, and formats for your active app.

Paste

Clean text is injected into your active app via clipboard + synthetic Cmd+V — no typing required.

YapYap is a native Swift + SwiftUI app. No Electron, no web views, no cloud.

STT Layer

LLM Layer

Context Layer

Data & UI

SwiftData (SQLite) AVAudioEngine Silero VAD Sparkle auto-update KeyboardShortcuts SwiftUI + AppKit hybrid

All models are stored in ~/Library/Application Support/YapYap/models/ — never in ~/Documents (iCloud eviction hazard).

👋

Ready to start?

Follow the Getting Started guide to be up and running in 5 minutes.