
Work faster
Speech-to-Text, Speech-to-Prompt, Read Aloud, and AI chat with Gemini, GPT, Grok, or offline with Whisper on macOS 15.5+.
Demo video
Features







Setup
Bring your own Google API key from Google AI Studio for Gemini cloud features, or add OpenAI or xAI API keys for GPT and Grok models. Offline Whisper transcription works without any API key.
FAQ
What is the best dictation app for Mac?
WhisperShortcut is a macOS menu bar app for voice-first productivity. Press a shortcut, speak, and your words are transcribed straight to the clipboard — using Google Gemini, GPT, or Grok in the cloud, or Whisper fully offline. No account, no subscription.
Is there a dictation app for Mac that works offline?
Yes. WhisperShortcut runs Whisper locally on your Mac, so transcription works without internet and without any API key. Your audio never leaves the device.
Can I use dictation on Mac without a subscription?
Yes. WhisperShortcut is BYOK (bring your own key): you add your own Google, OpenAI, or xAI API key and pay those providers directly at cost. There is no subscription and no middleman backend.
How is WhisperShortcut different from Apple Dictation?
WhisperShortcut is not locked to one built-in engine. You bring your own API keys and pick whichever model is currently best on the market — Google Gemini, OpenAI GPT, or xAI Grok — and switch the moment a stronger one ships. On top of plain transcription it adds AI on your voice: Speech-to-Prompt rewrites clipboard text from a spoken instruction, Read Aloud speaks answers back, and there is a built-in AI chat. You can also stay fully offline with Whisper.
What is Speech-to-Prompt?
You copy some text, press a shortcut, and speak an instruction like "make this more formal." WhisperShortcut sends both to an AI model and puts the rewritten result on your clipboard — voice-driven editing without typing a prompt.
Can WhisperShortcut read text aloud on Mac?
Yes. Highlight any text — or a chat reply — press the Read Aloud shortcut, and WhisperShortcut speaks it back using Gemini, GPT, or Grok voices. You can pick a voice per provider, adjust the playback speed, and optionally let AI rewrite the text for more natural spoken output.
Does WhisperShortcut support meeting transcription?
Yes. Live Meeting records and transcribes in real time using silence-based chunking, so long meetings are transcribed continuously rather than in one giant upload.
Which AI models does WhisperShortcut support?
Google Gemini, OpenAI GPT, and xAI Grok for cloud transcription, prompting, and chat — plus offline Whisper. You bring your own API keys and switch models anytime.
Is my data private with WhisperShortcut?
There is no WhisperShortcut backend and no telemetry — requests go directly from your Mac to the provider you chose (Google, OpenAI, or xAI), or stay fully local with offline Whisper. No account is required.
Support & Feedback
If you have feedback, if something doesn't work, or if you have suggestions for improvement, feel free to contact me—via WhatsApp or on GitHub.
