Speech goes to the cloud.
Your audio is streamed to a server, transcribed remotely, and stored — often with a login, retention policy, and a per‑minute bill attached.
OpenDicta is a free, open-source speech-to-text app built with Tauri. It runs Freely on your device, keeps your voice private, and gives you optional AI features only when you choose to use them.
A floating voice bar that captures your words, and a full dashboard to search, refine, and act on everything you've said.
Default mode runs entirely on‑device. No upload, no account, no API call. The difference between cloud dictation and OpenDicta, side by side:
Your audio is streamed to a server, transcribed remotely, and stored — often with a login, retention policy, and a per‑minute bill attached.
A local ASR engine handles transcription on your CPU. Nothing leaves the machine. Nothing is logged. No account is created.
Flip on AI mode and your raw transcript can be cleaned, summarised, reformatted or translated — using a model you choose, with a key you own. We never proxy your traffic.
Headings, bullets, code fences, punctuation. The AI tidies the shape without changing your words.
Drop a 30‑minute standup transcript in, get the decisions, blockers and action items out — formatted however you like.
A pass that removes filler, false starts and verbal tics — keeping your voice, losing the hesitation.
Dictate in any of the 25 supported languages, get the transcript in your target — instant cross‑lingual notes.
No copy‑paste dance. Press the shortcut anywhere, browser, IDE, terminal, chat and OpenDicta types straight into the active text field.
Tested on 60+ apps. Custom shortcut per AI tool · push‑to‑talk or hands‑free · respects focused field.
Five rough shapes of how teams put OpenDicta to work. Yours will look different.
OpenDicta sits in the background. Speak the change, get a clean commit message — Freely, so proprietary code never touches a cloud.
Long Zendesk replies become a 30‑second voice memo. AI mode formats them into a polite, structured response.
Record a 90‑minute lecture, get a marked‑up transcript with definitions surfaced. Works in Danish, English, German and 22 others.
Walk‑and‑dictate first drafts. AI cleanup removes filler without rewriting your voice — the cadence stays yours.
Speak the client call summary while it's fresh. AI turns it into a scope doc you can send before the kettle's boiled.
We add real workflows to the docs as testers share them. If yours is novel, we'll feature it (with permission).
Every feature is free — local transcription and AI features alike. No paywall, no Pro tier, no upsell. OpenDicta stays alive because people who can afford to chip in, do. And because a small number of values‑aligned partners help us keep the lights on.
The whole app. No tiers, no gated features, no trial countdown.
no credit card · no account · MIT licensed
Two ways the project stays alive — both optional, both transparent.
OpenDicta started as a weekend script to dump a long voice memo into clean markdown. It grew into a small, opinionated tool: capture audio, transcribe it Freely, optionally polish it with an LLM you control.
No accounts, no telemetry, no subscription. no spam. no advertising.
I dictate every standup, every commit message, half my emails. The voice bar floats over whatever I'm doing and I just press a key. It's the most invisible piece of software I use.
I dictate every PR description now. Low enough latency that I don't break my train of thought.
I used to lose half my ideas between thinking them and finding the keyboard. Now I just talk.
10-minute in Danish. Clean, No upload, no login. I love it!
Yes. Once a language model is downloaded (≈40 MB per language), transcription runs entirely on your CPU. Airplane mode, terminal, train — OpenDicta keeps working. AI mode is the only feature that needs a connection, and only when you actively trigger it.
No. Audio buffers live in RAM for the duration of a transcription, then they're discarded. Nothing is written to disk, nothing is sent over the network. If you enable AI mode, only the resulting text transcript is sent to the provider you chose — never the audio.
No. Download the App, run it, dictate. There's no sign‑up flow, no email gate, no telemetry ping.
Yes — that's how AI mode is designed. Paste an Anthropic, OpenAI, Gemini, or local LLM endpoint into Settings. Traffic flows directly from your machine to the provider; we never proxy, never see your tokens, never mark them up.
25 European languages including Bulgarian (bg), Croatian (hr), Czech (cs), Danish (da), Dutch (nl), English (en), Estonian (et), Finnish (fi), French (fr), German (de), Greek (el), Hungarian (hu), Italian (it), Latvian (lv), Lithuanian (lt), Maltese (mt), Polish (pl), Portuguese (pt), Romanian (ro), Slovak (sk), Slovenian (sl), Spanish (es), Swedish (sv), Russian (ru), Ukrainian (uk).
Windows 10/11, macOS 12+ (Apple Silicon and Intel), and major Linux distros (Ubuntu, Fedora, Arch).
Yes, MIT licensed. The local app, the model‑downloader and the workflow runtime all live in the same public repository. Fork it, audit it, ship a derivative — that's the point.
Get the install link the day v1.0 ships. No marketing, no spam, one email, one download.
By joining you agree to receive a small number of launch announcement emails. No spam, ever. Privacy Policy