The Fastest Way to Capture Voice Notes into Obsidian on iPhone (and Apple Watch) in 2026

I've watched hundreds of Obsidian users describe the same arc.
They love their vault on desktop. Templates dialed in, plugins humming, Daily Notes flowing. Then they pull out their phone.
Mobile Obsidian is brutal. The keyboard is small, the app is slow to load, the input friction kills any thought that wasn't fully formed before they pulled the phone out. So they cobble something together. Quick Capture this month. Drafts next month. A custom Shortcut after that. Maybe Voice Memos with a vague plan to transcribe later, knowing full well they never will.
Three weeks per fix, on average. Then it breaks, or they stop using it, and they're back at the friction wall.
This article is for that person. Not the desktop Obsidian power user. The Obsidian power user trying to make Obsidian work on their phone, who has lost faith in the cobbled workflow but doesn't want to give up on their vault.
There's a simpler answer in 2026. It uses your voice, an Action Button or your Apple Watch, and one app that knows where you want the note to land. Setup takes five minutes. After that, capture is one tap, and the note lands in your vault the next time you open Obsidian.
Here's the workflow.
The three options Obsidian users actually try
Before we get to the workflow, let's name the patterns. If you're reading this, you've probably tried one or two of these.
Native Obsidian Mobile typing
Open Obsidian, tap into Daily Note, type. The keyboard takes half the screen and the thought evaporates mid-sentence.
Voice Memos + manual transcribe
Hit record, save, promise yourself you'll process it later. Apple now transcribes, but the memo still lives in a filing cabinet you never open.
Drafts + Shortcuts + webhook
Custom Shortcut, maybe a webhook, maybe a plugin watcher. Works until iOS updates and breaks the chain.
1. Native typing in Obsidian Mobile
This is where most people start. Open Obsidian, tap into Daily Note, type. The friction wall hits in three places: the app takes a beat to load, the keyboard takes up half the screen, and any thought that wasn't fully formed evaporates while you're still composing the first sentence.
Reddit's r/ObsidianMD has a recurring thread: "How do you get ideas into Obsidian fast when you're away from the keyboard?" The question keeps coming back because the answer hasn't crystallized. People recommend voice memos, plugins, Shortcuts, Drafts. No single solution wins, which is its own kind of answer about the problem.
2. Voice Memos (now with transcription, still broken)
The fallback. You hit record in Apple's Voice Memos, talk for thirty seconds, save. Apple does the transcription for you these days, so you tell yourself it'll be fine.
Except it isn't. The memo and its transcript sit in a separate filing cabinet that you never actually open. By the time you've got eight unprocessed memos, two of them aren't even labeled and you're not sure which idea was in which file. The system never gets to your vault, where the rest of your thinking lives.
3. The DIY power-user workaround: Drafts plus Shortcuts plus webhook
The third option is where most committed Obsidian users land. Drafts captures the voice or text, a custom Shortcut routes it, sometimes through a webhook to Obsidian Sync's iCloud folder, sometimes through Working Copy and git, sometimes through a community plugin that watches for a specific file.
It works. Until iOS updates and the Shortcut quietly breaks. Or Drafts changes the URL scheme. Or the plugin maintainer takes a six-month hiatus. Or you got a new phone and forgot which step in the chain holds the API token.
The DIY workaround is the most respected solution in the Obsidian community precisely because it requires the most maintenance. Wearing the maintenance as a badge of honor is a tell that the underlying capture problem isn't solved.
There are some variants. Voice notes apps like Audiopen, Voicenotes, or v2md (a Reddit-built voice-to-markdown iOS app) take you part of the way. They give you back text. The handoff from "I have clean text" to "the text is in my vault, formatted correctly, in the right folder" is still on you.
Quick Capture and Quick Draft are Obsidian community plugins that get closer. Both require Obsidian Sync to work cleanly, and neither has a full Apple Watch story.
What's missing across all three options is the thing that matters: a single tap from voice to vault, formatted the way you want it, configurable to land where you want it, without you having to maintain the connective tissue.
That's the workflow.
The voice-to-action approach: configure once, capture forever
Epiphany is a voice workflow tool. Not a voice notes app, not a transcription tool, not a journaling app. The product does one thing well: it takes your voice, runs it through an AI prompt you chose, and routes it to a destination you chose. One tap from your iPhone or Apple Watch.
Here's how that lands for Obsidian specifically.
You configure an action one time. The action settings are:
- Destination: where the note lands. Daily Note, append to an existing note, or create a new note.
- AI prompt: what happens to your transcribed words between voice and vault. Strip filler words and clean up. Convert to bullet points. Extract action items. Keep verbatim. Whatever you want.
- Audio file (optional): include a link to the original recording alongside the note.
That setup takes about five minutes. From that point on, capturing is one tap.
Three ways to open Epiphany quickly
- Action Button on iPhone (15 Pro and later). Long-press to launch Epiphany from anywhere.
- Lock-screen widget. Opens Epiphany from your lock screen, no detour to the home screen.
- Apple Watch. Open Epiphany on the Watch and capture there. Notes sync to your phone, then route to your vault.
Once Epiphany is open, you tap your action and speak. The flow looks like this:
- 1ConfigureOne-time setup: destination + AI prompt
- 2Open EpiphanyAction Button, lock-screen, or Watch
- 3Tap + speakYour action, your prompt
- 4Lands in vaultOn the next Obsidian Sync
That's not an Epiphany behavior; it's how Obsidian Mobile sync works. Every tool that writes to your vault on iPhone is bound by the same constraint.
Why this beats the three other options
Native typing has the keyboard friction problem. Voice Memos has the backlog problem. The DIY workaround has the maintenance problem.
The voice-to-action approach has a different shape: the routing is the product. You don't connect five things together. You configure one thing.
You can still build complex workflows. One Obsidian action lands in Daily Note. A second one appends to a specific note. A third creates a new note per capture. They all live in the same app, configurable from the same screen.
The mental model is: once you've taught Epiphany where things go, you stop thinking about where things go. Tap, speak, done. Your vault stays clean because the format is baked into the action. Your folders stay organized because the destination is baked into the action.
This is what we mean by "voice-to-action." Most voice tools hand you text and stop there. Epiphany hands the text to the place it belongs.
A note on what's NOT happening here: Epiphany is not running an AI conversation with you. It's not asking follow-up questions. It's not synthesizing across captures. If that's what you want (it's a real and legitimate workflow), Voicepal does that well. Epiphany is for the moment when you have a thought, you know where it goes, and you want it there. Now.
How to set it up in five minutes
This is the one-time setup. After this you're just tapping and speaking.
1. Install Epiphany. Get it on the App Store. Free to download. The annual plan includes a seven-day trial, and there's a monthly plan if you want to test for a single month.
2. Connect Obsidian. On Epiphany's recording screen, tap Create Action and pick Obsidian. Epiphany walks you through installing the Epiphany Community Plugin inside Obsidian, then verifies the connection. Once verified, the vault you granted access to via the Community Plugin is available as a destination.
3. Create your first Action. An Action is the bundle of (destination + AI prompt + optional audio file link). For your first one, try this:
- Name: "Quick capture to Daily Note"
- Destination: Daily Note (append to today's entry)
- AI prompt: "Clean up the transcription. Remove filler words. Preserve voice and structure. Format as markdown."
4. Test the loop. Trigger the action, say a sentence, stop. Open Obsidian. The note appears in today's Daily Note when Obsidian Sync catches up.
If you don't see the note, the most common issue is Obsidian Sync hasn't pulled the latest changes. Force a sync in Obsidian (pull down to refresh on the Vault tab). The note will be there.
What ends up in your vault
The note that lands is a normal markdown file (or appended block in an existing file, depending on your action). Nothing proprietary, nothing escaped from your control.
A typical capture from a five-second voice trigger:
Idea for the Q3 product brief: instead of leading with the feature list, open with the user's workflow before and after. The before is the painful part. People remember pain better than features.
That's it. Markdown, in your vault, in the format you defined. If you opted for raw transcription without cleanup, you'd see the filler words. If you opted for bullet conversion, you'd see bullets. The AI prompt does the formatting on the way in, so your vault stays exactly the way you set it up.
Your existing system stays your system
If you have templates, they get respected. If you have frontmatter conventions, you can bake them into the action. If you have a tag schema, the AI prompt can apply the right tag based on what you said.
The most common worry I hear from Obsidian users when they first set up Epiphany is "will this mess up my system?" The answer is: only if you configure it to. Out of the box, the default action appends to your Daily Note as plain markdown. You decide everything from there.
Obsidian Sync
Obsidian Mobile syncs to your vault when you open the app. This applies to every tool that writes to your vault on iPhone, not just Epiphany. Drafts has the same constraint. Quick Capture has the same constraint. Native typing in Obsidian Mobile has it too (open a new note while offline, the note exists locally but doesn't propagate until sync runs).
The honest framing: Epiphany hands your captured note to Obsidian Sync immediately. Obsidian Sync writes the file to your vault the next time you open Obsidian.
The Apple-Watch-plus-Epiphany combo: the dumb-phone-alternative for PKM users
A note for the digital minimalists in the room. The dumb phone movement has been growing. Light Phone, Punkt, Mudita, the Boring Phone. The argument: phones are designed for distraction; you'd think better without the dopamine slot machine in your pocket.
The argument is half right. The phone's distractions are real. Trading away the phone entirely is throwing the baby out with the bathwater, because there's a class of phone uses that aren't distractions. Capture is one of them.
The Apple-Watch-plus-Epiphany combo is closer to the dumb-phone position than most setups, and it doesn't ask you to give up your vault or your AI plugins.
Speak into your Watch when an idea hits. The capture lands in Epiphany, routes to your Obsidian vault. You never unlock your phone. The notification doesn't come, the scroll doesn't start, the next two hours of your evening don't disappear into Instagram.
If you've been eyeing a Light Phone but can't quite cut the cord because you actually use your iPhone for things that matter (your knowledge base being one), the Apple Watch is a partial answer. Voice capture is the killer use case for it.
The dumb-phone alternative isn't a dumb phone. It's a phone you use with intention.
How Epiphany compares to the other voice-to-Obsidian options
Honest comparison. All of these have legitimate use cases. The right one depends on what you optimize for.
| Tool | Voice-to-vault time | Markdown native | Apple Watch | Maintenance | Best for |
|---|---|---|---|---|---|
| Native Obsidian Mobile typing | Depends on typing speed | Yes | No | Zero | Quick notes when you're already in the app |
| Voice Memos + manual transcribe | Hours to days (the backlog problem) | After your manual work | No | Zero, but high effort per note | When you don't actually need the note in your vault |
| Drafts + Shortcuts + webhook | 15-30 seconds when working | Yes | Yes (via Drafts complication) | High. Breaks on iOS updates. | Power users who enjoy maintaining their stack |
| Audiopen + Shortcut to Obsidian | 30-60 seconds | Yes if Shortcut handles it | Recent Audiopen update | Medium. Audiopen still requires manual paste in many flows. | Users who want polished text output more than instant routing |
| Quick Draft / Quick Capture plugins | 10-20 seconds | Yes | No | Low (plugin maintained) | Pure Obsidian-ecosystem purists |
| Epiphany | One tap inside Epiphany; lands in vault on next Obsidian open | Yes | Yes (native) | Low (Action Button + lock-screen widget + Watch all work out of box) | The fastest one-tap-to-vault path that scales beyond Obsidian |
When NOT to use Epiphany
If any of the following describe you, Epiphany is the wrong tool:
- You need fully local, on-device transcription. Epiphany transcribes in the cloud. If you need air-gapped capture, look at offline Whisper or similar.
- You're Android-first. Epiphany is iOS-only today.
- You want AI to develop your idea conversationally ("ask me follow-up questions about what I just said, help me draft a longer piece"). That's Voicepal's lane. Epiphany doesn't have a shadow-reader feature.
- You're committed to a fully open-source, plugin-only Obsidian setup. Quick Capture or Quick Draft will fit your ethics better than a SaaS tool.
For everyone else (most Obsidian users on iPhone, frankly), the math works out. Five-minute setup, voice-to-vault in seconds from there.
FAQ
Try the workflow
Epiphany is free to try on iPhone. The five-minute setup is the same for every integration. After that, your tools get fed by voice instead of by typing.
