Write documents without a keyboard or mouse – completely

AgentDoc is a voice-controlled text editor for people who can't, shouldn't, or simply don't want to type. From the first word to the exported PDF, every operation – writing, formatting, page breaks, navigation, table of contents, export – is driven by voice. There is no required keyboard interaction at any point. No installation, no license fee, no training period, no platform restriction. Open the editor in any modern browser and speak your first sentence.

What "completely hands-free" actually means here

Most "voice writing" tools transcribe speech to text and then make you reach for the keyboard for everything else – formatting, fixing the cursor, choosing a font, navigating to page 3. AgentDoc closes that gap. Concretely, all of the following work by voice:

For people with motor disabilities

AgentDoc is designed to remove the keyboard and mouse entirely from the document-writing workflow. People with the following conditions can use it as a primary writing tool:

Important: this page describes capabilities, not a medical recommendation. Whether AgentDoc is the right tool for any individual depends on speech intelligibility, cognitive load tolerance, and clinical context. Caregivers and therapists are welcome to evaluate it directly – see the section below.

For RSI, post-injury, and temporary needs

Not every reason to stop typing is permanent. AgentDoc is also for people who used to type fine and need a competent voice writing tool right now:

A free Dragon NaturallySpeaking alternative – that also formats

If you've used Dragon and been frustrated by the $299–$699 price tag, the Windows-only restriction, the dropped Mac support, the long training period, or the fact that it transcribes but doesn't actually edit your document, AgentDoc is built around exactly that gap. Free. Browser-based. macOS, Windows, Linux, iOS, Android – anywhere you have a microphone and Chrome, Safari, Firefox, or Edge. No training. And formatting works mid-sentence, not as an afterthought.

For caregivers, family members, and therapists

If you're evaluating AgentDoc on behalf of a patient, family member, or client, here is what's relevant:

The questions people actually ask before trying this

Can someone with ALS or tetraplegia write a complete letter with this – without help?

Yes. From an empty document to an exported PDF letter, no keyboard or mouse interaction is required. The voice session is started by clicking the microphone button – which can be triggered by any assistive switch that simulates a click – and from that point everything is voice. Headings, paragraphs, formatting, page layout, signature placement, and PDF export.

Is this a free alternative to Dragon NaturallySpeaking?

Yes. AgentDoc is free to use, runs in any modern browser, and works on macOS, Windows, Linux, iOS, and Android. No license, no per-user fee, no platform exclusivity, no training period. Unlike Dragon, AgentDoc is not just a transcription engine – it formats, restructures, navigates, and exports documents by voice through an AI agent that understands document context.

I have RSI. Can I still write professional documents (cover letter, report, contract)?

Yes. The use cases AgentDoc is built for include exactly this: dictating a formal letter, applying mid-sentence formatting like "make the recipient's name bold," inserting headings and page breaks, generating a table of contents, and exporting a clean PDF. Nothing about the workflow assumes a working keyboard.

How is this different from iOS Dictation, Windows Voice Access, or Google Voice Typing?

Those tools transcribe speech into text at the current cursor location. They don't format, don't restructure, can't insert page breaks, can't navigate, and can't export. The moment you need to do anything beyond entering text, you are back at the keyboard or mouse. AgentDoc operates at the document level – every editing operation is exposed to the voice agent.

Will Dragon work better for me, or this?

Dragon has decades of speaker-adaptive training and may produce slightly more accurate transcription for some users with non-standard speech. AgentDoc's advantage is the editor itself: full document control by voice, real-time multi-page formatted output, and PDF export without ever leaving voice mode. If your bottleneck is transcription accuracy on accented or atypical speech, Dragon may still win. If your bottleneck is "I can dictate text but I can't format or restructure the document," AgentDoc is built to solve that.

Are my documents and my voice secure?

Voice audio is streamed to Google Gemini Live for processing. Documents are stored on AgentDoc servers in Germany under DSGVO/GDPR rules. Per-user document isolation, JWT-based authentication, and HTTPS apply throughout. For especially sensitive documents (medical records, legal correspondence, attorney-client material), the text chat mode is recommended – it uses the same agent and the same tools, but no audio is streamed externally.

Does this work with screen readers?

The editor surface is text-based and works with VoiceOver, NVDA, and JAWS for navigation between controls. The voice mode itself is the recommended way to read and edit documents – say "read me page 2" and the agent reads the content out loud, identifying headings and paragraph boundaries.

Funktioniert das auch auf Deutsch?

Ja. Gemini Live unterstützt Deutsch (und ~30 weitere Sprachen) sowohl bei der Spracherkennung als auch bei der gesprochenen BestÀtigung des Agenten. Befehle wie "Mach die Überschrift fett und blau" oder "Füge einen Seitenumbruch vor 'Schlussfolgerung' ein" funktionieren genauso wie auf Englisch.

Try it now – without commitment

AgentDoc is free, browser-based, and requires no install. Open the editor, press the microphone button, and dictate your first instruction. If you're evaluating it for someone else, the same trial works.

Open the voice editor Read the research β†’

Behind the scenes