Real Time AI Audio Transcription

What is Ermine.AI and how does it handle audio transcription?
Ermine.AI is a browser‑based tool that records and transcribes audio entirely on your device (100% local). All transcription processing happens in your browser, so your audio data isn’t sent to external servers. On first use, the transcription model needs to be downloaded and cached (about 50 MB), which may take a few minutes. After that, English transcription is available in future sessions.
How do I start a transcription session in Ermine.AI?
On the tool’s interface, click the “Click to begin transcribing” button. If prompted, grant microphone access. The tool will start recording and transcribing; you can download the audio and transcript afterward as separate files.
Which languages can Ermine.AI transcribe?
Currently, Ermine.AI supports only English transcription.
Is Ermine.AI fully local or does it send data to the cloud?
All processing happens in your browser on your device; no data is sent to external servers.
How can I save or export my transcription and audio?
You can download both the audio recording and the transcript from the interface (the option is labeled “Download Audio + Transcript”).
What are the limitations of Ermine.AI?
- Language support is limited to English.
- Accuracy may vary for proper nouns, technical terms, or strong accents.
- Local processing requires a reasonably capable device to run smoothly.
- The first use may take a few minutes to download and initialize the model.
- You must grant microphone access in your browser.
What do I need to use Ermine.AI?
A web browser, permission to access your microphone, and time for the initial model download/initialization (about 50 MB) when using the tool for the first time.
How long does the initial model download take?
The model files are about 50 MB and may take a few minutes to download and cache during the first use. Subsequent sessions load faster.



























