How to add native-speaker audio to Anki cards automatically

Audio is one of the highest-value fields you can put on a flashcard. Hearing a native speaker say a word fixes its pronunciation, rhythm and tone far better than a written transliteration ever will. The problem is that adding audio by hand is tedious — which is why most decks never have it. This guide shows you how to add native-speaker audio to your Anki cards automatically, in 40+ languages, without recording anything yourself.

Why audio matters on a flashcard

Reading a word and saying it correctly are two different skills. If you only ever see a word written down, you build a silent, internal pronunciation that is often wrong — and wrong pronunciations are hard to unlearn. A native-speaker clip on every card trains your ear and your mouth at the same time, and it is essential for tonal languages like Mandarin Chinese and for scripts where spelling does not map cleanly to sound.

The manual way (and why it falls apart)

To add audio by hand you would find a pronunciation source, download an audio file for each word, rename it, drop it into Anki's media folder, and then reference it in the card's field with the correct [sound:...] tag. Multiply that by a few hundred words and the project collapses. Almost everyone gives up and studies without audio.

The automatic way with AnkiExpress

AnkiExpress generates a native-speaker audio clip for every card and wires it into your import file for you. Here is the full workflow:

Capture your words. Use the Chrome extension to capture words and phrases while you read. The capture dialog already lets you preview the native-speaker pronunciation. See how to make Anki cards from web articles for the capture step in detail.
Install the Windows companion app. The AnkiExpress companion app is what actually generates the audio files and places them in your Anki media folder. This step is what turns translation-only cards into cards with sound.
Set languages and folders. In the app, choose your source and target language, point it at your Anki media folder (it auto-detects a standard Anki install), and pick where to save the generated import file.
Generate the file with audio. Click generate. The app creates a native-speaker clip for each captured card and references it in the file automatically — no renaming, no [sound:] tags by hand.
Import into Anki. In Anki, click Import file, select the generated file, confirm the field separator is Tab, and import. Every card now plays native-speaker audio.

For the screenshot walkthrough of the companion app, see the tutorial's "With audio" section.

Tips for better audio cards

Put the audio on the answer side. Try to recall and say the word first, then reveal the card and check your pronunciation against the native clip.
Capture short phrases. Audio for a two- or three-word phrase teaches natural rhythm and liaison that single words miss.
Pick the right target language voice. Setting the correct target language in the companion app ensures the accent and pronunciation match what you are learning.

Frequently asked questions

Do I need to record audio myself?

No. AnkiExpress generates native-speaker pronunciation automatically for every card — you never record your own voice or hunt for sound files.

Which languages have audio?

AnkiExpress generates native-speaker audio in 40+ languages, including French, Spanish, Mandarin Chinese, Japanese, German, Italian, Korean, Russian, Arabic and Hindi.

Why do I need the Windows companion app for audio?

The Chrome extension captures words and previews pronunciation, but the companion app writes the audio files into your Anki media folder and builds an import file that links each card to its sound. Without it you still get translation-only cards.

Give your deck a voice

Cards with audio are the ones that actually improve how you speak. Create a free account, add the Chrome extension, and install the companion app to start generating audio cards automatically.