Open Speech and Language Resources


Identifier: SLR8

Summary: Danish pronunciation dictionary generated using eSpeak

Category: Text

License: Unrestricted

Downloads (use a mirror closer to you):
lexicon-da.tgz [550K]   (Danish pronunciation dictionary )   Mirrors: [US]   [EU]   [CN]  
lexicon-da-nonorm.tgz [523K]   (Danish pronunciation dictionary (case retained) )   Mirrors: [US]   [EU]   [CN]  

About this resource:

This data is a pronunciation dictionary for Danish.

There is a shortage of open domain language resources for Danish, especially for speech recognition. The dictionary is automatically generated using eSpeak and is therefore not subject to license restrictions. It contains nearly 66000 entries, which is not a lot for a compounding language, but the transcriptions are adequate for ASR experiments.

This data was generated for a workshop at the 2013 Copenhagen Speech event at Copenhagen Business School.