What is a Language Model (LM) file?
.lm extension) is a Language model. The language model describes the likelihood, probability, or penalty taken when a sequence or collection of words is seen. Sphinx2 uses N-gram models, and usually N is 3, so they are trigram models, and these are sequences of three words. All the sequences of three words, two words, and one word are combined together using back-off weights in order to assign probabilities to sequences of words. Many of the advances in accuracy in speech recognition have come from language modeling. Having a language model that is tuned to a particular application, especially when it is a small language, leads to much better results than when the language model is mismatched to the one given. You can see this if you run the “turtle” demo, which is made from sentences like “rotate right forty five degrees” and “go forward ten meters,” and then start reading Alice in Wonderland. The system will do the best it can to fit Alice into the the toy vocabulary and language mod
Related Questions
- When I am manipulating a slider within a Working Model file, I am confined to a set of preset values. How do I input a nonpreset number, e.g., 3.5678?
- My Working Model file no longer gives me the answers found in the Beer & Johnston textbook. Whats the problem?
- I cannot seem to find an input slider in a Working Model file. Where is it?