An overview of the SPHINX speech recognition system

K.-F. Lee; Hsiao-Wuen Hon; R. Reddy

Страница публикации Публикация в OpenAlex

Аннотация: A description is given of SPHINX, a system that demonstrates the feasibility of accurate, large-vocabulary, speaker-independent, continuous speech recognition. SPHINX is based on discrete hidden Markov models (HMMs) with LPC- (linear-predictive-coding) derived parameters. To provide speaker independence, knowledge was added to these HMMs in several ways: multiple codebooks of fixed-width parameters, and an enhanced recognizer with carefully designed models and word-duration modeling. To deal with coarticulation in continuous speech, yet still adequately represent a large vocabulary, two new subword speech units are introduced: function-word-dependent phone models and generalized triphone models. With grammars of perplexity 997, 60, and 20, SPHINX attained word accuracies of 71, 94, and 96%, respectively, on a 997-word task.< >

Год издания: 1990

Авторы: K.-F. Lee, Hsiao-Wuen Hon, R. Reddy

Издательство: Institute of Electrical and Electronics Engineers

Источник: IEEE Transactions on Acoustics Speech and Signal Processing

Ключевые слова: Speech Recognition and Synthesis, Music and Audio Processing, Speech and Audio Processing

Показать дополнительные сведения

Будние дни	9:00–19:00
Суббота	9:00–17:00
Воскресенье	выходной день

Подразделения:

8:30–17:00 (обед 12:30–13:00), пн-пт

Контакты

Единый телефон	+7 (391) 291-25-74
Библиотека	+7 (391) 206-21-06
Издательство	+7 (391) 206-25-88
E-mail	bik@sfu-kras.ru
Адрес	пр. Свободный, 79/10

Библиотечно-издательский комплекс СФУ

An overview of the SPHINX speech recognition system
статья из журнала

An overview of the SPHINX speech recognition systemстатья из журнала

An overview of the SPHINX speech recognition system
статья из журнала