IMPROVEMENT OF MAP-VFS ADAPTATION PERFORMANCE BY FUZZY CONTROL

doi:10.2316/Journal.202.2011.2.202-2733

IMPROVEMENT OF MAP-VFS ADAPTATION PERFORMANCE BY FUZZY CONTROL

Ing-Jr Ding

References

[1] J.L. Gauvain & C.H. Lee, Maximum a posteriori estimation formultivariate gaussian mixture observations of Markov chains,IEEE Transactions on Speech and Audio Processing, 2(2),1994, 291–298.
[2] K. Shinoda & C.-H. Lee, Structural MAP speaker adaptationusing hierarchical priors, Proc. IEEE Workshop on AutomaticSpeech Recognition and Understanding, Santa Barbara, CA,USA, 1997, 381–388.
[3] K. Ohkura, M. Sugiyama, & S. Sagayama, Speaker adapta-tion based on transfer vector ﬁeld smoothing with continuousmixture density HMMs, Proc. Int. Conf. on Spoken LanguageProcessing, Banﬀ, Canada, 1992, 369–372.122
[4] H. Hattori & S. Sagayama, Vector ﬁeld smoothing principlefor speaker adaptation, Proc. Int. Conf. on Spoken LanguageProcessing, Banﬀ, Canada, 1992, 381–384.
[5] J. Ishii, M. Tonomura, & S. Matsunaga, Speaker adaptationusing tree structured shared-state HMMs, Proc. Int. Conf. onSpoken Language Processing, Philadelphia, PA, USA, 1996,1149–1152.
[6] J.-I. Takahashi & S. Sagayama, Vector-ﬁeld-smoothed Bayesianlearning for fast and incremental speaker/telephone-channeladaptation, Computer Speech and Language, 11, 1997, 127–146.
[7] C.J. Leggetter & P.C. Woodland, Maximum likelihood linearregression for speaker adaptation of continuous density hiddenMarkov models, Computer Speech and Language, 9, 1995,171–185.
[8] J.T. Chien & H.C. Wang, Telephone speech recognition basedon Bayesian adaptation of hidden Markov models, SpeechCommunication, 22, 1997, 369–384.
[9] C. Chesta, O. Siohan, & C.H. Lee, Maximum a posteriorilinear regression for hidden Markov model adaptation, Proc.European Conf. on Speech Communication and Technology,Budapest, Hungary, 1999, 211–214.
[10] O. Siohan, T.A. Myrvoll, & C.-H. Lee, Structural maximuma posteriori linear regression for fast HMM adaptation, Proc.ISCA Workshop on Automatic Speech Recognition, 2000,120–127.
[11] R. Kuhn, J.-C. Junqua, P. Nguyen, & N. Niedzielski, Rapidspeaker adaptation in eigenvoice space, IEEE Transactions onSpeech and Audio Processing, 8(6), 2000, 695–707.
[12] K.T. Chen, W.W. Liau, H.M. Wang, & L.S. Lee, Fast speakeradaptation using eigenspace-based maximum likelihood linearregression, Proc. Int. Conf. on Spoken Language Processing,Beijing, China, 2000, 742–745.
[13] B. Mak, J.T. Kwok, & S. Ho, Kernel eigenvoice speaker adap-tation, IEEE Transactions on Speech and Audio Processing,13(5), 2005, 984–992.
[14] B. Zhou & J. Hansen, Rapid discriminative acoustic modelbased on eigenspace mapping for fast speaker adaptation, IEEETransactions on Speech and Audio Processing, 13(4), 2005,554–564.
[15] B. Mak, R. Hsiao, S. Ho, & J.T. Kwok, Embedded kerneleigenvoice speaker adaptation and its implication to referencespeaker weighting, IEEE Transactions on Audio, Speech, andLanguage Processing, 14(4), 2006, 1267–1280.
[16] B. Mak & R. Hsiao, Kernel eigenspace-based MLLR adap-tation, IEEE Transactions on Audio, Speech, and LanguageProcessing, 15(3), 2007, 784–795.
[17] R. Yager & D. Filev, Essentials of fuzzy modeling and control(New York: Wiley, 1994).
[18] T. Takagi & M. Sugeno, Fuzzy identiﬁcation of systems andits applications to modeling and control, IEEE Transactionson Systems, Man and Cybernetics, 15, 1985, 116–132.
[19] J. Yen, R. Langari, & L.A. Zadeh (Eds.), Industrial applicationsof fuzzy logic and intelligent systems (New York: IEEE Press,1995).
[20] S. Kermiche, M.L. Saidi, H.A. Abbassi, & H. Ghodbane,Takagi–Sugeno based controller for mobile robot navigation,Journal of Applied Science, 6(8), 2006, 1838–1844.
[21] C.T. Lin, H.W. Nein, & W.F. Lin, Speaker adaptation of fuzzy-perceptron-based speech recognition, International Journal ofUncertainty, Fuzziness and Knowledge-Based Systems, 7(1),1999, 1–30.
[22] P. Melin, J. Urias, D. Solano, M. Soto, M. Lopez, & O. Castillo,Voice recognition with neural networks, fuzzy logic and geneticalgorithms, Engineering Letters, 13(2), 2006, 108–116.
[23] Y.T. Juang, K.C. Huang, & I.J. Ding, Speaker adaptationbased on MAP estimation using fuzzy controller, PatternRecognition Letters, 24(15), 2003, 2807–2813.
[24] C.H. Lee, C.H. Lin, & B.H. Juang, A study on speakeradaptation of the parameters of continuous density hiddenMarkov models, IEEE Transactions on Acoustics, Speech andSignal Processing, 39(4), 1991, 806–814.
[25] B.H. Juang & L.R. Rabiner, The segmental k-means algorithmfor estimating parameters of hidden Markov models, IEEETransactions on Signal Processing, 38(9), 1990, 1639–1641.
[26] H.C. Wang, F. Seide, C.Y. Tseng, & L.S. Lee, MAT-2000 –Design, collection, and validation of a Mandarin 2000-speakertelephone speech database, Proc. Int. Conf. on Spoken Lan-guage Processing, Beijing, China, 2000, 460–463.
[27] C.H. Lin, C.H. Wu, P.Y. Ting, & H.M. Wang, Frameworks forrecognition of Mandarin syllables with tones using sub-syllabicunits, Speech Communication, 18(2), 1996, 175–190.

Important Links:

Abstract
DOI: 10.2316/Journal.202.2011.2.202-2733
From Journal (202) International Journal of Computers and Applications - 2011

Go Back