A syllable lattice approach to speaker verification

Cited 8 time in webofscience Cited 0 time in scopus
  • Hit : 726
  • Download : 0
DC FieldValueLanguage
dc.contributor.authorJin, MHko
dc.contributor.authorSoong, FKko
dc.contributor.authorYoo, Chang Dongko
dc.date.accessioned2013-03-06T22:01:00Z-
dc.date.available2013-03-06T22:01:00Z-
dc.date.created2012-02-06-
dc.date.created2012-02-06-
dc.date.issued2007-11-
dc.identifier.citationIEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, v.15, no.8, pp.2476 - 2484-
dc.identifier.issn1558-7916-
dc.identifier.urihttp://hdl.handle.net/10203/88611-
dc.description.abstractThis paper proposes a syllable-lattice-based speaker verification algorithm for Mandarin Chinese input. For each speech utterance, a syllable lattice is generated with a speaker-independent large-vocabulary continuous speech recognition system in free syllable decoding. The verification decision is made based upon the likelihood ratio between a target-speaker model and a speaker-independent background model, computed on the decoded syllable lattice. The likelihood function is calculated efficiently in a forward algorithm by considering all paths in the lattice. The proposed algorithm was evaluated using a Mandarin Chinese database, where 1832 true and 26 250 impostor trials were recorded by 19 target speakers and 180 impostors. The average duration of each trial is 2 s long without silence. The target-speaker model was adapted from the speaker-independent background model using enrollment data of two minutes with silence. The proposed algorithm achieved an equal-error rate of 0.857% which is beter than 1.21% of the hidden Markov model-based speaker verification algorithm without using syllable lattices. The equal-error rate was further reduced to 0.617% by incorporating the Goussian mixture model-universal background model algorithm with 2048 Gaussian kernels whose equal error rate is 0.990%.-
dc.languageEnglish-
dc.publisherIEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC-
dc.subjectHIDDEN MARKOV-MODELS-
dc.subjectMANDARIN CHINESE-
dc.subjectRECOGNITION-
dc.subjectADAPTATION-
dc.subjectSPEECH-
dc.titleA syllable lattice approach to speaker verification-
dc.typeArticle-
dc.identifier.wosid000250282800026-
dc.identifier.scopusid2-s2.0-57349174331-
dc.type.rimsART-
dc.citation.volume15-
dc.citation.issue8-
dc.citation.beginningpage2476-
dc.citation.endingpage2484-
dc.citation.publicationnameIEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING-
dc.identifier.doi10.1109/TASL.2007.906181-
dc.contributor.localauthorYoo, Chang Dong-
dc.contributor.nonIdAuthorJin, MH-
dc.contributor.nonIdAuthorSoong, FK-
dc.type.journalArticleArticle-
dc.subject.keywordAuthorlattice-based speaker adaptation-
dc.subject.keywordAuthorlattice rescoring-
dc.subject.keywordAuthorMandarin Chinese-
dc.subject.keywordAuthorspeaker recognition.-
dc.subject.keywordPlusHIDDEN MARKOV-MODELS-
dc.subject.keywordPlusMANDARIN CHINESE-
dc.subject.keywordPlusRECOGNITION-
dc.subject.keywordPlusADAPTATION-
dc.subject.keywordPlusSPEECH-
Appears in Collection
EE-Journal Papers(저널논문)
Files in This Item
There are no files associated with this item.
This item is cited by other documents in WoS
⊙ Detail Information in WoSⓡ Click to see webofscience_button
⊙ Cited 8 items in WoS Click to see citing articles in records_button

qr_code

  • mendeley

    citeulike


rss_1.0 rss_2.0 atom_1.0