DSpace at KOASAS: SPELL MY NAME: KEYWORD BOOSTED SPEECH RECOGNITION

DSpace at KOASAS

College of Engineering(공과대학)School of Electrical Engineering(전기및전자공학부)EE-Conference Papers(학술회의논문)

SPELL MY NAME: KEYWORD BOOSTED SPEECH RECOGNITION

Cited 4 time in

Cited 0 time in

Hit : 49
Download : 0

Export

DC Field	Value	Language
dc.contributor.author	Jung, Namkyu	ko
dc.contributor.author	Kim, Geonmin	ko
dc.contributor.author	Chung, Joon Son	ko
dc.date.accessioned	2022-11-15T08:00:57Z	-
dc.date.available	2022-11-15T08:00:57Z	-
dc.date.created	2022-09-27	-
dc.date.created	2022-09-27	-
dc.date.issued	2022-05	-
dc.identifier.citation	47th IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2022, pp.2385 - 2389	-
dc.identifier.issn	1520-6149	-
dc.identifier.uri	http://hdl.handle.net/10203/299660	-
dc.description.abstract	Recognition of uncommon words such as names and technical terminology is important to understanding conversations in context. However, the ability to recognise such words remains a challenge in modern automatic speech recognition (ASR) systems. In this paper, we propose a simple but powerful ASR decoding method that can better recognise these uncommon keywords, which in turn enables better readability of the results. The method boosts the probabilities of given keywords in a beam search based on acoustic model predictions. The method does not require any training in advance. We demonstrate the effectiveness of our method on the LibriSpeeech test sets and also internal data of real-world conversations. Our method significantly boosts keyword accuracy on the test sets, while maintaining the accuracy of the other words, and as well as providing significant qualitative improvements. This method is applicable to other tasks such as machine translation, or wherever unseen and difficult keywords need to be recognised in beam search. © 2022 IEEE	-
dc.language	English	-
dc.publisher	Institute of Electrical and Electronics Engineers Inc.	-
dc.title	SPELL MY NAME: KEYWORD BOOSTED SPEECH RECOGNITION	-
dc.type	Conference	-
dc.identifier.wosid	000864187906186	-
dc.identifier.scopusid	2-s2.0-85134046125	-
dc.type.rims	CONF	-
dc.citation.beginningpage	2385	-
dc.citation.endingpage	2389	-
dc.citation.publicationname	47th IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2022	-
dc.identifier.conferencecountry	US	-
dc.identifier.conferencelocation	Virtual, Online	-
dc.identifier.doi	10.1109/ICASSP43922.2022.9747714	-
dc.contributor.localauthor	Chung, Joon Son	-
dc.contributor.nonIdAuthor	Jung, Namkyu	-
dc.contributor.nonIdAuthor	Kim, Geonmin	-

Appears in Collection: EE-Conference Papers(학술회의논문)

Files in This Item: There are no files associated with this item.

This item is cited by other documents in WoS

⊙ Detail Information in WoSⓡ	Click to see
⊙ Cited 4 items in WoS	Click to see citing articles in

Display Simple Item Record

qr_code

트윗하기

KOASAS

Knowledge Service Development Team, KAIST 291 Daehak-ro, Yuseong-gu, Daejeon 34141, Republic of Korea. T. 82-42-350-4493 Email. koasas@kaist.ac.kr
Copyright © 2016. Korea Advanced Institute of Science and Technology. All Rights Reserved.

KOASAS

KOASAS

Browse

SPELL MY NAME: KEYWORD BOOSTED SPEECH RECOGNITION

This item is cited by other documents in WoS

KOASAS

Communities & Collections