DC Field | Value | Language |
---|---|---|
dc.contributor.author | Park, Jihoon | ko |
dc.contributor.author | Hahn, Minsoo | ko |
dc.date.accessioned | 2016-04-20T06:49:13Z | - |
dc.date.available | 2016-04-20T06:49:13Z | - |
dc.date.created | 2016-01-04 | - |
dc.date.created | 2016-01-04 | - |
dc.date.issued | 2015-12 | - |
dc.identifier.citation | ETRI JOURNAL, v.37, no.6, pp.1211 - 1219 | - |
dc.identifier.issn | 1225-6463 | - |
dc.identifier.uri | http://hdl.handle.net/10203/205504 | - |
dc.description.abstract | In a hidden Markov model based speech synthesis system using a two-band excitation model, a maximum voiced frequency (MVF) is the most important feature as an excitation parameter because the synthetic speech quality depends on the MVF. This paper proposes an enhanced MVF estimation scheme based on a peak picking method. In the proposed scheme, both local peaks and peak lobes are picked from the spectrum of a linear predictive residual signal. The average of the normalized distances of local peaks and peak lobes is calculated and utilized as a feature to estimate an MVF. Experimental results of both objective and subjective tests show that the proposed scheme improves the synthetic speech quality compared with that of a conventional one in a mobile device as well as a PC environment. | - |
dc.language | English | - |
dc.publisher | ELECTRONICS TELECOMMUNICATIONS RESEARCH INST | - |
dc.subject | SPEECH SYNTHESIS SYSTEM | - |
dc.subject | PARAMETER GENERATION | - |
dc.title | Enhanced Maximum Voiced Frequency Estimation Scheme for HTS Using Two-Band Excitation Model | - |
dc.type | Article | - |
dc.identifier.wosid | 000366151900016 | - |
dc.identifier.scopusid | 2-s2.0-84955463490 | - |
dc.type.rims | ART | - |
dc.citation.volume | 37 | - |
dc.citation.issue | 6 | - |
dc.citation.beginningpage | 1211 | - |
dc.citation.endingpage | 1219 | - |
dc.citation.publicationname | ETRI JOURNAL | - |
dc.identifier.doi | 10.4218/etrij.15.0115.0124 | - |
dc.contributor.localauthor | Hahn, Minsoo | - |
dc.contributor.nonIdAuthor | Park, Jihoon | - |
dc.description.isOpenAccess | Y | - |
dc.type.journalArticle | Article | - |
dc.subject.keywordAuthor | Speech synthesis | - |
dc.subject.keywordAuthor | HTS | - |
dc.subject.keywordAuthor | two-band excitation | - |
dc.subject.keywordAuthor | maximum voiced frequency | - |
dc.subject.keywordAuthor | harmonic peak | - |
dc.subject.keywordPlus | SPEECH SYNTHESIS SYSTEM | - |
dc.subject.keywordPlus | PARAMETER GENERATION | - |
Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.