Classifying vocalization from parent-child conversations with a single sensor단일 측정기를 통한 모자 음성 분류

Cited 0 time in webofscience Cited 0 time in scopus
  • Hit : 344
  • Download : 0
Recent rapid advances in Machine Learning (ML) have led to many novel applications previously thought infeasible. The proliferation of various speech recognition solutions in various languages shows ML’s promise as the solution for a natural way to interact with devices. Speaker classification has been a subject with less than stellar results, using expensive multi-sensor, multi-device solutions, as well as sacrificing ease of use to achieve acceptable accuracy. We believe utilizing only a single device will greatly enhance the usability, and therefore the utility, of the technology, enabling novel applications such as preventing child abduction and facilitating at-home speech therapy, by automating previously labor-intensive tasks. This study looks at the feasibility of the idea by proposing and evaluating various methods to classify voice as to whether they are from an adult or a small child. This paper demonstrates that specific combinations of existing technologies can now deliver acceptable accuracy in child-adult voice classification without the addition of expensive equipment or cumbersome user interaction, such as training a custom ML model. We believe it is feasible that further research could further improve the technology, facilitating the realization of aforementioned use-cases, even without the aid of a vast and regularized corpus of data.
Advisors
Kim, Dong Junresearcher김동준researcher
Description
한국과학기술원 :전산학부,
Publisher
한국과학기술원
Issue Date
2018
Identifier
325007
Language
eng
Description

학위논문(석사) - 한국과학기술원 : 전산학부, 2018.2,[v, 39 p. :]

Keywords

Machine Learning▼aSpeech classification▼aDeep learning▼aClustering▼aSingle source; 기계 학습▼a음성 분류▼a딥 러닝▼a클러스터 분석▼a단일 측정기

URI
http://hdl.handle.net/10203/267014
Link
http://library.kaist.ac.kr/search/detail/view.do?bibCtrlNo=734091&flag=dissertation
Appears in Collection
CS-Theses_Master(석사논문)
Files in This Item
There are no files associated with this item.

qr_code

  • mendeley

    citeulike


rss_1.0 rss_2.0 atom_1.0