Scene text recognition using part-based character models문자의 지역적 특성 모델을 이용한 자연영상 내 문자인식 연구

Cited 0 time in webofscience Cited 0 time in scopus
  • Hit : 635
  • Download : 0
DC FieldValueLanguage
dc.contributor.advisorKim, Jin-Hyung-
dc.contributor.advisor김진형-
dc.contributor.authorSeok, Jae-Hyun-
dc.contributor.author석재현-
dc.date.accessioned2015-04-23T08:30:35Z-
dc.date.available2015-04-23T08:30:35Z-
dc.date.issued2014-
dc.identifier.urihttp://library.kaist.ac.kr/search/detail/view.do?bibCtrlNo=568607&flag=dissertation-
dc.identifier.urihttp://hdl.handle.net/10203/197819-
dc.description학위논문(박사) - 한국과학기술원 : 전산학과, 2014.2, [ vi, 70 p. ]-
dc.description.abstractUnderstanding scene images has attracted considerable attentions, and there have been many researches to solve the problem in the form of subproblems such as object detection, object recognition, and scene segmentation. Text in scene images is one of the most informative contents to understand the images. Scene text recognition is the problem of recognizing text in scene images taken in unconstrained manner. Many researches on scene text recognition have been proposed, but most of them utilize character models only in character recognition phase, the last stage of the process. In former phases such as text detection and text extraction, only abstracted features of text regions are used, which might cause loss of information. In this thesis, we propose a novel scene text recognition method which fully utilizes concrete models of target characters from the beginning to the end of the recognition process. Each of the target character set is modeled with a part-based object model called implicit shape model (ISM) to achieve robustness for partial degradation of characters. Towards this end, we trained a Hough forest which localizes character parts and casts probabilistic votes on possible positions of characters. The votes are aggregated in voting spaces via generalized Hough transform, and then character candidates are detected at the local maxima of the voting space. The detected character candidates are verified by organizing the most plausible text lines in a semi-Markov conditional random field (semi-CRF) framework where the optimal configuration can be efficiently found using dynamic programming. As concrete character models are utilized throughout the process, even extremely deformed text are detected and recognized, which are hardly detected with previous approaches.eng
dc.languageeng-
dc.publisher한국과학기술원-
dc.subjectText recognition-
dc.subject부분 기반 문자 모델-
dc.subject허프 포레스트-
dc.subject내포 형태 모델-
dc.subject문자인식-
dc.subjectpart-based character-
dc.subjectImplicit shape model-
dc.subjectHough forest-
dc.titleScene text recognition using part-based character models-
dc.title.alternative문자의 지역적 특성 모델을 이용한 자연영상 내 문자인식 연구-
dc.typeThesis(Ph.D)-
dc.identifier.CNRN568607/325007 -
dc.description.department한국과학기술원 : 전산학과, -
dc.identifier.uid020085275-
dc.contributor.localauthorKim, Jin-Hyung-
dc.contributor.localauthor김진형-
Appears in Collection
CS-Theses_Ph.D.(박사논문)
Files in This Item
There are no files associated with this item.

qr_code

  • mendeley

    citeulike


rss_1.0 rss_2.0 atom_1.0