A probabilistic framework integrating multiple proposals of text regions for scene text extraction자연 영상 내 글자 추출을 위한 확률 모델 기반의 글자 후보 통합 시스템에 관한 연구

Cited 0 time in webofscience Cited 0 time in scopus
  • Hit : 654
  • Download : 0
DC FieldValueLanguage
dc.contributor.advisorKim, Jin-Hyung-
dc.contributor.advisor김진형-
dc.contributor.authorLee, Seong-Hun-
dc.contributor.author이성훈-
dc.date.accessioned2013-09-12T01:46:17Z-
dc.date.available2013-09-12T01:46:17Z-
dc.date.issued2013-
dc.identifier.urihttp://library.kaist.ac.kr/search/detail/view.do?bibCtrlNo=515420&flag=dissertation-
dc.identifier.urihttp://hdl.handle.net/10203/180354-
dc.description학위논문(박사) - 한국과학기술원 : 전산학과, 2013.2, [ vi, 78 p. ]-
dc.description.abstractText contained in scene images provides the semantic context of the images. For that reason, robust extraction of text regions is essential for successful scene text understanding. However, separating text pixels from images still remains a challenging issue because of uncontrolled lighting conditions and complex backgrounds. In addition, any prior knowledge about text regions is usually unavailable in the scene image. To robustly extract text regions in the scene image, we propose a two-stage probabilistic framework that combines top-down knowledge of the text and bottom-up image processing. To deal with the various conditions of scene images, bottom-up image processing produces multiple image segmentations which represent different types of interpretations of the scene images. Our image segmentation algorithm seamlessly combines color, texture, and edge to isolate text regions from backgrounds without the loss of small details of text regions. Even though single segmentation cannot find all text regions, the set of all segmented regions obtained by multiple segmentations could contain all text regions. The proposed two-stage conditional random field approach generates multiple proposals of text regions and integrates them into textlines by utilizing the properties and hierarchical structures of the scene text. A region-oriented representation of the image is used to build a random field in each stage of the CRF model for identifying the possibilities of the text regions at local and global levels. In the first stage, proposals of text regions are generated by removing apparent non-text regions in each segmentation by using a local CRF model. The local CRF model couples to local image features such as color, edge, and textures as well as global character contexts such as compactness, aspect ratio, and compatibility between characters. In the second stage, the proposed system selectively integrates the multiple proposals to find plausible combinations of text ...eng
dc.languageeng-
dc.publisher한국과학기술원-
dc.subjectScene Text Extraction-
dc.subjectTwo-Stage CRF Models-
dc.subjectMultiple Image Segmentations-
dc.subjectComponent-
dc.subject자연 영상 내 글자 추출-
dc.subject2단계 CRF 모델-
dc.subject다중 영상 분할-
dc.subject컴포넌트-
dc.subject글자 후보 통합-
dc.subjectCharacter Proposal-
dc.titleA probabilistic framework integrating multiple proposals of text regions for scene text extraction-
dc.title.alternative자연 영상 내 글자 추출을 위한 확률 모델 기반의 글자 후보 통합 시스템에 관한 연구-
dc.typeThesis(Ph.D)-
dc.identifier.CNRN515420/325007 -
dc.description.department한국과학기술원 : 전산학과, -
dc.identifier.uid020065120-
dc.contributor.localauthorKim, Jin-Hyung-
dc.contributor.localauthor김진형-
Appears in Collection
CS-Theses_Ph.D.(박사논문)
Files in This Item
There are no files associated with this item.

qr_code

  • mendeley

    citeulike


rss_1.0 rss_2.0 atom_1.0