(A) new compression algorithm of DNA sequencesDNA 염기열에 대한 새로운 압축 알고리즘

Cited 0 time in webofscience Cited 0 time in scopus
  • Hit : 403
  • Download : 0
Universal data compression algorithms fail to compress genetic sequences. It is due to the specificity of this particular kind of “text”. We analyze in some details the properties of the sequences, which cause the failure of classical algorithms. We then present a lossless algorithm, DNAcompress, to compress the information contained in DNA and RNA sequences, based on the detection of regularities, such as the presence of palindromes. The algorithm combines substitutional and statistical methods, and to the best of our knowledge, lead to the highest compression of DNA. The results, although not satisfactory, gives insight to the necessary correlation between compression and comprehension of DNA sequences.
Advisors
Hahn, Sang-Geunresearcher한상근researcher
Description
한국과학기술원 : 수학전공,
Publisher
한국과학기술원
Issue Date
2003
Identifier
230871/325007  / 020013916
Language
eng
Description

학위논문(석사) - 한국과학기술원 : 수학전공, 2003.8, [ v, 23 p. ]

Keywords

DNA sequence; Compression Algorithm; 압축 알고리즘; 염기열

URI
http://hdl.handle.net/10203/42081
Link
http://library.kaist.ac.kr/search/detail/view.do?bibCtrlNo=230871&flag=dissertation
Appears in Collection
MA-Theses_Master(석사논문)
Files in This Item
There are no files associated with this item.

qr_code

  • mendeley

    citeulike


rss_1.0 rss_2.0 atom_1.0