Analyzing Disagreements among ICD-9-CM Coders

Cited 0 time in webofscience Cited 0 time in scopus
  • Hit : 366
  • Download : 0
NLP researchers find it difficult to acquire and interpret clinical free text directly, most likely because of the unfamilarity with medical practices. This is why publicly available annotated corpora would be of much help, but there are still very few in the clinical domain due to patient confidentiality. In this regard, it is encouraging to see that Computational Medicine Center’s 2007 Challenge provides a publicly available corpus consisting of radiology reports with ICD-9-CM codes as independently assigned by three different coders.However, the corpus shows many disagreements among the coders, making it imperative to set the standard correctly for their proper interpretation. A proposal for such a standard as implicitly advanced by its developers is to take the majority annotation. In this paper, we propose an alternative method to address such disagreements. We believe our work not only makes a meaningful improvement on the utility of this corpus but also has good implications for similar tasks, such as ICD-10-CM coding.
Publisher
International Symposium on Languages in Biology and Medicine (LBM 2011)
Issue Date
2011-12
Language
English
Citation

4th International Symposium on Languages in Biology and Medicine (LBM 2011)

URI
http://hdl.handle.net/10203/170479
Appears in Collection
CS-Conference Papers(학술회의논문)
Files in This Item
There are no files associated with this item.

qr_code

  • mendeley

    citeulike


rss_1.0 rss_2.0 atom_1.0