Use of Clue Word Annotations as the Silver-standard in Training Models for Biological Event Extraction

Cited 0 time in webofscience Cited 0 time in scopus
  • Hit : 277
  • Download : 6
Current state-of-the-art approaches to biological event extraction train models by reconstructing relevant graphs from training sentences, where labeled nodes correspond to tokens that indicate the presence of events and the relations between nodes correspond to the relations between these events and their participants. Since multi-word expressions may also indicate events, these approaches use heuristic rules to define target graphs to reconstruct by mapping various clue words into single tokens. Since training instances define actual problems to solve, the method of deriving graphs must affect the system performance, but there has not been any related study on this aspect, to the best of our knowledge. In this study, we propose an incorporation of an EM algorithm into supervised learning to look for training graphs that are more favorable for model construction. We evaluate our algorithm on the development dataset in the 2009 BioNLP shared task and show that this algorithm makes a statistically meaningful improvement on the performance of trained models over a supervised learning algorithm on a fixed set of training graphs. The models and graphs are available at http://biopathway.org/EventExtraction/.
Publisher
University of Zurich
Issue Date
2012-09-03
Language
English
Citation

5th International Symposium on Semantic Mining in Biomedicine (SMBM 2012), pp.34 - 41

URI
http://hdl.handle.net/10203/187688
Appears in Collection
CS-Conference Papers(학술회의논문)
Files in This Item

qr_code

  • mendeley

    citeulike


rss_1.0 rss_2.0 atom_1.0