Calendario
Mark Stevenson: Disambiguation of Biomedical Texts |
Ponente: Mark Stevenson (NLP Group, University of Sheffield) Fecha: lunes 17 de mayo 2010 Hora: 11h00 Lugar de celebración: Sala de Grados, Facultad de Informática, UCM (entrada libre hasta completar el aforo)
ResumenLike text in other domains, biomedical documents contain a range of terms with more than one possible meaning. These ambiguities form a significant obstacle to the automatic processing of these texts. Previous approaches to resolving this problem have made use of a variety of knowledge sources including the context in which the ambiguous term is used and domain-specific resources (such as UMLS). We compare a range of knowledge sources which have been previously used and introduce a novel one: MeSH terms. The best performance is obtained using linguistic features in combination with MeSH terms. Performance exceeds previously reported results on a standard test set. Our approach is supervised and therefore relies on annotated training examples. A novel approach to automatically acquiring additional training data, based on the relevance feedback technique from Information Retrieval, is presented. Applying this method to generate additional training examples is shown to lead to a further increase in performance.
BioDr. Mark Stevenson is a lecturer and EPSRC Advanced Research Fellow (2006-2011) at the Natural Language Processing group of Sheffield University. His research interests include lexical semantics, word sense disambiguation, semantic similarity, information extraction and text retrieval. His PhD explored the application of a diverse set of knowledge sources to the word sense disambiguation problem and his thesis was published as a monograph by CSLI Publications. Other publications include two edited volumes and over sixty papers in journals, collected volumes and international conferences. He is a member of the EPSRC Peer Review College, the editorial board of the journal Computational Linguistics and regularly reviews for the leading
|