Enrique Amigó: "The Heterogeneity Principle: Optimizing Systems without Human Assessments in NLP Tasks"

Calendario

Ponente: Enrique Amigó (NLP&IR-UNED)

Fecha: martes 12 de junio de 2012, a las 11h00

Lugar de celebración: Sala 6.02, ETSI Informática, UNED (mapa)

Abstract

The heterogeneity property of text evaluation measures states that the probability of a real (i.e. human assessed) similarity increase is directly related to the heterogeneity of the set of automatic similarity measures that corroborate such increase. In this talk we i) generalize this principle to all Natural Language Processing tasks that involve computing similarity between texts; ii) we present empirical evidence that it holds in a wide range of tasks: Text Entailment, Clustering, Document Retrieval, Machine Translation evaluation and Text Summarization evaluation; and iii) we introduce a combination method for similarity measures that is based on the heterogeneity principle. The method is completely unsupervised (it does not use any kind of human assessments on the quality of the measures to be combined) and leads to top performing combined similarity measures in all the tasks considered.

Bio

Enrique Amigó is researcher and lecturer at UNED's NLP&IR Group. His main research interest are focused on evaluation metrics applied to Natural Language Processing tasks.

Lugar de Celebración

Sala 6.02 (sexta planta)

ETSI Informática, UNED

c/ Juan del Rosal, 16

Ciudad Universitaria

28040 Madrid

Materiales

Próximamente...