Answering definition questions: Dealing with data sparseness in lexicalised dependency trees-based language models

Alejandro Figueroa, John Atkinson

Resultado de la investigación: Contribución a los tipos de informe/libroContribución a la conferencia

Resumen

A crucial step in the answering process of definition questions, such as "Who is Gordon Brown?", is the ranking of answer candidates. In definition Question Answering (QA), sentences are normally interpreted as potential answers, and one of the most promising ranking strategies predicates upon Language Models (LMs). However, one of the factors that makes LMs less attractive is the fact that they can suffer from data sparseness, when the training material is insufficient or candidate sentences are too long. This paper analyses two methods, different in nature, for tackling data sparseness head-on: (1) combining LMs learnt from different, but overlapping, training corpora, and (2) selective substitutions grounded upon part-of-speech (POS) taggings. Results show that the first method improves the Mean Average Precision (MAP) of the top-ranked answers, while at the same time, it diminishes the average F-score of the final output. Conversely, the impact of the second approach depends on the test corpus.

Idioma originalInglés
Título de la publicación alojadaWeb Information Systems and Technologies - 5th International Conference, WEBIST 2009, Revised Selected Papers
EditorialSpringer Verlag
Páginas297-310
Número de páginas14
ISBN (versión impresa)3642124356, 9783642124358
DOI
EstadoPublicada - 1 ene 2010
Evento5th International Conference on Web Information Systems and Technologies, WEBIST 2009 - Lisbon, Portugal
Duración: 23 mar 200926 mar 2009

Serie de la publicación

NombreLecture Notes in Business Information Processing
Volumen45 LNBIP
ISSN (versión impresa)1865-1348

Otros

Otros5th International Conference on Web Information Systems and Technologies, WEBIST 2009
PaísPortugal
CiudadLisbon
Período23/03/0926/03/09

Áreas temáticas de ASJC Scopus

  • Sistemas de gestión de la información
  • Ingeniería de control y sistemas
  • Gestión internacional y de empresa
  • Sistemas de información
  • Modelización y simulación
  • Gestión y sistemas de información

Huella Profundice en los temas de investigación de 'Answering definition questions: Dealing with data sparseness in lexicalised dependency trees-based language models'. En conjunto forman una huella única.

  • Citar esto

    Figueroa, A., & Atkinson, J. (2010). Answering definition questions: Dealing with data sparseness in lexicalised dependency trees-based language models. En Web Information Systems and Technologies - 5th International Conference, WEBIST 2009, Revised Selected Papers (pp. 297-310). (Lecture Notes in Business Information Processing; Vol. 45 LNBIP). Springer Verlag. https://doi.org/10.1007/978-3-642-12436-5-22