Category-specific models for ranking effective paraphrases in community Question Answering

Alejandro Figueroa, Günter Neumann

Resultado de la investigación: Article

15 Citas (Scopus)

Resumen

Platforms for community-based Question Answering (cQA) are playing an increasing role in the synergy of information-seeking and social networks. Being able to categorize user questions is very important, since these categories are good predictors for the underlying question goal, viz. informational or subjective. Furthermore, an effective cQA platform should be capable of detecting similar past questions and relevant answers, because it is known that a high number of best answers are reusable. Therefore, question paraphrasing is not only a useful but also an essential ingredient for effective search in cQA. However, the generated paraphrases do not necessarily lead to the same answer set, and might differ in their expected quality of retrieval, for example, in their power of identifying and ranking best answers higher. We propose a novel category-specific learning to rank approach for effectively ranking paraphrases for cQA. We describe a number of different large-scale experiments using logs from Yahoo! Search and Yahoo! Answers, and demonstrate that the subjective and objective nature of cQA questions dramatically affect the recall and ranking of past answers, when fine-grained category information is put into its place. Then, category-specific models are able to adapt well to the different degree of objectivity and subjectivity of each category, and the more specific the models are, the better the results, especially when benefiting from effective semantic and syntactic features.

Idioma originalEnglish
Páginas (desde-hasta)4730-4742
Número de páginas13
PublicaciónExpert Systems with Applications
Volumen41
N.º10
DOI
EstadoPublished - 1 ago 2014

Huella dactilar

Syntactics
Semantics
Experiments

ASJC Scopus subject areas

  • Engineering(all)
  • Computer Science Applications
  • Artificial Intelligence

Citar esto

@article{e75d009aca38440ba1691399fbb3042d,
title = "Category-specific models for ranking effective paraphrases in community Question Answering",
abstract = "Platforms for community-based Question Answering (cQA) are playing an increasing role in the synergy of information-seeking and social networks. Being able to categorize user questions is very important, since these categories are good predictors for the underlying question goal, viz. informational or subjective. Furthermore, an effective cQA platform should be capable of detecting similar past questions and relevant answers, because it is known that a high number of best answers are reusable. Therefore, question paraphrasing is not only a useful but also an essential ingredient for effective search in cQA. However, the generated paraphrases do not necessarily lead to the same answer set, and might differ in their expected quality of retrieval, for example, in their power of identifying and ranking best answers higher. We propose a novel category-specific learning to rank approach for effectively ranking paraphrases for cQA. We describe a number of different large-scale experiments using logs from Yahoo! Search and Yahoo! Answers, and demonstrate that the subjective and objective nature of cQA questions dramatically affect the recall and ranking of past answers, when fine-grained category information is put into its place. Then, category-specific models are able to adapt well to the different degree of objectivity and subjectivity of each category, and the more specific the models are, the better the results, especially when benefiting from effective semantic and syntactic features.",
keywords = "Community-based Question Answering, Learning to rank, Question categories, Question paraphrases",
author = "Alejandro Figueroa and G{\"u}nter Neumann",
year = "2014",
month = "8",
day = "1",
doi = "10.1016/j.eswa.2014.02.004",
language = "English",
volume = "41",
pages = "4730--4742",
journal = "Expert Systems with Applications",
issn = "0957-4174",
publisher = "Elsevier Limited",
number = "10",

}

Category-specific models for ranking effective paraphrases in community Question Answering. / Figueroa, Alejandro; Neumann, Günter.

En: Expert Systems with Applications, Vol. 41, N.º 10, 01.08.2014, p. 4730-4742.

Resultado de la investigación: Article

TY - JOUR

T1 - Category-specific models for ranking effective paraphrases in community Question Answering

AU - Figueroa, Alejandro

AU - Neumann, Günter

PY - 2014/8/1

Y1 - 2014/8/1

N2 - Platforms for community-based Question Answering (cQA) are playing an increasing role in the synergy of information-seeking and social networks. Being able to categorize user questions is very important, since these categories are good predictors for the underlying question goal, viz. informational or subjective. Furthermore, an effective cQA platform should be capable of detecting similar past questions and relevant answers, because it is known that a high number of best answers are reusable. Therefore, question paraphrasing is not only a useful but also an essential ingredient for effective search in cQA. However, the generated paraphrases do not necessarily lead to the same answer set, and might differ in their expected quality of retrieval, for example, in their power of identifying and ranking best answers higher. We propose a novel category-specific learning to rank approach for effectively ranking paraphrases for cQA. We describe a number of different large-scale experiments using logs from Yahoo! Search and Yahoo! Answers, and demonstrate that the subjective and objective nature of cQA questions dramatically affect the recall and ranking of past answers, when fine-grained category information is put into its place. Then, category-specific models are able to adapt well to the different degree of objectivity and subjectivity of each category, and the more specific the models are, the better the results, especially when benefiting from effective semantic and syntactic features.

AB - Platforms for community-based Question Answering (cQA) are playing an increasing role in the synergy of information-seeking and social networks. Being able to categorize user questions is very important, since these categories are good predictors for the underlying question goal, viz. informational or subjective. Furthermore, an effective cQA platform should be capable of detecting similar past questions and relevant answers, because it is known that a high number of best answers are reusable. Therefore, question paraphrasing is not only a useful but also an essential ingredient for effective search in cQA. However, the generated paraphrases do not necessarily lead to the same answer set, and might differ in their expected quality of retrieval, for example, in their power of identifying and ranking best answers higher. We propose a novel category-specific learning to rank approach for effectively ranking paraphrases for cQA. We describe a number of different large-scale experiments using logs from Yahoo! Search and Yahoo! Answers, and demonstrate that the subjective and objective nature of cQA questions dramatically affect the recall and ranking of past answers, when fine-grained category information is put into its place. Then, category-specific models are able to adapt well to the different degree of objectivity and subjectivity of each category, and the more specific the models are, the better the results, especially when benefiting from effective semantic and syntactic features.

KW - Community-based Question Answering

KW - Learning to rank

KW - Question categories

KW - Question paraphrases

UR - http://www.scopus.com/inward/record.url?scp=84897775650&partnerID=8YFLogxK

U2 - 10.1016/j.eswa.2014.02.004

DO - 10.1016/j.eswa.2014.02.004

M3 - Article

AN - SCOPUS:84897775650

VL - 41

SP - 4730

EP - 4742

JO - Expert Systems with Applications

JF - Expert Systems with Applications

SN - 0957-4174

IS - 10

ER -