TY - JOUR
T1 - Search clicks analysis for discovering temporally anchored questions in community Question Answering
AU - Figueroa, Alejandro
AU - Gómez-Pantoja, Carlos
AU - Herrera, Ignacio
N1 - Funding Information:
This work was partially supported by the project Fondecyt “Bridging the Gap between Askers and Answers in Community Question Answering Services” (11130094) funded by the Chilean Government. This research was partially conducted during the internship of Ignacio Herrera at Yahoo! Labs in Santiago, Chile.
Publisher Copyright:
© 2015 Elsevier Ltd. All rights reserved.
PY - 2016/5/15
Y1 - 2016/5/15
N2 - Nowadays, community Question-Answering (cQA) sites are massive repositories for user-generated content, where members prompt questions expecting satisfactory answers from other members. However, in this dynamic, there is an intrinsic delay between the moment questions are posted to the arrival of acceptable responses. Therefore, cQA platforms have the pressing need for promoting unresolved questions to potential answerers and for taking advantage of resolved questions contained in their archives, whenever possible. This paper studies cQA services from the viewpoint of the time frame where their questions attract the interest of their community members. By drawing a parallel with temporal patterns of user interests in web search activity, we are able to define three main types of temporally anchored questions: trend or bursty, periodic and permanent. Then, by analyzing user click distributions to Yahoo! Answers pages across Yahoo! Search logs, we automatically acquired a set of 35,000 cQA questions labeled with one of these three temporal anchors. Accordingly, we show the practicality of this approach by means of human assessments; and by using this automatically acquired corpus for studying several classification models. Essentially, the proposed method was found to correlate well with these human judgements, and proven to be effective in building systems that automatically identify the temporal anchor of unseen cQA questions. In substance, our outcomes indicate that some contexts are strongly related to a particular temporal anchor. We believe that these anchors will contribute to the discrimination of resolved questions that are capable of being revitalized, as well as to foster the opportune participation in questions that generate enthusiasm only for a short time.
AB - Nowadays, community Question-Answering (cQA) sites are massive repositories for user-generated content, where members prompt questions expecting satisfactory answers from other members. However, in this dynamic, there is an intrinsic delay between the moment questions are posted to the arrival of acceptable responses. Therefore, cQA platforms have the pressing need for promoting unresolved questions to potential answerers and for taking advantage of resolved questions contained in their archives, whenever possible. This paper studies cQA services from the viewpoint of the time frame where their questions attract the interest of their community members. By drawing a parallel with temporal patterns of user interests in web search activity, we are able to define three main types of temporally anchored questions: trend or bursty, periodic and permanent. Then, by analyzing user click distributions to Yahoo! Answers pages across Yahoo! Search logs, we automatically acquired a set of 35,000 cQA questions labeled with one of these three temporal anchors. Accordingly, we show the practicality of this approach by means of human assessments; and by using this automatically acquired corpus for studying several classification models. Essentially, the proposed method was found to correlate well with these human judgements, and proven to be effective in building systems that automatically identify the temporal anchor of unseen cQA questions. In substance, our outcomes indicate that some contexts are strongly related to a particular temporal anchor. We believe that these anchors will contribute to the discrimination of resolved questions that are capable of being revitalized, as well as to foster the opportune participation in questions that generate enthusiasm only for a short time.
KW - Community Question-Answering
KW - Question analysis
KW - Question classification
KW - Temporality
KW - User generated content
KW - User search activity
UR - http://www.scopus.com/inward/record.url?scp=84961359021&partnerID=8YFLogxK
U2 - 10.1016/j.eswa.2015.12.016
DO - 10.1016/j.eswa.2015.12.016
M3 - Article
AN - SCOPUS:84961359021
SN - 0957-4174
VL - 50
SP - 89
EP - 99
JO - Expert Systems with Applications
JF - Expert Systems with Applications
ER -