top of page

From Question to Text: Question-Oriented Feature Attention for Answer Selection

Abstract

Understanding unstructured texts is an essential skill for human beings as it enables knowledge acquisition. Although understanding unstructured texts is easy for we human beings with good education, it is a great challenge for machines. Recently, with the rapid development of artificial intelligence techniques, researchers put efforts to teach machines to understand texts, and justify the educated machines by letting them solve the questions upon the given unstructured texts, inspired by the reading comprehension test as we human do. However, feature effectiveness with respect to different questions significantly hinders the performance of answer selection, because different questions may focus on various aspects of the given text and answer candidates. In order to solve this problem, we propose a question-oriented feature attention (QFA) mechanism, which learns to weight different engineering features according to the given question, so that important features with respect to the specific question is emphasized accordingly. Experiments on MCTest dataset have well-validated the effectiveness of the proposed method. Additionally, the proposed QFA is applicable to various IR tasks, such as question answering and answer selection. We have verified the applicability on a crawled community-based question answering dataset.

Data Description

We have two dataset in our experiment, i.e., StackExchange and MCTest.

 

1. StackExchange is a cQA dataset. It contains 20,278 questions and 82,260 answers. We extracted 212-dimensional features, the feature label is obtained here. The dataset is available here.

2. MCTest is a reading comprehension dataset with a set of texts and associated questions. In this dataset, each text has four multiple-choice questions, each with four answer candidates, and only one answer is correct. The original dataset and reconstructed statements are available at here.

Data Description
Code Download

This code in implemented with TensorFlow, a deep learning library. Please install Tensorflow referring official guides at first. 

1. The code for StackExchange data is available here.

2. The feature extraction code is available here.

3. The code for MCTest data is available here .

bottom of page