It is not easy to find useful information in huge volume of documents. In this research, we focus on deep natural language processing methods for efficient information access. The research topics include NLP technologies such as context/topic-aware sentence compression, automatic question generation with a knowledge base.
Text Compression and Summarization
Text compression and summarization systems aim to produce a shorter version of a source text by preserving the key contents of the original. However, yielding an informative and grammatical compression (summary) is still a challenge. In this project, we tackle this issue by considering two aspects – the word (local) features such as part-of-speech tag of word and sentence (global) features such as readability of a whole sentence. Our experimental results demonstrate that these features coupled with techniques like deep learning and reinforcement learning can lead to compressions (summaries) with better quality (Yang et al: NLDB-2017 [1]; ACL-2018 short, accepted)