>
    当前位置:首页>科研成果
论文编号:
第一作者所在部门:
中文论文题目: Comparison Study on Critical Components in Composition Model for Phrase Representation
英文论文题目: Comparison Study on Critical Components in Composition Model for Phrase Representation
论文题目英文:
作者: Wang, Shaonan
论文出处:
刊物名称: ACM TRANSACTIONS ON ASIAN AND LOW-RESOURCE LANGUAGE INFORMATION PROCESSING
年: 2017
卷: 16
期: 3
页: 25
联系作者:
收录类别:
影响因子:
摘要: Phrase representation, an important step in many NLP tasks, involves representing phrases as continuousvalued vectors. This article presents detailed comparisons concerning the effects of word vectors, training data, and the composition and objective function used in a composition model for phrase representation. Specifically, we first discuss how the augmented word representations affect the performance of the composition model. Then, we investigate whether different types of training data influence the performance of the composition model and, if so, how they influence it. Finally, we evaluate combinations of different composition and objective functions and discuss the factors related to composition model performance. All evaluations were conducted in both English and Chinese. Our main findings are as follows: (1) The Additive model with semantic enhanced word vectors performs comparably to the state-of-the-art model; (2) The Additive model which updates augmented word vectors and the Matrix model with semantic enhanced word vectors systematically outperforms the state-of-the-art model in bigram and multi-word phrase similarity task, respectively; (3) Representing the high frequency phrases by estimating their surrounding contexts is a good training objective for bigram phrase similarity tasks; and (4) The performance gain of composition model with semantic enhanced word vectors is due to the composition function and the greater weight attached to important words. Previous works focus on the composition function; however, our findings indicate that other components in the composition model (especially word representation) make a critical difference in phrase representation.
英文摘要:
外单位作者单位:
备注:

关闭窗口