- 1、原创力文档(book118)网站文档一经付费(服务费),不意味着购买了该文档的版权,仅供个人/单位学习、研究之用,不得用于商业用途,未经授权,严禁复制、发行、汇编、翻译或者网络传播等,侵权必究。。
- 2、本站所有内容均由合作方或网友上传,本站不对文档的完整性、权威性及其观点立场正确性做任何保证或承诺!文档内容仅供研究参考,付费前请自行鉴别。如您付费,意味着您自己接受本站规则且自行承担风险,本站不退款、不进行额外附加服务;查看《如何避免下载的几个坑》。如果您已付费下载过本站文档,您可以点击 这里二次下载。
- 3、如文档侵犯商业秘密、侵犯著作权、侵犯人身权等,请点击“版权申诉”(推荐),也可以打举报电话:400-050-0827(电话支持时间:9:00-18:30)。
- 4、该文档为VIP文档,如果想要下载,成为VIP会员后,下载免费。
- 5、成为VIP后,下载本文档将扣除1次下载权益。下载后,不支持退款、换文档。如有疑问请联系我们。
- 6、成为VIP后,您将拥有八大权益,权益包括:VIP文档下载权益、阅读免打扰、文档格式转换、高级专利检索、专属身份标志、高级客服、多端互通、版权登记。
- 7、VIP文档为合作方或网友上传,每下载1次, 网站将根据用户上传文档的质量评分、类型等,对文档贡献者给予高额补贴、流量扶持。如果你也想贡献VIP文档。上传文档
查看更多
利用双语翻译对社区问答进行问题检索
Exploiting Bilingual Translation for Question Retrieval in
Community-Based Question Answering
Guangyou Zhou, Kang Liu and Jun Zhao
National Laboratory of Pattern Recognition
Institute of Automation, Chinese Academy of Sciences
95 Zhongguancun East Road, Beijing 100190, China
{gyzhou, kliu, jzhao@
ABSTRACT
Community-based question answering (CQA) has become an important issue due to the pop-
ularity of CQA archives on the web. This paper is concerned with the problem of question
retrieval. Question retrieval in CQA archives aims to find historical questions that are seman-
tically equivalent or relevant to the queried questions. However, question retrieval is chal-
lenging partly due to the word ambiguity and lexical gap between the queried questions and
the historical questions in the archives. To deal with these problems, we propose the use of
translated words to enrich the question representation, going beyond the words in the original
language to represent a question. In this paper, each original language question (e.g., English)
is automatically translated into an foreign language (e.g., Chinese) by machine translation ser-
vices, and the resulting translated questions serves as a semantically enhanced representation
for supplementing the original bag of words. Experiments conducted on real CQA data set
demonstrate that our proposed approach significantly outperforms several baseline methods
and achieves the state-of-the-art performance.
TITLE AND ABSTRACT IN ANOTHER LANGUAGE, L2 (OPTIONAL, AND ON SAME PAGE)
利利利用用用双双双语语语翻翻翻译译译对对对社社社区区区问问问答答答进进进行行行问问问题题题检检检索索索
由于互联网上社区问答数据集的流行,使得社区问答的研究变得越来越流行。本文
关注的是问题检索。问题检索的目的是从历史问题数据集中查找与查询问题语义等价或相
关的历史问题。然而,问题检索的挑战主要是词汇歧义和查询问题与历史问题之间的词汇
鸿沟。为了解决这些问题,我们提出利用翻译词来丰富问题的表示,而不单纯利用原始语
言的词来表示问题。在本文中,通过机器翻译,每个原始语言(例如
文档评论(0)