CORC  > 北京大学  > 信息科学技术学院
基于代码模式的软件问答文档检索优化方法; Refine Software QA Document Search Results Based on Code Pattern
华晨彦 ; 邹艳珍 ; 朱子骁 ; 谢冰
刊名计算机科学与探索
2017
关键词代码模式 软件问答文档 文档检索 code pattern software Q& A document document search amp
DOI10.3778/j.issn.1673-9418.1609028
英文摘要开发人员通常通过问答网站的搜索引擎进行相关软件问答文档的搜索.在检索结果中,包含优质代码片段(使用示例)的问答文档往往更受青睐,但如何度量这些文档中代码片段的质量仍是个巨大的挑战.针对这个问题,提出了一种基于代码模式的软件问答文档检索优化方法.该方法能够基于当前检索结果,抽取文档中的代码片段,分析代码片段中的公共代码模式,并基于代码模式度量文档中代码片段的质量,从原有检索结果中向用户推荐高质量的软件问答文档.以软件开发人员在实践过程中遇到的真实问题为基础进行了实验,对比StackOverflow的搜索结果,所提方法在准确率指标NDCG@5上提升了40%.; Developers often need to search related software Q&A documents in Q&A website. In the search results, the Q&A documents which contain good code snippets (usage examples) are preferred. However, how to metric those code snippets in document is still a big challenge. To address this issue, this paper proposes an approach for refining software Q&A document search results based on code pattern. Firstly, code snippets are extracted from each document in the search results. Then, the common code patterns are mined and used to measure the quality of those code snippets. Finally, the documents with high quality are recommended and ranked at the top of the search results. In the experiments, this paper carries out some evaluations with 10 real problems that software developers meet in practice. Compared to the search results of StackOverflow, the proposed approach has an increment of 40% at NDCG@5.; The National Key Research and Development Program of China under Grant No. 2016YFB1000804; the Na-; 10; 1591-1598; 11
语种英语
内容类型期刊论文
源URL[http://ir.pku.edu.cn/handle/20.500.11897/483502]  
专题信息科学技术学院
推荐引用方式
GB/T 7714
华晨彦,邹艳珍,朱子骁,等. 基于代码模式的软件问答文档检索优化方法, Refine Software QA Document Search Results Based on Code Pattern[J]. 计算机科学与探索,2017.
APA 华晨彦,邹艳珍,朱子骁,&谢冰.(2017).基于代码模式的软件问答文档检索优化方法.计算机科学与探索.
MLA 华晨彦,et al."基于代码模式的软件问答文档检索优化方法".计算机科学与探索 (2017).
个性服务
查看访问统计
相关权益政策
暂无数据
收藏/分享
所有评论 (0)
暂无评论
 

除非特别说明,本系统中所有内容都受版权保护,并保留所有权利。


©版权所有 ©2017 CSpace - Powered by CSpace