您的位置:网站首页 > 《中文科技期刊数据库》 > 工程技术 > 自动化计算机 > 计算机网络 > 摘要

Summarization based on physical features and logical structure of multi documents

《高技术通讯:英文版》2005年 第2期 | 秦兵 LiuTing LiSheng   SchoolofComputerScienceandTechnology HarbinInstituteofTechnology Harbin150001 P.R.China
购物车 | ★ 收藏 | 分享
摘 要:With the rapid development of the Internet, multi documents summarization is becoming a very hot research topic. In order to generate a summarization that can effectively characterize the original information from documents, this paper proposes a multi documents summarization approach based on the physical features and logical structure of the document set. This method firstly clusterssimilar sentences into several Logical Topics (LTs), and then orders these topics according to their physical features of multi documents. After that, sentences used for the summarization are extracted from these LTs, and finally the summarization is generated via certain sorting algorithms. Our experiments show that the information coverage rate of our method is 8.83% higher than those methods based solely on logical structures, and 14.31% higher than Top-N method.
【分 类】【工业技术】 > 自动化技术、计算机技术 > 计算技术、计算机技术 > 计算机的应用 > 计算机网络
【关键词】 因特网 多文件摘要 逻辑结构 拓扑结构 物理特征
【出 处】 《高技术通讯:英文版》2005年 第2期 133-136页 共4页
【收 录】 中文科技期刊数据库