A Table Detection Method for Multipage PDF Documents via Visual Seperators and Tabular Structures | |
Fang, Jing ; Gaoa, Liangcai ; Bai, Kun ; Qiu, Ruiheng ; Tao, Xin ; Tang, Zhi | |
2011 | |
关键词 | table detection table spotting PDF documents separators ruling lines |
英文摘要 | Table detection is always an important task of document analysis and recognition. In this paper, we propose a novel and effective table detection method via visual separators and geometric content layout information, targeting at PDF documents. The visual separators refer to not only the graphic ruling lines but also the white spaces to handle tables with or without ruling lines. Furthermore, we detect page columns in order to assist table region delimitation in complex layout pages. Evaluations of our algorithm on an e-Book dataset and a scientific document dataset show competitive performance. It is noteworthy that the proposed method has been successfully incorporated into a commercial software package for large-scale Chinese e-Book production.; http://gateway.webofknowledge.com/gateway/Gateway.cgi?GWVersion=2&SrcApp=PARTNER_APP&SrcAuth=LinksAMR&KeyUT=WOS:000343450700153&DestLinkType=FullRecord&DestApp=ALL_WOS&UsrCustomerID=8e1609b174ce4e31116a60747a720701 ; Computer Science, Artificial Intelligence; Engineering, Electrical & Electronic; CPCI-S(ISTP); 3 |
语种 | 英语 |
DOI标识 | 10.1109/ICDAR.2011.304 |
内容类型 | 其他 |
源URL | [http://ir.pku.edu.cn/handle/20.500.11897/321245] |
专题 | 信息科学技术学院 |
推荐引用方式 GB/T 7714 | Fang, Jing,Gaoa, Liangcai,Bai, Kun,et al. A Table Detection Method for Multipage PDF Documents via Visual Seperators and Tabular Structures. 2011-01-01. |
个性服务 |
查看访问统计 |
相关权益政策 |
暂无数据 |
收藏/分享 |
除非特别说明,本系统中所有内容都受版权保护,并保留所有权利。
修改评论