Comprehensive global typography extraction system for electronic book documents | |
Gao, Liangcai ; Tang, Zhi ; Lin, Xiaofan ; Qiu, Ruiheng | |
2008 | |
英文摘要 | Book documents usually have consistent typographies throughout the whole book including headers, footers, columns, text line directions, and fonts used in the each level of headings. Such document-level typography information is of great value for downstream document processing applications. This paper presents a document analysis System that can extract a comprehensive set Of typographies used in book documents, The system consists of several components: recognition of fonts used in the body text and chapter headings; detection of page body area, headers and footers; detection of columns, text line direction and line spacing of body text. Page-association is employed in the system. The preliminary experimental results demonstrate the effectiveness of the system.; http://gateway.webofknowledge.com/gateway/Gateway.cgi?GWVersion=2&SrcApp=PARTNER_APP&SrcAuth=LinksAMR&KeyUT=WOS:000263679200074&DestLinkType=FullRecord&DestApp=ALL_WOS&UsrCustomerID=8e1609b174ce4e31116a60747a720701 ; Computer Science, Artificial Intelligence; Engineering, Electrical & Electronic; Imaging Science & Photographic Technology; EI; CPCI-S(ISTP); 3 |
语种 | 英语 |
DOI标识 | 10.1109/DAS.2008.30 |
内容类型 | 其他 |
源URL | [http://ir.pku.edu.cn/handle/20.500.11897/162019] |
专题 | 信息科学技术学院 |
推荐引用方式 GB/T 7714 | Gao, Liangcai,Tang, Zhi,Lin, Xiaofan,et al. Comprehensive global typography extraction system for electronic book documents. 2008-01-01. |
个性服务 |
查看访问统计 |
相关权益政策 |
暂无数据 |
收藏/分享 |
除非特别说明,本系统中所有内容都受版权保护,并保留所有权利。
修改评论