Learning common and specific patterns from data of multiple interrelated biological scenarios with matrix factorization
Zhang, Lihua1,2; Zhang, Shihua1,2,3
刊名NUCLEIC ACIDS RESEARCH
2019-07-26
卷号47期号:13页码:6606-6617
ISSN号0305-1048
DOI10.1093/nar/gkz488
英文摘要High-throughput biological technologies (e.g. ChIP-seq, RNA-seq and single-cell RNA-seq) rapidly accelerate the accumulation of genome-wide omics data in diverse interrelated biological scenarios (e.g. cells, tissues and conditions). Integration and differential analysis are two common paradigms for exploring and analyzing such data. However, current integrative methods usually ignore the differential part, and typical differential analysis methods either fail to identify combinatorial patterns of difference or require matched dimensions of the data. Here, we propose a flexible framework CSMF to combine them into one paradigm to simultaneously reveal Common and Specific patterns via Matrix Factorization from data generated under interrelated biological scenarios. We demonstrate the effectiveness of CSMF with four representative applications including pairwise ChIP-seq data describing the chromatin modification map between K562 and Huvec cell lines; pairwise RNA-seq data representing the expression profiles of two different cancers; RNA-seq data of three breast cancer subtypes; and single-cell RNA-seq data of human embryonic stem cell differentiation at six time points. Extensive analysis yields novel insights into hidden combinatorial patterns in these multi-modal data. Results demonstrate that CSMF is a powerful tool to uncover common and specific patterns with significant biological implications from data of interrelated biological scenarios.
资助项目National Natural Science Foundation of China[11661141019] ; National Natural Science Foundation of China[61621003] ; National Natural Science Foundation of China[61422309] ; National Natural Science Foundation of China[61379092] ; Strategic Priority Research Program of the Chinese Academy of Sciences (CAS)[XDB13040600] ; National Ten Thousand Talent Program for Young Top-notch Talents ; Key Research Program of the Chinese Academy of Sciences[KFZD-SW-219] ; National Key Research and Development Program of China[2017YFC0908405] ; CAS Frontier Science Research Key Project for Top Young Scientist[QYZDB-SSW-SYS008]
WOS研究方向Biochemistry & Molecular Biology
语种英语
出版者OXFORD UNIV PRESS
WOS记录号WOS:000490556600010
内容类型期刊论文
源URL[http://ir.amss.ac.cn/handle/2S8OKBNM/35774]  
专题应用数学研究所
通讯作者Zhang, Shihua
作者单位1.Univ Chinese Acad Sci, Sch Math Sci, Beijing 100049, Peoples R China
2.Chinese Acad Sci, Acad Math & Syst Sci, NCMIS, CEMS,RCSDS, Beijing 100190, Peoples R China
3.Chinese Acad Sci, Ctr Excellence Anim Evolut & Genet, Kunming 650223, Yunnan, Peoples R China
推荐引用方式
GB/T 7714
Zhang, Lihua,Zhang, Shihua. Learning common and specific patterns from data of multiple interrelated biological scenarios with matrix factorization[J]. NUCLEIC ACIDS RESEARCH,2019,47(13):6606-6617.
APA Zhang, Lihua,&Zhang, Shihua.(2019).Learning common and specific patterns from data of multiple interrelated biological scenarios with matrix factorization.NUCLEIC ACIDS RESEARCH,47(13),6606-6617.
MLA Zhang, Lihua,et al."Learning common and specific patterns from data of multiple interrelated biological scenarios with matrix factorization".NUCLEIC ACIDS RESEARCH 47.13(2019):6606-6617.
个性服务
查看访问统计
相关权益政策
暂无数据
收藏/分享
所有评论 (0)
暂无评论
 

除非特别说明,本系统中所有内容都受版权保护,并保留所有权利。


©版权所有 ©2017 CSpace - Powered by CSpace