CORC  > 上海财经大学  > 上海财经大学
Overlapping clustering of gene expression data using penalized weighted normalized cut
Hidalgo, Sebastian J. Teran1; Zhu, Tingyu2; Wu, Mengyun1,3; Ma, Shuangge1
刊名GENETIC EPIDEMIOLOGY
2018-12
卷号42期号:8页码:796-811
关键词gene expression data NCut overlapping clustering penalization
ISSN号0741-0395
DOI10.1002/gepi.22164
英文摘要Clustering has been widely conducted in the analysis of gene expression data. For complex diseases, it has played an important role in identifying unknown functions of genes, serving as the basis of other analysis, and others. A common limitation of most existing clustering approaches is to assume that genes are separated into disjoint clusters. As genes often have multiple functions and thus can belong to more than one functional cluster, the disjoint clustering results can be unsatisfactory. In addition, due to the small sample sizes of genetic profiling studies and other factors, there may not be sufficient evidence to confirm the specific functions of some genes and cluster them definitively into disjoint clusters. In this study, we develop an effective overlapping clustering approach, which takes account into the multiplicity of gene functions and lack of certainty in practical analysis. A penalized weighted normalized cut (PWNCut) criterion is proposed based on the NCut technique and an L2 norm constraint. It outperforms multiple competitors in simulation. The analysis of the cancer genome atlas (TCGA) data on breast cancer and cervical cancer leads to biologically sensible findings which differ from those using the alternatives. To facilitate implementation, we develop the function pwncut in the R package NCutYX.
WOS研究方向Genetics & Heredity ; Mathematical & Computational Biology
语种英语
出版者WILEY
WOS记录号WOS:000450354800004
内容类型期刊论文
源URL[http://10.2.47.112/handle/2XS4QKH4/465]  
专题上海财经大学
作者单位1.Yale Univ, Dept Biostat, New Haven, CT 06520 USA;
2.Xiamen Univ, Dept Stat, Xiamen, Peoples R China;
3.Shanghai Univ Finance & Econ, Sch Stat & Management, Shanghai 200433, Peoples R China
推荐引用方式
GB/T 7714
Hidalgo, Sebastian J. Teran,Zhu, Tingyu,Wu, Mengyun,et al. Overlapping clustering of gene expression data using penalized weighted normalized cut[J]. GENETIC EPIDEMIOLOGY,2018,42(8):796-811.
APA Hidalgo, Sebastian J. Teran,Zhu, Tingyu,Wu, Mengyun,&Ma, Shuangge.(2018).Overlapping clustering of gene expression data using penalized weighted normalized cut.GENETIC EPIDEMIOLOGY,42(8),796-811.
MLA Hidalgo, Sebastian J. Teran,et al."Overlapping clustering of gene expression data using penalized weighted normalized cut".GENETIC EPIDEMIOLOGY 42.8(2018):796-811.
个性服务
查看访问统计
相关权益政策
暂无数据
收藏/分享
所有评论 (0)
暂无评论
 

除非特别说明,本系统中所有内容都受版权保护,并保留所有权利。


©版权所有 ©2017 CSpace - Powered by CSpace