南方医科大学学报 ›› 2015, Vol. 35 ›› Issue (06): 777-.

• •    下一篇

基于千人基因组谱系数据的拷贝数变异识别与分析

赵 辉,赵方庆   

  • 出版日期:2015-06-20 发布日期:2015-06-20

Detection and analysis of copy number variation from 1000 Genomes trio data

  • Online:2015-06-20 Published:2015-06-20

摘要: 拷贝数变异(copy number variation, CNV)是基因组结构变异中的一个重要类型,它在人类很多复杂疾病的发生和发展过
程中扮演着重要角色。当前CNV的识别研究,主要集中在单一样本相对于参考序列的CNV识别,以及针对成对样本的CNV识
别。然而,这种单纯基于个体水平的CNV分析,只能局限于个体之间而无法进行亲本到子代的遗传学分析。本文基于千人基
因组计划中三样本父-母-子代的家系数据,寻找子代相对于父、母的变异区域,不仅识别出子女继承自父母的CNV,并通过分层
聚类分析推断出这些CNV的生成方式,同时还检测出少量疑似子代相对于父母的纯合CNV变异。

Abstract: Copy number variation (CNV) is an important type of genomic structural variation and plays a crucial role in
genomic disorders imposed by diseases. Most of the current bioinformatic researches focus on developing algorithms and
tools for detecting CNVs from single or paired datasets, but the analysis of such CNVs is not sufficient from a family-based
genetic point of view. We performed a trio-sample family based parents-offspring CNV analysis using the 1000G data. We
found a number of CNVs that the offsprings inherited from their parents and inferred through hierarchical analysis how they
were generated. In addition, we also discovered several de novo CNV candidates.