南方医科大学学报 ›› 2016, Vol. 36 ›› Issue (04): 552-.

• • 上一篇    下一篇

LongMan:一个哺乳类直系同源长链非编码RNA数据库

杨小雪,张海,贺莎,林杰,朱浩   

  • 出版日期:2016-04-20 发布日期:2016-04-20

The beta version of LongMan: a large-scale mammalian lncRNA database orthologous to human lncRNAs

  • Online:2016-04-20 Published:2016-04-20

摘要: 目的计算并预测13 562个GENCODE项目首期鉴定的人类长链非编码RNA在16个哺乳动物的直系同源基因,并建立 数据库LongMan,为长链非编码RNA研究提供重要数据。方法使用RNAfold预测13 562个人类长链非编码RNA每个外显 子的结构;使用Infernal对每个外显子进行基因组搜索,分析其在16个哺乳动物可能的同源外显子;分析每个人类长链非编码 RNA是否有同源基因;分析同源长链非编码RNA中的转座子和剪切信号;构造数据库的搜索引擎和输出界面;实现数据库维护 更新机制。结果LongMan目前收录133 646个直系同源长链非编码RNA;提供序列、比对、转座子和种系特异性插缺(indel) 等信息;提供多条件组合查询;提供显示与下载功能。结论LongMan是首个大规模多种系同源长链非编码RNA数据库,对长 链非编码RNA比较与功能研究具有重要价值。

Abstract: Objective To predict orthologous sequences of the GENCODE-identified 13 562 human long non-coding RNAs (lncRNA) in 16 mammalian genomes and construct a lncRNA database LongMan for lncRNA studies. Methods The exon structures of a total of 13 562 human lncRNAs were analyzed using RNAfold, and their orthologous sequences were searched against 16 mammalian genomes using Infernal. The potential orthologous genes, transposons and splicing signals of human lncRNAs were predicted to construct a lncRNA database with a updating mechanism. Results and Conclusion The lncRNA database LongMan we constructed, which currently contains 133 646 orthologous lncRNAs, provides information of the sequences, alignments, transposons, and species-specific insertions and deletions and allows database search on combinatorial conditions, graphic display and data download. As the first large-scale mammalian orthologous lncRNA database, LongMan has important values in future comparative and functional studies of lncRNAs.