欢迎访问《渔业研究》官方网站,今天是 分享到:

渔业研究 ›› 2023, Vol. 45 ›› Issue (3): 213-221.DOI: 10.14012/j.cnki.fjsc.2023.03.001

• 论文与报告 •    下一篇

西藏山溪鲵转录组组装与分析

朱文文1,2(), 郁川1,2, 熊建利3, 黄勇4,*()   

  1. 1.洛阳职业技术学院食品与药品学院,河南 洛阳 471000
    2.河南省动物疾病与公共卫生工程研究中心,河南 洛阳 471000
    3.绵阳师范学院,生态安全与保护四川省重点实验室,四川 绵阳 621000
    4.河南科技大学动物科学院水生动物适应与进化实验室,河南 洛阳 471000
  • 收稿日期:2022-09-19 出版日期:2023-06-25 发布日期:2023-06-16
  • 通讯作者: 黄勇
  • 作者简介:朱文文(1979—),女,副教授,博士,主要研究方向为动物学。E-mail:anglezww@126.com
  • 基金资助:
    国家自然科学基金项目(31471971);河南省高等学校青年骨干教师资助计划项目(2015GGJS-054)

Transcriptome data assembly and analysis of Batrachuperus tibetanus

ZHU Wenwen1,2(), YU Chuan1,2, XIONG Jianli3, HUANG Yong4,*()   

  1. 1. Food and Drug College,Luoyang Polytechnic, Luoyang 471000,China
    2. Animal Diseases and Public Health Engineering Research Center of Henan Province,Luoyang 471000,China
    3. Ecological Security & Protection,Mianyang Teachers’ College,Mianyang 621000,China
    4. Laboratory of Adaptation and Evolution of Aquatic Animals,College of Animal Science and Technology, Henan University of Science and Technology,Luoyang 471000,Henan
  • Received:2022-09-19 Online:2023-06-25 Published:2023-06-16
  • Contact: HUANG Yong

摘要:

目前有关小鲵属物种的基因组信息和转录组序列报道尚少。本研究利用Illumina HiSeq高通量测序技术对西藏山溪鲵(Batrachuperus tibetanus)进行了转录组测序与数据分析,再经拼接和组装,最终获得了36 252条unigenes,序列的平均长度为937 bp,N50为1 549 bp。此外,对unigenes进行GC含量、长度分布和基因表达量等分析,结果显示测序数据质量较好。使用Swiss-prot、Nr、Pfam、KEGG、KOG和GO数据库注释西藏山溪鲵转录组unigenes,分别有16 465、18 749、13 983、10 607、15 704和14 242条unigenes 获得注释。根据注释结果,共检测出17 472条CDS,鉴定出2 779个SSR标记和3 100个SNP 位点。KEGG通路分析结果表明:获得注释的10 607条unigenes又被划分到257个代谢通路中,其中涉及信号转导通路的unigenes比例最大。本研究通过对西藏山溪鲵进行转录组测序,得到的大量转录本信息,为进一步了解西藏山溪鲵遗传多样性分析、分子标记开发、种群资源评价与保护和功能基因的克隆等研究提供了丰富的数据资源。

关键词: 西藏山溪鲵, 转录组, unigenes, 基因注释

Abstract:

The genomic information and transcriptome sequences of Batrachuperus tibetanus are rarely reported at present.In this study,the transcripts of this species were sequenced and assembled using Illumina HiSeq platform.A total of 36 252 unigenes were obtained through sequencing and assembling,with an average length of 937 bp and a N50 of 1 549 bp.Additionally,GC content,length distribution and expression level of unigenes were assessed.The result showed that the sequencing data was of high quality.Blasted against the Swiss-prot,Nr,Pfam,KEGG,KOG and GO,a total of 16 465,18 749,13 983,10 607,15 704 and 14 242 unigenes were annotated,respectively.According to the functional annotation results,17 472 coding sequence (CDS) were detected.A total of 2 779 microsatellite markers (SSR) and 3 100 single nucleotide polymorphisms (SNP) were identified.KEGG pathway analysis showed that 10 607 unigenes were annotated,and it was divided into 257 categories according to the different pathways.Among all the pathways,the number of unigenes involved in signal transduction pathway exhibited the largest proportion.In this study,the obtained transcriptome sequences provided rich data resources for further understanding of genetic diversity analysis,molecular marker development,population resource evaluation and conservation,as well as functional gene cloning of the B.tibetanus.

Key words: Batrachuperus tibetanus, transcriptome, unigenes, gene annotation

中图分类号: