生物信息学在昆虫学研究中的应用

张赞 已出版文章查询
张赞
本平台内已出版文章查询
1 刘金定 已出版文章查询
刘金定
本平台内已出版文章查询
2 黄水清 已出版文章查询
黄水清
本平台内已出版文章查询
3 李飞 已出版文章查询
李飞
本平台内已出版文章查询
1

+ 作者地址

1南京农业大学植物保护学院 南京210095

2南京农业大学植物保护学院 南京210095;南京农业大学信息科学技术学院 南京210095

3南京农业大学信息科学技术学院 南京210095


0
  • 摘要
  • 参考文献
  • 相关文章
  • 统计
随着深度测序和基因芯片技术的不断发展,基因组、转录组、表达谱数据大量积累.目前,至少有10多个昆虫的基因组已被测序,30多个昆虫的转录组数据被报道.显然,传统的生物统计学方法无法处理如此海量的生物数据.量变引发质变,生物数据的大量积累催生了一门新兴学科,生物信息学.生物信息学融合了统计学、信息科学和生物学等各学科的理论和研究内容,在医学、基础生物学、农业科学以及昆虫学等方面获得了广泛的应用.生物信息学的目标是存储数据、管理数据和数据挖掘.因此,建立维护生物学数据库、设计开发基于模式识别、机器学习、数据挖掘等方法的生物软件,以及运用上述工具进行深度的数据挖掘,是生物信息学的重要研究内容.本文首先简要介绍了生物信息学的历史、研究现状及其在昆虫学科中的应用,然后综述了昆虫基因组学和转录组学的研究进展,最后对生物信息学在昆虫学研究中的应用前景进行了展望.

[1] Adams MD;Celniker SE;Holt RA;Evans CA Gocayne JD Amanatides PG Scherer SE Li PW Hoskins RA Galle RF George RA Lewis SE Richards S Ashburner M Henderson SN Sutton GG Wortman JR Yell MD Zhang Q Chen LX Bron RC Rogers YH Blazej RG Champe M Pfeiffer BD Wan KH Doyle C Baxter EG Helt G Ne .The genome sequence of Drosophila melanogaster[J].Science,2000,287(5461):2185-2195.

[2] Adams MD;Kelley JM;Gocayne JD;Dubnick M Polymeropoulos MH Xiao H Merril CR Wu A Olde B Moreno RF .Complementary DNA sequencing:expressed sequence tags and human genome project[J].Science,1991,252(5013):1651-1656.

[3] Altschul SF;Gish W;Miller W;Myers EW,Lipman D J .Basic local alignment search tool[J].Journal of Molecular Biology,1990,215(03):403-410.

[4] Arnold K;Bordoli L;Kopp J;Schwede T .The SWISS-MODEL workspace: a web-based environment for protein structure homology modelling[J].Bioinformatics,2006(2):195-201.

[5] Bai Y;Casola C;Feschotte C;Betran E .Comparative genomics reveals a constant rate of origination and convergent acquisition of functional retrogenes in Drosophila[J].Genome Biology,2007,8(01):R11.

[6] Benson DA;Karsch-Mizrachi I;Lipman D J;Ostell J,Sayers EW .GenBank[J].Nucleic Acids Research,2011,39(Database issue):D32-D37.

[7] Biedler JK;Tu Z .Evolutionary analysis of the kinesin light chain genes in the yellow fever mosquito Aedes aegypti:gene duplication as a source for novel early zygotic genes[J].BMC Evolutionary Biology,2010,10:206.

[8] Birol, Inanc;Jackman, Shaun D.;Nielsen, Cydney B.;Qian, Jenny Q.;Varhol, Richard;Stazyk, Greg;Morin, Ryan D.;Zhao, Yongjun;Hirst, Martin;Schein, Jacqueline E.;Horsman, Doug E.;Connors, Joseph M.;Gascoyne, Randy D.;Marra, Marco A. .De novo transcriptome assembly with ABySS[J].Bioinformatics,2009(21):2872-2877.

[9] Roberto Bonasio;Guojie Zhang;Chaoyang Ye;Navdeep S. Mutti;Xiaodong Fang;Nan Qin;Greg Donahue;Pengcheng Yang;Qiye Li;Cai Li;Pei Zhang;Zhiyong Huang;Shelley L. Berger;Danny Reinberg;Jun Wang;Juergen Liebig .Genomic Comparison of the Ants Camponotus floridanus and Harpegnathos saltator[J].Science,2010(Aug.27 TN.5995):1068-1071.

[10] Brocchieri L.;Karlin S. .A symmetric-iterated multiple alignment of protein sequences[J].Journal of Molecular Biology,1998(1):249-264.

[11] Burge C.;Karlin S. .PREDICTION OF COMPLETE GENE STRUCTURES IN HUMAN GENOMIC DNA[J].Journal of Molecular Biology,1997(1):78-94.

[12] Ramu Chenna;Hideaki Sugawar;Tadashi Koike;Rodrigo Lopez;Toby J. Gibson;Desmond G. Higgins;Julie D. Thompson .Multiple sequence alignment with the Clustal series of programs[J].Nucleic Acids Research,2003(13):3497-3500.

[13] Demkin VV .Bioinformatic analysis of nucleotide sequences records retrieved from GenBank[J].Molekuliarnaia Genetika Mikrobiologia I Virusologia,2009,2:36-39.

[14] Drysdale R .FlyBase:a database for the Drosophila research community[J].Methods in Molecular Biology,2008,420:45-59.

[15] Etebari K;Palfreyman RW;Schlipalius D;Nielsen LK Glatz RV Asgari S .Deep sequencing-based transcriptome analysis of Plutella xylostella larvae parasitized by Diadegma semiclausum[J].BMC Genetics,2011,12:446.

[16] Fiser A;Sali A .Modeller:generation and refinement of homology-based protein structure models[J].Methods in Enzymology,2003,374:461-491.

[17] Qiang Gan,Iouri Chepelev,Gang Wei,Lama Tarayrah,Kairong Cui,Keji Zhao,Xin Chen.Dynamic regulation of alternative splicing and chromatin structure in Drosophila gonads revealed by RNA-seq[J].细胞研究(英文版),2010(07):763-783.

[18] Geourjon C;Deleage G .SOPMA: significant improvements in protein secondary structure prediction by consensus prediction from multiple alignments[J].Computer Applications in the Biosciences,1995,11(06):681-684.

[19] Daniel G. Gibson;John I. Glass;Carole Lartigue;Vladimir N. Noskov;Ray-Yuan Chuang;Mikkel A. Algire;Gwynedd A. Benders;Michael G. Montague;Li Ma;Monzia M. Moodie;Chuck Merryman;Sanjay Vashee;Radha Krishnakumar;Nacyra Assad-Garcia;Cynthia Andrews-Pfannkoch;Evgeniya A. Denisova;Lei Young;Zhi-Qing Qi;Thomas H. Segall-Shapiro;Christopher H. Calvey;Prashanth P. Parmar;Clyde A. Hutchison III;Hamilton O. Smith;J. Craig Venter .Creation of a Bacterial Cell Controlled by a Chemically Synthesized Genome[J].Science,2010(Jul.2 TN.5987):52-56.

[20] Grabherr, Manfred G.;Haas, Brian J.;Yassour, Moran;Levin, Joshua Z.;Thompson, Dawn A.;Amit, Ido;Adiconis, Xian;Fan, Lin;Raychowdhury, Raktima;Zeng, Qiandong;Chen, Zehua;Mauceli, Evan;Hacohen, Nir;Gnirke, Andreas;Rhind, Nicholas;di Palma, Federica;Birren, Bruce W.;Nusbaum, Chad;Lindblad-Toh, Kerstin;Friedman, Nir;Regev, Aviv [Author;E-mail: aregev@broad.mit.edu]. .Full-length transcriptome assembly from RNA-Seq data without a reference genome[J].Nature biotechnology,2011(7):644.

[21] Graveley BR;Brooks AN;Carlson JW;Duff MO Lolin JM Yang L Artieri CG van Baren MJ Boley N Booth BW Brown JB Cherbas L Davis CA Dobin A Li R Lin W Malone JH Mattiuzzo NR Miller D Sturgill D Tuch BB Zaleski C Zhang D Blanchette M Dudoit S Eads B Green R E Hammonds A Jiang L Kapranov P Lang .The developmental transcriptome of Drosophila melanogaster[J].Nature,471(7339):473-479.

[22] Guttman M;Garber M;Levin JZ;Donaghey J;Robinson J;Adiconis X;Fan L;Koziol MJ;Gnirke A;Nusbaum C;Rinn JL;Lander ES;Regev A .Ab initio reconstruction of cell type-specific transcriptomes in mouse reveals the conserved multi-exonic structure of lincRNAs.[Erratum appears in Nat Biotechnol. 2010 Jul;28(7):756]Comments Comment in: Nat Biotechnol. 2010 May;28(5):421-3; PMID: 20458303[J].Nature biotechnology,2010(5):503-510.

[23] Havlak P;Chen R;Durbin KJ;Egan A,Ren Y,Song XZ,Weinstock GM, Gibbs RA .The Atlas genome assembly system[J].Genome Research,2004,14(04):721-732.

[24] Holt RA;Subramanian GM;Halpern A;Sutton GG Charlab R Nusskern DR Wincker P Clark AG Ribeiro JM Wides R Salzberg SL Loftus B Yell M Majoros WH Rusch DB Lai Z Kraft CL Abril JF Anthouard V Arensburger P Atkinson PW Baden H de Berardinis V Baldwin D Benes V Biedler J Blass C Bolanos R Bosc .The genome sequence of the malaria mosquito Anopheles gambiae[J].Science,2002,298(5591):129-149.

[25] Huang X;Madan A .CAP3:A DNA sequence assembly program[J].Genome Research,1999,9(09):868-877.

[26] Huang X;Wang J;Aluru S;Yang SP,Hillier L .PCAP:a whole-genome assembly program. Genome Res,13(9):2164-2170.Ingram VM,1961. Gene evolution and the haemoglobins[J].Nature,2003,189:704-708.

[27] Jones DT;Taylor WR;Thornton JM .A new approach to protein fold recognition[J].Nature,1992,358(6381):86-89.

[28] Kelley LA;MacCallum RM;Sternberg MJ .Enhanced genome annotation using structural profiles in the program 3D-PSSM.[J].Journal of Molecular Biology,2000(2):499-520.

[29] Kent WJ .BLAT-the BLAST-like alignment tool[J].Genome Research,2002,12(04):656-664.

[30] Kirkness, E.F.;Haas, B.J.;Sun, W.;Braig, H.R.;Perotti, M.A.;Clark, J.M.;Lee, S.H.;Robertson, H.M.;Kennedy, R.C.;Elhaik, E.;Gerlach, D.;Kriventseva, E.V.;Elsik, C.G.;Graur, D.;Hill, C.A.;Veenstra, J.A.;Walenz, B.;Tubío, J.M.C.;Ribeiro, J.M.C.;Rozas, J. .Genome sequences of the human body louse and its primary endosymbiont provide insights into the permanent parasitic lifestyle[J].Proceedings of the National Academy of Sciences of the United States of America,2010(27):12168-12173.

[31] Kneller DG;Cohen FE;Langridge R .Improvements in protein secondary structure prediction by an enhanced neural network[J].Journal of Molecular Biology,1990,214(01):171-182.

[32] Korf I;Flicek P;Duan D;Brent MR .Integrating genomic homology into gene structure prediction[J].Bioinformatics,2001,17(Suppl.1):S140-S148.

[33] Lambert C;Leonard N;De Bolle X;Depiereux E .ESyPred3D: Prediction of proteins 3D structures.[J].Bioinformatics,2002(9):1250-1256.

[34] Li, Ruiqiang;Yu, Chang;Li, Yingrui;Lam, Tak-Wah;Yiu, Siu-Ming;Kristiansen, Karsten;Wang, Jun .SOAP2: an improved ultrafast tool for short read alignment[J].Bioinformatics,2009(15):1966-1967.

[35] Mendes ND;Freitas AT;Vasconcelos AT;Sagot MF .Combination of measures distinguishes pre-miRNAs from other stem-loops in the genome of the newly sequenced Anopheles darlingi[J].BMC Genetics,2010,11:529.

[36] Mikheyev AS;Vo T;Wee B;Singer MC,Parmesan C .Rapid microsatellite isolation from a butterfly by de novo transcriptome sequencing:performance and a comparison with AFLP-derived distances[J].PLoS ONE,2010,5(06):e11212.

[37] Mita K;Kasahara M;Sasaki S;Nagayasu Y;Yamada T;Kanamori H;Namiki N;Kitagawa M;Yamashita H;Yasukochi Y;Kadono Okuda K;Yamamoto K;Ajimura M;Ravikumar G;Shimomura M;Nagamura Y;Shin I T;Abe H;Shimada T;Morishita S;Sasaki T .The genome sequence of silkworm, Bombyx mori.[J].DNA research: an international journal for rapid publication of reports on genes and genomes,2004(1):27-35.

[38] Mott R .EST_GENOME:a program to align spliced DNA sequences to unspliced genomic DNA[J].Computer Applications in the Biosciences,1997,13(04):477-478.

[39] Mullikin JC;Ning Z .The phusion assembler[J].Genome Research,2003,13(01):81-90.

[40] Myers EW;Sutton GG;Delcher AL;Dew IM Fasulo DP Flanigan M J Kravitz SA Mobarry CM Reinert KH Remington KA Anson EL Bolanos RA Chou HH Jordan CM Halpern AL onardiSL Beasley EM Bron RC Chen L Dunn PJ Lai Z Liang Y Nusskern DR Zhan M Zhang Q Zheng X Rubin GM Adams MD Venter JC .A whole-genome assembly of Drosophila[J].Science,2000,287(5461):2196-2204.

[41] Needleman SB;Wunsch CD .A general method applicable to the search for similarities in the amino acid sequence of two proteins[J].Journal of Molecular Biology,1970,48(03):443-453.

[42] Nene V;Wortman JR;Lawson D;Haas B;Kodira C;Tu ZJ;Loftus B;Xi ZY;Megy K;Grabherr M .Genome sequence of Aedes aegypti, a major arbovirus vector[J].Science,2007(5832):1718-1723.

[43] Nielsen M;Lundegaard C;Lund O;Petersen TN .CPHmodels-3.0 - remote homology modeling using structure-guided sequence profiles[J].Nucleic Acids Research,2010,38(Web server issue):W576-W581.

[44] Notredame C;Higgins DG .SAGA - SEQUENCE ALIGNMENT BY GENETIC ALGORITHM[J].Nucleic Acids Research,1996(8):1515-1524.

[45] Notredame C;Higgins DG;Heringa J .T-Coffee: A novel method for fast and accurate multiple sequence alignment.[J].Journal of Molecular Biology,2000(1):205-217.

[46] O' Neil ST;Dzurisin JD;Carmichael RD;Lobo NF Emrich SJ Hellmann JJ .Population-level transcriptome sequencing of nonmodel organisms Erynnis propertius and Papilio zelicaon[J].BMC Genetics,2010,11:310.

[47] Pearson WR;Lipman DJ .Improved tools for biological sequence comparison[J].Proceedings of the National Academy of Sciences(USA),1988,85(08):2444-2448.

[48] Pevzner PA;Tang H;Tesler G .De novo repeat classification and fragment assembly.[J].Genome research,2004(9):1786-1796.

[49] Pevzner PA;Tang H;Waterman MS .An Eulerian path approach to DNA fragment assembly.[J].Proceedings of the National Academy of Sciences of the United States of America,2001(17):9748-9753.

[50] Quevillon E;Silventoinen V;Pillai S;Harte N;Mulder N;Apweiler R;Lopez R .InterProScan: protein domains identifier[J].Nucleic Acids Research,2005(Suppl.):W116-W120.

[51] Richards S;Gibbs RA;Weinstock GM;Brown SJ;Denell R;Beeman RW;Gibbs R;Beeman RW;Brown SJ;Bucher G;Friedrich M;Grimmelikhuijzen CJ;Klingler M;Lorenzen M;Richards S;Roth S;Schroder R;Tautz D;Zdobnov EM;Muzny D;Gibbs RA;Weinstock GM;A .The genome of the model beetle and pest Tribolium castaneum.[J].Nature,2008(7190):949-955.

[52] Richards S;Liu Y;Bettencourt BR;Hradecky P,Letovsky S,Nielsen R,Thornton K,Hubisz M J,Chen R,Meisel RP,Couronne O,Hua S,Smith MA,Zhang P,Liu J,Bussemaker HJ,van Batenburg MF,Howells SL,Scherer SE,Sodergren E,Matthews BB,Crosby MA,Schroeder AJ,Ortiz-Barrientos D,Rives CM,Metzker ML,M .Comparative genome sequencing of Drosophila pseudoobscura:chromosomal,gene,and cis-element evolution[J].Genome Research,2005,15(01):1-18.

[53] Rost B;Yachdav G;Liu J .The PredictProtein server[J].Nucleic Acids Research,2004,32(Web server issue):W321-W326.

[54] Salamov,AA;Solovyev,VV .Ab initio gene finding in Drosophila genomic DNA (see comments)[J].Genome research,2000(4):516-522.

[55] Thomas D. Schneider;David N. Mastronarde .Fast multiple alignment of ungapped DNA sequences using information theory and a relaxation method[J].Discrete Applied Mathematics,1996(1/3):259-268.

[56] Smith, C.D.;Zimin, A.;Holt, C.;Abouheif, E.;Benton, R.;Cash, E.;Croset, V.;Currie, C.R.;Elhaik, E.;Elsik, C.G.;Fave, M.-J.;Fernandes, V.;Gadau, J.;Gibson, J.D.;Graur, D.;Grubbs, K.J.;Hagen, D.E.;Helmkampf, M.;Holley, J.-A.;Hu, H. .Draft genome of the globally widespread and invasive Argentine ant (Linepithema humile)[J].Proceedings of the National Academy of Sciences of the United States of America,2011(14):5673-5678.

[57] Smith, C.R.;Smith, C.D.;Robertson, H.M.;Helmkampf, M.;Zimin, A.;Yandell, M.;Holt, C.;Hu, H.;Abouheif, E.;Benton, R.;Cash, E.;Croset, V.;Currie, C.R.;Elhaik, E.;Elsik, C.G.;Favé, M.-J.;Fernandes, V.;Gibson, J.D.;Graur, D.;Gronenberg, W. .Draft genome of the red harvester ant Pogonomyrmex barbatus[J].Proceedings of the National Academy of Sciences of the United States of America,2011(14):5667-5672.

[58] Smith TF;Waterman MS .Identification of common molecular subsequences[J].Journal of Molecular Biology,1981,147(01):195-197.

[59] Suen G;Teiling C;Li L;Holt C,Abouheif E,BornbergBauer E,Bouffard P,Caldera EJ,Cash E,Cavanaugh A,Denas O,Elhaik E,Fave M J,Gadau J,Gibson JD,Graur D,Grubbs KJ,Hagen DE,Harkins TT,Helmkampf M,Hu H,Johnson BR,Kim J,Marsh SE,Moeller JA,Munoz-Torres MC,Murphy MC,Naughto .The genome sequence of the leaf-cutter ant Atta cephalotes reveals insights into its obligate symbiotic lifestyle[J].PLoS Genetics,2011,7(02):e1002007.

[60] Trapnell, Cole;Williams, Brian A.;Pertea, Geo;Mortazavi, Ali;Kwan, Gordon;van Baren, Marijke J.;Salzberg, Steven L.;Wold, Barbara J.;Pachter, Lior .Transcript assembly and quantification by RNA-Seq reveals unannotated transcripts and isoform switching during cell differentiation[J].Nature biotechnology,2010(5):511-515.

[61] Van' t Hof AE;Saccheri IJ .Industrial melanism in the peppered moth is not associated with genetic variation in canonical melanisation gene candidates[J].PLoS ONE,2010,5(05):e10889.

[62] Wang J;Wong GK;Ni P;Han Y,Huang X,Zhang J,Ye C,Zhang Y,Hu J,Zhang K,Xu X,Cong L,Lu H,Ren X,He J,Tao L,Passey DA,Yang H,Yu J,Li S .RePS:a sequence assembler that masks exact repeats identified from the shotgun data[J].Genome Research,2002,12(05):824-831.

[63] Wang XW;Luan JB;Li JM;Bao YY Zhang CX Liu SS .De novo characterization of a whitefly transcriptome and analysis of its gene expression during development[J].BMC Genetics,2010,11:400.

[64] Werren JH;Richards S;Desjardins CA;Niehuis O Gadau J Colbourne JK Beukeboom LW Desplan C Elsik CG Grimmelikhuijzen CJ Kitts P Lynch JA Murphy T Oliveira DC Smith CD van de Ze L Worley KC Zdobnov EM Aerts M Albert S Anaya VH Anzola JM Barchuk AR Behura SK Bera AN Berenbaum MR Bertossa RC .Functional and evolutionary insights from the genomes of three parasitoid Nasonia species[J].Science,327(5963):343-348.

[65] Wurm, Y.;Wang, J.;Riba-Grognuz, O.;Corona, M.;Nygaard, S.;Hunt, B.G.;Ingram, K.K.;Falquet, L.;Nipitwattanaphon, M.;Gotzek, D.;Dijkstra, M.B.;Oettler, J.;Comtesse, F.;Shih, C.-J.;Wu, W.-J.;Yang, C.-C.;Thomas, J.;Beaudoing, E.;Pradervand, S.;Flegel, V. .The genome of the fire ant Solenopsis invicta[J].Proceedings of the National Academy of Sciences of the United States of America,2011(14):5679-5684.

[66] 夏庆友,郭一然,张泽,李东,玄兆伶,李卓,代方银,李英睿,程道军,李瑞强,程廷才,蒋涛,赛琳·贝凯,徐讯,刘春,查幸福,樊伟,林英,沈以红,蒋岚,杰弗里·詹森,伊恩丝·黑尔曼,唐思,赵萍,徐汉福,余昶,张国捷,李俊,曹建军,刘仕平,何宁佳,周妍,刘慧,赵静,叶辰,杜周和,潘国庆,赵爱春,邵浩靖,曾巍,吴平,李春峰,潘敏慧,李晶晶,殷旭阳,李大为,王娟,郑会松,王文,张秀清,李松岗,杨焕明,鲁成,瑞斯摩·尼尔森,周泽扬,汪建,向仲怀,王俊.40个基因组完全重测序揭示蚕的驯化事件及其相关基因[J].蚕学通讯,2009(03):1-6.

[67] Qingyou Xia;Zeyang Zhou;Cheng Lu;Daojun Cheng;Fangyin Dai;Bin Li;Ping Zhao;Xingfu Zha;Tingcai Cheng;Chunli Chai;Guoqing Pan .A Draft Sequence for the Genome of the Domesticated Silkworm (Bombyx mori)[J].Science,2004(5703):1937-1940.

[68] Xue J;Bao YY;Li BL;Cheng YB,Peng ZY,Liu H,Xu HJ, Zhu ZR, Lou YG, Cheng,Zhang CX .Transcriptome analysis of the brown planthopper Nilaparvata lugens[J].PLoS ONE,2010,5(12):e14233.

[69] Yeh RF;Lim LP;Burge CB .Computational inference of homologous gene structures in the human genome[J].Genome Research,2001,11(05):803-816.

[70] 商英璠,肖晓旦,刘雁书.生物信息数据库与查询检索的简介[J].医学信息,2005(04):328-331.

[71] 张春霆.生物信息学的现状与展望[J].世界科技研究与发展,2000(06):17-20.


语种: 中文   

基金中央高校科研基本科研业务费(KYJ200908;1806J0063)

关键词生物信息学 昆虫学 基因组学 转录组学


期刊热词
  • + 更多
  • 字体大小