|
|
||||||||
|
Plant Physiology 144:588-593 (2007) © 2007 American Society of Plant Biologists Genome Sequencing and Genome Resources in Model LegumesKazusa DNA Research Institute, Kisarazu, Chiba 2920818 Japan
Since the completion of the sequencing of the Arabidopsis (Arabidopsis thaliana) genome in 2000, it has become clear that information about the genome of a particular plant species can have dramatic benefits in promoting plant molecular genetics in general. To create a similar situation in legumes, two species with small genomes, Lotus japonicus (Japanese trefoil) and Medicago truncatula (barrel medic), with diploid genomes of 470 to 500 Mb in size, were chosen as references, and genome sequencing was launched at the beginning of this decade (Young et al., 2005
Details of the genome sequencing projects in L. japonicus and M. truncatula have been described previously (Young et al., 2005
Gene assignment and gene modeling in L. japonicus were performed as follows. Similarity searches against several DNA and protein libraries were executed to find homologous regions to known gene sequences. Concurrently, ab initio gene-finding predictions were done, including those for splice sites (NetGene2 and splicepredictor), gene structure (GeneMark.hmm and Genescan), and protein-coding exons (Grail; Mural et al., 1992
The general features of the deduced protein-encoding genes in the model legumes are summarized as follows. The average length of a coding exon is 304 bp and the average number of introns per gene is 3.7 in L. japonicus. These features are quite similar to those of M. truncatula and Arabidopsis However, the average length of a gene, including introns, is longer in L. japonicus (2,883 bp) than in Arabidopsis (1,918 bp) due to the longer average length of an intron (Kato et al., 2003
Information about genome sequences and their positions in the genomes of both L. japonicus and M. truncatula have made it possible to explore genome-scale synteny, which provides a basis for comparative genomic studies. Using more than 100 Mb of genome sequences anchored on the genetic linkage maps, a preliminary detailed comparison of sequence similarity and commonalities in genome position between the two genomes has been carried out (Cannon et al., 2006
Comparison at the chromosomal level revealed the presence of large-scale synteny blocks (Choi et al., 2004b
During the course of the synteny analysis, segmental duplications within the genomes of L. japonicus and M. truncatula were often detected as synteny blocks (Cannon et al., 2006
Significant microsyntenic but not macrosyntenic relationships have been detected between model legumes and other plant species, including Arabidopsis, poplar (Populus trichocarpa), and rice (Oryza sativa; Stracke et al., 2003
Selected information resources for the two model legumes are listed in Table I . Kazusa DNA Research Institute provides a Web database for the L. japonicus genome sequence project (http://www.kazusa.or.jp/lotus). Annotated genome and EST sequences as well as a high-density genetic map with marker information are available. The database provides visualized baseline annotations, including ab initio gene predictions and alignment to EST/cDNA and known protein sequences. A central online database for the M. truncatula genome project (http://medicago.org/genome/) is maintained by the University of Minnesota and the Center for Computational Genomics and Bioinformatics (CCGB). This Web site offers a Java visualization interface for overlap information and status of sequence assembly of the BAC clones, and furnishes links to other M. truncatula sequencing or information centers, which in turn provide access to sequence-related information, such as genetic markers and BAC contigs (Cannon et al., 2005
Genetic mapping of determined sequences has been carried out to enhance the value of such information for molecular genetics and comparative genomics. In L. japonicus, an intraspecies genetic linkage map created on the basis of a cross between two accessions, Gifu B-129 and Miyakojima MG-20, has been used as a reference (Hayashi et al., 2001
EST information is crucial for accurate gene annotation of the genome sequence; it gives information about gene structure, alternative splicing, expression patterns, and transcript abundance. The L. japonicus EST information and transcriptome data are provided by the Lotus EST database at http://www.kazusa.or.jp/en/plant/lotus/EST/. The EST sequences were clustered into groups to reduce redundancy and were annotated with the BLASTX search results against a protein database UniRef100 (Apweiler et al., 2004
As the genome projects of the two model legumes have advanced, material resources, such as a large number of cDNA clones and genomic libraries, have been developed. These resources constitute extremely valuable tools for genetic and physiological studies on individual biological phenomena when they are made available to the research community. Miyazaki University has established a resource center for L. japonicus and soybean (Glycine max) that is financially supported by the National Bioresource Project of Japan. They distribute TAC/BAC genomic and cDNA clones generated during the course of Lotus genome analysis, as well as seeds of major experimental accessions and wild accessions collected from all over Japan. LegumeBase (http://www.legumebase.agr.miyazaki-u.ac.jp) provides a list of resources and detailed information. Recombinant inbred lines between accessions Miyakojima MG-20 and Gifu B-129 are also available, and phenotypic and genotyping data on 205 established lines can be accessed online through LegumeBase. John Innes Center provides a screening service of a TILLING population of approximately 5,000 M2 lines and a database (http://www.lotusjaponicus.org) of phenotypic descriptions and photographs of various mutants (Perry et al., 2003
Most material resources for M. truncatula are distributed from the institutes where the resources were developed. cDNA and BAC libraries generated by European groups are collected and distributed to the research community from the Centre National de Ressources Génomiques Végétales in France (http://toulouse.inra.fr/cnrgv/). In the United States, mth1 and mth2 BAC libraries and derived individual clones are provided by the Clemson University Genomics Institute (http://www.genome.clemson.edu/), and EST clones from 14 distinct cDNA libraries are available from the Samuel Roberts Noble Foundation (http://www.noble.org/PlantBio/). Germlines and other genetic resources, such as recombinant inbred lines and mutant lines, are maintained at four resource centers, the South Australian Research and Development Institute, the U.S. Department of Agriculture National Plant Germplasm System, INRA, and the Samuel Roberts Noble Foundation (United States). Detailed information about the above resources has been summarized in an online-accessible article (Nair et al., 2006
An essential role for model plants is to serve as a source of the accumulated knowledge to transfer to crop plants by means of common gene sequences and DNA markers. Though reliable and efficient procedures for knowledge transfer have yet to be established, intensive trials to utilize the information of model legumes in crops have been undertaken for a variety of purposes. One straightforward and promising approachdirect gene transfer from a model plant to a cropis exemplified by a report using the isoflavone synthase gene of M. truncatula to engineer genistein glucoside production in alfalfa (Medicago sativa; Deavours and Dixon, 2005
DNA markers that allow cross-species mapping, referred to as "anchor markers," are essential for comparative genome analysis. The anchor markers are usually developed from the protein-encoding regions of the genome to define unique loci in the genetic linkage maps of multiple species. An intensive study in which anchor markers successfully detected orthologous loci in multiple legume genomes has been reported (Choi et al., 2006
By utilizing the anchor markers as well as orthologous gene sequences, a growing number of studies have demonstrated extensive synteny among model and crop legumes, as summarized previously (Zhu et al., 2005
To support comparative genome analysis among legumes, databases that integrate genetic and genomic data from multiple legume species have been developed. LIS (http://www.comparative-legumes.org/lis/) and LegumeDB (http://ccg.murdoch.edu.au/index.php/LegumeDB; Gonzales et al., 2005
The increasing rate of accumulation of genomic sequence information allied to the increasing availability of associated material resources is drastically accelerating the pace of investigation of the genetic backgrounds of individual biological phenomena in model legumes. Furthermore, the development of these resources has facilitated the introduction of "omics" approaches that are crucial for our comprehensive understanding of whole genetic systems in legume plants. Comparison of the knowledge obtained by performing comparative genomics between legumes and other model plants, such as Arabidopsis and rice, would not only allow the identification of legume-specific systems but also would provide insights into plant genome evolution.
Model plants hold the promise that their genomic and genetic information will be used for crop breeding. A typical example are the Poaceae, where the sequencing of the rice genome is encouraging the development of molecular genetics approaches to other Poaceae crops, such as maize (Zea mays), sorghum (Sorghum bicolor), wheat (Triticum aestivum), and barley (Hordeum vulgare; Jaiswal et al., 2006 The transfer of knowledge acquired from model plants, mainly through orthologous gene sequences and DNA markers, results in identification and isolation of the corresponding genes using genomic and/or cDNA libraries and common DNA markers on genetic linkage maps in crop plants. Such resources have been vigorously developed for model legumes but not intensively for crop legumes so far. A remarkable exception is sequencing of the soybean genome (http://www.energy.gov/news/2979.htm). The creation of a larger number of "common words" will be crucial for the more efficient transfer of knowledge and for facilitating the exchange of knowledge between researchers working on legumes. In addition, organization of a public system to share the information and material resources within the legume community, e.g. genome databases, bioinformatics tools, and resource centers, is urgently required.
We thank Drs. Giles Oldroyd and Nevin Young for updated information on the M. truncatula genome sequencing project. Received February 3, 2007; accepted March 10, 2007; published June 6, 2007.
The author responsible for distribution of materials integral to the findings presented in this article in accordance with the policy described in the Instructions for Authors (www.plantphysiol.org) is: Satoshi Tabata (tabata{at}kazusa.or.jp). www.plantphysiol.org/cgi/doi/10.1104/pp.107.097493 * Corresponding author; e-mail tabata{at}kazusa.or.jp; fax 81438523934.
Allen JE, Pertea M, Salzberg SL (2004) Computational gene prediction using multiple sources of evidence. Genome Res 14: 142148 Apweiler R, Bairoch A, Wu CH, Barker WC, Boeckmann B, Ferro S, Gasteiger E, Huang H, Lopez R, Magrane M, et al (2004) UniProt: the Universal Protein knowledgebase. Nucleic Acids Res 32: D115D119 Asamizu E, Kato T, Sato S, Nakamura Y, Kaneko T, Tabata S (2003) Structural analysis of a Lotus japonicus genome. IV. Sequence features and mapping of seventy-three TAC clones which cover the 7.5 mb regions of the genome. DNA Res 10: 115122[Abstract] Asamizu E, Nakamura Y, Sato S, Tabata S (2005) Comparison of the transcript profiles from the root and the nodulating root of the model legume Lotus japonicus by serial analysis of gene expression. Mol Plant Microbe Interact 18: 487498[ISI][Medline] Ashikari M, Matsuoka M (2006) Identification, isolation and pyramiding of quantitative trait loci for rice breeding. Trends Plant Sci 11: 344350[CrossRef][ISI][Medline] Aubert G, Morin J, Jacquin F, Loridon K, Quillet MC, Petit A, Rameau C, Lejeune-Hénaut I, Huguet T, Burstin J (2006) Functional mapping in pea, as an aid to the candidate gene selection and for investigating synteny with the model legume Medicago truncatula. Theor Appl Genet 112: 10241041[CrossRef][ISI][Medline] Brendel V, Kleffe J (1998) Prediction of locally optimal splice sites in plant pre-mRNA with applications to gene identification in Arabidopsis thaliana genomic DNA. Nucleic Acids Res 26: 47484757 Burge C, Karlin S (1997) Prediction of complete gene structures in human genomic DNA. J Mol Biol 268: 7894[CrossRef][ISI][Medline] Cannon SB, Crow JA, Heuer ML, Wang X, Cannon EK, Dwan C, Lamblin AF, Vasdewani J, Mudge J, Cook A, et al (2005) Databases and information integration for the Medicago truncatula genome and transcriptome. Plant Physiol 138: 13 Cannon SB, Sterck L, Rombauts S, Sato S, Cheung F, Gouzy J, Wang X, Mudge J, Vasdewani J, Schiex T, et al (2006) Legume genome evolution viewed through the Medicago truncatula and Lotus japonicus genomes. Proc Natl Acad Sci USA 103: 1495914964 Choi HK, Kim D, Uhm T, Limpens E, Lim H, Mun JH, Kalo P, Penmetsa RV, Seres A, Kulikova O, et al (2004a) A sequence-based genetic map of Medicago truncatula and comparison of marker colinearity with M. sativa. Genetics 166: 14631502 Choi HK, Luckow MA, Doyle J, Cook DR (2006) Development of nuclear gene-derived molecular markers linked to legume genetic maps. Mol Genet Genomics 276: 5670[CrossRef][ISI][Medline] Choi HK, Mun JH, Kim DJ, Zhu H, Baek JM, Mudge J, Roe B, Ellis N, Doyle J, Kiss GB, et al (2004b) Estimating genome conservation between crop and model legume species. Proc Natl Acad Sci USA 101: 1528915294 Deavours BE, Dixon RA (2005) Metabolic engineering of isoflavonoid biosynthesis in alfalfa. Plant Physiol 138: 22452259 Foissac S, Bardou P, Moisan A, Cros MJ, Schiex T (2003) EuGene'Hom: a generic similarity-based gene finder using multiple homologous sequences. Nucleic Acids Res 31: 37423745 Fredslund J, Madsen LH, Hougaard BK, Sandal N, Stougaard J, Bertioli D, Schauser L (2006) GeMprospectoronline design of cross-species genetic marker candidates in legumes and grasses. Nucleic Acids Res 34: W670W675 Graham PH, Vance CP (2003) Legumes: importance and constraints to greater use. Plant Physiol 131: 872877 Gonzales MD, Archuleta E, Farmer A, Gajendran K, Grant D, Shoemaker R, Beavis WD, Waugh ME (2005) The Legume Information System (LIS): an integrated information resource for comparative legume biology. Nucleic Acids Res 33: D660D665 Haas BJ, Delcher AL, Mount SM, Wortman JR, Smith RK Jr, Hannick LI, Maiti R, Ronning CM, Rusch DB, Town CD, et al (2003) Improving the Arabidopsis genome annotation using maximal transcript alignment assemblies. Nucleic Acids Res 31: 56545666 Hartmann S, Lu D, Phillips J, Vision TJ (2006) Phytome: a platform for plant comparative genomics. Nucleic Acids Res 34: D724D730 Hayashi M, Miyahara A, Sato S, Kato T, Yoshikawa M, Taketa M, Hayashi M, Pedrosa A, Onda R, Imaizumi-Anraku H, et al (2001) Construction of a genetic linkage map of the model legume Lotus japonicus using an intraspecific F2 population. DNA Res 8: 301310[Abstract] Hebsgaard SM, Korning PG, Tolstrup N, Engelbrecht J, Rouze P, Brunak S (1996) Splice site prediction in Arabidopsis thaliana DNA by combining local and global sequence information. Nucleic Acids Res 24: 34393452 Huang X, Adams MD, Zhou H, Kerlavage AR (1997) A tool for analyzing and annotating genomic sequences. Genomics 46: 3745[CrossRef][ISI][Medline] Hwang TY, Moon JK, Yu S, Yang K, Mohankumar S, Yu YH, Lee YH, Kim HS, Kim HM, Maroof MAS, et al (2006) Application of comparative genomics in developing molecular markers tightly linked to the virus resistance gene Rsv4 in soybean. Genome 49: 380388[Medline] Jaiswal P, Ni J, Yap I, Ware D, Spooner W, Youens-Clark K, Ren L, Liang C, Zhao W, Ratnapu K, et al (2006) Gramene: a bird's eye view of cereal genomes. Nucleic Acids Res 34: D717D723 Journet EP, van Tuinen D, Gouzy J, Crespeau H, Carreau V, Farmer MJ, Niebel A, Schiex T, Jaillon O, Chatagnier O, et al (2002) Exploring root symbiotic programs in the model legume Medicago truncatula using EST analysis. Nucleic Acids Res 30: 55795592 Kaneko T, Asamizu E, Kato T, Sato S, Nakamura Y, Tabata S (2003) Structural analysis of a Lotus japonicus genome. III. Sequence features and mapping of sixty-two TAC clones which cover the 6.7 Mb regions of the genome. DNA Res 10: 2733[Abstract] Kato T, Sato S, Nakamura Y, Kaneko T, Asamizu E, Tabata S (2003) Structural analysis of a Lotus japonicus genome. V. Sequence features and mapping of sixty-four TAC clones which cover the 6.4 mb regions of the genome. DNA Res 10: 277285[Abstract] Kevei Z, Seres A, Kereszt A, Kalo P, Kiss P, Toth G, Endre G, Kiss GB (2005) Significant microsynteny with new evolutionary highlights is detected between Arabidopsis and legume model plants despite the lack of macrosynteny. Mol Genet Genomics 274: 644657[CrossRef][ISI][Medline] Koebner RM, Summers RW (2003) 21st century wheat breeding: plot selection or plant detection? Trends Biotechnol 21: 5963[CrossRef][ISI][Medline] Kouchi H, Shimomura K, Hata S, Hirota A, Wu GJ, Kumagai H, Tajima S, Suganuma N, Suzuki A, Aoki T, et al (2004) Large-scale analysis of gene expression profile during early stages of root nodule formation in a model legume, Lotus japonicus. DNA Res 11: 263274[Abstract] Lee Y, Tsai J, Sunkara S, Karamycheva S, Pertea G, Sultana R, Antonescu V, Chan A, Cheung F, Quackenbush J (2005) The TIGR Gene Indices: clustering and assembling EST and known genes and integration with eukaryotic genomes. Nucleic Acids Res 33: D71D74 Liu YG, Shirano Y, Fukaki H, Yanai Y, Tasaka M, Tabata S, Shibata D (1999) Complementation of plant mutants with large genomic DNA fragments by a transformation-competent artificial chromosome vector accelerates positional cloning. Proc Natl Acad Sci USA 96: 65356540 Lukashin AV, Borodovsky M (1998) GeneMark.hmm: new solutions for gene finding. Nucleic Acids Res 26: 11071115 Moolhuijzen P, Cakir M, Hunter A, Schibeci D, Macgregor A, Smith C, Francki M, Jones MG, Appels R, Bellgard M (2006) LegumeDB bioinformatics resource: comparative genomic analysis and novel cross-genera marker identification in lupin and pasture legume species. Genome 49: 689699[Medline] Mural RJ, Einstein JR, Guan X, Mann RC, Uberbacher EC (1992) An artificial intelligence approach to DNA sequence feature recognition. Trends Biotechnol 10: 6669[CrossRef][ISI][Medline] Nair R, Hughes SJ, Ellwood S, Oliver R, Green SL, Delalande M, Wen J, Oldroyd G (2006) M. truncatula stock centre. In The Medicago truncatula Handbook. Samuel Roberts Noble Foundation. http://www.noble.org/MedicagoHandbook/ (December 1, 2006) Nakamura Y, Kaneko T, Asamizu E, Kato T, Sato S, Tabata S (2002) Structural analysis of a Lotus japonicus genome. II. Sequence features and mapping of sixty-five TAC clones which cover the 6.5-mb regions of the genome. DNA Res 9: 6370[Abstract] Nelson MN, Phan HT, Ellwood SR, Moolhuijzen PM, Hane J, Williams A, O'Lone CE, Fosu-Nyarko J, Scobie M, Cakir M, et al (2006) The first gene-based map of Lupinus angustifolius L.-location of domestication genes and conserved synteny with Medicago truncatula. Theor Appl Genet 113: 225238[CrossRef][ISI][Medline] Perry JA, Wang TL, Welham TJ, Gardner S, Pike JM, Yoshida S, Parniske M (2003) A TILLING reverse genetics tool and a web accessible collection of mutants of the legume Lotus japonicus. Plant Physiol 131: 866871 Phan HT, Ellwood SR, Hane JK, Ford R, Materne M, Oliver RP (2007) Extensive macrosynteny between Medicago truncatula and Lens culinaris ssp. culinaris. Theor Appl Genet 114: 549558[CrossRef][ISI][Medline] Salamov A, Solovyev V (2000) Ab initio gene finding in Drosophila genomic DNA. Genome Res 10: 516522 Sato S, Isobe S, Asamizu E, Ohmido N, Kataoka R, Nakamura Y, Kaneko T, Sakurai N, Okumura K, Klimenko I, et al (2005) Comprehensive structural analysis of the genome of red clover (Trifolium pratense L.). DNA Res 12: 301364 Sato S, Kaneko T, Nakamura Y, Asamizu E, Kato T, Tabata S (2001) Structural analysis of a Lotus japonicus genome. I. Sequence features and mapping of fifty-six TAC clones which cover the 5.4 Mb regions of the genome. DNA Res 8: 311318[Abstract] Stacey G, Libault M, Brechenmacher L, Wan J, May GD (2006) Genetics and functional genomics of legume nodulation. Curr Opin Plant Biol 9: 110121[CrossRef][ISI][Medline] Stracke S, Sato S, Sandal N, Koyama M, Kaneko T, Tabata S, Parniske M (2003) Exploitation of colinear relationships between the genomes of Lotus japonicus, Pisum sativum and Arabidopsis thaliana, for positional cloning of a legume symbiosis gene. Theor Appl Genet 108: 442444[CrossRef][ISI][Medline] Tamasaki M, Tenaillon MI, Bi IV, Schroeder SG, Sanchez-Villeda H, Doebley JF, Gaut BS, McMullenn MD (2005) A large-scale screen for artificial selection in maize identified candidate agronomic loci for domestication and crop improvement. Plant Cell 7: 28592872 Young ND, Cannon SB, Sato S, Kim D, Cook DR, Town CD, Roe BA, Tabata S (2005) Sequencing the genespaces of Medicago truncatula and Lotus japonicus. Plant Physiol 137: 11741181 Zhu H, Choi HK, Cook DR, Shoemaker RC (2005) Bridging model and crop legumes through comparative genomics. Plant Physiol 137: 11891196 Zhu H, Riely BK, Burns NJ, Ane JM (2006) Tracing nonlegume orthologs of legume genes required for nodulation and arbuscular mycorrhizal symbioses. Genetics 172: 24912499 This article has been cited by other articles:
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| HOME | HELP | FEEDBACK | SUBSCRIPTIONS | ARCHIVE | SEARCH | TABLE OF CONTENTS |
| ASPB Publications | PLANT PHYSIOLOGY | THE PLANT CELL | |
|---|---|---|---|