|
|
||||||||
|
Plant Physiol, March 2003, Vol. 131, pp. 1104-1123 Mapping the Proteome of Barrel Medic (Medicago truncatula)1,[w]Plant Biology Division, The Samuel Roberts Noble Foundation, P.O. Box 2180, Ardmore, Oklahoma 73402
A survey of six organ-/tissue-specific proteomes of the model legume barrel medic (Medicago truncatula) was performed. Two-dimensional polyacrylamide gel electrophoresis reference maps of protein extracts from leaves, stems, roots, flowers, seed pods, and cell suspension cultures were obtained. Five hundred fifty-one proteins were excised and 304 proteins identified using peptide mass fingerprinting and matrix-assisted laser desorption ionization time-of-flight mass spectrometry. Nanoscale high-performance liquid chromatography coupled with tandem quadrupole time-of-flight mass spectrometry was used to validate marginal matrix-assisted laser desorption ionization time-of-flight mass spectrometry protein identifications. This dataset represents one of the most comprehensive plant proteome projects to date and provides a basis for future proteome comparison of genetic mutants, biotically and abiotically challenged plants, and/or environmentally challenged plants. Technical details concerning peptide mass fingerprinting, database queries, and protein identification success rates in the absence of a sequenced genome are reported and discussed. A summary of the identified proteins and their putative functions are presented. The tissue-specific expression of proteins and the levels of identified proteins are compared with their related transcript abundance as quantified through EST counting. It is estimated that approximately 50% of the proteins appear to be correlated with their corresponding mRNA levels.
Legumes are valuable agricultural
and commercial crops that serve as important nutrient sources for both
humans and animals. For example, alfalfa (Medicago sativa)
is an important forage crop with over 24 million acres planted annually
with an annual U.S. value approaching 6 billion dollars
(U.S. Department of Agriculture-National Agricultural
Statistics Service, 2002 The study of legume biology using many of the agriculturally important
legumes such as soybean (Glycine max) and alfalfa is complicated by the large genome size and complex ploidy of these species. Fortunately, barrel medic (Medicago truncatula) has
a smaller diploid genome that yields more manageable genetics. These traits, along with its autogamous nature, short generation time, and
prolific seed production have made barrel medic a useful model legume
(Barker et al., 1990 The impressive achievements in genome and expressed sequence tag (EST)
sequencing have yielded a wealth of information for many model
organisms, including the plants Arabidopsis and barrel medic.
Unfortunately, sequence information alone is insufficient to answer
questions concerning gene function, developmental/regulatory biology,
and the biochemical kinetics of life. To address these questions, more
comprehensive approaches that include quantitative and qualitative
analyses of gene expression products are necessary at the
transcriptome, proteome, and metabolome levels. Transcriptome approaches using microarray and serial analysis of gene expression technologies are powerful tools; however, mRNA abundances may only
represent putative function because there is still a questionable correlation between mRNA and protein levels (Futcher et al.,
1999 Although there is a substantial amount of work in the literature on
bacterial (Guerreiro et al., 1999 The objective of the present work was to survey the
organ-/tissue-specific proteomes of the model legume barrel medic, to provide an overview of the barrel medic proteome, and to serve as a
basis for future proteome comparisons of genetic mutants, biotically,
abiotically, and/or environmentally challenged plants. The survey was
accomplished using 2-DE to produce reference maps of protein extracts
from leaves, stems, roots, flowers, seed pods, and cell suspension
cultures. MALDI-TOFMS peptide mass fingerprinting was used to identify
304 proteins. HPLC coupled with quadrupole time-of-flight tandem mass
spectrometry (LC/MS/MS) was used to validate marginal MALDI-TOFMS
protein identifications. The identified proteins are discussed and
classified based on putative functions determined through similarity
(Bevan et al., 1998
2-DE Reference Maps and Protein Identifications of Barrel Medic Tissues 2-DE reference maps were obtained for barrel medic leaves, stems, roots, flowers, seed pods, and cell suspension cultures and are provided in Figure 1. To qualitatively survey the proteins visualized by 2-DE, a total of 551 proteins (i.e. approximately 96 arbitrary protein spots per gel including positive molecular mass marker controls and negative gel blank controls) were excised from each of the organ-/tissue-specific Coomassie-stained 2-DE gels and analyzed by mass spectrometry. Typically, high-quality MALDI-TOFMS peptide mass maps were obtained, and representative spectra are provided in Figure 2. Of the 551 protein spots processed, 304 proteins were successfully identified and are listed in Table I.
Supplemental Table I (see www.plantphysiol.org) contains extensive data
that document the analytical rigor of the protein identifications.
These data include an assigned protein spot number (see Fig. 1), an
arbitrary peptide mass fingerprint data quality (PMFQ) score of 1 to 5 (with 5 being best, see "Materials and Methods") to allow
assessment of data quality, the number of peptides matched,
m/z accuracy and SD of
peptides matched, percent protein coverage, theoretical molecular mass
and pI, experimental molecular mass and pI, the database accession
number of the best match and the databases that yielded concurrent
identifications, LC/MS/MS data for select proteins, and the organism to
which the matching protein was identified through similarity. For
protein identifications determined using the SwissProt and National
Center for Biotechnology Information (NCBI) databases, the organism
reported in supplemental Table I is that from which the protein or gene
was directly sequenced. In the case of most ESTs, protein
identifications were first made to barrel medic ESTs that were not
annotated. These ESTs were annotated by comparison with The Institute
for Genomic Research (TIGR) gene indices or through similarity to other
organisms via BLAST. The organism yielding the highest similarity score
is the organism reported for EST database identifications in
Supplemental Table I. Protein function is also classified and recorded
in Supplemental Table I. A minimum of four peptides is statistically
necessary to qualify as a confident match (Pappin et al.,
1993
Database Query Strategies and Success Rates In an attempt to maximize our protein identification success rate
for barrel medic proteins, we have used protein (SwissProt), nucleotide
(NCBI), and EST databases (dbESTothers, and barrel medic-only ESTs from
NCBI) for queries of experimental peptide mass maps (Mann and
Wilm, 1994
The average protein identification success rate for all tissues using only the protein databases (SwissProt and NCBInr) was 25%, whereas the average protein identification success rate for all tissues using the EST database was 46% (see Table II). Interestingly, the average overlap in the number of proteins identified in both databases was only 15%; thus, searching both databases was complementary and not necessarily redundant. For example, the peptide maps provided in Figure 2 are of similar high quality; however, spectra 2b could not be identified successfully in the SwissProt or NCBI databases and could only be identified successfully through EST database queries. This complementary searching strategy yielded a final protein identification success rate of 55% for our representative protein set. Strategies using multiple database queries have enhanced our ability to
identify proteins even in the absence of a genomic sequence. Our
overall success rate of 55% is good when compared with other reports
focused on organisms without sequenced genomes. For example, a recent
publication concerning pea (Pisum sativum) chloroplast proteins reported a success rate of 15% using mass spectrometry and Edman sequencing (Peltier et al.,
2000 The average length of barrel medic ESTs used to successfully identify
proteins in all organ/tissues was 597 ± 177 nucleotides (or
199 ± 59 amino acids). For proteins in the 30-kD range or less,
this represents complete or almost complete sequence coverage by the
EST; thus, our confidence in these identifications is very high. For
larger proteins this only represents partial protein sequence; however,
our data demonstrate that the current EST information is sufficient to
allow confident identifications. Additional experimental data such as
number of peptides matched, m/z accuracy,
molecular mass, and pI provide additional confirmation of
identification. It is logical that a strategy including both protein
and nucleotide databases would yield greater protein identification
rates as some mRNAs, such as mitochondrial and chloroplast-encoded
mRNAs (i.e. Rubisco large subunit), do not contain
poly(A+) tails (Sugiura and Takeda,
2000 Protein Identifications and Functional Classifications Putative protein functional classifications were assigned based on
similarity to better understand the biological processes encompassed by
the proteins identified using a 2-DE proteomics approach. Summaries of
protein functions observed in the barrel medic proteome are provided in
Figure 4. Protein functions were assigned using the protein function database Pfam
(http://www.sanger.ac.uk/Software/Pfam/; Bateman et al.,
2002
Leaves Photosynthetic enzymes dominated the 2-DE profiles of leaf tissue. Approximately 40% of the leaf protein mass visualized with Coomassie staining can be attributed to a small number of enzymes including the large subunit of Rubisco (26.1%), Rubisco small subunit (2.8%), Rubisco activase (3.2%), and oxygen-evolving protein (6.4%). Most of these proteins appear as multiple spots, and the reported percentages are estimates including all identified spots. The relatively high concentrations of the abundant photosynthetic enzymes demonstrate the importance of these enzymes; however, the prominence of these proteins, specifically Rubisco, in specific regions of the gel, generally contributes to lower quality 2-DE gels and prevents the observation of moderate or lower abundance proteins due to their relatively lower concentrations and the limited dynamic range of common 2-DE staining techniques including Coomassie. Other proteins involved in photosynthesis and carbon fixation were observed in leaf, including: PS1 iron-sulfur protein, ATP synthase, glyceraldehyde 3-phosphate dehydrogenase, malate dehydrogenase, triose phosphate isomerase, tartrate dehydrogenase, and Fru biphosphate aldolase. Many of these photosynthetic enzymes were also observed at lower levels in other green tissues such as stems and immature seed pods. Several signal transduction proteins were observed in leaves, including
the multiple domain protein remorin. Remorin binds simple and complex
galacturonide and its C-terminal region has functional similarities to
viral intercellular communication proteins (Reymond et al.,
1996 Stems The 2-DE reference map of barrel medic stem proteins was of better quality than that of leaves, primarily due to a lower abundance of Rubisco. Many of the same photosynthetic and carbon metabolism enzymes reported above for leaf were also identified in stems. In addition, several members of the ATP complex associated with energy metabolism were observed. Proteins involved in protein destination and storage were also identified and included the 26S proteasome AAA-ATPase subunit and a 20S proteasome subunit alpha type 7 protein. The 26S proteasome is responsible for protein degradation of endogenous proteins. Proteins involved in secondary metabolism are of specific interest to our functional genomics project focused on natural products (National Science Foundation Plant Genome Research Project no. 0109732). Several secondary metabolic enzymes were identified in stems and included cinnamoyl-CoA reductase, which plays a role in lignin biosynthesis, and isoflavone reductase-like oxidoreductase, an enzyme involved in phytoalexin production. Stems also revealed several kinases including adenosine kinase, fructokinase, Rib-phosphate pyrophosphokinase, uridylate monophosphate kinase, and nucleoside diphosphate kinase1. A number of RNA binding proteins thought to be important in transcription were also observed. Multiple ribosomal proteins including 40S and 60S ribosomal proteins were identified and function in protein synthesis. Roots The roots of legumes are of special interest because of their role
in the characteristic symbiotic relationships formed with microorganisms. Although recent articles have been published on the
proteomes of barrel medic nodulated root (Bestel-Corre et al.,
2002 Relative to other tissues, a larger percentage (i.e. 15%) of the
barrel medic root proteins were identified as putative proteins or
unannotated proteins. These proteins could be confidently linked to
specific ESTs or predicted open reading frames whose functions are
still unknown. The observation of unannotated proteins provides experimental evidence of putative/predicted proteins that offer exceptional opportunities in gene annotation (Mann and Pandey, 2001 The root 2-DE reference map and protein identifications reported here
are consistent with the previous studies by Mathesius et al.
(2001) Proteins identified in all three investigations include a peroxidase
precursor, cytochrome c oxidase subunit 6, VcCyp (cyclophilin), a
superoxide dismutase, and ABA-responsive protein. Only the
ABA-responsive protein and VcCyp were reported to be constitutively
expressed by Bestel-Corre et al. (2002) Interestingly, two proteins identified in this investigation were not
found in either of the other two studies. Acidic glucanase was observed
as a relatively abundant protein in the present report (rts#39), but
due to its pI of 8.4 and the fact that Mathesius and coworkers' first
dimension immobilized pH gradient (IPG) pH range was 4 to 7, it was not
present on their gels. We also identified three isoforms of chitinase,
all with a pI above 7, that are missing in the Mathesius et al. work.
Bestel-Corre et al. (2002) Overall, these three reports (this report; Mathesius et al.,
2001 Flowers The proteome of flowers contained proteins from almost every functional category. The major portion (38%) of the identified proteins was associated with energy production including glycolysis, pyruvate metabolism, and the tricarbonylic acid (TCA) cycle. Another 21% of the identified proteins were involved with protein synthesis or protein destination. For example, peptidyl prolyl isomerase accelerates protein folding by catalyzing cis-trans isomerization in oligopeptides. Several proteins identified were related to disease/defense or involved in secondary metabolism, such as chalcone isomerase. These enzymes are commonly associated with flower pigmentation or UV protection and serve as important defense proteins in developing seeds. One of the proteins identified specifically in the flower proteome was profilin. Profilin normally binds to monomeric actin to prevent polymerization, although under certain conditions it can promote the polymerization of actin. It occurs in all organs, but is most abundant in mature pollen, making it more likely to be identified in flowers. Many proteins associated with oxidative responses were also identified in flowers. Low levels of a few photosynthetic enzymes were observed due to collection of green sepals with the flowers. Seed Pods The intact seed pod proteome was generated from tissue containing
both seed and pod tissue. The proteins visualized and identified in the
barrel medic seed pod proteome consisted primarily of globulins or seed
storage proteins that serve as a nitrogen/nutritional source for
developing plants. Several members of the superfamily of "cupins"
were identified in barrel medic seed and included 7S and 11S globulins
(Dunwell, 1998 A significant number of disease-/defense-related proteins were observed in seed pods including peroxidases, osmotin, and ABA-responsive protein. These proteins help defend the plant in early stages of development. Other proteins associated with carbon metabolism, nutrient acquisition, and protein syntheses were also observed. These proteins supply necessary nutrients to the developing plant. Several photosynthetic proteins were observed and are attributed to the collection of immature green seed pods. Cell Suspension Cultures Cell suspension cultures were initiated from barrel medic root
calli (Dixon, 1980 In some instances, more than one protein was identified with high
confidence in each protein spot. For example, spot cls#82 contained
peptides that could be associated with both ABA-responsive protein and
leghemoglobin. Interestingly, leghemoglobin was identified as a root
nodule-specific isoform (Gallusci et al., 1991 The proteome of suspension cell cultures is of special interest because the tissue is relatively homogeneous and, therefore, provides a good model tissue system for experiments directed toward integrated functional genomic studies of natural products (https://www.fastlane.nsf.gov/servlet/showaward?award=0109732). Future work will focus on generation of an extensive 2-DE proteome reference map of suspension cell cultures and the changes in the proteome after biotic and abiotic elicitation. Tissue-/Organ-Specific Expression of Proteins Many of the proteins identified were redundant as an average of 61% were identified in one or more tissues of barrel medic. The remaining 39% were identified in only one tissue and have the potential of being uniquely expressed in specific tissues/organs based on our limited dataset. The quantities of redundant and potentially unique proteins identified in each specific tissue are summarized in Figure 5. Many of the putative unique proteins are related to the primary function of the specific tissue. For example, photosynthetic enzymes such as PSI iron-sulfur protein and plastid specific ribosomal proteins were only identified in leaves. Other proteins identified only in a specific tissue include the seed storage proteins glycinin, convicilin, and legumin in seed pods. Profilin, a known pollen allergen, was also identified in flowers. These are limited examples illustrating the unique nature of the proteome, but we are hopeful that continued evaluation of the tissue- and organelle-specific proteomes of barrel medic will yield further insight into the specialized functionality of these tissues.
Comparison of Barrel Medic Proteome and Transcriptome A better understanding of the relationship between mRNA and
protein abundances is needed to elucidate the processes and regulation of transcription and translation. Several recent publications present conflicting views concerning the correlation of mRNA and protein levels. Gygi et al. (1999) Given the large abundance of EST information for barrel medic
(http://www.ncbi.nlm.nih.gov/entrez/query.fcgi/), a simple
comparison of identified protein levels with their corresponding mRNA
levels was performed. Currently, over 145,000 EST sequences from
approximately 20 different non-subtractive, non-normalized (J. White,
TIGR, personal communication) cDNA libraries are available
(Covitz et al., 1998 Although the proteins were arbitrarily chosen across pI and molecular mass ranges, most represent relatively abundant proteins typical of 2-DE and CBB 250 staining. Based on the 2-DE protein quantification results presented here, 67% of the identified proteins were in the top 100 most abundant proteins visualized with Coomassie, whereas 97% of the proteins identified were in the top 200 most abundant proteins. Thus, identified proteins were compared with the top 200 most abundant tissue-specific ESTs in related cDNA libraries. The percentages of the identified proteins observed by 2-DE that were also observed in the top tissue specific ESTs are summarized in Table III. This summary reveals that an average of 50% of the identified proteins were observed in the top 200 tissue-specific ESTs. An evaluation of the top 100 tissue-specific ESTs shows that 40% of proteins identified in 2-DE experiments were also observed in the 100 most abundant tissue-specific ESTs. These results suggest a moderate level of correlation between mRNA and protein. For example, leaf proteins such as the photosynthetic enzymes Rubisco small subunit and oxygen-evolving protein appear to be highly correlated with their respective mRNA levels.
Interestingly, some highly expressed proteins such as Rubisco large
subunit were not observed in the EST libraries. As mentioned earlier,
we believe that this is due to the chloroplast-encoded nature of
certain mRNAs, such as Rubisco large subunit, which do not contain
poly(A+) tails necessary for purification and
cDNA library preparation (Sambrook et al.,
1989 Highly abundant leaf ESTs not represented in the protein data to
date included aquaporins, chlorophyll-binding proteins, and cytochrome
B6. This apparent lack of correlation can be explained by the integral
thylakoid membrane nature of these proteins. It is commonly accepted
that integral membrane proteins are underrepresented in 2-DE due to
poor solubilization. Lipoxygenase also appeared in the top 100 clones
of five tissue-specific EST libraries; however, it was never identified
in the protein dataset. Plants express both cytosolic and chloroplast
isoforms of lipoxygenase, most of which have a molecular mass of
approximately 100 kD. A possible explanation for the absence of this
protein from the protein data could be the inherent discrimination
against high-molecular mass proteins encountered during isoelectric
focusing using IPG strips of fixed gel composition
(Candiano et al., 2002 The lack of correlation between mRNA and protein could not always be
explained. For example, identified stem proteins included acid
phosphatase, actin, and osmotin; however, these proteins were absent or
of very low abundance in the stem-specific EST library. Other proteins
identified but not represented in the EST libraries included:
RNA-binding protein and ankyrin repeat protein in flowers and
hydroxyacyl glutathione hydrolase in roots. Interestingly, elongation
factor 1-alpha was observed as a highly expressed EST (top 50) in all
tissues but was not observed in the protein set. The lack of
correlation may be due to the relative turnover rates of both
transcripts and proteins, or translational controls such as codon bias
(Gygi et al., 1999 Based on the limited comparison above, we estimate a moderate 50%
correlation between protein and mRNA levels. This value suggests a
correlation that is higher than that reported by Gygi et al.
(1999)
To date, we have identified over 300 proteins in specific tissues
of barrel medic. Protein identifications using only protein databases
were 25% successful even with good peptide mass fingerprints. Significant increases in protein identification success rates were
achieved by using EST sequence databases. Using complementary protein,
nucleotide, and EST sequence libraries, we were able to achieve a
protein identification success rate of 55% for our representative
protein dataset. We consider this a relatively high success rate in the
absence of a genomic sequence and in comparison with other plant
proteomic projects. Tentative consensus searches currently are being
performed and confirm many of the proposed identifications in this
study (Asirvatham et al., 2002b The 2-DE profiles of various barrel medic tissues provide reference maps for future proteomic comparisons of genetic mutants, biotically and abiotically challenged plants, and/or environmentally challenged plants. The identified proteins provide a survey of those proteins observable using current technology and also serve to define the limitations of the reported proteomics approach. For example, it will be difficult to study other physiological processes besides photosynthesis and carbon metabolism in leaves using current proteomic technologies due to the very high level of these proteins in leaves. Further, the proteins identified serve as physiological markers of tissue-specific protein expression. Based on the limited dataset, 39% of all the identified proteins were only identified in a single tissue. These putative unique proteins provide valuable insight into the specialized physiological function of each of the tissues. For example, a comparison of roots and root-derived cell cultures can yield insights into the physiological phenomena associated with the dedifferentiation of root tissue during establishment of a suspension cell culture. A comparison between the levels of the identified proteins and mRNA levels quantified through EST counting was performed. It is estimated that on average 50% of the proteins appear to be correlated with their corresponding mRNA levels; conversely, 50% are not. Information on both transcript and protein levels can be utilized for targeting potential regulatory genes that are characterized by high transcript but low protein levels. The proteins identified in this study as unclear or putative represent unique opportunities to probe molecular function. Systematic perturbations and monitoring of these proteins would be expected to yield insight into function. These abundant but unclassified proteins have been linked to specific ESTs and, thus, establish the feasibility to experimentally monitor both the protein and mRNA. The relatively high abundance of these proteins further stresses the biological but unknown importance of these proteins in barrel medic. This report provides a comprehensive overview of the barrel medic proteome and provides a good foundation for future comparative proteomic efforts associated with this important model plant. The importance of barrel medic is further emphasized by the recent recommendation from the National Academy of Sciences that the goals of the National Plant Genome Initiative for 2003 through 2008 should focus on a small number of key species including barrel medic (http://books.nap.edu/books/0309085292/html/index.html). This work serves as a major step in this direction for a key plant species. As we seek to better understand gene function and to study the holistic biology of systems, it is inevitable that we study the proteome.
Plant Material and Protein Extraction Differentiated plant tissues were collected from barrel medic
(Medicago truncatula cv Jemalong A17) grown in an
environmentally controlled growth chamber and maintained under standard
conditions (Asirvatham et al., 2002a Cell cultures derived from barrel medic cv Jemalong A17 roots were
grown in the dark in shaker flasks and suspended in Schenk and
Hildebrandt (SH) medium with transfer to fresh medium every 2 weeks.
Cells were harvested 4 d after transfer, washed once with fresh SH
medium and once with SH:water (1:1 [v/v]), ground in liquid
N2, and extracted with 40 mM Tris (pH 9.5), 50 mM MgCl2, 2% (w/v)
polyvinylpolypyrrolidone, 1 mM phenylmethylsulfonyl
fluoride, and 120 units mL Protein Quantification and Electrophoresis Protein concentrations of all tissue extracts were
quantified using the Bradford method (Bradford, 1976 Digestions and MALDI-TOFMS Protein spots were excised from the gel, washed twice with water
for 15 min, and destained with a 1:1 (v/v) solution of
acetonitrile and 50 mM ammonium bicarbonate while changing
solutions every 30 min until the blue color of Coomassie was removed.
2-DE gel spots were then dehydrated by washing twice with 100%
acetonitrile and dried by vacuum centrifugation. Gel plugs were
rehydrated with a solution of 10 ng µL Database Queries and Protein Identifications The peptide mass fingerprints were compared with sequences in: (a) NCBInr database (release January 1, 2002), (b) SwissProt database (release January 1, 2002), and/or (c) dbESTothers (NCBI; release January 1, 2002), (d) and/or a subset of dbESTothers (NCBI) consisting of approximately 145,000 barrel medic EST sequences, dated November 15, 2001, and queried using MS-Fit (http://prospector.ucsf.edu) in an automated mode using Proteomic Solutions 1 software from Applied Biosystems (Foster City, CA). Mass spectra were de-isotoped, baseline corrected, and threshold adjusted before database searching. Database searches were performed using a 100-ppm mass accuracy with a minimum requirement of four peptide matches from a submission list of typically 30 peptides. The maximum number of missed cleavages was set at one. The only user-defined modification specified was carbamidomethylation of Cys; however, the software default considered possible modifications of N-terminal Gln to pyro-Glu, oxidation of Met, and protein N terminus acetylation. When peptide mass fingerprints were matched to sequences in the EST databases, functional information was obtained by BLASTX (NCBI; http://www.ncbi.nlm.nih.gov/BLAST/) of the sequence or reference of the clone identifier to the barrel medic gene index (MtGI; http://www.tigr.org/tdb/mtgi/). The theoretical molecular mass and pI of the identified protein were then calculated using GPMAW (Lighthouse data) and compared with the experimental molecular mass calculated from the digitized 2-DE images. Protein identifications were evaluated on the basis of multiple variables including the number of peptides matched, mass error (m/z accuracy), percent coverage of the matched protein with 10% of the full-length protein set as the minimum value, quality of the peptide maps, intensity of the matched peaks (18%-20% minimum), similarity of experimental and theoretical protein molecular masses and pIs, and species from which the sequence was matched. For EST matches, the percent coverage was calculated by dividing the number of matched amino acids by the total number of amino acids in the protein sequence returned from the BLASTX or MtGI searches. LC/MS/MS Select digest mixtures were analyzed by nanoscale HPLC coupled with LC/MS/MS. Data were obtained using an ABI QSTAR Pulsar (Applied Biosystems) hybrid quadrupole time-of-flight mass spectrometer. The instrument m/z was calibrated with standards supplied by the manufacturer. Separated peptides were introduced into the mass spectrometer from an HPLC system equipped with an autosampler (LC Packings, San Francisco). Separations were achieved using an LC Packings nanoscale pepmap column (15 cm × 75 µm i.d., 3 µm, 100 Å, C18) and a linear binary gradient (solvent A was 1% [v/v] formic acid in 95%:5% [v/v] water:acetonitrile, whereas solvent B was a 0.8% [v/v] formic acid in 5%:95% [v/v] water:acetonitrile). The linear gradient was 95% (w/v) A:5% (w/v) B (0 min) to 60% (w/v) A:40% (w/v) B over 33 min, then ramped to 5% (w/v) A:95% (w/v) B at 37 min and held at 5% (w/v) A:95% (w/v) B until 42 min, where it was returned to 95% (w/v) A:5% (w/v) B 48 min and allowed to reequilibrate to 95% (w/v) A:5% (w/v) B 60 min. Nanoscale-ESI was performed using a Protona interface and nanoelectrospray needles (silver-coated glass capillary, New Objective, Woburn, MA). Mass spectra datasets were searched against NCBInr, SwissProt, dbESTothers, and mtEST databases using Mascot (http://www.matrixscience.com). The search results were validated as described for the peptide mass fingerprint results. EST Counting and Protein Relative Abundance Estimates Barrel medic ESTs were extracted from dbEST (http://www.ncbi.nlm.nih.gov/, accessed November 4, 2001). ESTs were assembled into tentative consensus sequences by TIGR to generate the barrel medic gene index (MtGI, http://www.tigr.org/tdb/tgi.shtml). The MtGI release of September 7, 2001 was used to count the occurrence of barrel medic genes in six different EST datasets including leaf (one cDNA library of developing leaf, 7,831 ESTs), stem (one library of developing stem, 10,314 ESTs), root (three libraries of uninoculated root, 6,593 ESTs), flower (one library of developing flower, 3,404 ESTs), seed pod (one library of developing seed and one library of developing pod, 4,587 ESTs), and cell suspensions (one library of elicited cell suspensions, 8,926 ESTs). The barrel medic genes were then sorted in the descending order on their EST counts for each dataset and used in the comparison with proteomic data. Protein abundances were calculated using the normalized spot volume of
each protein determined with HT Analyzer software (Genomic Solutions)
as previously reported (Asirvatham et al.,
2002a
We thank Dr. Richard Dixon for scientific discussion and editorial comments. We thank Drs. Zhentian Lei and Aaron Elmer for their assistance in performing LC/MS/MS analyses.
Received December 11, 2002; returned for revision December 24, 2002; accepted January 3, 2003. 1 This work was supported by the Samuel Roberts Noble Foundation and by the National Science Foundation (Plant Genome Research Project no. 0109732).
[w] The online version of this article contains Web-only data. The supplemental material is available at www.plantphysiol.org.
* Corresponding author; e-mail lwsumner{at}noble.org; fax 580-224-6692.
Article, publication date, and citation information can be found at www.plantphysiol.org/cgi/doi/10.1104/pp.102.019034.
|