|
|
||||||||
|
Plant Physiology 139:5-17 (2005) © 2005 American Society of Plant Biologists Genome-Wide Identification and Testing of Superior Reference Genes for Transcript Normalization in Arabidopsis1,[w]Max-Planck Institute of Molecular Plant Physiology, 14476 Potsdam, Germany
Gene transcripts with invariant abundance during development and in the face of environmental stimuli are essential reference points for accurate gene expression analyses, such as RNA gel-blot analysis or quantitative reverse transcription-polymerase chain reaction (PCR). An exceptionally large set of data from Affymetrix ATH1 whole-genome GeneChip studies provided the means to identify a new generation of reference genes with very stable expression levels in the model plant species Arabidopsis (Arabidopsis thaliana). Hundreds of Arabidopsis genes were found that outperform traditional reference genes in terms of expression stability throughout development and under a range of environmental conditions. Most of these were expressed at much lower levels than traditional reference genes, making them very suitable for normalization of gene expression over a wide range of transcript levels. Specific and efficient primers were developed for 22 genes and tested on a diverse set of 20 cDNA samples. Quantitative reverse transcription-PCR confirmed superior expression stability and lower absolute expression levels for many of these genes, including genes encoding a protein phosphatase 2A subunit, a coatomer subunit, and an ubiquitin-conjugating enzyme. The developed PCR primers or hybridization probes for the novel reference genes will enable better normalization and quantification of transcript levels in Arabidopsis in the future.
Transcripts of stably expressed genes are crucial internal references for normalization of gene expression data. This is especially the case for quantitative reverse transcription (qRT)-PCR studies, which are growing in importance as a means to validate data from whole-genome oligonucleotide arrays and as a primary source of expression data for smaller sets of genes (Czechowski et al., 2004 (EF-1 ), polyubiquitin (UBQ), actin (ACT), and -tubulin and -tubulin (TUA and TUB, respectively) genes (Goidin et al., 2001 et al., 2004
To determine which reference gene(s) is best suited for transcript normalization in a given subset of biological samples, statistical algorithms like geNORM or BestKeeper have been developed (Vandesompele et al., 2002
To investigate the stability of expression of commonly used reference genes in more depth and to identify novel and superior reference genes, it is necessary to have gene expression data for many (ideally all) expressed genes of an organism from as many different organs and experimental conditions as possible. Tomato (Lycopersicon esculentum) expressed sequence tag (EST) libraries have previously been used for this purpose (Coker and Davies, 2003
In this study, we complemented the publicly available AtGenExpress data (http://web.uni-frankfurt.de/fb15/botanik/mcb/AFGN/atgenex.htm) with our own ATH1 data from a nutrient stress series (nitrogen, carbon, sulfur, and phosphorus deprivation of wild-type seedlings and time courses after readdition of the respective nutrient; Scheible et al., 2004
Identification of Very Stably Expressed Arabidopsis Genes from ATH1 Array Data ATH1 gene probe sets showing very stable hybridization signals were extracted from the different experimental series represented in the AtGenExpress database (http://web.uni-frankfurt.de/fb15/botanik/mcb/AFGN/atgenex.htm; http://arabidopsis.org/info/expression/ATGenExpress.jsp) and from our own nutrient stress and diurnal cycle series in the following way. First, the mean expression value (MV), SD, and the ratio SD/MV (i.e. coefficient of variation, CV) were calculated for each gene in each given experimental series (e.g. developmental series or shoot abiotic stress series; see Supplemental Table I). Second, gene probe sets with at least 80% of "Present" calls, as attributed by Affymetrix Microarray Suite 5 (MAS5) software, within an experimental series were sorted according to their CV value. The 100 genes with the lowest CV values in each series are presented in Supplemental Table I, while Table I displays a selection of five stably expressed genes for the different series.
Within the diurnal series, 96 of the 100 genes with the lowest CV values had MVs <30, putting them among the 20% of lowest-expressed genes. Since the CV was very low (<0.060) for >700 genes in this particular experimental series, 100 of these genes with the highest MVs were chosen to generate a list of both high- and low-expressed, diurnally invariable genes (Supplemental Table I). MAS5-processed files with Present/Absent calls were not available to us for the light, hormone, and biotic stress arrays at the time of the analysis (but they are now available from the Nottingham Arabidopsis Stock Centre; http://affymetrix.arabidopsis.info). To avoid identifying genes with Absent calls as potential reference genes, we considered only the 14,000 genes (approximately 60%) with the highest MVs. This percentage is slightly lower than the percentage of genes typically expressed (called Present by MAS5 software) in an entire Arabidopsis seedling or its major organs (F. Wagner, RZPD Berlin, personal communication; Czechowski et al., 2004 Comparison of the CV values for the chosen Top100, i.e. the most stably expressed genes from the different experimental series (Table I; Supplemental Table I), revealed that they were highest for the complete developmental series (0.1410.200; 237 ATH1 arrays), which presumably reflects the diversity of plant organs, ages, and genotypes included (see http://www.weigelworld.org/resources/microarray/AtGenExpress). In contrast, all other experimental series were obtained with plant material that was more homogenous with respect to tissue type, developmental stage, and genetic background (see Supplemental Fig. 6). The developmental series was analyzed further to identify whether specific types of samples were responsible for the higher CV (see supplemental text and Supplemental Figs. 24). Pollen samples and, to a lesser extent, seed samples had a marked influence on the CV, whereas root and flower samples did not. This indicates that pollen and seeds have transcriptomes that are very different from other tissues.
Some of the Affymetrix probe sets (labeled s_at in Table I) are not gene specific, but rather have perfect matches to two or more related genes (Redman et al., 2004
After the identification of stably expressed genes from the ATH1 data, an important aim was to compare the traditional and novel reference genes. Figure 1A shows the developmental expression patterns of five traditional reference genes: ACT2, TUB6, EF-1
The developmental expression patterns and CV values of additional TUB, ACT, and UBQ genes (with >80% Present calls) are shown in Supplemental Figure 5. Whereas UBQ10 appears to be the most constitutively expressed polyubiquitin gene, neither ACT2 nor TUB6 appeared to be the most stably expressed gene in its family. Other proposed reference genes and their homologs were also investigated with respect to their CV values, including TUA, histone, phosphoglyceratekinase, cyclophilin, and EF-Tu genes, and eIF4a (AT1G54270). Only eIF4a and two of the 30 cyclophilin genes (At2g16600, At4g33060) had developmental CVs (0.268, 0.266, and 0.233, respectively) that were close to those of EF-1 , UBQ10, or GAPDH. Figure 1B depicts the expression patterns of the five "novel" genes with the lowest CVs in the developmental series (compare Supplemental Table I). All five genes are clearly more stably expressed than the traditional reference genes in Figure 1A. The TIP41-like gene At4g34270 (black line) had the lowest CV (0.141) among the five genes in Figure 1B, and the value further decreased to 0.116 after omission of the pollen sample (no. 73; compare Supplemental Table I). Interestingly, two of the five genes shown in Figure 1B encode subunits of Ser/Thr protein phosphatase 2A (PP2A). At1g13320 encodes a 65-kD regulatory subunit, and At1g59830 encodes a catalytic subunit for which two splice variants are annotated at The Arabidopsis Information Resource (TAIR; www.arabidopsis.org) yielding 30- and 35-kD proteins. Moreover, At1g10430 (encoding another catalytic subunit of PP2A) and At2g39840 (encoding a catalytic subunit of protein phosphatase 1) are also represented among the Top100 genes of the developmental series, and At3g25800 (encoding another regulatory PP2A subunit) is represented among the Top100 genes of the root abiotic stress and hormone series (Supplemental Table I). Besides its appearance in the developmental Top100 list(s), At1g13320 is also represented in the Top100 lists of the shoot and the root abiotic stress series, as well as the light series (Table I; Supplemental Tables I and II), and was placed within the Top500 in the hormone, biotic, and nutrient stress series (see Supplemental Table II). In the diurnal series, the gene was not placed among the Top500, although the CV was reasonably low (0.134). In this case, it should be noted that many genes had very low CV values in this series. Altogether, At1g13320 appears to be a very stably expressed gene in the Arabidopsis genome (see Supplemental Fig. 6).
Another important difference between the novel reference genes identified by our analysis and more traditional controls are their absolute expression levels. While EF-1 Complete expression profiles, including all available experimental series, for the genes shown in Figure 1, for At5g09810 (ACT7, formerly called ACT2), two other reference genes (i.e. At5g44200/CBP20; At5g25760/UBC) qPCR primers that are marketed by Sigma-Aldrich, and for several other genes selected from Supplemental Table I are presented in Supplemental Figure 6. These genes were selected because they appeared on Top100 lists or extended Top500 lists of two or more experimental series (see selection criteria in Supplemental Table II) and because most displayed much lower absolute expression on ATH1 arrays than traditional reference genes (Table II). Overall selection was biased toward genes that are stably expressed in the developmental series because this series is the most complex one, as mentioned above.
Gene-Stability Measure and Ranking of Reference Genes Using geNORM
Vandesompele et al. (2002) We used this method as an alternative approach to validate and rank the traditional and novel reference genes, using ATH1 data from the developmental series (Fig. 2; Supplemental Fig. 6). The average expression stability value (M) for most traditional genes (black bars in Fig. 2) was considerably higher (i.e. expression is less stable) than for most of the novel reference genes identified by our approach (white bars). Once again, the polyubiquitin genes At5g25760 (UBC) and At4g05320 (UBQ10) showed the most stable expression within the subset of traditional genes with their M values being comparable to those of most of the novel reference genes (Fig. 2). The novel gene with the highest M value, At1g62930, was chosen because it was represented in three Top100 lists (see Supplemental Table II), although none of these were related to development. Hence, it is not really surprising that it performed less well than other genes as a reference for normalization of developmental expression data. There was good agreement between the average expression stability (M) value and the calculated CV value for this set of genes: Linear regression analysis yielded a correlation coefficient (R2) of 0.91. When the geNORM algorithm was used on combined ATH1 data from all experimental series (Supplemental Fig. 7), the picture remained the same: Novel genes generally had a lower M value, i.e. more stable expression, than traditional genes, with the exception of UBQ10 (At4g05320).
Validation of Traditional and Novel Reference Genes by qRT-PCR
Affymetrix ATH1 chips have become a "gold standard" for Arabidopsis transcriptome analysis. However, for hybridization-based technologies, like Affymetrix chips, there is not a strict linear relationship between signal strength and transcript amount for different genes, as there is for qRT-PCR (Holland, 2002
A diverse set of 20 cDNA pools was synthesized as templates for RT-PCR. Total RNA from the different plant samples (Supplemental Table III) was isolated, DNase I digested, and reverse transcribed, with various quality controls (Fig. 3). To enable quantitative and qualitative assessment of the RT reaction, an equal amount of DNase I-digested total RNA from each sample was spiked with 30 pg cRNA of a foreign gene Lotus japonicus leghemoglobin-2 (LjLb2) before the RT reaction. Subsequently, first-strand cDNAs were analyzed for (1) equal amplification of LjLb2 cDNA and (2) the 5'/3' amplification ratio of GAPDH [(1 + EGAPDH5')CT GAPDH5'/(1 + EGAPDH3')CT GAPDH3'; approximately 2
PCR primers for 20 novel and five traditional reference genes were designed and tested in qRT-PCR reactions (see Supplemental Table II). To ensure maximum specificity and efficiency during PCR amplification of cDNA under a standard set of reaction conditions, primers were required to have melting temperatures (TM) of 60°C ± 1°C and to amplify short products, usually around 60 to 70 bp (see Supplemental Table II). Typically primers were also designed to give an amplicon located close to the 3' end of the transcript of interest (see Supplemental Table II), which should allow better amplification especially for cDNAs with GAPDH 5'/3' ratios <<1. In addition, when possible at least one primer of a pair was also designed to cover an exon-exon junction. Care was taken that the primers encompass all known transcript splice variants.
The specificity of PCR primers was tested using the 20 first-strand cDNAs described above (see Supplemental Table III and "Materials and Methods"). Twenty-two of the 25 primer pairs yielded unique PCR amplicons from our cDNA samples. The primers for UBQ10 did not produce amplicons from the two hormone-treatment cDNA pools, which were produced from Arabidopsis accession C24 (see Supplemental Table III). The primer pairs for three hypothetical genes, At1g47770, At2g07190, and At3g32260, yielded no amplicons from any cDNA sample, (Table II; Supplemental Table II). The 22 remaining primer pairs all amplified single PCR products of the expected size from the various cDNA pools, as shown by gel electrophoresis and melting-curve analyses performed by the PCR machine after 40 amplification cycles (Fig. 4, AD; Supplemental Figs. 8 and 9). A more stringent test of the specificity of PCR was performed by sequencing the products of four novel constitutive genes (Supplemental Fig. 10) and three products from two traditional reference genes (UBQ10, 5' and 3' amplicons of GAPDH; data not shown). In all seven cases, the sequence of the PCR product matched that of the intended target cDNA, thereby confirming the exquisite PCR specificity of the developed primer pairs (see also Czechowski et al., 2004
The number of cycles needed to reach a given fluorescence intensity during qPCR depends not only on the amount of cDNA in the extract but also on the amplification efficiency (E). In the ideal case, when the amount of cDNA is doubled in each reaction cycle, E = 1. PCR efficiency can be estimated with various methods (see http://www.gene-quantification.info/). The classical method uses threshold cycle (CT) values obtained from a series of template dilutions (Pfaffl, 2001
Figure 4, E to J, displays the 60 real-time PCR amplification plots for LjLb2 (Fig. 4E) and each of five reference genes. They were obtained with specific primers (see Supplemental Table II) for two traditional genes, UBQ10 and EF-1
The amplification curves for each gene were generally grouped closely together across all treatments and almost as close as those for LjLb2, the external reference. The stable expression of these genes inferred from the ATH1 data was therefore verified by our qRT-PCR experiments. An exception was EF-1
To analyze the expression stability M for the genes investigated by qRT-PCR, we used geNORM v.3.4 software (Vandesompele et al., 2002
Gene expression databases for Arabidopsis and other organisms are important resources to investigate the expression pattern of a given gene or gene families, to identify genes that respond to specific stimuli, and to search for coexpressed genes (http://www.ncbi.nlm.nih.gov/geo/; http://www.ebi.ac.uk/arrayexpress/; Steinhauser et al., 2004
Although the scope of the AtGenExpress database is great, it is certainly not all encompassing. Hence, it is possible that genes other than those identified here may exist that are better references for normalizing gene expression under special conditions. For instance, the developmental series contains data from various organs at different stages of development, but information about specific tissue or cell types is lacking. It is likely that the transcriptome of each cell type is very different from that of other cell types, and tissues and organs as a whole. Therefore, care is required in selecting reference genes for specific comparisons. As pointed out by Radoni
Traditional reference genes were chosen in the pre-genomic era based on their known or suspected housekeeping roles in basic cellular processes (e.g. protein translation, ubiquitin-dependent protein degradation), cell structure maintenance (e.g. cytoskeleton), or primary metabolism (e.g. glycolysis), and comprise a small number of gene families (see "Results"). Our study revealed that the polyubiquitin gene family contains very stably expressed members, and several stably expressed genes involved in the ubiquitin/26S proteasome pathway (Smalle and Vierstra, 2004
All genes with moderate or higher ATH1 mean expression in our chosen subset (Table II) were successfully validated and confirmed by qRT-PCR with respect to their high expression stability (Fig. 5) as well as their expression level. As pointed out before, there was good agreement between expression values calculated from the qRT-PCR data and the developmental ATH1 data with a linear correlation coefficient R2 = 0.88. This indicates that it should be straightforward to successfully validate by qRT-PCR also others of the many potential reference genes with moderate or higher ATH1 expression listed in Supplemental Table I. In contrast, for three hypothetical genes (Table II) that showed stable expression but had very low ATH1 MVs, no transcripts could be detected using qRT-PCR, even when a second primer pair was tested for each (data not shown). One possible explanation for this failure could be incorrect electronic gene model predictions, which make it difficult to design functional PCR primers. Another possibility is that these are pseudogenes and are not expressed, which is supported by the lack of any reported cDNAs, ESTs, or Multiple Parallel Signature Sequencing signatures in public databases. Thus, it is probable that the ATH1 expression values for these genes and the high percentage (80%) of Present calls initially used to filter the data arose by unspecific hybridization of the probe set. Another two genes in our chosen subset (At1g62390 and At4g38070) also had very low expression (Table II) and turned out to yield M values similar to those of traditional reference genes (Fig. 5). Again, this might indicate that low ATH1 expression values are not always reliable indicators for true gene expression, and shows that validation of stable gene expression for such genes (Supplemental Table I) is critical before they are used as references for sensitive and accurate gene expression by qRT-PCR. Still, these two genes and, in particular, AT5G12240 and AT5G15710 (Table II) demonstrate that it is perfectly possible to extract from ATH1 data and validate by qRT-PCR genes that have high expression stability but expression levels 1,000- to 10,000-fold lower than UBQ10, EF-1
We have shown here that it is possible to identify excellent reference genes from large collections of comprehensive transcriptome data obtained from DNA-array hybridization studies. However, care must be taken that hybridization data is truly gene-specific for the genes of interest. While this appears to be the case for most probe sets on the Arabidopsis ATH1 arrays, it is not true for all sets. For instance, the probe set 247644_s_at detects four EF-1 In conclusion, this analysis revealed that hundreds of genes in the Arabidopsis genome are more stably expressed and at lower levels than traditional reference genes. This was confirmed for a subset of genes by qRT-PCR analysis. The gene-specific primer pairs developed here for novel reference genes will enable more accurate normalization and quantification of small- and medium-scale gene expression studies in Arabidopsis by qRT-PCR in the future. Probes to these reference genes will also aid normalization of transcripts using other methods, such as RNA gel-blot analysis. Finally, orthologs of these novel reference genes could serve the same purposes in other species.
ATH1 Data Mining for Stably Expressed Genes
Affymetrix CEL files from each experimental series (see Table I; 721 ATH1 CEL files in total representing 323 experimental conditions) were processed using RMA software (Bolstad et al., 2003
Gene models, including information on exon/intron structure, 3' and 5' untranslated regions, and known splice variants for all investigated reference genes were downloaded from sequence viewer (SeqViewer) at TAIR (http://www.arabidopsis.org/). To facilitate RT-PCR measurement of transcripts of all investigated genes under a standard set of reaction conditions, qPCR primers were designed on these sequences using PrimerExpress 2.0 software (Applied Biosystems) and the following criteria: TM of 60°C ± 1°C and PCR amplicon lengths of 60 to 150 bp, yielding primer sequences with lengths of 20 to 30 nucleotides and guanine-cytosine contents of 35% to 55%. Primers were also designed to amplify close to the annotated 3' end of the transcripts, to encompass all known splice variants, and at least one primer of a pair was designed to cover an exon-exon junction if possible (see Supplemental Table II). The specificity of the resulting primer pair sequences was checked against the Arabidopsis (Arabidopsis thaliana) transcript database using TAIR BLAST (http://www.arabidopsis.org/Blast/). Specificity of the primer amplicons was checked by melting-curve analysis performed by the PCR machine after 40 amplification cycles and by gel-electrophoretic analysis. To that effect, primer amplicons were resolved on 4% (w/v) agarose gels (3:1 HR agarose; Amresco) run at 4 V cm1 in Tris-borate/EDTA buffer, along with a 50-bp DNA-standard ladder (Invitrogen GmbH). Identity of the short PCR products was checked by direct sequencing at AGOWA.
Arabidopsis (Col-0) wild-type plants were grown under long-day conditions (16-h day/8-h night) on GS90 soil (Gebr. Patzer). The following tissue samples/organs were harvested at the given time after sowing: rosette leaves and shoot apices (4 weeks); cauline leaves and stems (5 weeks); flowers, green siliques, and old rosette leaves (6 weeks); and mature seeds (8 weeks). Shoot and root samples were also harvested from 14-d-old Arabidopsis (Col-0) wild-type plants that were grown vertically on half-strength Murashige and Skoog medium (Murashige and Skoog, 1962
Total RNA from most of the samples was isolated using TRIZOL reagent (Invitrogen GmbH) as described (http://www.Arabidopsis.org/info/2010_projects/comp_proj/AFGC/RevisedAFGC/site2RnaL). RNA from hormone-treated plant materials was prepared using the Invisorb Spin Plant RNA mini kit (Invitek GmbH), according to the manufacturer's protocol. RNA samples from green siliques and seeds were prepared using a "hot borate" method (Wan and Wilkins, 1994
One microgram of pSPORT1 plasmid containing the Lotus japonicus LjLb2 full-length cDNA sequence (GenBank accession no. BI416412) was linearized by BamHI digestion, phenol/chloroform treated, ethanol precipitated, and in vitro transcribed using T7 RNA polymerase with Ambion's mMESSAGE kit (catalog no. 1340), according to the manufacturer's instructions. Concentration of the cRNA was measured using a NanoDrop ND-1000 spectrophotometer (NanoDrop Technologies), and the presence of a discrete 700-bp LjLb2 cRNA band was checked on an Agilent-2100 Bioanalyzer using RNA 6000 NanoChips (Agilent Technologies).
RNA concentration and integrity were measured before and after DNase I digestion with a NanoDrop ND-1000 UV-Vis spectrophotometer (NanoDrop Technologies) and an Agilent-2100 Bioanalyzer using RNA 6000 NanoChips (Agilent Technologies). Altogether 150 µg of total RNA were digested with Turbo DNA-free DNase I (Ambion) according to the manufacturer's instructions. Absence of genomic DNA contamination in DNase I-treated samples was tested by PCR using primers (5'-TTTTTTGCCCCCTTCGAATC-3' and 5'-ATCTTCCGCCACCACATTGTAC-3') designed to amplify an intron sequence of a reference gene (At5g65080), and primers (5'-ACTTTCATCAGCCGTTTTGA-3' and 5'-ACGATTGGTTGAATATCATCAG-3') designed to amplify a 633-bp genomic fragment of the ACT2 gene (At3g18780). DNase I-treated RNA was subsequently spiked with 30 pg of LjLb2 cRNA template, and RT reactions were performed with SuperScript III reverse transcriptase (Invitrogen GmbH), according to the manufacturer's instructions. Amplification from LjLb2 cDNA was achieved with primers 5'-TTCGCGGTGGTTAAAGAAGC-3' and 5'-TCCATTTGTCCCCAACTGCT-3' (primer efficiency 108% ± 2%, amplicon length 61 bp, distance from annotated transcript 3' end 212 bp). The 5'/3' ratio of GAPDH cDNA [(1 + EGAPDH5')CT GAPDH5'/(1 + EGAPDH3')CT GAPDH3'; approximately 2
PCR reactions were performed in an optical 384-well plate with an ABI PRISM 7900 HT sequence detection system (Applied Biosystems), using SYBR Green to monitor dsDNA synthesis. Reactions contained 3 µL 2x SYBR Green Master Mix reagent (Applied Biosystems), 1 µL of cDNA (1 ng/µL), and 200 nM of each gene-specific primer in a final volume of 6 µL. A master mix of sufficient cDNA and 2x SYBR Green reagent was prepared prior to dispensing into individual wells, to reduce pipetting errors and to ensure that each reaction contained an equal amount of cDNA. The following standard thermal profile was used for all PCR reactions: 50°C for 2 min, 95°C for 10 min, 40 cycles of 95°C for 15 s, and 60°C for 1 min. Amplicon dissociation curves, i.e. melting curves, were recorded after cycle 40 by heating from 60°C to 95°C with a ramp speed of 1.9°C min1. Data were analyzed using the SDS 2.2.1 software (Applied Biosystems). To generate a baseline-subtracted plot of the logarithmic increase in fluorescence signal (
To analyze gene expression stability, we used geNORM v.3.4 software (Vandesompele et al., 2002
We thank Lutz Nover, Detlef Weigel, and all other researchers that contributed to the Deutsche Forschungsgemeinschaft-, GARNET-, RIKEN-, and National Science Foundation-funded AtGenExpress dataset, and who agreed to make the data freely available to the research community prior to publication. We are also grateful to Thomas Ott at Max-Planck Institute for Molecular Plant Physiology (MPI-MPP) for providing the LjLb2 cDNA clone, to Florian Wagner and his team at RZPD Berlin (German Resource Center for Genome Research, Berlin) for expert Affymetrix array service, including all steps from total RNA to data acquisition, and to Rajendra Bari, Monika Bielecka, Dirk Hincha, Tomasz Kobylko, Janina Lisso, Rosa-Maria Morcuende, Ana-Silvia Nita, Daniel Osuna, Armin Schlereth, and Wenming Zheng at MPI-MPP for donations of total RNA samples. Received April 4, 2005; returned for revision May 9, 2005; accepted June 2, 2005.
1 This work was supported by the Max-Planck Society and the Bundesministerium für Bildung und Forschung-funded project GABI Verbund Arabidopsis III Gauntlets (Carbon and Nutrient Signaling: Test Systems, and Metabolite and Transcript Profiles; 0312277A).
[w] The online version of this article contains Web-only data. www.plantphysiol.org/cgi/doi/10.1104/pp.105.063743. * Corresponding author; e-mail scheible{at}mpimp-golm.mpg.de; fax 493315678136.
Altmann T, Weigel D, Nover L (2004) AtGenExpressEin multinational koordiniertes Programm zur Erforschung des Arabidopsis Transkriptoms. GenomXpress 3: 1314
Andersen CL, Jensen JK, Orntoft TF (2004) Normalization of real-time quantitative reverse transcription-PCR data: a model-based variance estimation approach to identify genes suited for normalization, applied to bladder and colon cancer data sets. Cancer Res 64: 52455250
Bolstad BM, Irizarry RA, Astrand M, Speed TP (2003) A comparison of normalization methods for high density oligonucleotide array data based on bias and variance. Bioinformatics 19: 185193 Brunner AM, Yakovlev IA, Strauss SH (2004) Validating internal controls for quantitative plant gene expression studies. BMC Plant Biol 4: 14[CrossRef][Medline] Bustin SA (2002) Quantification of mRNA using real-time reverse transcription PCR RT-PCR: trends and problems. J Mol Endocrinol 29: 2329[Abstract]
Charrier B, Champion A, Henry Y, Kreis M (2002) Expression profiling of the whole Arabidopsis shaggy-like kinase multigene family by real-time reverse transcriptase-polymerase chain reaction. Plant Physiol 130: 577590 Coker JS, Davies E (2003) Selection of candidate housekeeping controls in tomato plants using EST data. Biotechniques 35: 740748[ISI][Medline] Czechowski T, Bari RP, Stitt M, Scheible W-R, Udvardi MK (2004) Real-time RT-PCR profiling of over 1400 Arabidopsis transcription factors: unprecedented sensitivity reveals novel root- and shoot-specific genes. Plant J 38: 366379[CrossRef][ISI][Medline] Dheda K, Huggett JF, Bustin SA, Johnson MA, Rook G, Zumla A (2004) Validation of housekeeping genes for normalizing RNA expression in real-time PCR. Biotechniques 37: 112119[ISI][Medline]
Gachon C, Mingam A, Charrier B (2004) Real-time PCR: what relevance to plant studies? J Exp Bot 55: 14451454
Goidin D, Mamessier A, Staquet M-J, Schmitt D, Berthier-Vergnes O (2001) Ribosomal 18S RNA prevails over glyceraldehyde-3-phosphate dehydrogenase and
Holland MJ (2002) Transcript abundance in yeast varies over six orders of magnitude. J Biol Chem 277: 1436314366 Kim B-R, Nam H-Y, Kim S-U, Kim S-I, Chang Y-J (2003) Normalization of reverse transcription quantitative-PCR with housekeeping genes in rice. Biotechnol Lett 25: 18691872[CrossRef][ISI][Medline] Lee PD, Sladek R, Greenwood CMT, Hudson TJ (2001) Control genes and variability: absence of ubiquitous reference transcripts in diverse mammalian expression studies. Genome Res 12: 292297
Lurin C, Andres C, Aubourg S, Bellaoui M, Bitton F, Bruyere C, Caboche M, Debast C, Gualberto J, Hoffmann B, et al (2004) Genome-wide analysis of Arabidopsis pentatricopeptide repeat proteins reveals their essential role in organelle biogenesis. Plant Cell 16: 20892103 Murashige T, Skoog F (1962) A revised medium for rapid growth and bioassays with tobacco tissue culture. Physiol Plant 15: 473497[CrossRef]
Pfaffl MW (2001) A new mathematical model for relative quantification in real-time RT-PCR. Nucleic Acids Res 29: e45 Pfaffl MW, Tichopad A, Prgomet C, Neuvians TP (2004) Determination of stable housekeeping genes, differentially regulated target genes and sample integrity: BestKeeperExcel-based tool using pair-wise correlations. Biotechnol Lett 26: 509515[CrossRef][ISI][Medline]
Radoni Ramakers C, Ruijter JM, Deprez RH, Moorman AF (2003) Assumption-free analysis of quantitative real-time polymerase chain reaction (PCR) data. Neurosci Lett 13: 6266 Redman JC, Haas BJ, Tanimoto G, Town CD (2004) Development and evaluation of an Arabidopsis whole genome Affymetrix probe array. Plant J 38: 545561[CrossRef][ISI][Medline] Sanderfoot AA, Raikhel NV (2002) The secretory system of Arabidopsis. In CR Somerville, EM Meyerowitz, eds, The Arabidopsis Book. American Society of Plant Biologists, Rockville, MD, doi/10.1199/tab.0098, http://www.aspb.org/publications/arabidopsis/
Scheible W-R, Morcuende R, Czechowski T, Fritz C, Osuna D, Palacios-Rojas N, Schindelasch D, Thimm O, Udvardi MK, Stitt M (2004) Genome-wide reprogramming of primary and secondary metabolism, protein synthesis, cellular growth processes, and the regulatory infrastructure of Arabidopsis in response to nitrogen. Plant Physiol 136: 24832499 Schmid M, Davison TS, Henz SR, Pape UJ, Demar M, Vingron M, Schölkopf B, Weigel D, Lohmann J (2005) A gene expression map of Arabidopsis development. Nat Genet 37: 501506[CrossRef][ISI][Medline]
Shen L, Gong J, Caldo RA, Nettleton D, Cook D, Wise RP, Dickerson JA (2005) BarleyBasean expression profiling database for plant genomics. Nucleic Acids Res 33: D614D618 Smalle J, Vierstra RD (2004) The ubiquitin 26s proteasome proteolytic pathway. Annu Rev Plant Biol 55: 555590[CrossRef][Medline]
Steinhauser D, Usadel B, Luedemann A, Thimm O, Kopka J (2004) CSB.DB: a comprehensive systems-biology database. Bioinformatics 20: 36473651 Suzuki T, Higgins PJ, Crawford DR (2000) Control selection for RNA quantitation. Biotechniques 29: 332337[ISI][Medline] Thellin O, Zorzi W, Lakaye B, De Borman B, Coumans B, Henne G, Grisar T, Igout A, Heinen E (1999) Housekeeping genes as internal standards: use and limits. J Biotechnol 75: 197200 Thimm O, Bläsing O, Gibon Y, Nagel A, Meyer S, Krüger P, Selbig J, Müller LA, Rhee SY, Stitt M (2004) MAPMAN: a user-driven tool to display genomics data sets onto diagrams of metabolic pathways and other biological processes. Plant J 37: 914939[CrossRef][ISI][Medline] Vandesompele J, De Preter K, Pattyn F, Poppe B, Van Roy N, De Paepe A, Speleman F (2002) Accurate normalization of real-time quantitative RT-PCR data by geometric averaging of multiple internal control genes. Genome Biol 37: RESEARCH0034 Wan C-Y, Wilkins TA (1994) A modified hot-borate method significantly enhances the yield of high-quality RNA from cotton Gossypium hirsutum L. Anal Biochem 223: 712[CrossRef][ISI][Medline]
Zimmermann P, Hirsch-Hoffmann M, Hennig L, Gruissem W (2004) GENEVESTIGATOR. Arabidopsis microarray database and analysis toolbox. Plant Physiol 136: 26212632 This article has been cited by other articles:
| |||||||||||||||||||||||||||||||||||||||||||||||||||||||||