|
|
||||||||
|
First published online May 8, 2008; 10.1104/pp.107.115535 Plant Physiology 147:1004-1016 (2008) © 2008 American Society of Plant Biologists OPEN ACCESS ARTICLE
CressExpress: A Tool For Large-Scale Mining of Expression Data from Arabidopsis1,[W],[OA]Section on Statistical Genetics, Department of Biostatistics, University of Alabama, Birmingham, Alabama 35294 (V.S., G.P.P., T.M, I.C.); and University of North Carolina at Charlotte, Bioinformatics Research Center, North Carolina Research Campus, Kannapolis, North Carolina 28081 (A.E.L.)
CressExpress is a user-friendly, online, coexpression analysis tool for Arabidopsis (Arabidopsis thaliana) microarray expression data that computes patterns of correlated expression between user-entered query genes and the rest of the genes in the genome. Unlike other coexpression tools, CressExpress allows characterization of tissue-specific coexpression networks through user-driven filtering of input data based on sample tissue type. CressExpress also performs pathway-level coexpression analysis on each set of query genes, identifying and ranking genes based on their common connections with two or more query genes. This allows identification of novel candidates for involvement in common processes and functions represented by the query group. Users launch experiments using an easy-to-use Web-based interface and then receive the full complement of results, along with a record of tool settings and parameters, via an e-mail link to the CressExpress Web site. Data sets featured in CressExpress are strictly versioned and include expression data from MAS5, GCRMA, and RMA array processing algorithms. To demonstrate applications for CressExpress, we present coexpression analyses of cellulose synthase genes, indolic glucosinolate biosynthesis, and flowering. We show that subselecting sample types produces a richer network for genes involved in flowering in Arabidopsis. CressExpress provides direct access to expression values via an easy-to-use URL-based Web service, allowing users to determine quickly if their query genes are coexpressed with each other and likely to yield informative pathway-level coexpression results. The tool is available at http://www.cressexpress.org.
Availability of abundant, high-quality data sets from microarray expression experiments has stimulated rapid progress in gene networks analysis for a variety of plant and animal species (Stuart et al., 2003 Many applications of this idea utilize variations of Pearson's correlation coefficient and linear regression to quantify coexpression relationships. Figure 1 presents an example scatter plot that illustrates the idea. Each point on the plot represents data from one array; x and y coordinates represent expression values for genes indicated on the horizontal and vertical axes, respectively. In this case, there is a strong positive relationship between the two genes' expression values; when one gene's expression is high, the other gene's expression is also high. Computing a linear regression between expression values for the two genes quantifies the strength of this relationship. This yields an r2 value, equivalent to the square of Pearson's correlation coefficient, and a P value that expresses the probability of obtaining the observed r2 value (or larger) assuming a random relationship between the two variables. The regression also yields a slope, which indicates the direction of the coexpression relationship. Larger r2 and smaller P values signal higher confidence in the coexpression relationship.
Altogether, these numbers summarize how closely two genes are coexpressed across multiple array experiments and provide a way for experimenters to identify and quantify relationships between genes. When these numbers are available for a large number of genes, they can be used to build coexpression graphs, or networks, in which highly correlated genes (nodes) are linked and less well-correlated genes are not. Studies analyzing coexpression networks have demonstrated that genes connected in the coexpression network often perform related functions, thus demonstrating biological relevance of the approach (for review, see Aoki et al., 2007 A number of groups have established Web-based interfaces for mining precomputed coexpression results and coexpression networks in Arabidopsis (Arabidopsis thaliana). To our knowledge, none offers ways to recompute the coexpression networks using subsets of experiments and arrays, perhaps because of technical challenges involved. Recomputing correlation between a query gene and all the genes in the genome is computationally intensive and cannot easily be done in real-time during a user's visit to a Web site. However, the coexpression networks arising from different inputs may vary greatly, depending on sample or tissue type. We address this problem by developing an easy-to-use Web tool (CressExpress) that allows users to select distinct tissue types and experiments to include in an analysis, which then executes the calculations offline. When the analysis finishes, users receive an e-mail linking to a zip file on the CressExpress site that contains a complete package of results, along with a record of all parameters and samples used as inputs to the experiment. By providing a complete report of results and inputs, CressExpress makes it convenient for users to integrate coexpression analysis into their research workflow.
CressExpress is an easy-to-use Web-based tool that allows researchers to set up and run coexpression analysis experiments using a variety of different data sets and sample types. To set up and run an experiment, users enter query identifiers and analysis parameters on the CressExpress Web site located at http://www.cressexpress.org. To begin an analysis, users click the "Run the Tool" link and then proceed through a series of screens (Fig. 2 ) that offer users the opportunity to vary quality control (QC) parameters, specify data release and array platforms, and select subsets of sample types to include in analysis. This latter feature can be particularly important for query genes that exert their effects in a tissue- or developmentally restricted fashion. At each step, a "help" icon links to a page describing the various options and how they would likely affect the analysis results. CressExpress also provides reasonable default choices so that users can easily perform pilot studies and quickly learn how the tool operates.
In step one, users choose a data release of expression values that will be used in the analysis. Currently, there are four data release options, each one representing different array collections and array processing methods (Table I ). Each release contains expression data harvested from the Nottingham Arabidopsis Stock Center (NASC) AffyWatch subscription service (Craigon et al., 2004
Step one also features an option to configure a QC setting for individual arrays. This QC setting is based on a Kolmogorov-Smirnov (KS) test of deleted residuals, which is described in detail elsewhere (Persson et al., 2005
In step two, users enter a list of up to 50 queries, using Arabidopsis Genome Initiative (AGI) gene names or probe set names to identify genes. To map AGI gene names onto probe set names, CressExpress uses the probe set-to-gene id annotations provided by The Arabidopsis Information Resource (TAIR). However, because these mappings are problematic in some cases (Cui and Loraine, 2006 In steps three and four, users choose sample types (step three) and experiments (step four) to include in the regression analysis for their query genes. In step three, CressExpress builds a list of sample types for arrays that met the QC and array type criteria specified in previous steps. Users can select all sample types (the default) or subsets of sample types from a menu listing, where the wording for each item on the list comes from the text provided by NASC. Because NASC obtains the textual description of sample types from experimenters who generated and submitted the original data, the sample type descriptions may use a variety of different terms meaning the same thing. Therefore, we advise users to read the entire list when attempting to limit their analyses to specific tissue types, because different text may have similar meanings. For example, users wishing to examine coexpression networks of flowering organs might select sample types labeled "flowers" and "flower buds" as well as "inflorescence." The default option for steps three and four is to use all available sample types. This default setting is mainly for the convenience of users wishing to run quick pilot experiments and become familiar with the tool and how it operates. To take full advantage of the CressExpress tool, we recommend that users choose sample types where the query gene products are expected to be expressed or active. For example, users interested in investigating the coexpression network surrounding genes involved in flowering should choose sample types that include flowers. Similarly, users interested in investigating pathways involved in photosynthesis should choose sample types derived from green shoots and leaves.
To demonstrate the effects of sample type filtering, Figure 4
presents a diagram showing output from a coexpression experiment in which 185 flowering-related genes were compared to each other. Each connection in the network represents an r2 value
In step five, users may configure a pathway-level coexpression (PLC) analysis, which identifies genes that are coexpressed with multiple query genes. Seen from the point of view of the larger coexpression network, PLC analysis identifies query genes' common neighbors, such that each neighbor is connected to two or more members of the query group. We previously used PLC analysis to identify candidate genes involved in metabolic pathways and cell wall biosynthesis, and, in general, we have found that genes identified as coexpressed not just with a single gene but with ensembles of genes acting together are often the best and more successful candidates to test (Persson et al., 2005
Interpreting and using the PLC functionality and its results requires understanding how the PLC algorithm operates, and so we describe it in detail here. The PLC method as implemented in CressExpress operates as described previously, with some differences in how results are ranked (Wei et al., 2006
The PLC method is most useful and relevant when at least two of the query genes are coexpressed with each other at the given PLC r2 threshold. One way to determine the best cutoff for determining coexpression in PLC is to examine the correlations between the query genes that are expected ahead of time to be coexpressed and then choose an r2 threshold that is lower than the smallest pairwise r2 between them. If this is done, then the CressExpress PLC will recover all genes that are coexpressed with the queries at least as well as any two query genes are coexpressed with each other. To find out the threshold r2 between queries, users can run the tool once, examine the list of coexpressed genes for each query individually to find out the cross-query r2 values, and then rerun the tool with a new PLC r2 threshold that is smaller than the lowest cross-query pair. Another method for finding the correlations between queries would be to use the CressExpress expression data direct access method in conjunction with a statistical analysis environment like R (R Development Core Team, 2008 The final step (step six) asks the user to enter a comma-separated list of one or more e-mail addresses that will receive e-mails reporting on the status of the analysis. Upon successful completion of the analysis, each address receives a "Job Completion" message containing a link to a compressed, archive folder (a zip file) stored temporarily on the CressExpress server. The zip file contains the full complement of analysis results as well as a record of all parameters used in the experiment (Table II ). CressExpress generates results files (in csv format) for each query gene, in which regression results comparing the query gene to all probe sets on the selected array are reported. The spreadsheet files are named after the query gene's matching probe set and can be loaded directly into Excel or any other program capable of reading comma-separated format files. Each row of data represents the results from a single linear regression comparing the query gene against another gene on the designated array and includes the linear regression P value, r2 value, and slope, along with the brief description of the target gene. These descriptions come from TAIR and are provided for the convenience of users as they scan results searching for interesting patterns in the types of genes that are most highly coexpressed with their queries.
The PLC results files include csv files reporting coexpressed neighbor genes and corresponding Web pages with hyperlinks to TAIR, allowing for rapid review of results. In addition, the PLC analysis generates a network file (coexp.sif) together with companion node and edge attribute files suitable for loading and visualization in Cytoscape, a popular network analysis and visualization tool (Shannon et al., 2003
Because CressExpress distributes data in bulk as one zip file, users typically find it relatively easy to track and store results from CressExpress analysis runs using their preferred electronic data archiving scheme. For example, users who incorporate results from CressExpress in published work might prefer to distribute the original zip file as part of supplementary data files on journal or lab Web sites. The CressExpress design philosophy is that each run of the CressExpress tool is a self-contained experiment and should not only be easy to repeat but also should be easy to record and incorporate into users' research workflow. By providing complete records of results and experimental parameters, CressExpress aims to make it easier for users to manage and track their in silico coexpression experiments.
CressExpress offers direct access to precomputed expression data via a simple URL-based method in which users access expression values for specific probe sets by encoding the requests as Web addresses. Using the direct access approach, users can retrieve expression values for individual genes and arrays, save the values to local files, or import them into directly into Web-enabled desktop programs like R or TableView (Johnson et al., 2003
Example Analysis: Cellulose Synthase Enzymes and Cell Wall Biosynthesis Following the procedures described above, we instigated a CressExpress experiment using the six primary and secondary cell wall genes (Table III ) as queries and a PLC r2 cutoff 0.36. Figure 5 shows a screen capture from the Cytoscape network visualization tool depicting the six query genes and their PLC-identified neighbors. We find that the secondary and primary cell wall CESA genes are coexpressed with different, nonoverlapping groups. Of the genes linked with secondary cell wall CESA genes (Fig. 5B), at least 17 have been investigated experimentally and found to exhibit secondary cell wall-related phenotypes and functions (Table IV ), while many more are annotated as having predicted functions related to carbohydrate synthesis or cell wall functions (Supplemental Data S1). This example illustrates how one might use CressExpress to tease apart the different functions of genes that share considerable similarity at the sequence level but which may play distinct roles in the plant body. In this case, it was already known that CESA4, CESA7, and CESA8 perform a different role from CESA1, CESA3, and CESA6, and we found that the coexpression analysis tends to confirm this view, because the two groups are coexpressed with nonoverlapping groups of genes, as determined by the PLC analysis. The same argument may be made for other closely related genes, potentially yielding new hypotheses regarding gene function even for members of closely related gene families.
Example Analysis: Glucosinolate Biosynthesis from Trp Glucosinolates are nitrogen- and sulfur-containing secondary metabolites that are derived from several different amino acids in plants, including Arabidopsis and many agriculturally important Brassicaceae species (Grubb and Abel, 2006
We used the AraCyc database hosted at the TAIR Web site to look up AGI codes for genes associated with the indolic glucosinolates pathway (Zhang et al., 2005
The PLC analysis revealed 155 different genes that are coexpressed (at r2 0.35) with two or more of the glucosinolate pathway genes; of these, seven are coexpressed with all six. We then used the BioMoby literature aggregator tool (LitRep, http://mips.gsf.de/proj/planet/araws/litRepSearch.html) to search for articles referencing this and the other top seven coexpressed genes (Table VI
). One gene (AT1G18590) has already been shown to play a role in glucosinolate biosynthesis (Piotrowski et al., 2004
We developed CressExpress, an easy-to-use, freely available Web-based tool that allows users to run coexpression analysis experiments using biologically relevant subsets of expression data harvested from public domain Affymetrix Arabidopsis expression microarrays. For each experiment, the tool recomputes coexpression relationships by performing a linear regression comparing each user-entered query gene to all other genes represented in the expression data, using a user-selected subset of experiments in the database. Computing the linear regressions typically requires several minutes, and so individual analyses are performed offline as distinct jobs. All analysis software executes on a server remote from the user's desktop, and users set up experiments using their Web browser by proceeding through a series of steps in which they enter query genes, choose QC parameters, and specify sample types to include in the analysis. At each point where a choice must be made, CressExpress provides links to help pages describing how the different parameters may affect results and also supplies reasonable defaults for users wanting to run preliminary pilot experiments. Thus, to operate the tool, users need only be able to operate a Web browser and have access to an e-mail account. When the experiment job completes, users receive an e-mail containing a link to a compressed zip file on the server containing results and output files.
Several groups have developed online, Web-based analysis tools that offer a variety of methods and approaches for analyzing and visualizing publicly available Arabidopsis expression data (Zimmermann et al., 2004
Another distinctive feature of CressExpress is that it provides comprehensive analysis results in formats aimed at facilitating downstream data-mining and analysis. After a run of the tool, users receive a set of simple, comma-separated plain text file for each query gene. Each of these files lists the r2, slope, and P values for each linear regression between the query and all possible target genes, along with a short textual description of each target to aid users as they explore the results. In general, CressExpress aims to provide data in ways that allow researchers to visualize and explore results using desktop visualization programs and analysis tools, such as Excel, TableView (Johnson et al., 2003
In the future, we plan to add several new features to CressExpress, focusing on three major goals: facilitating comparative coexpression analysis across species, making sample selection easier, and repackaging the software to allow easy deployment at other sites. To streamline array selection (step three), we plan to add a feature that will let users select samples based on their Plant Ontology term annotations as they become available (Avraham et al., 2008
Array Informatics CD media containing expression data were obtained from the NASC AffyWatch subscription service. Each CD contained CEL files with "raw" expression data, grouped into folders named for the investigator who contributed the data. Upon receipt of each CD, XML files describing each experiment were harvested from the NASC site, using the "passthru" parameter as described on the Web page located at http://arabidopsis.info/bioinformatics/narraysxml/index.html. Slide names associated with each experiment id were harvested from the XML files by extracting the content of "NASC:Name" tags. For about one-half of the CEL files, we were able to use CEL file names and slide names to connect slides with their corresponding CEL files and thus capture in our database the experimental group affiliation for each CEL file. For the remainder, NASC supplied (via e-mail) Excel spreadsheets that report the correspondence between CEL file names and NASC slide names. The mapping between slide name and CEL file names were also manually checked; cases where the names seemed to disagree or contradict each other were resolved via e-mail correspondence with NASC. The mapping is supplied as part of the direct access method.
CEL file processing steps, including background correction, probe set summarization, and normalization procedures, were done using functions in the Bioconductor "affy" package suite of tools (Gautier et al., 2004
The coexpression tool is based on large-scale linear regression analysis of expression values between genes of interest and the rest of the genes on a selected array using the methodology described previously (Persson et al., 2005
The following materials are available in the online version of this article.
The authors thank the Alabama Supercomputer Center for providing computational assistance with array processing steps, Jim Johnson for help with the TableView software, the School of Public Health MITS team for computer systems support, and Mikako Kawai for Web page and graphic design. We are particularly grateful for the staff of the NASC for their generosity and help with matching slide and CEL file names. Last, we thank the anonymous reviewers for their excellent comments on the manuscript. Received January 18, 2008; accepted April 29, 2008; published May 8, 2008.
1 This work was supported by the University of Alabama Center for Nutrient-Gene Interaction, by the National Institutes of Health National Cancer Institute (grant no. U54CA100949), and by the National Science Foundation (grant no. 061012 and Plant Genome award nos. 0217651 and 0501890). The author responsible for distribution of materials integral to the findings presented in this article in accordance with the policy described in the Instructions for Authors (www.plantphysiol.org) is: Ann E. Loraine (aloraine{at}uncc.edu).
[W] The online version of this article contains Web-only data.
[OA] Open Access articles can be viewed online without a subscription. www.plantphysiol.org/cgi/doi/10.1104/pp.107.115535 * Corresponding author; e-mail aloraine{at}uncc.edu.
Aoki K, Ogata Y, Shibata D (2007) Approaches for extracting practical information from gene co-expression networks in plant biology. Plant Cell Physiol 48: 381–390 Avraham S, Tung CW, Ilic K, Jaiswal P, Kellogg EA, McCouch S, Pujar A, Reiser L, Rhee SY, Sachs MM, et al (2008) The Plant Ontology Database: a community resource for plant structure and developmental stages controlled vocabulary and annotations. Nucleic Acids Res 36: D449–D454 Bechtold U, Murphy DJ, Mullineaux PM (2004) Arabidopsis peptide methionine sulfoxide reductase2 prevents cellular oxidative damage in long nights. Plant Cell 16: 908–919 Bender J, Fink GR (1998) A Myb homologue, ATR1, activates tryptophan gene expression in Arabidopsis. Proc Natl Acad Sci USA 95: 5655–5660 Brown DM, Zeef LA, Ellis J, Goodacre R, Turner SR (2005) Identification of novel genes in Arabidopsis involved in secondary cell wall formation using expression profiling and reverse genetics. Plant Cell 17: 2281–2295 Chandler J, Wilson A, Dean C (1996) Arabidopsis mutants showing an altered response to vernalization. Plant J 10: 637–644[CrossRef][ISI][Medline] Craigon DJ, James N, Okyere J, Higgins J, Jotham J, May S (2004) NASCArrays: a repository for microarray data generated by NASC's transcriptomics service. Nucleic Acids Res 32: D575–D577 Cui X, Loraine AE (2006) Global correlation analysis between redundant probe sets using a large collection of Arabidopsis ATH1 expression profiling data. In LSS Computational Systems Bioinformatics. World Scientific, Stanford, CA, pp 223–227 Ekman DR, Lorenz WW, Przybyla AE, Wolfe NL, Dean JF (2003) SAGE analysis of transcriptome responses in Arabidopsis roots exposed to 2,4,6-trinitrotoluene. Plant Physiol 133: 1397–1406 Gachon CM, Langlois-Meurinne M, Henry Y, Saindrenan P (2005) Transcriptional co-regulation of secondary metabolism enzymes in Arabidopsis: functional and evolutionary implications. Plant Mol Biol 58: 229–245[CrossRef][ISI][Medline] Gautier L, Cope L, Bolstad BM, Irizarry RA (2004) affy: Analysis of Affymetrix GeneChip data at the probe level. Bioinformatics 20: 307–315 Grubb CD, Abel S (2006) Glucosinolate metabolism and its control. Trends Plant Sci 11: 89–100[CrossRef][ISI][Medline] Halkier BA, Gershenzon J (2006) Biology and biochemistry of glucosinolates. Annu Rev Plant Biol 57: 303–333[CrossRef][Medline] Hirai MY, Sugiyama K, Sawada Y, Tohge T, Obayashi T, Suzuki A, Araki R, Sakurai N, Suzuki H, Aoki K, et al (2007) Omics-based identification of Arabidopsis Myb transcription factors regulating aliphatic glucosinolate biosynthesis. Proc Natl Acad Sci USA 104: 6478–6483 Irizarry RA, Hobbs B, Collin F, Beazer-Barclay YD, Antonellis KJ, Scherf U, Speed TP (2003) Exploration, normalization, and summaries of high density oligonucleotide array probe level data. Biostatistics 4: 249–264[Abstract] Johnson JE, Stromvik MV, Silverstein KA, Crow JA, Shoop E, Retzel EF (2003) TableView: portable genomic data visualization. Bioinformatics 19: 1292–1293 Kiyosue T, Yamaguchi-Shinozaki K, Shinozaki K (1993) Characterization of two cDNAs (ERD11 and ERD13) for dehydration-inducible genes that encode putative glutathione S-transferases in Arabidopsis thaliana L. FEBS Lett 335: 189–192[CrossRef][ISI][Medline] Ko JH, Beers EP, Han KH (2006) Global comparative transcriptome analysis identifies gene network regulating secondary xylem development in Arabidopsis thaliana. Mol Genet Genomics 276: 517–531[CrossRef][Medline] Levy YY, Mesnage S, Mylne JS, Gendall AR, Dean C (2002) Multiple roles of Arabidopsis VRN1 in vernalization and flowering time control. Science 297: 243–246 Lillig CH, Schiffmann S, Berndt C, Berken A, Tischka R, Schwenn JD (2001) Molecular and catalytic properties of Arabidopsis thaliana adenylyl sulfate (APS)-kinase. Arch Biochem Biophys 392: 303–310[CrossRef][ISI][Medline] Loeffler C, Berger S, Guy A, Durand T, Bringmann G, Dreyer M, von Rad U, Durner J, Mueller MJ (2005) B1-phytoprostanes trigger plant defense and detoxification responses. Plant Physiol 137: 328–340 Manfield IW, Jen CH, Pinney JW, Michalopoulos I, Bradford JR, Gilmartin PM, Westhead DR (2006) Arabidopsis Co-expression Tool (ACT): web server tools for microarray-based gene expression analysis. Nucleic Acids Res 34: W504–W509 Niyogi KK, Fink GR (1992) Two anthranilate synthase genes in Arabidopsis: defense-related regulation of the tryptophan pathway. Plant Cell 4: 721–733 Obayashi T, Kinoshita K, Nakai K, Shibaoka M, Hayashi S, Saeki M, Shibata D, Saito K, Ohta H (2007) ATTED-II: a database of co-expressed genes and cis elements for identifying co-regulated gene groups in Arabidopsis. Nucleic Acids Res 35: D863–D869[CrossRef][ISI][Medline] Persson S, Wei H, Milne J, Page GP, Somerville CR (2005) Identification of genes required for cellulose synthesis by regression analysis of public microarray data sets. Proc Natl Acad Sci USA 102: 8633–8638 Piotrowski M, Schemenewitz A, Lopukhina A, Muller A, Janowitz T, Weiler EW, Oecking C (2004) Desulfoglucosinolate sulfotransferases from Arabidopsis thaliana catalyze the final step in the biosynthesis of the glucosinolate core structure. J Biol Chem 279: 50717–50725 R Development Core Team (2008) R: a language and environment for statistical computing. R Foundation for Statistical Computing, Vienna Redman JC, Haas BJ, Tanimoto G, Town CD (2004) Development and evaluation of an Arabidopsis whole genome Affymetrix probe array. Plant J 38: 545–561[CrossRef][ISI][Medline] Saito K, Hirai MY, Yonekura-Sakakibara K (2008) Decoding genes with coexpression networks and metabolomics: majority report by precogs. Trends Plant Sci 13: 36–43[CrossRef][ISI][Medline] Sawa S, Demura T, Horiguchi G, Kubo M, Fukuda H (2005) The ATE genes are responsible for repression of transdifferentiation into xylem cells in Arabidopsis. Plant Physiol 137: 141–148 Schuhegger R, Rauhut T, Glawischnig E (2007) Regulatory variability of camalexin biosynthesis. J Plant Physiol 164: 636–644[CrossRef][ISI][Medline] Shannon P, Markiel A, Ozier O, Baliga NS, Wang JT, Ramage D, Amin N, Schwikowski B, Ideker T (2003) Cytoscape: a software environment for integrated models of biomolecular interaction networks. Genome Res 13: 2498–2504 Stuart JM, Segal E, Koller D, Kim SK (2003) A gene-coexpression network for global discovery of conserved genetic modules. Science 302: 249–255 Trivedi P, Edwards JW, Wang J, Gadbury GL, Srinivasasainagendra V, Zakharkin SO, Kim K, Mehta T, Brand JP, Patki A, et al (2005) HDBStat!: a platform-independent software suite for statistical analysis of high dimensional biology data. BMC Bioinformatics 6: 86[CrossRef][Medline] Ubeda-Tomas S, Edvardsson E, Eland C, Singh SK, Zadik D, Aspeborg H, Gorzsas A, Teeri TT, Sundberg B, Persson P, et al (2007) Genomic-assisted identification of genes involved in secondary growth in Arabidopsis utilising transcript profiling of poplar wood-forming tissues. Physiol Plant 129: 415–428[CrossRef] Wagner U, Edwards R, Dixon DP, Mauch F (2002) Probing the diversity of the Arabidopsis glutathione S-transferase gene family. Plant Mol Biol 49: 515–532[CrossRef][ISI][Medline] Wei H, Persson S, Mehta T, Srinivasasainagendra V, Chen L, Page GP, Somerville C, Loraine A (2006) Transcriptional coordination of the metabolic network in Arabidopsis. Plant Physiol 142: 762–774 Wilkinson M, Schoof H, Ernst R, Haase D (2005) BioMOBY successfully integrates distributed heterogeneous bioinformatics Web Services. The PlaNet exemplar case. Plant Physiol 138: 5–17 Wille A, Zimmermann P, Vranova E, Furholz A, Laule O, Bleuler S, Hennig L, Prelic A, von Rohr P, Thiele L, et al (2004) Sparse graphical Gaussian modeling of the isoprenoid gene network in Arabidopsis thaliana. Genome Biol 5: R92[CrossRef][Medline] Zhang P, Foerster H, Tissier CP, Mueller L, Paley S, Karp PD, Rhee SY (2005) MetaCyc and AraCyc. Metabolic pathway databases for plant research. Plant Physiol 138: 27–37 Zhong W, Sternberg PW (2006) Genome-wide prediction of C. elegans genetic interactions. Science 311: 1481–1484 Zimmermann P, Hirsch-Hoffmann M, Hennig L, Gruissem W (2004) GENEVESTIGATOR. Arabidopsis microarray database and analysis toolbox. Plant Physiol 136: 2621–2632
|
|||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| HOME | HELP |