First published online February 13, 2003; 10.1104/pp.102.014928
Plant Physiol, March 2003, Vol. 131, pp. 1313-1326
Whole-Genome Comparison of Leucine-Rich Repeat Extensins in
Arabidopsis and Rice. A Conserved Family of Cell Wall Proteins Form a
Vegetative and a Reproductive Clade1,[w]
Nicolas
Baumberger,2
Brigitte
Doesseger,
Romain
Guyot,
Anouck
Diet,
Ronald L.
Parsons,
Mark A.
Clark,
M.P.
Simmons,
Patricia
Bedinger,
Stephen A.
Goff,
Christoph
Ringli, and
Beat
Keller*
Institute of Plant Biology, University of Zurich, CH-8008 Zurich,
Switzerland (N.B., B.D., R.G., A.D., C.R., B.K.); Department of
Biology, Colorado State University, Fort Collins, Colorado 80523-1878
(R.L.P., M.A.C., M.P.S., P.B.); and Torrey Mesa Institute, Syngenta,
3115 Meryfield Row, San Diego, California 92121 (S.A.G.)
 |
ABSTRACT |
We have searched the Arabidopsis and rice (Oryza
sativa) genomes for homologs of LRX1, an
Arabidopsis gene encoding a novel type of cell wall protein
containing a leucine-rich repeat (LRR) and an extensin domain. Eleven
and eight LRX (LRR/EXTENSIN) genes have
been identified in these two plant species, respectively. The
LRX gene family encodes proteins characterized by a
short N-terminal domain, a domain with 10 LRRs, a cysteine-rich motif, and a variable C-terminal extensin-like domain. Phylogenetic analysis performed on the conserved domains indicates the existence of two major
clades of LRX proteins that arose before the eudicot/monocot divergence
and then diversified independently in each lineage. In Arabidopsis,
gene expression studies by northern hybridization and
promoter::uidA fusions showed that the two
phylogenetic clades represent a specialization into "reproductive"
and "vegetative" LRXs. The four Arabidopsis genes of
the "reproductive" clade are specifically expressed in pollen,
whereas the seven "vegetative" genes are predominantly expressed in
various sporophytic tissues. This separation into two expression
classes is also supported by previous studies on maize (Zea
mays) and tomato (Lycopersicon esculentum)
LRX homologs and by information on available rice ESTs.
The strong conservation of the amino acids responsible for the putative
recognition specificity of the LRR domain throughout the family
suggests that the LRX proteins interact with similar ligands.
 |
INTRODUCTION |
With the completion of the
Arabidopsis genome sequence, it became clear that many Arabidopsis
genes are members of multigene families. Although this had already been
suggested by the analysis of expressed sequence tag (EST) databases and
by classical gene searches, the availability of the full gene set of a
plant provides the unique opportunity to get a complete inventory of
all the members of a gene family. Among the 25,500 genes predicted in the Arabidopsis genome, 65% are members of a multigene family and 37%
belong to families of more than five members (Arabidopsis Genome
Initiative, 2000 ). Although the predicted total
gene number of Arabidopsis is significantly larger than that of
other sequenced multicellular eukaryotes such as
Caenorhabditis elegans (19,000; C.
elegans Sequencing Consortium, 1998 ) or
Drosophila melanogaster (13,600; Adams et al.,
2000 ), the absolute number of gene families and singletons
(11,601 in Arabidopsis) is comparable in all these organisms
(Arabidopsis Genome Initiative, 2000 ). This indicates that frequent gene duplications and consequently large gene families are a distinctive feature of the Arabidopsis genome, and possibly of
all plant genomes. With the recent publication of a high-quality draft
from two different subspecies, rice (Oryza sativa) is the second plant whose genome can be comprehensively investigated. Depending on the stringency applied in gene prediction, the rice genome
contains between 32,277 and 61,668 genes (Goff et al., 2002 ; Yu et al., 2002 ). With 77% of the genes
distributed in about 15,000 multigene families, it appears that this
larger number of genes is at least partly due to an increase in the
number of copies per family. Both Arabidopsis and rice contain large
duplicated segments, suggesting that they have undergone
one or several polyploidization events during their
evolutionary history. Monocots and eudicots diverged 180 to 240 million years ago (Wolfe et al., 1989 ;
Goremykin et al., 1997 ; Soltis et al.,
2002 ), and the main polyploidization or large
duplication events still detectable in the genome of Arabidopsis and
rice are estimated to have occurred around 112 million years and 40 to
50 million years ago, respectively (Vision et al., 2000 ;
Goff et al., 2002 ). Combined with local tandem
duplications, polyploidization is a mechanism that generates multigene
families and is a major source of evolutionary novelty. With the
availability of the Arabidopsis and rice genomes, it is now possible to
study and understand more precisely how and when gene families arose, how they were amplified, and which new biological functions were derived in the monocots and eudicots. In addition to basic knowledge on
evolutionary processes, these comparative studies should also help to
attribute a function to every identified gene.
Recently, a new type of modular cell wall protein containing a Leu-rich
repeat (LRR) and an extensin domain was described in eudicots and
monocots (Rubinstein et al., 1995a ; Baumberger et
al., 2001 ; Stratford et al., 2001 ). LRRs are
frequently implicated in protein-protein interactions and, in plants, a
large subclass of receptor-like kinases has extracellular LRRs in the
receptor domain (Shiu and Bleecker, 2001 ). Of those,
several have been shown to work in signal transduction during
development or to participate in pathogen recognition and defense
(Song et al., 1995 ; Torii et al., 1996 ;
Clark et al., 1997 ; Li and Chory, 1997 ; Jinn et al., 2000 ). Extensins form an abundant group of
cell wall structural proteins belonging to the family of Hyp-rich
glycoproteins. They are defined by the presence of the repeated
pentapeptide Ser(Hyp)4 (Kieliszewski et
al., 1990 ; Kieliszewski and Lamport, 1994 ),
where most Hyp and Ser residues are glycosylated (Wilson and
Fry, 1986 ). One proposed function of extensins is to reinforce the polysaccharidic structure of the wall by cross-linking to each
other and/or other cell wall components. It has also been postulated
that extensins might fix the shape of the cell at the end of the
expansion phase (Carpita and Gibeaut, 1993 ). Such a function in cell morphogenesis is supported by the recent finding that
an extensin gene is required for correct cell morphogenesis during
Arabidopsis embryogenesis (Hall and Cannon,
2002 ).
The Arabidopsis LRX1 (LRR/EXTENSIN1) gene was
shown to be involved in root hair morphogenesis because LRX1
mutants develop root hairs with aberrant shape (Baumberger et
al., 2001 ). The LRX1 protein is specifically targeted to the
root hair cell wall, where it is insolubilized. Similarly, PEX1, a
pollen-specific maize (Zea mays) LRX, was immunolocalized in
the intine layer of the pollen grain and the callosic sheath of the
pollen tube (Rubinstein et al., 1995a ,
1995b ). Together, the properties of LRR and extensins,
the cell wall localization of LRX proteins, and the phenotype of
lrx1 mutants suggest that LRXs are potentially involved in
the regulation of cell wall expansion in response to signaling.
To better understand the function and the evolutionary history of LRR
extensins, we have characterized the LRX family of Arabidopsis and
rice. We have compared the protein organization of the identified gene
family members and further studied the expression of the Arabidopsis
LRX genes. Our results reveal that LRX proteins fall into
two different subclasses defined by their phylogenetic relationship and
their preferential expression either in pollen or in vegetative tissues. The LRX phylogeny suggests that this family has
independently evolved in monocots and eudicots from two common
ancestral genes specifically expressed in either vegetative or
reproductive tissues.
 |
RESULTS |
Genome-Wide Searches Identified 11 LRX Genes in
Arabidopsis and Eight in Rice
To identify the complete LRX gene family, we searched
the genomes of Arabidopsis (Arabidopsis Genome Initiative,
2000 ) and rice subsp. japonica (Goff et al.,
2002 ), which we recently got access to, using the tBLASTn
algorithm (Altschul et al., 1997 ) and the LRR protein
sequence of LRX1 as a query. Among all the sequences showing some
homology to the LRR domain of LRX1 (over 100), we selected for further
examination those with an E value under 10e-22. Sequences encoding an
extensin-like domain in the C terminus were retained and reiteratively
blasted against the databases. The reiterative search did not provide
additional sequences. All the LRX sequences had an E value under 10e-75
and were compacted at the top of the list of hits (depending on the
query, 25-60 sequences were retained following the criterion of an E
value under 10e-22), indicating that the initial cutoff value of 10e-22 was appropriate. In total, this survey identified 11 Arabidopsis LRX genes, including the already characterized Arabidopsis
LRX1 gene, and eight novel rice genes (Table
I). The LRX genes are dispersed on the five Arabidopsis chromosomes and on seven rice chromosomes. The nomenclature used to identify the individual genes has
been developed based on previous work (Rubinstein et al.,
1995a , 1995b ; Baumberger et al.,
2001 ; this study). The gene family is referred to as the
LRX gene family. The specific members of the
"vegetative" clade (see below) are referred to as LRX
genes with the addition of genus-species modifiers (i.e.
LRX1 is now AtLRX1, and we propose that
TOML-4 [Zhou et al., 1992 ] be renamed LeLRX1). The specific members of the "reproductive clade
(see below) are referred to as PEX genes with a similar
modifier (i.e. Pex1 [Rubinstein et al.,
1995a ] is now ZmPEX1). We believe that this system
minimizes confusion, while still maintaining the links to the previous
published work.
Because all the AtLRX (Arabidopsis LRX)
genes appear to be located on pairs of duplicated chromosomal segments
(Blanc et al., 2000 ; Vision et al.,
2000 ), we used the Dotter program (Sonnhammer and
Durbin, 1995 ) to align the sequences of the bacterial
artificial chromosomes (BACs) harboring the LRX genes
and to verify the conservation of the genes flanking each candidate.
This analysis confirmed that AtLRX1/AtLRX2,
AtLRX3/AtLRX4, and AtPEX1/AtPEX2, as well as
AtPEX3/PEX4, form gene pairs, which arose by segmental
duplication. Therefore, these gene pairs can be considered as paralogs
(Fig. 1). The three remaining genes,
AtLRX5, AtLRX6, and AtLRX7, are also
located on duplicated chromosomal segments. However, the second copy of
each gene has been lost by sequence rearrangement as revealed by the
analysis of the corresponding BAC sequences (data not shown).

View larger version (13K):
[in this window]
[in a new window]
|
Figure 1.
Position of the 11 Arabidopsis LRX
genes on the five Arabidopsis chromosomes. The large chromosomal
duplications harboring the LRX genes are
represented by shaded boxes with identical pattern, and paralogs are
linked by arrows. AtLRX5, 6, and 7 are
single genes, although they are located on duplicated segments.
Centromeres are indicated by constrictions in the chromosome schematic
representation. The chromosomal duplications are deduced from
Blanc et al. (2000) and Vision et al.
(2000) . Gene positions on the chromosomes are approximated
according to the mapping of the BAC clones available at The Arabidopsis
Information Research Web site (http://www.arabidopsis.org).
|
|
The prediction of the AtLRX open reading frames (ORFs) was
verified by visually checking the three-frame translations of the genomic sequences and by comparing the predicted proteins with the
AtLRX1, ZmPEX1, and ZmPEX2 sequences
(Rubinstein et al., 1995a ; Baumberger et al.,
2001 ; Stratford et al., 2001 ). The
OsLRX ORFs were determined similarly, after correction of
the ambiguities and sequencing errors by comparison with ESTs and rice
subsp. indica genomic sequences (Yu et al.,
2002 ). At the time point of the analysis, ORF sequences of two
OsLRXs candidate genes were incomplete. One of them,
OsLRX3, was included in the analysis because its ORF
sequence only lacks the 5' end, which is poorly conserved in the
LRX family. The second candidate was not considered further,
because its identification as a putative LRX gene only relies on its homology with the LRR domain of AtLRX1,
without any possibility to verify the rest of the sequence and the
presence of an extensin domain.
Because previously reported LRX genes lack introns
(Rubinstein et al., 1995a ; Baumberger et al.,
2001 ; Stratford et al., 2001 ), we wished to
determine whether any AtLRX genes contained introns. If
available, the presence or absence of introns was verified by comparing
the genomic sequences with corresponding EST sequences. Two genes
(AtLRX6 and AtLRX7) encode proteins with very
short and atypical extensin domains, and, therefore, could contain
unannotated introns in the extensin domain leading to a perceived
premature truncation. For these genes, the 3' end of the cDNA was
experimentally confirmed by RACE-PCR, verifying that the annotated gene
sequences are correct. Two genes, AtPEX3 and
AtLRX5, are annotated as containing introns. For
AtPEX3, starting the ORF at position 54,949 of the BAC F19G14 (accession no. AC006438) results in a complete ORF that
predicts a protein with a better signal sequence compared with the
annotated protein. For AtLRX5, the predicted intron results in an unusual sequence motif nested within the extensin domain. To
reexamine the nucleotide sequence in this region, we sequenced PCR
products covering the predicted intron and flanking sequences. As
suspected, the published genomic sequence contains an extra nucleotide
at position 35,198 of the BAC F28A21 (accession AL035526). Correction
of this sequencing error eliminates the requirement for the predicted
intron and eliminates the unusual sequence motif from the extensin
domain. The complete corrected DNA and protein sequences of the rice
LRX are available in the supplemental data (see www.plantphysiol.org).
From our results, we conclude that all of the AtLRX genes
lack introns.
The LRR Domain of LRX Proteins Is Strongly Conserved between
Monocots and Eudicots
To provide a general description of the LRX family, we
compared the 11 Arabidopsis LRX protein sequences with the eight rice homologs. We also included in the analysis homologs from maize (ZmPEX1 and ZmPEX2; Rubinstein et al.,
1995a ; Stratford et al., 2001 ), tomato
(Lycopersicon esculentum; LeLRX1 and
LePEX1; Zhou et al., 1992 ; Baumberger
et al., 2001 ; Stratford et al., 2001 ), and from Nicotiana tabacum (NtPEX1; Wong,
2001 ).
The general organization of the LRX proteins after cleavage of the
predicted signal peptide (Nielsen et al., 1997 ) consists of a 54- to 102-amino acid-long N-terminal domain, which is followed by
a 236- to 240-amino acid-long domain of LRRs, separated from the
C-terminal extensin-like domain by a short Cys-rich region of 39 to 50 amino acids (Fig. 2, A and B). Three
regions can be delimited in the N-terminal domain: a hypervariable
region of 11 to 20 amino acids and two flanking regions of predicted
-helices. The first -helix is truncated or entirely missing in
eight of the LRX proteins (AtLRX1, 2, 6, 7, LeLRX, and OsLRX1, 3, and 4) but is
always present in the PEX proteins. In contrast, the second -helix
is present throughout the family and forms the beginning of a very
well-conserved region of the N-terminal domain (Fig. 2, B and C). The
hypervariable region is sometimes conserved between proteins of the
same plant family (LePEX/NtPEX and
ZmPEX1/ZmPEX2/OsPEX1/OsPEX2) and between subsets of the Arabidopsis proteins
(AtPEX1/AtPEX3/AtPEX4 and
AtLRX4/AtLRX5; Fig. 2C).

View larger version (35K):
[in this window]
[in a new window]
|
Figure 2.
Domain organization of the LRX proteins. A,
Schematic representation of the conserved domain organization of the
LRX proteins. The different domains are drawn to scale and their
identity is indicated (SP, predicted signal peptide; and , position
of an -helix). The N-terminal domain contains two predicted
-helices (gray oblique stripes) that flank a hypervariable region
(light-gray box). The region of the N-terminal domain beginning with
the second predicted -helix is strongly conserved (stippled box). The C terminus (black oblique stripes) is conserved
between several of the LRX proteins. B, Consensus amino acid sequence
of each conserved domain. Capital letters, Residues that are conserved
in at least 80% of the Arabidopsis, rice, maize, and tomato
LRX genes; x, Non-conserved residues; and a, Any aliphatic
amino acids. The 10 LRRs are aligned to the plant extracellular LRR
consensus sequence, and positions matching the consensus are indicated
in bold letters. The predicted -strand/ -turn structural motif is
framed. Lower script numbers indicate the number of non-conserved
residues between two consensus amino acids. C, N-terminal domain
sequences of the LRX proteins. The -helix regions and the
hypervariable regions are indicated at the top. Conserved residues are
in bold, and regions of particular homology are framed. D, C terminus
of the LRX proteins. Conserved residues are indicated in bold, and
sequences sharing a same consensus are grouped together. The consensus
motifs are shown on the right of the sequences. Capital letters,
Residues conserved in 50% or more of the sequences; minuscule letters,
residues common in less than 50% of the sequences.
|
|
The LRR domain is composed of nine complete repeats of 23 to 25 amino
acids flanked by a 10th degenerated terminal repeat (Fig. 2B). The LRX
LRRs match the plant extracytoplasmic LRR consensus sequence
LxxLxxLxLxxNxLxGxIPxxLGx, where L represents a conserved Leu or other
aliphatic residues, and x can be any amino acid (Kajava, 1998 ). The identity of the LRR domain between the different
members of the LRX family ranges from 96.6% to 50%. The first three
LRRs, as well as the sixth one, show the strongest conservation
throughout the family. Protein tertiary structure prediction and
crystallographic studies indicate that the protein-protein interaction
surface of LRR domains is constituted by the juxtaposition of the
xxLxLxx motif of each repeat in which the variable residues are exposed to the solvent and determine the specificity of the interaction (Fig.
2B; Kobe and Deisenhofer, 1995 ; Papageorgiou et
al., 1997 ). Therefore, we compared the conservation of the
solvent exposed residues with the conservation of the amino acids at
non-consensus positions in the LRRs of the complete Arabidopsis LRX
family. For this purpose, the solvent-exposed residues of each LRR were extracted from the sequence and joint together into a separate file for
each AtLRX protein. All pair-wise similarities were
calculated between these artificial sequences, and the values were
averaged. The same calculation was performed on the rest of the LRR
domain sequence, excluding the LRR consensus residues, which are likely to be under strong negative selection pressure (Parniske et al., 1997 ). The average similarity is 72% ± 11% for the
solvent-exposed residues (xxLxLxx) against 48% ± 13% for the
variable amino acids not directly involved in the interaction. This
suggests that the purifying selection tended to preserve the
recognition specificity of the LRR domain.
The Cys-rich region that follows the LRR domain also forms a
characteristic signature of LRX proteins. This hinge-region consists of
five Cys residues regularly spaced by 10 to 18 variable amino acids
and two well-conserved Asp and Gln residues
(Cx10-18Cx10-14NCx6-7Qx4Cx9-11C). The strong conservation of these Cys suggests that disulfide bonds are
important in the conformation and/or function of LRX proteins.
The Extensin Domain of the LRX Family Is Highly
Variable
In contrast to the high similarity observed between the N-terminal
and LRR domain of all the predicted LRX proteins, the extensin domain
is extremely variable, both in length and motif organization (Table
II). The S(P)4-n
motif characteristic of the extensins is present in a high copy number
in most of the LRX proteins. However, some of the identified LRX
proteins contain a very short extensin domain with only one or two
S(P)4-n motifs (AtLRX7 and OsLRX3) or contain only shorter stretches of Pro residues
(OsLRX1 and OsLRX2). Most of the LRX extensin
domains contain additional subdomains that lack the specific
SP(4-n) motifs but are made of other repeats or
are enriched in specific amino acids, which are frequently Ser, Val,
Lys, Gln, and Thr. The alignment of the extensin domain was difficult,
even between duplicated genes, mostly because of the
frequent insertion and deletion of particular repeats. As a
consequence, we manually screened single-extensin domains for patterns
and higher order repeats and compared these patterns between the
different LRX proteins. An analysis performed with the
motif-discovering algorithm MEME (Bailey and Elkan,
1994 ) essentially confirmed the manual search and was used to
refine the motif and consensus definition where necessary. Table II
compiles the different repeats identified in each extensin domain and
reports motifs that are present in more than one LRX protein. Repeats are frequently clustered together and define subdomains, which have a
specific signature on hydropathy plots (data not shown). For example,
the repeat SPPPPVH constitutes a conserved subdomain of variable copy
number in four of the Arabidopsis LRX proteins (AtPEX1-4). This SPPPPVH subdomain makes up
essentially the entire extensin moiety of AtPEX3, whereas in
its paralog (AtPEX4), it is preceded by a different domain
absent from AtPEX3. Similarly, the motif VKSPAPVSPPPP is
present in numerous copies, with small variations, in several of the
monocot LRXs identified so far (ZmPEX1, ZmPEX2, OsPEX1, and OsPEX2).
Interestingly, the most conserved part of the extensin domain is the C
terminus. Monocot and dicot PEX proteins have the consensus terminal
motif ilPP(i/f)(i/L/m)ghqYaSPPPP(m/q)FqGY, whereas two other related
motifs, yegxplPPvigVSYxASPPPPpxx(f/y)Y and
KLPFPPVYGVx(y/a)(a/y)SxPPPP(v/s)KPYN, terminate the extensin domain of
AtLRX1 to 5, LeLRX, and OsLRX1 and 2, respectively (Fig. 2D; Table II). The remaining LRX proteins have less
conserved C termini.
The Expression of the Arabidopsis LRX Genes Is Tissue
Specific
The identification of several corresponding EST sequences for most
of the Arabidopsis LRX genes indicated that they are
expressed during normal plant development, with the notable exception
of AtLRX2 and AtLRX7, for which no ESTs were
found (Table I). We characterized the expression pattern of each
AtLRX gene by northern hybridization of total RNA isolated
from various organs (Fig. 3). Transcripts
of AtPEX1 to 4, which share the highest
similarity with the ZmPEX genes and form two pairs of
paralogs, were almost exclusively detected in flowers. Transcripts of
all AtPEX genes are readily detected in mature anthers,
pollen, and pollinated carpels, and AtPEX1 may be expressed
at a very low level in unpollinated carpels (Fig. 3). AtLRX2
to 6, which are phylogenetically more related to
AtLRX1 (Baumberger et al., 2001 ), display a
more diverse expression pattern. AtLRX2 (N. Baumberger, M. Steiner, U. Ryser, B. Keller, and C. Ringli, unpublished
data) and AtLRX6 are specifically expressed in roots,
whereas AtLRX5 mRNA is mainly found in flowers, young
leaves, and roots and is present in low amounts in the other organs.
The two paralogous genes, AtLRX3 and AtLRX4, are
both expressed in all organs, with the difference that
AtLRX4 has a higher expression in roots and young leaves
than AtLRX3.

View larger version (53K):
[in this window]
[in a new window]
|
Figure 3.
Northern-blot analysis of the Arabidopsis
LRX transcript levels. Total RNA was extracted from roots
(Rt), young developing leaves and cotyledons (yL), rosette mature
leaves (rL), cauline leaves (cL), floral stems (St), flower buds and
opened flowers (Fl), stamens (S), carpels (Ca), pollen (Po), and
pollinated carpels (PCa). Roots and young developing leaves were
harvested from 14-d-old Columbia seedlings grown vertically on
solidified Murashige and Skoog medium. All the other material was
harvested from 35- to 40-d-old Columbia plants. Carpels were harvested
from unopened flowers with young stamens to reduce pollen
contamination. For northern analysis, 5 µg (2 µg for pollen) of
total RNA was hybridized with 32P-labeled
gene-specific probes amplified by PCR from genomic DNA. Ribosomal RNAs
were used as loading control (lower). No signal was obtained with the
LRX7 probe and the result is not shown.
AtLRX2 data will be presented elsewhere (N. Baumberger, M. Steiner, U. Ryser, B. Keller, and C. Ringli, unpublished
data).
|
|
To assess more precisely the spatial and temporal regulation of
AtLRX gene expression, the promoter region of each
LRX gene was fused to the bacterial -glucuronidase (GUS)
reporter gene (uidA) and transformed into
plants. The activity of the uidA reporter gene was tested
in at least five independent transgenic T2
lines. pAtPEX2, pAtPEX3,
and pAtPEX4::uidA plants showed a clear
staining of the mature pollen grains and, upon pollen germination, of
the elongating pollen tubes (Fig. 4, A
and B). No other floral tissue showed any GUS activity in these
transgenic plants. These results confirm the flower-specific signal
obtained by northern hybridization and suggest that these genes
constitute orthologs of the ZmPEX1, ZmPEX2, and
LePEX genes that were also shown to be expressed in pollen
(Rubinstein et al., 1995a ; Stratford et al.,
2001 ). pAtLRX6::uidA expression was
observed in secondary roots emerging from the primary root (Fig. 4C).
GUS staining at the apex was still visible during elongation of the
lateral roots once they had broken through the cortex of the primary
roots but gradually faded with age (data not shown). In older roots
(>7 d post-germination), a moderate level of expression was detected
in the central cylinder of primary and secondary roots (Fig. 4C).
pAtLRX5::uidA showed a very similar pattern of
expression in emerging lateral roots at a very early stage of
dedifferentiation of the pericycle cells (Fig. 4D). However, in
contrast to pAtLRX6::uidA, a high GUS activity was
detected in very young emerging leaves and stipules at the center of
the pAtLRX5::uidA plant rosette (Fig. 4E). A more
diffuse staining persisted later in the expanding leaves, mostly in the
petiole (Fig. 4E). During flower development,
pAtLRX5::uidA expression was restricted to the
carpels, the stamen filament, and the abscission zone of the floral
whorls (Fig. 4F). Consistent with the northern data,
pAtLRX3::uidA and
pAtLRX4::uidA plants showed the same pattern of
expression. In these plants, GUS activity was restricted to the root
vascular tissues (Fig. 4G) and in the veins of the cotyledons, the
developing leaves, and the sepals (Fig. 4, H and I). A moderate level
of expression was also observed in the rest of the leaf and sepal
tissues. The pAtLRX7::uidA construct induced a
strong GUS activity in pollen grains, whereas no signal could be
detected in flowers by northern hybridization (data not shown).
However, we could amplify AtLRX7 transcripts by RACE-PCR
from flower total RNA as template, demonstrating that this gene is at
least expressed in flowers, and possibly in other tissues. Therefore,
it is possible that a low stability of the AtLRX7 transcript
prevents its detection by northern hybridization.

View larger version (89K):
[in this window]
[in a new window]
|
Figure 4.
Histochemical localization of LRX
expression in LRX promoter::uidA
transgenic plants. A and B, pPEX3::uidA
expression. C, pLRX6::uidA expression.
D through F, pLRX5::uidA expression. G
through I, pLRX4::uidA expression. A,
Open flower. Mature pollen grains show a strong expression of
pPEX3::uidA. B, Pollen germinating on a
stigma. C, Emerging lateral roots, showing strong expression of
pLRX6::uidA in the lateral root
meristem, as well as in the vascular tissue of the mature primary root.
D, Root of a 7-d-old seedling; GUS activity is restricted to the
emerging lateral roots. E, Fourteen-day-old seedlings;
pLRX5::uidA is expressed in the leaf
primordia and stipules (inset image) at the basis of the rosette
leaves. A moderate activity persisted during further expansion of the
leaves, preferentially in petioles. F, Opened flowers and flower buds.
GUS activity is detected in the carpels and faintly in the pedicels.
Carpel expression seems higher in the style directly under the stigma
and at the base of the organ and the junction with the receptacle
(abscission zone). G, Primary mature roots; the vascular bundle is
strongly stained. H, Fourteen-day-old seedlings; the inset picture
shows a leaf stained for a shorter time to reveal the stronger
expression in vascular tissue. I, Open flower; GUS activity is detected
in sepals, mostly in the veins. Bars = 1 mm (A, F, and I), 50 µm
(B-D), 2 mm (E and H), 350 µm (E, inset image), and 200 µm
(G).
|
|
Monocot and Eudicot LRX Genes Cluster in Two Mixed
Phylogenetic Clades
To study the phylogeny of the LRX proteins, the protein sequences
were aligned, and trees were generated by the maximum parsimony method.
Because of uncertain alignment, the signal peptide, the hypervariable
region between the predicted -helices in the amino terminus,
as well as the extensin domain except the last 24 residues of the
carboxy terminus, were removed before the phylogenetic analysis. A
phylogenetic analysis excluding the conserved C termini of the extensin
domain gave a very similar tree topology (data not shown). The aligned
regions were composed of 389 amino acid positions. Of these 389 positions, 254 were parsimony informative for the amino acid-based
analysis. Seven parsimony-informative gap characters were included in
all analyses. Two most parsimonious gene trees of 1,721 steps were
found in all 1,000 replicates. The ensemble consistency index
(Kluge and Farris, 1969 ) of these trees was 0.65 (excluding uninformative characters) and the ensemble retention index
(Farris, 1989 ) was 0.64. The strict consensus tree
(Schuh and Polhemus, 1980 ; Sokal and Rohlf,
1981 ) with jackknife support values mapped is presented in
Figure 5. In the jackknife tree,
OsPEX3 was resolved as the sister group of the clade that includes ZmPEX1 and OsPEX1 with weak (54%)
support.

View larger version (18K):
[in this window]
[in a new window]
|
Figure 5.
Phylogenetic analysis of the LRX family. The
strict consensus of the two most parsimonious amino acid based
phylogenetic trees is reproduced, and the jackknife support values
mapped above the branches. The number of steps was 1,721, the ensemble
consistency index of the most parsimonious trees was 0.65 (excluding
uninformative characters), and the ensemble retention index was 0.64. Monocot and dicot LRX proteins cluster in a vegetatively expressed LRX
and a reproductive-expressed LRX clade.
|
|
A phylogenetic analysis was also performed on 778 nucleotide characters
corresponding to the first and second codon positions. Among those
characters, 442 were parsimony informative. One most parsimonious gene
tree of 2,157 steps was found in all 1,000 replicates with an ensemble
consistency index of 0.45 (excluding uninformative characters) and an
ensemble retention index of 0.58. This tree topology was nearly
identical to that inferred by using amino acid characters and only
differed with respect to branches that were weakly supported in one or
both trees (see supplemental data at www.plantphysiol.org).
The phylogenetic analysis shows that the LRX sequences fall in two
distinct clades, which comprise both eudicot (Arabidopsis and tomato)
and monocot LRXs (rice and maize; Fig. 5). Remarkably, this
classification of the LRX genes into two clades almost
perfectly overlaps with the LRX expression patterns: the
first clade, which clusters the pollen-expressed AtPEX1 to
4, also contains ZmPEX1, ZmPEX2, and
LePEX, which were all shown to be specifically expressed in
pollen (this manuscript; Rubinstein et al., 1995a ;
Stratford et al., 2001 ). The second phylogenetic clade
contains all the Arabidopsis LRX genes expressed in
vegetative tissues and AtLRX7, for which the expression
profile remains unclear. The expression of the OsLRX genes,
based on ESTs isolated from different tissues, indicate that this
classification also holds true for rice. For OsPEX1 to
3, ESTs were isolated from flowering panicles. ESTs of
OsLRX1, 2, and 5 were isolated from
roots, shoots, and whole seedlings (Table I) and, thus, fit in the
clade of genes expressed in vegetative tissue. For OsLRX3
and 4, a final conclusion cannot be drawn because no or only
one EST from flowering panicles, respectively, was isolated. A more
detailed analysis will be necessary to conclusively classify these two genes.
 |
DISCUSSION |
The LRX phylogeny suggests that the origin and initial expansion
of the LRX family predate the divergence of monocots and eudicots 160 to 240 million years ago (Wolfe et al., 1989 ;
Goremykin et al., 1997 ; Soltis et al.,
2002 ). The first step in the diversification of the
LRX gene family possibly occurred by the duplication of a
unique ancestral gene resulting in the two clades observed in modern
angiosperms. The separation between eudicot and monocot LRX sequences
within each of the clades derived from this gene duplication,
indicating that the LRX family expanded and diversified after the divergence between the two lineages. Although the absence of
LRX candidate gene sequences from more ancient taxa in
current databases prevents us from reliably rooting the phylogenetic
tree, the evolutionary scenario presented above is the most probable because it is the most parsimonious. Placing the origin of the family
anywhere else than between the two postulated reproductive and
vegetative clades would imply multiple gene extinction events in
each angiosperm lineage. The expression of the LRX genes of Arabidopsis, maize, and tomato reveals that they can be classified as
vegetatively expressed or pollen-expressed genes, two categories that
almost completely overlap with the phylogenetic clades. The AtLRX7 gene seems to be an exception: It is expressed in
pollen although it belongs to the vegetative LRX phylogenetic clade. However, its pollen-specific expression was only demonstrated by the
promoter::uidA fusion and no mRNA was detectable
by northern analysis, suggesting that AtLRX7 mRNA may
be unstable. The expression profiles of the OsLRX genes,
deduced from EST database mining, so far corroborate the
suggested classification. ESTs corresponding to
OsLRX1, 2, and 5 of the vegetative
clade were isolated from different vegetative tissue and those of
OsPEX1 to 3 from flowering panicles. In two
cases, EST data are either absent (OsLRX3) or possibly
incomplete (OsLRX4). Thus, a final conclusion on the classification of the whole rice LRX gene family will
require a more detailed analysis of the gene expression profiles.
The existence of "vegetative" and "reproductive" LRX
clades suggests that the two ancestral LRX genes had
acquired their tissue specificity before the monocot-eudicot
divergence. Such a clear specialization between reproductive and
vegetative isovariants was also observed in the actin and profilin gene
families (McDowell et al., 1996 ; Meagher et al.,
1999 ; Kandasamy et al., 2002 ). It was proposed
that the initial separation between vegetative and reproductive actins
was contemporary with and perhaps contingent upon the invention of new
developmental pathways involved in the formation of vegetative organs
from reproductive ones (Meagher et al., 1999 ). Similar
to actins (Ringli et al., 2002 ), LRXs are possibly involved in cell morphogenesis, as demonstrated for
AtLRX1 (Baumberger et al., 2001 ). Thus, the
same evolutionary mechanisms might have driven the specialization of
LRX genes into reproductive and vegetative forms. However,
in contrast to the LRX family, the vegetative and
reproductive actin clades expanded before the radiation of the
angiosperms, as indicated by the existence of subclasses comprising
monocot and dicot sequences within each clade (Meagher et al.,
1999 ).
The localization of the most closely related AtLRX genes on
duplicated chromosomal regions indicates that the last expansion of the
LRX family in Arabidopsis occurred during whole or partial genome duplication events, followed, in some cases, by selective gene
loss (Blanc et al., 2000 ; Grant et al.,
2000 ; Ku et al., 2000 ; Vision et al.,
2000 ). Three, and possibly four, rounds of duplications
generated five vegetative subclasses with distinct expression patterns,
and two rounds of duplications created two reproductive subclasses with
identical expression patterns. The conservation of a very similar
expression pattern in each copy of the paired genes suggests that the
functional specialization of these genes happened before their
duplication. According to the estimation of Vision et al.
(2000) and the synteny observed between tomato and Arabidopsis
(Ku et al., 2000 ), most of the duplications in the
Arabidopsis genome occurred shortly after the divergence between
asterids (tomato) and rosids (Arabidopsis) 112 to 156 million years
ago. It is surprising that genes duplicated so long ago have maintained
the same expression pattern, whereas the estimated half-life of a
duplicated gene copy is only 3.2 million years before it is silenced
(Lynch and Conery, 2000 ). This might happen if the
duplicated LRX genes have acquired useful new functions
(neofunctionalization) or if two copies are necessary to achieve a
sufficient level of expression (sub-functionalization; Force et
al., 1999 ). Complementation of the Arabidopsis lrx1
mutant with different LRX genes and isolation of other
lrx mutants should help to resolve this issue.
Interestingly, the seven vegetative Arabidopsis LRX genes
form five subclasses with distinct expression patterns. This mirrors the number of rice genes in the vegetative clade. Similarly, the two
pairs of Arabidopsis PEX paralogs correspond to two rice
PEX genes (the phylogenetic position of the third one,
OsPEX3, is not well resolved). Because the LRX
genes have evolved independently in moncots and eudicots, this might
indicate that a minimum of six to seven differently regulated
LRXs is required for the development of modern angiosperms.
It will be interesting to learn whether the evolution has led to the
formation of the same vegetative expression patterns in different
LRX subclasses in monocots and in eudicots.
The high variability of the extensin domain throughout the LRX family
is in strong contrast to the high conservation of the LRR domain and
raises the question of whether or not different extensin domains have
been recruited multiple times by a single LRR protein. However, the
presence of similar C termini in different LRXs, both within the same
species and between species, rather suggests that the formation of a
chimeric LRR extensin by domain shuffling probably took place before
the first gene duplication. Except for the C terminus, which is
particularly conserved between monocot and dicot PEX genes,
similarities to some extent are only observed between the extensin
domains of genes belonging to the same expression subclass
(AtLRX1/AtLRX2,
AtLRX3/AtLRX4, and AtPEX1-4) or
between putative orthologs (ZmPEXs and
OsPEX1/OsPEX2). It is possible that the
variability of the extensin domains reflects an adaptation to
particular cell walls. For instance, the composition of the pollen tube
cell walls is clearly different from the composition of other cell
types, being rich in callose and poor in cellulose (Taylor and
Hepler, 1997 , and refs. therein). Primary cell walls also differ by a number of structural and biochemical specificities between Poaceae (type II cell walls) and other angiosperms (type I cell
walls; Carpita and Gibeaut, 1993 ). One possible
consequence of the difference in the primary cell wall might be the
presence of the amino acid motif YxY, important for intramolecular
cross-linking, exclusively in extensin domains of dicot LRXs. In
general, it seems that the extensin domains are under a relatively mild
selection pressure, and only some critical structural motifs are
conserved. The insertion/deletion of repeats and the insertion of
single amino acids have been shown to conserve conformation and
function of some Pro-rich proteins (Rabanal et al.,
1993 ; Schmidt et al., 1994 ). It is likely that
replication slippage and unequal crossing over occur more frequently in
the highly repetitive G/C-rich sequences encoding the extensin-like
domain than in the LRR-encoding sequence. These two mechanisms could
generate the observed variability in the extensin domain by sequence
extension and repeat shuffling. The higher similarities between genes
belonging to the same expression subclass then would rather be a
consequence of a more recent divergence than the sign of a functional
specialization to a specific type of cell wall. Experiments involving
complementation of the lrx1 mutant (Baumberger et
al., 2001 ) by chimeric AtLRX constructs containing
different extensin domains will address this question. It is
interesting to note that none of the LRXs contains the motif VYK,
thought to be important for intermolecular cross-linking (Schnabelrauch et al., 1996 ). Because protein
immobilization in the cell wall was demonstrated for AtLRX1
and ZmPEX1 (Baumberger et al., 2001 ;
Rubinstein et al., 1995b ), it is likely that this process is mediated by other motifs than those so far identified in extensins.
LRRs frequently participate in protein-protein interactions, and
crystallographic studies indicate that they collectively form an open
horseshoe structure (Kobe and Deisenhofer, 1994 , 1995 ; Papageorgiou et al., 1997 ). In each
LRR, a region of the consensus sequence (xxLxLxx) is predicted to form
a -strand/ -turn structure in which the variable residues (x) are
exposed to the solvent and determine the specificity of the
interaction. This region of the LRR is subjected to diversifying
selection in plant resistance genes (McDowell et al.,
1998 ; Meyers et al., 1998 ; Ellis et al.,
1999 ; Van der Hoorn et al., 2001 ). The
observation that among different LRXs, the solvent-exposed amino acids
are well conserved or even identical (AtLRX3/4 and
AtLRX6/7) suggests that the LRX proteins interact with very
similar or identical ligands. However, we observed that overexpression
of the LRR domain of AtLRX1, despite its ability to
sequester the ligand of the native AtLRX1 protein and induce
a dominant negative effect in root hairs, had no influence in the rest
of the plant where other LRXs are active (Baumberger et al.,
2001 ). This result indicates that the flanking regions of the
LRRs might contribute to the specificity of the interaction with the
ligand as observed for the LRRs of the tomato and flax
(Linum usitatissimum) resistance genes
Cf-4, L6, and L7 (Ellis et al.,
1999 ; Van der Hoorn et al., 2001 ). The existence
of family- or paralog/ortholog-specific regions adjacent to the
conserved region of the N-termini of the LRX proteins supports this notion.
Mutations in the Arabidopsis LRX1 gene result in root hairs
that swell, branch, or abort, suggesting that AtLRX1 is
involved in tip growth, an extreme case of polarized growth restricted to root hairs and pollen tubes (Baumberger et al.,
2001 ). Therefore, it is particularly intriguing to note that at
least four Arabidopsis LRX genes are expressed in mature and
possibly in germinating pollen as demonstrated for the
ZmPEX1 gene (Rubinstein et al., 1995a ,
1995b ). Thus, the primary function of LRX proteins might be to control cell polarization or to locally regulate cell wall expansion during tip growth. For example, the PEX proteins may regulate
the polarized growth of pollen tubes, possibly in response to a signal
from the pistil. However, because LRX genes are also expressed in cells that expand by diffuse growth, the LRX proteins might have multiple functions depending on the cellular context. Instead of acting during cell expansion, LRXs might function in the
modification of localized cell wall domains during the differentiation process. The preferential expression of AtLRX3 and
AtLRX4 in vascular tissue, for instance, might be related to
a role of LRX proteins in the maturation of xylem vessels, a process
that requires the elimination of the terminal cell walls of the xylem
elements. The functional analysis of several Arabidopsis LRX
genes currently undertaken in our laboratories should provide
additional insight into the function of these intriguing proteins.
 |
MATERIALS AND METHODS |
Plant Material and Growth Conditions
Arabidopsis ecotype Columbia was used in all expression studies.
Plants were grown either in soil or on one-half-strength Murashige and
Skoog medium, supplemented with 2% (w/v) Suc and 0.6% (w/v)
Phytagel (Sigma, Buchs, Switzerland), under continuous light, and at
24°C. Transgenic seedlings were selected on one-half-strength Murashige and Skoog plates with 0.8% (w/v) Phytoagar
(Invitrogen, Basel) supplemented with 50 µg mL 1
kanamycin. All analyses using transgenic lines were performed in the
T2 or T3 generation.
Database Search and Gene Annotation
Arabidopsis and rice (Oryza sativa)
LRX sequences were retrieved by tBLASTn searches of the
Arabidopsis genome sequence database at the National Center for
Biotechnology Information (http://www.ncbi.nlm.nih. gov/) and the SYD
database at the Syngenta Torrey Mesa Research Institute
(http://www.tmri.org/index.html), respectively. The preliminary search
was performed with the amino acid sequence of AtLRX1
with the exclusion of the extensin domain (AY026364). Retrieved
sequences with a cutoff E value above 1 × 10 10 were
translated in the three frames to visually check for the presence of a
putative extensin-like domain downstream of the region of homology with
LRX1. LRX candidates were all recovered with a cutoff value well above
1 × 10 20, indicating that the stringency of the
search criterion was appropriate. Selected candidates were then used as
query in a second round tBLASTn search that did not identify new candidates.
Gene predictions were visually verified using the AtLRX1
and ZmPEX1 (Z34465) protein sequences as predictive
models. Ambiguities were solved by comparison of the genomic sequences
with EST or cDNA sequences, if available. For rice sequences, which
have a lower accuracy than the Arabidopsis sequences, the ORF sequences were also validated by comparison with the genomic sequences of the
rice subsp. indica (http://btn.genomics. org.cn/rice).
In case of doubt, corrections were made in favor of the highest
homology with AtLRX1 and ZmPEX1 protein sequences.
A 466-bp fragment containing the predicted AtLRX7 intron
was amplified by PCR (using the oligonucleotide pair
5'-ccattgtaggcccgactccatcgtc-3' and 5'-gagtgtattggcgttggtggaggtg-3'),
subcloned into pGEMT-Easy according to the manufacturer's protocol
(Promega, Madison, WI), and sequenced (Macromolecular Resources,
Colorado State University, Fort Collins).
Sequence Alignment and Phylogenetic Analysis
Amino acid sequences were aligned by use of the default
alignment parameters in ClustalX (Thompson et al.,
1997 ) and then manually adjusted by use of the alignment
criterion presented by Zurawski and Clegg (1987) , in
which gaps are considered as characters and the number of evolutionary
events is minimized. Regions of individual sequences that remained
ambiguously aligned with this criterion were coded as uncertain
("?") for the phylogenetic analysis. The corresponding
alignment of DNA sequences was created manually. The analysis with the
motif-detecting algorithm MEME was performed at
http://bioweb.pasteur.fr/seqanal/motif/meme/meme.html.
Aligned sequences were input into MacClade (Maddison and
Maddison, 1992 ) for phylogenetic analysis. Sequence gaps that
were unambiguously aligned were scored as additional characters by use
of "complex indel coding" (Simmons and Ochoterena,
2000 ). Parsimony-based tree searches were performed using PAUP*
(Swofford, 1998 ). Three tree searches were conducted:
using amino acid characters and using nucleotide characters from first
and second codon positions. Each tree search was performed using 1,000 replicates with equal character weighting, random taxon addition, and
tree bisection reconnection, which swapped to completion for every
search. Relative levels of branch support were determined using
jackknife support (Farris, 1989 ). One thousand
replicates were performed with 10 tree bisection reconnection searches
held per replicate. Jackknife support values were mapped onto the most
parsimonious tree or the strict consensus of the most parsimonious
trees, respectively.
Outgroup rooting was attempted using the Leu-rich region from LePRK1 of
tomato (Lycopersicon esculentum; Muschietti et
al., 1998 ) and CLV1 of Arabidopsis (Clark et al.,
1997 ). LRRs are comprised of a series of 23 to 25 residue
repeats, of which there are 9.5 in the LRX gene family, six in LePRK1,
and 21 in CLV1. Unfortunately, which repeats from the outgroup
sequences aligned with which repeats from the ingroup sequences could
not be determined. Therefore, duplicate gene rooting (Gogarten
et al., 1989 ; Iwabe et al., 1989 ; Donoghue and Mathews, 1998 ) was used to root the gene
tree between the vegetative- and reproductive-expressed paralogs.
The maize (Zea mays) and tomato LRX sequences used in
the analysis are recorded in the databases under the following
accession numbers: ZmPEX1, Z34465;
ZmPEX2, AF159297; and LePEX1,
AF159296.
Northern Analysis and RACE-PCR
RNA was extracted from various organs with the Trizol reagent
(Invitrogen) following the manufacturer's instructions. Pollen was
harvested and ruptured essentially as described (Huang et al.,
1997 ). Five micrograms of total RNA (2 µg for pollen) was separated by agarose gel electrophoresis, transferred onto a
GenescreenPlus nylon membrane (Invitrogen), and hybridized with
32P-labeled probes specific to each LRX
gene. The fragment used as probes covered the sequence coding for the
LRR domain and were amplified from Arabidopsis genomic DNA by PCR,
cloned into pGEM-T easy vectors (Promega), and sequenced. The actual
transcript end of the AtLRX6 and AtLRX7
genes was determined by 3'-RACE-PCR using the GeneRacer Kit
(Invitrogen) and gene-specific primers. The resulting PCR products were
cloned and sequenced. The specificity of the probes used for the
northern experiment was verified by Southern hybridization. Each probe
gave a different pattern, indicating that they are gene specific.
Constructs and Plant Transformation
For the each LRX
promoter::uidA fusion construct
(pLRX::uidA), 1.5 kb of the promoter region
were amplified by PCR from genomic DNA by PCR, sequenced, and fused to
the bacterial uidA gene in the vector pGPTV-KAN
(Becker et al., 1992 ). The T-DNA constructs were
transformed into Agrobacterium tumefaciens GV3101, and
plant transformation was performed following the floral dip method
described by Clough and Bent (1998) . Transgenic plants
were selected on Murashige and Skoog agar plates containing kanamycin
transferred on soil and allowed to set seed. T2 and
T3 generations were analyzed for GUS activity.
GUS Histochemical Analysis
Histochemical staining for GUS activity was performed by
incubation in 100 µg mL 1 5-bromo-4-chloro-3-indolyl
glucuronide in 50 mM Na-phosphate buffer (pH 6.8), 10 mM EDTA, 0.5 mM
K3Fe(CN)6, 0.5 mM
K4Fe(CN)6, and 0.1% (v/v) Triton X-100 at
37°C for 3 to 16 h. The material was then fixed and the pigments
removed by incubation in 70% (v/v) ethanol for several hours.
Observations were made on a Leica stereomicroscope LZM125 and a Leica
microscope Laborlux equipped with a Wild camera MPS52 (Leica,
Glattbrugg, Switzerland).
 |
ACKNOWLEDGMENT |
We thank Martin Parniske for insight into the LRR sequence analysis.
 |
FOOTNOTES |
Received September 24, 2002; returned for revision November 5, 2002; accepted December 12, 2002.
1
This work was supported by the Swiss National
Science Foundation (grant nos. 31-51055.97 and 41-6 419.00) and by
the National Science Foundation (grant no. 0091976 to P.A.B.).
2
Present address: Sainsbury Laboratory, John Innes
Centre Colney, NR4 7UH Norwich, UK.
[w]
The online version of this article contains Web-only
data. The supplemental material is available at
www.plantphysiol.org.
*
Corresponding author; e-mail bkeller{at}botinst.unizh.ch; fax
41-1-6348204.
Article, publication date, and citation information can be found at
www.plantphysiol.org/cgi/doi/10.1104/pp.102.014928.
 |
LITERATURE CITED |
-
Adams MD, Celniker SE, Holt RA, Evans CA, Gocayne JD, Amanatides PG, Scherer SE, Li PW, Hoskins RA, Galle RF, et al
(2000)
The genome sequence of Drosophila melanogaster.
Science
287: 2185-2195[Abstract/Free Full Text]
-
Altschul SF, Madden TL, Schaffer AA, Zhang JH, Zhang Z, Miller W, Lipman DJ
(1997)
Gapped BLAST and PSI-BLAST: a new generation of protein database search programs.
Nucleic Acids Res
25: 3389-3402[Abstract/Free Full Text]
-
Arabidopsis Genome Initiative
(2000)
Analysis of the genome sequence of the flowering plant Arabidopsis thaliana.
Nature
408: 796-815[CrossRef][Medline]
-
Bailey TL, Elkan C
(1994)
Fitting a mixture model by expectation maximization to discover motifs in biopolymers.
In
R Altman, D Brutlog, P Karp, R Lathrop, D Searls, eds, Proceedings of the Second International Conference on Intelligent Systems for Molecular Biology. American Association for Artificial Intelligence Press, Menlo Park, CA, pp 28-36
-
Baumberger N, Ringli C, Keller B
(2001)
The chimeric leucine-rich repeat/extensin cell wall protein LRX1 is required for root hair morphogenesis in Arabidopsis thaliana.
Genes Dev
15: 1128-1139[Abstract/Free Full Text]
-
Becker D, Kemper E, Schell J, Masterson R
(1992)
New plant binary vectors with selectable markers located proximal to the left T-DNA border.
Plant Mol Biol
20: 1195-1197[CrossRef][ISI][Medline]
-
Blanc G, Barakat A, Guyot R, Cooke R, Delseny I
(2000)
Extensive duplication and reshuffling in the Arabidopsis genome.
Plant Cell
12: 1093-1101[Abstract/Free Full Text]
-
Carpita NC, Gibeaut DM
(1993)
Structural models of primary cell walls in flowering plants: consistency of molecular structure with the physical properties of the walls during growth.
Plant J
3: 1-30[CrossRef][ISI][Medline]
-
C. elegans Sequencing Consortium
(1998)
Genome sequence of the nematode C. elegans: a platform for investigating biology.
Science
282: 2012-2018[Abstract/Free Full Text]
-
Clark SE, Williams RW, Meyerowitz EM
(1997)
The CLAVATA1 gene encodes a putative receptor kinase that controls shoot and floral meristem size in Arabidopsis.
Cell
89: 575-585[CrossRef][ISI][Medline]
-
Clough SJ, Bent AF
(1998)
Floral dip: a simplified method for Agrobacterium-mediated transformation of Arabidopsis thaliana.
Plant J
16: 735-743[CrossRef][ISI][Medline]
-
Donoghue MJ, Mathews S
(1998)
Duplicate genes and the root of angiosperms, with an example using phytochrome sequences.
Mol Phylogenet Evol
9: 489-500[CrossRef][ISI][Medline]
-
Ellis JG, Lawrence GJ, Luck JE, Dodds PN
(1999)
Identification of regions in alleles of the flax rust resistance gene L that determine differences in gene-for-gene specificity.
Plant Cell
11: 495-506[Abstract/Free Full Text]
-
Farris JS
(1989)
The retention index and the rescaled consistency index.
Cladistics
5: 417-419[ISI]
-
Force A, Lynch M, Pickett FB, Amores A, Yan YL, Postlethwait J
(1999)
Preservation of duplicate genes by complementary, degenerative mutations.
Genetics
151: 1531-1545[Abstract/Free Full Text]
-
Goff SA, Ricke D, Lan TH, Presting G, Wang RL, Dunn M, Glazebrook J, Sessions A, Oeller P, Varma H, et al
(2002)
A draft sequence of the rice genome (Oryza sativa L. ssp japonica).
Science
296: 92-100[Abstract/Free Full Text]
-
Gogarten JP, Kibak H, Dittrich P, Taiz L, Bowman EJ, Bowman BJ, Manolson MF, Poole RJ, Date T, Oshima T, et al
(1989)
Evolution of the vacuolar H+-ATPase: implications for the origin of eukaryotes.
Proc Natl Acad Sci USA
86: 6661-6665[Abstract/Free Full Text]
-
Goremykin VV, Hansmann S, Martin WF
(1997)
Evolutionary analysis of 58 proteins encoded in six completely sequenced chloroplast genomes: revised molecular estimates of two seed plant divergence times.
Plant Syst Evol
206: 337-351[CrossRef]
-
Grant D, Cregan P, Shoemaker RC
(2000)
Genome organization in dicots: genome duplication in Arabidopsis and synteny between soybean and Arabidopsis.
Proc Natl Acad Sci USA
97: 4168-4173[Abstract/Free Full Text]
-
Hall Q, Cannon MC
(2002)
The cell wall hydroxyproline-rich glycoprotein RSH is essential for normal embryo development in Arabidopsis.
Plant Cell
14: 1161-1172[Abstract/Free Full Text]
-
Huang S, An YQ, McDowell JM, McKinney EC, Meagher RB
(1997)
The Arabidopsis ACT11 actin gene is strongly expressed in tissues of the emerging inflorescence, pollen, and developing ovules.
Plant Mol Biol
33: 125-139[CrossRef][ISI][Medline]
-
Iwabe N, Kuma K, Hasegawa M, Osawa S, Miyata T
(1989)
Evolutionary relationship of archaebacteria, eubacteria, and eukaryotes inferred from phylogenetic trees of duplicated genes.
Proc Natl Acad Sci USA
86: 9355-9359[Abstract/Free Full Text]
-
Jinn TL, Stone JM, Walker JC
(2000)
HAESA, an Arabidopsis leucine-rich repeat receptor kinase, controls floral organ abscission.
Genes Dev
14: 108-117[Abstract/Free Full Text]
-
Kajava AV
(1998)
Structural diversity of leucine-rich repeat proteins.
J Mol Biol
277: 519-527[CrossRef][ISI][Medline]
-
Kandasamy MK, McKinney EC, Meagher RB
(2002)
Plant profilin isovariants are distinctly regulated in vegetative and reproductive tissues.
Cell Motil Cytoskelet
52: 22-32[CrossRef][ISI][Medline]
-
Kieliszewski MJ, Lamport DT
(1994)
Extensin: repetitive motifs, functional sites, post-translational codes, and phylogeny.
Plant J
5: 157-172[CrossRef][ISI][Medline]
-
Kieliszewski MJ, Leykam JF, Lamport DTA
(1990)
Structure of the threonine-rich extensin from Zea mays.
Plant Physiol
92: 316-326[Abstract/Free Full Text]
-
Kluge AG, Farris JS
(1969)
Quantitative phyletics and evolution of Anurans.
Syst Zool
18: 1-32
-
Kobe B, Deisenhofer J
(1994)
The leucine-rich repeat: a versatile binding motif.
Trends Biol Sci
19: 415-421
-
Kobe B, Deisenhofer J
(1995)
A structural basis of the interactions between leucine-rich repeats and protein ligands.
Nature
374: 183-186[CrossRef][Medline]
-
Ku HM, Vision T, Liu JP, Tanksley SD
(2000)
Comparing sequenced segments of the tomato and Arabidopsis genomes: large-scale duplication followed by selective gene loss creates a network of synteny.
Proc Natl Acad Sci USA
97: 9121-9126[Abstract/Free Full Text]
-
Li JM, Chory J
(1997)
A putative leucine-rich repeat receptor kinase involved in brassinosteroid signal transduction.
Cell
90: 929-938
|