Novel and Stress-Regulated MicroRNAs and Other Small RNAs from Arabidopsis W

Similar documents
Bacterial Gene Finding CMSC 423

Protein sequence alignment using binary string


Biology 12 January 2004 Provincial Examination

The Cell T H E C E L L C Y C L E C A N C E R

The transfer RNA genes in Oryza sativa L. ssp. indica

Identification of mirnas in Eucalyptus globulus Plant by Computational Methods

Cloning and Characterization of MicroRNAs from Rice W

he micrornas of Caenorhabditis elegans (Lim et al. Genes & Development 2003)

CASE TEACHING NOTES. Decoding the Flu INTRODUCTION / BACKGROUND CLASSROOM MANAGEMENT

Bacterial Gene Finding CMSC 423

Supplemental Figure 1. Small RNA size distribution from different soybean tissues.

Bi 8 Lecture 17. interference. Ellen Rothenberg 1 March 2016

Regulation of Gene Expression in Eukaryotes

General aspects of bacteriology, bacterial structure and growth. Che-Hsin Lee, Ph.D. Department of Biological Sciences National Sun Yat-senUniversity

Cross species analysis of genomics data. Computational Prediction of mirnas and their targets

Genomic and evolutionary aspects of chloroplast trna in monocot plants

Alternative RNA processing: Two examples of complex eukaryotic transcription units and the effect of mutations on expression of the encoded proteins.

Biology 12 November 1999 Provincial Examination

Phenomena first observed in petunia

Molecular Cell Biology - Problem Drill 10: Gene Expression in Eukaryotes

Overview: Conducting the Genetic Orchestra Prokaryotes and eukaryotes alter gene expression in response to their changing environment

In silico identification of mirnas and their targets from the expressed sequence tags of Raphanus sativus

RNA and Protein Synthesis Guided Notes

Transcriptional control in Eukaryotes: (chapter 13 pp276) Chromatin structure affects gene expression. Chromatin Array of nuc

Beta Thalassemia Case Study Introduction to Bioinformatics

Supplementary Figures

L I F E S C I E N C E S

MicroRNAs, RNA Modifications, RNA Editing. Bora E. Baysal MD, PhD Oncology for Scientists Lecture Tue, Oct 17, 2017, 3:30 PM - 5:00 PM

Ch. 18 Regulation of Gene Expression

Supplementary Table 3. 3 UTR primer sequences. Primer sequences used to amplify and clone the 3 UTR of each indicated gene are listed.

ANALYSIS OF SOYBEAN SMALL RNA POPULATIONS HLAING HLAING WIN THESIS

PROVINCIAL EXAMINATION MINISTRY OF EDUCATION BIOLOGY 12 GENERAL INSTRUCTIONS

High AU content: a signature of upregulated mirna in cardiac diseases

Processing of RNA II Biochemistry 302. February 13, 2006

PROVINCIAL EXAMINATION MINISTRY OF EDUCATION BIOLOGY 12 GENERAL INSTRUCTIONS

Identification of novel and candidate mirnas in rice by high throughput sequencing. and J-K Zhu,

The Synthetic Machinery of the Cell

Criteria for Annotation of Plant MicroRNAs

Human Genome: Mapping, Sequencing Techniques, Diseases

Eukaryotic Gene Regulation

sirna count per 50 kb small RNAs matching the direct strand Repeat length (bp) per 50 kb repeats in the chromosome

MicroRNA in Cancer Karen Dybkær 2013

Youhua Chen 1,2. 1. Introduction. 2. Materials and Methods

c Tuj1(-) apoptotic live 1 DIV 2 DIV 1 DIV 2 DIV Tuj1(+) Tuj1/GFP/DAPI Tuj1 DAPI GFP

Circular RNAs (circrnas) act a stable mirna sponges

RECAP (1)! In eukaryotes, large primary transcripts are processed to smaller, mature mrnas.! What was first evidence for this precursorproduct

mirna Dr. S Hosseini-Asl

Research Article Base Composition Characteristics of Mammalian mirnas

Nature Structural & Molecular Biology: doi: /nsmb Supplementary Figure 1

Processing of RNA II Biochemistry 302. February 14, 2005 Bob Kelm

Biology 12 JANUARY Course Code = BI. Student Instructions

RNA Processing in Eukaryotes *

BIOLOGY 621 Identification of the Snorks

Molecular Biology (BIOL 4320) Exam #2 May 3, 2004

Chapter 10 - Post-transcriptional Gene Control

Supplemental Data. Shin et al. Plant Cell. (2012) /tpc YFP N

Prediction of Plant MicroRNA Targets

MODULE 3: TRANSCRIPTION PART II

Supplementary Figure 1 Transcription assay of nine ABA-responsive PP2C. Transcription assay of nine ABA-responsive PP2C genes. Total RNA was isolated

Analysis of small RNAs from Drosophila Schneider cells using the Small RNA assay on the Agilent 2100 bioanalyzer. Application Note

Breast cancer. Risk factors you cannot change include: Treatment Plan Selection. Inferring Transcriptional Module from Breast Cancer Profile Data

Genetics. Instructor: Dr. Jihad Abdallah Transcription of DNA

Eukaryotic small RNA Small RNAseq data analysis for mirna identification

Studying Alternative Splicing

Genetics and Genomics in Medicine Chapter 6 Questions

The Biology and Genetics of Cells and Organisms The Biology of Cancer

Patterns of hemagglutinin evolution and the epidemiology of influenza

Prokaryotes and eukaryotes alter gene expression in response to their changing environment

Processing of RNA II Biochemistry 302. February 18, 2004 Bob Kelm

TRANSLATION: 3 Stages to translation, can you guess what they are?

Supplementary Document

Figure mouse globin mrna PRECURSOR RNA hybridized to cloned gene (genomic). mouse globin MATURE mrna hybridized to cloned gene (genomic).

BIOLOGY 12 NOVEMBER 2000 STUDENT INSTRUCTIONS

Identification of target genes of micrornas in retinoic acid-induced neuronal differentiation*

Beta Thalassemia Sami Khuri Department of Computer Science San José State University Spring 2015

Genome-Wide Profiling and Analysis of Arabidopsis sirnas

Phylogenetic analysis of human and chicken importins. Only five of six importins were studied because

REGULATED AND NONCANONICAL SPLICING

Figure S1. Analysis of genomic and cdna sequences of the targeted regions in WT-KI and

Protein Synthesis and Mutation Review

High-resolution identification and abundance profiling of cassava (Manihot esculenta Crantz) micrornas

AN INTRODUCTION TO GENETICS. The First Ankle in a Series about the Significance of Genetic Science for Catholic Health Care

Improved annotation of C. elegans micrornas by deep sequencing reveals structures associated with processing by Drosha and Dicer

Biology 12. Examination Booklet 2008/09 Released Exam June 2009 Form A DO NOT OPEN ANY EXAMINATION MATERIALS UNTIL INSTRUCTED TO DO SO.

Novel RNAs along the Pathway of Gene Expression. (or, The Expanding Universe of Small RNAs)

Chapter 11 How Genes Are Controlled

MicroRNA and Male Infertility: A Potential for Diagnosis

Supplemental Figure S1. Expression of Cirbp mrna in mouse tissues and NIH3T3 cells.

Where Splicing Joins Chromatin And Transcription. 9/11/2012 Dario Balestra

Polyomaviridae. Spring

DNA codes for RNA, which guides protein synthesis.

Translation Activity Guide

Supplemental Figure 1. Genes showing ectopic H3K9 dimethylation in this study are DNA hypermethylated in Lister et al. study.

microrna Presented for: Presented by: Date:

MicroRNAs and Their Regulatory Roles in Plants

PROTEIN SYNTHESIS. It is known today that GENES direct the production of the proteins that determine the phonotypical characteristics of organisms.

RNA-seq Introduction

Supplementary Figure 1 MicroRNA expression in human synovial fibroblasts from different locations. MicroRNA, which were identified by RNAseq as most

MicroRNA-mediated incoherent feedforward motifs are robust

Transcription:

The Plant Cell, Vol. 16, 2001 2019, August 2004, www.plantcell.org ª 2004 American Society of Plant Biologists Novel and Stress-Regulated MicroRNAs and Other Small RNAs from Arabidopsis W Ramanjulu Sunkar and Jian-Kang Zhu 1 Institute for Integrative Genome Biology and Department of Botany and Plant Sciences, University of California, Riverside, California 92521 MicroRNAs (mirnas) and short interfering RNAs (sirnas) are small noncoding RNAs that have recently emerged as important regulators of mrna degradation, translational repression, and chromatin modification. In Arabidopsis thaliana, 43 mirnas comprising 15 families have been reported thus far. In an attempt to identify novel and abiotic stress regulated mirnas and sirnas, we constructed a library of small RNAs from Arabidopsis seedlings exposed to dehydration, salinity, or cold stress or to the plant stress hormone abscisic acid. Sequencing of the library and subsequent analysis revealed 26 new mirnas from 34 loci, forming 15 new families. Two of the new mirnas from three loci are members of previously reported mir171 and mir319 families. Some of the mirnas are preferentially expressed in specific tissues, and several are either upregulated or downregulated by abiotic stresses. Ten of the mirnas are highly conserved in other plant species. Fifty-one potential targets with diverse function were predicted for the newly identified mirnas based on sequence complementarity. In addition to mirnas, we identified 102 other novel endogenous small RNAs in Arabidopsis. These findings suggest that a large number of mirnas and other small regulatory RNAs are encoded by the Arabidopsis genome and that some of them may play important roles in plant responses to environmental stresses as well as in development and genome maintenance. INTRODUCTION Whereas the roles of proteins as regulatory factors in development and adaptation to the environment are well known, the involvement of regulatory small RNA molecules is just emerging (Carrington and Ambros, 2003). MicroRNAs (mirnas) and short interfering RNAs (sirnas) are two classes of small noncoding RNAs with important roles in the regulation of gene expression in plants and animals. mirnas are an abundant class of tiny RNAs that are often found to be conserved evolutionarily across species boundaries and are capable of regulating the expression of protein-coding genes in multicellular eukaryotes (Bartel, 2004). mirnas are ;20 to 24 nucleotides in length, singlestranded noncoding RNAs derived predominantly from intergenic regions. mirna genes are transcribed as long pre-mirna transcripts ranging from 70 to 300 nucleotides that form foldback structures in which mature mirnas reside in either the 59 or 39 arm and are processed by DICER (Ambros et al., 2003a; Bartel and Bartel, 2003; Bartel, 2004). mirnas are known to silence genes posttranscriptionally by targeting cognate mrnas for degradation (Llave et al., 2002a; Palatnik et al., 2003; Tang et al., 2003) or translation repression (Lee et al., 1993; Reinhart et al., 2000; Aukerman and Sakai, 2003; Chen, 2004). 1 To whom correspondence should be addressed. E-mail jian-kang. zhu@ucr.edu; fax 909-827-7115. The author responsible for distribution of materials integral to the findings presented in this article in accordance with the Instructions for Authors (www.plantcell.org) is: Jian-Kang Zhu (jian-kang.zhu@ucr.edu). W Online version contains Web-only data. Article, publication date, and citation information can be found at www.plantcell.org/cgi/doi/10.1105/tpc.104.022830. The mirna profile of Caenorhabditis elegans is probably the most complete, with the number of validated mirna genes being ;95 (Ambros et al., 2003b; Lim et al., 2003). The total number of mirnas in each organism is unknown but is estimated to represent ;1% of the number of coding genes (Grad et al., 2003; Lai et al., 2003; Lim et al., 2003; Bartel, 2004). This prediction is based on the fact that mirnas are derived from evolutionarily conserved hairpin precursor RNAs. The total number of mirnas in a plant is unknown but could also be ;1% of the total coding genes (;250 to 300 mirnas in Arabidopsis thaliana). Three groups have independently reported mirnas from Arabidopsis (Llave et al., 2002b; Park et al., 2002; Reinhart et al., 2002) and yet the number of mirnas identified appears to be far from saturation. So far, only 43 mirnas falling into 15 families have been reported in Arabidopsis (Bartel and Bartel, 2003; Bartel, 2004). Several of these mirnas were demonstrated to play critical roles in leaf or flower development by targeting relevant developmental regulators for mrna cleavage or translational repression (Aukerman and Sakai, 2003; Palatnik et al., 2003; Chen, 2004; Juarez et al., 2004; Kidner and Martienssen, 2004). Consistent with the important roles of mirnas in the development of plants, the mutants dcl1, hen1-1, and hyl1 that are affected in mirna accumulation all have severe developmental defects (Lu and Federoff, 2000; Chen et al., 2002; Golden et al., 2002; Schauer et al., 2002; Han et al., 2004; Vazquez et al., 2004). Phenotypic defects in these mutants could be attributed, at least in part, to blocking or lowered levels of mirna production. sirnas are double-stranded 21- to 26-nucleotide small RNAs derived from the processing of long double-stranded RNAs (Waterhouse et al., 2001; Plasterk, 2002). sirnas were originally discovered in plants undergoing transgene silencing or infection with viruses (Hamilton and Baulcombe, 1999; Zamore et al.,

2002 The Plant Cell 2000). Endogenous small RNAs with characteristic features of sirnas have been cloned from Arabidopsis (Llave et al., 2002b; Reinhart et al., 2002). Double-stranded RNA may be derived from the transcription of inverted-repeat loci because of converging promoters or may be the product of host- or viral-encoded RNAdependent RNA polymerases (Dalmay et al., 2000; Mourrain et al., 2000; Aravin et al., 2001; Grishok et al., 2001; Sijen et al., 2001). sirnas assemble in complexes termed RNA-induced silencing complexes and target homologous RNA sequences for endonucleolytic cleavage, which is referred to as RNA interference in animals, posttranscriptional gene silencing in plants, and quelling in fungi (Hamilton and Baulcombe, 1999; Hammond et al., 2000; Zamore et al., 2000). The biological roles of sirnamediated RNA silencing include protection of the genome against mobile DNA elements (Ketting et al., 1999; Tabara et al., 1999; Wu-Scharf et al., 2000) and resistance against viruses (Voinnet, 2001; Waterhouse et al., 2001). In addition to PTGS, sirnas may be involved in transcriptional gene silencing by directing DNA methylation and chromatin modification (Mette et al., 2000; Aufsatz et al., 2002; Gong et al., 2002; Hamilton et al., 2002; Volpe et al., 2002; Zilberman et al., 2003). A distinguishing feature of plants is that they are sessile and thus have to cope with, rather than move to avoid, adverse environments. Plants have evolved sophisticated mechanisms to adapt to environmental stresses (Zhu, 2002). Abiotic stresses, such as drought, salinity, and cold, regulate the expression of thousands of genes in plants at both transcriptional and posttranscriptional levels. We are interested in exploring potential roles of regulatory small RNAs in abiotic stress tolerance and gene regulation. This study was performed with two objectives. One is to identify new mirnas and sirnas in Arabidopsis, given that only a small number of these small RNAs have been reported, and thus there is likely more to be found. The other objective is to discover mirnas that might be regulated by abiotic stresses. We constructed a library of small RNAs from seedlings treated with cold, dehydration, salinity, and the plant stress hormone abscisic acid (ABA). Through library sequencing and analysis, we found 15 new families of mirnas and many novel putative sirnas. Some of the mirnas are preferentially expressed in certain tissues, and several are either upregulated or downregulated by abiotic stresses. Our results have important implications for gene regulation under abiotic stresses and also contribute significantly to the long-term goal of having a complete profile of mirnas and sirnas in Arabidopsis. RESULTS Identification of 15 New Families of mirnas from Stress-Treated Seedlings To identify mirnas that might be regulated by abiotic stress, the expression of known Arabidopsis mirnas under dehydration, salt, cold, and ABA treatments was tested by RNA gel blot analysis. However, we did not find substantial stress regulation on any of the known mirnas (see Supplemental Figure C online; data not shown). Therefore, to identify novel mirnas and mirnas that might be regulated by abiotic stresses, we constructed a library of small RNA species from pooled seedlings exposed to cold, dehydration, salinity, and ABA. Small RNAs (15 to 26 bp) were recovered by size fractionation on denaturing polyacrylamide gels, eluted from the gel, ligated to 39 and 59 adapters, amplified, cloned, and sequenced. A BLAST search against the Arabidopsis database revealed that a major portion (90%) of the ;2500 cloned sequences appears to be breakdown products from rrnas, trnas, and small nuclear RNAs. The proportion (9%) of putative regulatory small RNAs is slightly lower than published results, presumably because the library was made from seedlings subjected to different abiotic stresses that may cause enhanced breakdown of noncoding RNAs (rrnas, trnas, and nucleolar RNAs). Several clones matched to chloroplast or mitochondrial sequences and may represent degradation or possibly regulatory products of organellar RNAs. A few clones do not have perfect matches to the Arabidopsis genome sequence and appear to be derived from RNAs of fungal/bacterial/viral origin. Thirteen previously published families of mirnas from Arabidopsis are represented in our sequences. These include mir156, mir157, mir158, mir159, mir161, mir163, mir166, mir168, mir169, mir171, mir172, mir173, and mir319/mirjaw (Llave et al., 2002b; Park et al., 2002; Reinhart et al., 2002; Palatnik et al., 2003). The frequency of any one mirna appearing in our sequenced library varied from 1 to 30 times (Table 1; see Supplemental Table 1 online). Importantly, we identified many new mirnas and putative sirnas. The ability of RNA corresponding to the genomic sequences surrounding the clones, to fold into hairpin mirna precursors, was applied to distinguish mirnas from other potential small regulatory RNAs. This criterion, together with phylogenetic conservation of mature mirnas, is now generally accepted as evidence for the existence of a mirna (Ambros et al., 2003a). We analyzed mirna precursors of up to 300 nucleotides for hairpin structures using the mfold program (see Supplemental Figure A online) (Zuker, 2003). The analysis revealed that we have cloned 21 new mirnas from Arabidopsis that correspond to 29 loci (Table 1). Furthermore, on analyzing the Arabidopsis genome, we were able to predict three homologs of the cloned mirnas mir399b and mir399f based on sequence similarity, the capacity of their precursor sequences to adopt hairpin structures, and their conservation in rice (Oryza sativa) (Figure 1F). One homolog for each of the cloned mirnas, mir389a.1 and mir407a, is also predicted in Arabidopsis (Figures 1C and 1G). Taken together, the total number of new mirnas identified in this study is 26, which come from 34 loci. Twenty-four of the newly identified mirnas define 15 new families (Table 1, Figure 1). Two of the new mirnas, mir171b and mir319c, belong to the previously reported mir171 and mir319/mirjaw families, respectively. Genomic Organization of the mirna Genes Twenty of the mirnas identified in this study, including the five predicted mirnas, correspond to single copy loci in the Arabidopsis genome, whereas the other six mirnas (mir399b and c, mir393a and b, mir398b and c, mir405a, b, c, and d, mir171b and c, mir389a.1, and mir389b.1) correspond to multiple loci (two to four) that are scattered in the genome (Table 1). Out of

Novel mirnas from Arabidopsis 2003 Table 1. Novel mirnas from Arabidopsis mirna Gene mirna Sequence (59/39) Length Fold- Back Arm Chr. Conserved in Other Plants Distance to Nearest Gene Expression Rice Lotus Medicago Populus MIR171b UUGAGCCGUGCCAAU AUCACG (3) 21 Yes 39 1 2.80 kb upstream At1g62040 MIR171c 21 Yes 39 1 1.27 kb downstream At1g11730 MIR319c UUUGGACUGAAGGGA 20 Yes 39 2 2.90 kb downstream GCUCC (5) At2g40800 MIR389a.1 UUCUAAGUCCAACAU 21 Yes 39 1 0.51 kb downstream AGCGUA (5) At1g50050 MIR389b.1 21 Yes 39 2 0.66 kb downstream At2g27395 MIR389b.3 UUUUAAGUCUAACAU 20 Yes 39 2 0.70 kb downstream AGCGU (5) At2g27395 MIR390a.2 CUAUCCAUCCUGAGUU 21 Yes 39 2 2.50 kb upstream UCAUU (1) At2g38330 MIR393a UCCAAAGGGAUCGCAUU 22 Yes 59 2 2.63 kb downstream GAUCC (30) At2g39880 MIR393b 22 Yes 59 3 2.02 kb upstream At3g55735 MIR397a UCAUUGAGUGCAGCGU 22 Yes 59 4 0.50 kb upstream UGAUGU (3) At4g05110 MIR397b UCAUUGAGUGCAUCGU 22 Yes 59 4 0.81 kb upstream UGAUGU (4) At4g13560 MIR398a UGUGUUCUCAGGUCA 21 Yes 39 2 0.35 kb upstream CCCCUU (3) At2g03450 MIR398b UGUGUUCUCAGGUCACC 21 Yes 39 5 0.10 kb upstream CCUG (7) At5g14550 MIR398c 21 Yes 39 5 0.40 kb upstream At5g14570 MIR399b UGCCAAAGGAGAGUUG 21 Yes 39 1 1.30 kb downstream CCCUG (2) At1g63000 MIR399c 21 Yes 39 5 1.30 kb downstream At5g62160 MIR399f UGCCAAAGGAGAUUUG 21 Yes 39 2 0.85 kb upstream CCCGG (6) At2g34210 MIR400 UAUGAGAGUAUUAUA 21 Yes 59 1 0.31 kb downstream AGUCAC (4) At1g32580 MIR401 CGAAACUGGUGUCGAC 21 Yes 59 4 4.10 kb downstream CGACA (1) At4g08102 MIR402 UUCGAGGCCUAUUAAAC 22 Yes 59 1 First intron of CUCUG (3) At1g77230 MIR403 UUAGAUUCACGCACAAA 21 Yes 39 2 1.70 kb upstream CUCG (2) At2g47280 MIR404 AUUAACGCUGGCGGUUG 24 Yes 59 1 1.73 kb upstream CGGCAGC (1) At1g31360 MIR405a AUGAGUUGGGUCUAACC 24 Yes 39 2 1.44 kb upstream CAUAACU (2) At2g22670 MIR405b 24 Yes 39 3 0.50 kb upstream At4g05510 MIR405c 24 Yes 39 5 0.66 kb upstream At5g50720 MIR405d 24 Yes 39 5 0.65 kb upstream At5g50720 MIR406 UAGAAUGCUAUUGUAAU 21 Yes 39 1 4.24 kb downstream CCAG (2) At1g52180 MIR407a UUUAAAUCAUAUACUUU 21 Yes 59 2 0.49 kb upstream UGGU (10) At2g32700 d þ þ d þ d d nd d þ þ nt þ þ d nt þ þ þ d þ þ þ nt þ þ nt d þ þ d nd d d þ nd nd nd d (Continued)

2004 The Plant Cell Table 1. (continued). mirna Gene mirna Sequence (59/39) Length Fold- Back Arm Chr. Conserved in Other Plants Distance to Nearest Gene Expression Rice Lotus Medicago Populus MIR408 AUGCACUGCCUCUUCC CUGGC (4) 21 Yes 39 2 Antisense polarity to At2g47020 d þ þ þ The sequences given represent the longest mirna sequence identified by cloning. Frequency of cloning is indicated in parentheses after the mirna sequences. Chromosomal (Chr.) positions and distance to the nearest annotated gene are indicated. mirna location in the predicted hairpin structure is specified (59 or 39 arm). Expression of some of the mirnas were confirmed by RNA blot analysis; d, nd, and nt denote detected, tested but not detected, and not tested because of a potential cross-hybridization problem, respectively. mirnas that are conserved in other plants are indicated. the 34 loci, 15, 9, and 5 come from chromosome 2, 1, and 5, respectively, and the remaining five loci are represented in chromosomes 3 and 4. Based on sequence homology and predicted fold-back structures, we were able to identify some mirna homologs in rice, Lotus, Medicago, and Populus. Thirty-two mirna loci matched with intergenic regions, whereas two loci correspond to either an intron or 39 untranslated region (UTR) of a gene. mir408 is located in the antisense polarity to the 39 UTR of At2g47020 (Table 1). mir402 originates from the intron of gene At1g77230, in the same orientation as the pre-mrna. Small RNAs corresponding to the introns or exons of pre-mrnas in the same polarity may be mirnas or may simply be general degradation products. The mirnas can be ascertained by the capacity of their precursors to form hairpin structures, together with expression studies. The mir402 precursor sequence has the ability to adopt a long and typical foldback structure (see Supplemental Figure A online). In addition, this mirna can be detected as a discrete band of 22 bp on RNA gel blots (Figure 5E). These results show that it is an mirna and not a nonspecific degradation product from the intron. mir399 and its homologs correspond to six genomic loci (a, b, c, d, e, and f; b and c are identical in sequence but come from two different loci [Table 1]) in Arabidopsis with identical sequences or only a slight change in sequence. A sequence identical to mir399f is also found in rice, but the rice precursor sequence does not appear to form a fold-back structure; therefore, it cannot be ascertained as mirna. However, rice has nine other loci with sequences that differ from the Arabidopsis mir399f in only one or two nucleotides (Figure 1F). These six loci from Arabidopsis and nine loci from rice all have the ability to form fold-back structures. Arabidopsis mir399 loci, mir399f, mir399d, and mir399e are located within a 2.0-kb region (Figure 2), and mir399f and mir399d are oriented in opposite directions. This organization of the two mirnas within 2.0 kb and in opposite orientations is also conserved in rice (Figure 2). The only difference is that an additional locus mir399e exists in Arabidopsis in the same orientation as mir399f, but this is absent in rice. The conserved arrangement of these mirnas suggests that some important aspects of their regulation may also be evolutionarily conserved. Although mir389 has three loci in Arabidopsis, only two loci (on chromosomes 1 and 2) are able to adopt fold-back structures (Figures 3A and 3B). The predicted fold-back structure of mir389b.1 from chromosome 2 may give rise to two mirnas that belong to the same family, one each from the 59 and 39 arms (Figure 3B). However, only one of the two (from the 39 arm, mir389b.1) is represented in the library, and the other (on the 59 arm, mir389b.2) is predicted based on sequence identity. Because mir389b.1 and its predicted homolog on the 59 arm are not located directly opposite to each other (Figure 3B), this represents a novel type of mirna gene organization. Conserved mirna Gene Families Based on sequence similarity, the mirnas were grouped into families. Five of the mirna families (mir399, mir393, mir398, mir171b, and mir408) are highly conserved in rice (Table 1, Figure 1). Some are also conserved in Lotus and/or Medicago or Populus (Table 1, Figure 1). The mir399 family is represented by six members in Arabidopsis and nine members in rice (Figure 1F) and is one of the largest mirna families in plants. mir171b is represented by one and seven members in Arabidopsis and rice, respectively, with slight changes in nucleotide sequence (Figure 1A). The mir398 family has two members each in Arabidopsis and rice (Figure 1E). mir393 is represented by one member in Arabidopsis and two members in rice (Table 1, Figure1D). Two members of mir389a.1 and mir407 with slight change in sequence exist in Arabidopsis (Figures 1C and 1G). In some cases, the number of family members is greater in rice than in Arabidopsis. For example, mir156 (Reinhart et al., 2002) and mir171b and mir399 (this study) are represented by more family members in rice. One mirna (mir319c) was represented by five clones in our library, and the sequence is highly similar to the previously cloned mir319/mirjaw and mir159 (Reinhart et al., 2002; Palatnik et al., 2003). mirjaw has two genomic loci, one each on chromosomes 3 and 5 (Palatnik et al., 2003). However, mir319c is derived from a distinct locus on chromosome 2 with one nucleotide change. mir319c and mirjaw have the same size (20 nucleotides), and the only difference is that mir319c has an additional U at its 59 end. The presence of an additional U in mir319c makes it more similar to mir159 because both have three uridines at the 59 end. This may explain why we detected two equally strong signals at 20- and 21-nucleotide size positions using mir319c as a probe (Figure 5G). The mir319c probe likely cross-hybridizes with mir159, thus the 20- and 21-bp

Figure 1. mirna Families in Arabidopsis and Other Plants (Rice, Lotus, and Medicago). mirnas cloned in this study are labeled in pink. The letter P following mirna loci (mir399, mir389b.2, and mir407b) denotes predicted mirnas based on similarity with the cloned sequences and the ability of their precursor sequences to form hairpin structures. mir399 family is represented by six loci in the Arabidopsis (A.t.) genome. Only five loci (a, b, c, d, and e) are shown in Figure 1A because the locus d is represented twice with identical sequence (Table 1). Identical and conserved nucleotides are labeled in red. Blue letters represent changes in nucleotide sequences in some family members. The location of the predicted Arabidopsis mirnas in the chromosomes is shown to the right of the sequence. Sequences were aligned with the ClustalW program (Thompson et al., 1994). All mirnas from rice (O.s.), Lotus, and Medicago are from computer prediction only.

2006 The Plant Cell Figure 2. Conserved Genomic Organization of the mir399 mirna Cluster in Arabidopsis and Rice. mir399 loci d, e, and f are located within a 2.0-kb region on chromosome 2 in Arabidopsis, and part of this cluster is conserved in rice (chromosome 5). The hairpin precursor is indicated as a box, and the position of the mirna in the precursor is shown as thick vertical lines. Arrow on top of the box indicates the orientation of mirnas. signals appear to correspond to mir319c and mir159, respectively (Figure 5G). This is in contrast with the mir319a/mirjaw probe, which detects a stronger signal at its expected size of 20 bp and a weaker signal at 21 bp corresponding to mir159 (Palatnik et al., 2003). The mir319c sequence is conserved in Lotus (Table 1, Figure 1B). Predicted Targets of the mirnas To predict potential targets of the mirnas identified in this study, we searched various databases for Arabidopsis mrnas with sequence complementarity to the mirnas, allowing up to three mismatches without any gap (Rhoades et al., 2002) and treating G:U pair as a match. In some cases, there is no predictable target with three or fewer mismatches, and the best match is regarded as a putative target (Table 2). Complementarity to the 59 end of the mirna has been given due importance while predicting a target (Bartel, 2004). This complementarity requirement includes a 7-nucleotide perfect or near perfect match near the 59 terminus of the mirna (Lai et al., 2003; Lewis et al., 2003; Stark et al., 2003; Doench and Sharp, 2004). The 39 region of the mirna is less critical (Bartel, 2004) but can modulate its activity under certain circumstances (Doench and Sharp, 2004). Our analysis predicted 51 potential targets for 21 of the newly identified mirnas (Table 2). The plant mirna target sites are predominantly found within the protein-coding segment of the mrnas (Rhoades et al., 2002), whereas animal mirnas appear to primarily target the 39 UTRs (Enright et al., 2003; Lewis et al., 2003; Stark et al., 2003). Consistent with the earlier findings in Arabidopsis (Rhoades et al., 2002), 47 of our predicted target sites fall in the open reading frames (ORFs). Five of the predicted target sites lie in the 39 UTR regions (At1g07650 [mir404], At1g12010 and At5g49870 [mir407a], At1g54870 [mir407b], and At2g47020 [mir408]). Interestingly, two mirnas (mir399 and mir398) exhibit complementarity to the 59 UTR regions of their respective targets. mir399 appears to target four sites in the 59 UTR of At2g33770 (a putative ubiquitin conjugating enzyme E2) (Figure 4), whereas the mir398 family is likely to target the 59 UTR of At3g15640 (a member of cytochrome c oxidase). We found that if more than one mrna is predicted as the target of a mirna, these targets tend to belong to the same gene family. For instance, mir400 is predicted to target more than 10 genes of the pentatricopeptide repeat (PPR) protein family; mir393 appears to target five genes belonging to the TIR1 (an F-box protein important for signaling by the phytohormone auxin) family; mir406 has the potential to target two members of Figure 3. Predicted Fold-Back Structures of mir389a.1 Precursors with Flanking Sequences on Chromosome 1 and Chromosome 2. (A) Chromosome 1. (B) Chromosome 2. The fold-back structure shown in (B) is expected to yield two mirnas, one each on 39 and 59 arms. The cloned mirna sequence is labeled in red, and the predicted one is labeled in blue.

Novel mirnas from Arabidopsis 2007 Figure 4. Schematic Representation of the At2g33770 mrna Showing the 59 UTR, ORF, and 39 UTR Regions. Predicted mirna targeting sites in the 59 UTR region are shown (black rectangles). The enlarged portion shows the pairing between mir399f and the target sites. spliceosome-related genes; mir397 appears to target two members of the laccase family; and mir389a.1 is capable of targeting four unknown proteins in the same family (Table 2). mir319c, a homolog of mir319/mirjaw, is likely to target the Teosinite branched1, cycloidea, and proliferating cell nuclear antigen factors family of transcription factors (Palatnik et al., 2003). mir399 family members are predicted to target the same gene, At2g33770 (ubiquitin conjugating enzyme E2). mirnas that are represented by more than one member may recognize a consensus target sequence and, hence, might act on a common target or they might have tissue/stage-specific roles. In addition, different members of an mirna family with slight changes in nucleotide sequence might also target different genes. For example, mir399b mirna displays complementarity with a gene encoding a vesicle-associated membrane protein (At4g00170) in addition to the common target (At2g33770) of this family of mirnas. mir397a is likely to target members of the laccase family, whereas another family member, mir397b, displays better complementarity with another gene (At3g60250, a casein kinase). Therefore, a slight change in mirna sequence may alter its target preferences, distinguishing its role from that of other members of the same family. The previously reported Arabidopsis mir161 is predicted to target PPR genes At1g06580(3), At1g62720(3), and At1g62670(3) (numbers in parenthesis are mismatches) (Rhoades et al., 2002). We have discovered an mirna, mir400, that is different from mir161 and yet it displays perfect or near perfect complementarity with some of the predicted targets of mir161 [At1g06580(0), At1g62720(1), and At1g62670(1)]. This suggests that two completely different mirnas may target the same gene. The attack on one gene by more than one mirna could be a matter of cooperative regulation. Alternatively, it may indicate a redundant control of target gene expression by multiple mirna genes or the deployment of different mirnas for the regulation of a target in a developmental- or tissue-specific manner. Indeed, mir161 and mir400 have different spatial expression patterns. mir161 is strongly expressed in the stem and inflorescence (Reinhart et al., 2002), whereas mir400 is almost below the detection limit in stem and inflorescence but is moderately expressed in leaves and roots (Figure 5H). The PPR motif has been proposed to mediate macromolecular interactions. More than 400 genes encoding PPR proteins have been reported in Arabidopsis, and the majority are predicted to reside in either the chloroplast or mitochondrion (Small and Peeters, 2000). Because the PPR gene family is very large and diverse, it is conceivable that diverse mirnas might be required for their regulation. Interestingly, PPR proteins have not been reported from animals. Some of the mirna complementary sites are highly conserved in different members of the same gene family in Arabidopsis or in mrna targets from different plant species (Table 3). For instance, mir393 complementary sites are highly conserved in F-box proteins in Arabidopsis, rice, and maize (Zea mays) (Table 3). mir397 is predicted to target laccases, and the target sites are conserved in Arabidopsis, rice, tobacco (Nicotiana tabacum), and Populus (Table 3). mir408 may target mrnas encoding plantacyanins, and the complementary sites are conserved in rice, maize, chickpea (Cicer arietinum), and spinach (Spinacia oleracea) (Table 3). The complementary sites of mir389a.1 are conserved in mrnas that encode proteins belonging to the same family (Table 3). The complementary sites are also conserved in spliceosomal proteins, which are predicted to be targeted by mir406. The conservation in the target sequence is observed not only at the amino acid level but also at the nucleotide level, including sequences at codon position 3 (Table 3). Furthermore, the sequence conservation in the target genes is markedly higher in the section corresponding to the 59 end of the mirnas (Table 3). Tissue and Developmental Expression Patterns of mirnas Preferential expression of an mirna in specific tissues or developmental stages may suggest a role in development for the mirna. We investigated the tissue distribution of the newly identified mirnas by RNA gel blot analysis on total RNA samples from various tissues of mature plants as well as young seedlings. We were able to detect a signal of the expected size for 13 mirnas identified through cloning (Figure 5). The validation of other members of mir399, mir398, mir397, and predicted members of mir389 and mir407 were not attempted because of potential cross-hybridization. Some of the small RNAs were not detected on RNA gel blots but are still considered here as mirnas because they were cloned and their precursors can adopt a hairpin structure. The mirnas that could not be detected on RNA gel blots appeared only once in our library. A correlation was reported previously between the number of times a mirna

2008 The Plant Cell Table 2. Predicted Targets of Newly Identified mirnas mirna Target Genes with Number of Mismatches in Parentheses Target Protein(s) MIR171b and c At2g45160(2), At4g00150(2), Scarecrow-like transcription factors and At3g60630(2) MIR389a.1 At5g18040(1), At5g18065(1), At4g29760(2), Unknown proteins At1g51670(2), and At4g29770(3) MIR389b.3 At4g28670(2) Protein kinase MIR390a.2 At3g49250(3), At5g14050(3), and At2g39480(3) Unknown protein; WD-40 repeat protein; ABC transporter MIR393 At3g26810(2), At1g12820(2), At3g62980(2), TIR1 (an F-box protein) family At3g26830(2), and At4g03190(3) MIR397a At2g29130(2) and At2g38080(2) Laccases MIR397b At3g60250(2), At2g29130(3), Casein kinase II; laccases and At2g38080(3) MIR398a At3g15640(3) and At1g08050(5) Cytochrome c oxidase; zinc finger (C3HC4-type ring finger) protein MIR398b At3g15640(4) and At1g08050(4) Cytochrome c oxidase; Zinc finger (C3HC4-type ring finger) protein MIR399b and c At2g33770 and At4g00170(3) Ubiquitin conjugating enzyme (UCE); vesicle-associated membrane protein MIR399f At2g33770 (zero to two mismatches, Putative UCE2 up to four target sites) MIR400 At1g06580(0), At1g62720(1), At1g62670(1), At1g62590(2), PPR proteins At1g22960(2), At1g20300(2), At2g31400(2), At5g39710(2), At2g31400(2), At3g16010(3), and At4g19440(3) MIR401 At2g06095(1), At3g42350(2), Unknown proteins and At2g13270(3) MIR402 At4g34060(1) ROS1-like, a putative DNA glycosylase MIR403 At3g07780(5) Unknown protein MIR404 At1g07650(1) LRR-TM protein kinase MIR405 At5g48280(3) Unknown protein MIR406 At2g42510(3) and At1g54380(3) Spliceosomal proteins MIR407a At1g17770(3), At3g13100(3), At5g49870(3), and At1g12010(3) SET domain protein; ABC transporter; lectin; ACC oxidase MIR407b At1g54870(3) Short-chain dehydrogenase/reductase MIR408 At2g47020(0) and At2g02850(2) Peptide chain release factor; plantacyanin For each potential target gene, the number of mismatches between the mirna and the mrna is indicated in parentheses. appeared in the library and its expression level. mirnas identified in just one or a few clones have been difficult to detect on RNA gel blots, whereas those isolated many times are more easily detected (Grad et al., 2003; Lim et al., 2003; Kim et al., 2004). Some mirnas may be expressed at low levels that are below the experimental detection threshold by RNA gel blot analysis, perhaps because of restricted tissue/cell-type expression (Grad et al., 2003; Lim et al., 2003; Kim et al., 2004). Most of the previously known mirnas in Arabidopsis are strongly expressed in floral tissues (Llave et al., 2002b; Park et al., 2002; Reinhart et al., 2002). This pattern is mirrored by mir319c, which is expressed at moderate levels in leaves, roots, and young seedlings but is very strongly expressed in stems and inflorescence (Figure 5G). Similarly, mir393 is ubiquitously detected in all samples but is strongest in the inflorescence, indicating that this mirna might be important in silencing genes in reproductive tissues (Figure 5J). Also, mir402 and mir171b appear to have relatively lower levels of expression in leaves and young seedlings than in other tissues (Figures 5E and 5F). However, other mirnas in this study have quite divergent expression patterns. The most striking is that of mir398b, which could not be detected in the inflorescence (Figure 5A) and is preferentially expressed in leaves and young seedlings that consist mostly of young leaves. Similarly, mir399f, mir389b.3, and mir389a.1 are expressed more strongly in leaves and young seedlings, although they are also expressed in stems, roots, and inflorescence (Figures 5B to 5D). mir400 and mir397b show low but detectable expression in leaves, roots, and young seedlings but are undetectable in stems and inflorescence tissues (Figures 5H and 5I). mir403 and mir407 appear to be uniformly expressed in all tissues (Figures 5K and 5L). mir408 is strongly expressed in seedlings but weakly in all the tissues from adult plants (Figure 5M). We also examined the expression of the mirnas in dcl1-9 and hen1-1 mutant seedlings and compared the mutants to their

Novel mirnas from Arabidopsis 2009 Figure 5. Expression Patterns of mirnas Cloned from Arabidopsis. RNA gel blots of total RNA isolated from different tissues and 2-week-old seedlings were probed with labeled oligonucleotides. The blots also included RNA from dcl1-9 and hen1-1 mutants and their wild type, Landsberg erecta. The samples are leaf (L), stem (S), root (R), inflorescence (I), 2-week-oldseedlings (Se), Landsberg erecta (Ler), dcl1-9 (dcl), and hen1-1 (hen). The trna and 5S rrna bands were visualized by ethidium bromide staining of polyacrylamide gels and serve as loading controls. wild-type background (Landsberg erecta ecotype). Consistent with the important roles of DCL1 (Reinhart et al., 2002) and HEN1 (Park et al., 2002) in mirna accumulation, most of the mirnas show substantially reduced expression in hen1-1, and several display reduced expression in dcl1-9 (Figure 5). The mirnas that could be detected in the hen1 mutant are slightly longer than in the wild-type plants (Figure 5). We were unable to detect precursors for the mirnas, even in the dcl1 and hen1 mutants. The inability to detect precursors of Arabidopsis mirnas has been noted before (Park et al., 2002; Reinhart et al., 2002). That the dcl1 mutation does not impair the expression of many of the mirnas may be because the Arabidopsis genome has several Dicer-like genes that might be functionally redundant for some of the mirnas. Abiotic Stress Regulation of mirnas To test whether the expression of any of the mirnas is regulated by abiotic stresses, RNA gel blot analysis was performed on 2-week-old seedlings grown on MS-agar plates without stress treatment or treated with cold (08C for 24 h), 100-mM ABA for 3 h, or dehydration (seedlings in agar plates were exposed to dry air), or seedlings were transferred to a blotting paper saturated with

2010 The Plant Cell Table 3. mirna Complementary Sites in Potential mrna Targets Conserved in Arabidopsis and Other Plant Species Target Gene mirna/mrna Peptide Sequence CCU AGU UAC GCU AGG GAA ACC U 59 mir393 At1g12820(2) GaA aca AUG CGA UCC CUU UGG A ETMRSLWM At3g26830(2) GaA aca AUG CGA UCC CUU UGG A ETMRSLWM At3g26810(2) GaA aca AUG CGA UCC CUU UGG A ETMRSLWM At3g62980(2) GaG aca AUG CGA UCC CUU UGG A ETMRSLWM At4g03190(3) GaG acc AUG CGA UCC CUU UGG A ETMRSLWM ZmAY105491(2) GaG aca AUG CGA UCC CUU UGG A ETMRSLWM ZmAY109095(2) GaG aca AUG CGA UCC CUU UGG A ETMRSLWM OsAK121600(2) GaG aca AUG CGA UCC CUU UGG A ETMRSLWM OsAK072338(2) GaG aca AUG CGA UCC CUU UGG A ETMRSLWM OsAK100862(2) GaG aca AUG CGA UCC CUU UGG A ETMRSLWM UG UAG UUG CGA CGU GAG UUA CU 59 mir397a At2g29130(2) ua AUC AAU GCU GCA CUC AAU GA LINAALND At2g38080(2) ua GUC AAC GCU GCA CUU AAU GA LVNAALNE OsNM-190623(3) uc AUC AAC GCU GCA guc AAU Gu IINAAVNV OsNM-190556(1) uc AUC AAC GCU GCG CUC AAU GA LINAALND OsNM-192960(3) uc AUC AAC GCU GCA guc AAc GA IINAAVND PtY13773(3) ua AUC AAC GCU GCA CUC AAc GA LINAALND NtU43542(1) uc AUC AAC GCU GCG CUC AAU GA VINAALNE CG GUC CCU UCU CCG UCA CGU A 59 mir408 At2g02850(2) cc aag GGA AGA GGC AGU GCA U AKGRGSAS OsAK107381(2) cc CAG GGA AGA GGC AGU GCA g AQGRGSAA ZmAY108638(2) cu CAG GGA AGA GGC AGU GCG a AQGRGSAT SoU76296(1) GU CAG GGA AGA GGC AGU GCA a GQGRGSAR CaAJ012693(2) cu UuG GGA AGA GGC AGU GCA U ALGRGSAL G CAC UAU AAC CGU GCC GAG UU 59 mir171b At2g45160(2) a GgG AUA UUG GCG CGG CUC AA QGILARLN At3g60630(2) a GgG AUA UUG GCG CGG CUC AA QGILARLN At4g00150(2) g GgG AUA UUG GCG CGG CUC AA QGILARLN OsAK100757(2) a GaG AUA UUG GCG CGG CUC AA REILARLN OsAK120317(2) a GaG AUA UUG GCG CGG CUC AA REILARLN OsAK101035(1) C GaG AUA UUG GCG CGG CUC AA REILARLN OsAK106868(2) g GaG AUA UUG GCG CGG CUC AA REILARLN OsNM-197667(1) C GaG AUA UUG GCG CGG CUC AA REILARLN AT GCG AUA CAA CCU GAA UCU U 59 mir389a.1 At5g18065(1) ca CGC UAU GUU GGA CUU AGG G SRYVGLRD At5g18040(1) ca CGC UAU GUU GGA CUU AGG G SRYVGLRD At4g29760(2) ca CaC UAU GUU GGA CUU AGA G THYVGLRD At4g29770(2) ca CGa UAU GUU GGA CUU AGA G TRYVGLRD GAC CUA AUG UUA UCG UAA GAU 59 mir406 At1g54380(3) gau GAU UAC AAU AGC AUU UUG DDYNSIL At2g42510(3) gac GAU UAC AAU AGC AUU UUG DDYNSIL For each gene, the nucleotide sequence of the mirna complementary site is broken into codons corresponding to the reading frame of the mrna. mirna 59 end is indicated. Mismatches are shown in lower case, and the number of mismatches is indicated in parentheses. The peptide sequence of the mirna complementary site is shown. At, Arabidopsis thaliana; Os, Oryza sativa; Zm, Zea mays; Nt, Nicotiana tabacum; Ca, Cicer arietinum; So, Spinach oleracea; Pt,Populus trichocarpus. 300-mM NaCl for 5 h. We found that the expression of several of the mirnas is either upregulated or downregulated by one or more of the stress treatments (Figure 6). mir393 is strongly upregulated by cold, dehydration, NaCl, and ABA treatments (Figure 6A). mir397b and mir402 are slightly upregulated by all the stress treatments, whereas mir319c appears to be upregulated by cold but not dehydration, NaCl, or ABA (Figures 6B to 6D). Interestingly, mir389a.1 appears to be downregulated by all of the stress treatments (Figure 6E). Other Novel Endogenous Small RNAs Large censuses of small RNA cdna libraries have revealed a significant class of other 21- to 22-nucleotide RNAs (Reinhart

Novel mirnas from Arabidopsis 2011 and Bartel, 2002; Ambros et al., 2003b; Aravin et al., 2003). These small RNAs differ from mirnas in that they do not arise from hairpin precursors. Nevertheless, they display a strong size bias that suggests regulated processing by a Dicer-like activity. Furthermore, unlike some mirnas, these small RNAs are generally not conserved phylogenetically. The recent discovery of a large class of tiny noncoding RNAs from C. elegans that share some features with mirnas is an example for exceptional cases like this (Ambros et al., 2003b). These tiny noncoding RNAs are not even conserved within Caenorhabditis species (Ambros et al., 2003b). Another class of small RNAs, repeat-associated silencing RNAs, has been reported from Drosophila (Aravin et al., 2003). Repeat-associated silencing RNAs were noticed to be derived potentially from genomic regions rich in repeats (Aravin et al., 2003). In this study, we have identified 102 endogenous small RNAs (putative sirnas) that cannot be designated as mirnas (Table 4). The putative sirnas represent a heterogeneous class with respect to size and origin. The size ranged between 15 and 25 nucleotides, although the majority are in the range of 20 to 24 nucleotides. The small RNAs do not exhibit nucleotide preferences at their 59 end. The majority have a single locus in the Arabidopsis genome, although some (28) correspond to multiple loci (2 to 64). Twelve of the small RNAs corresponding to multiple loci appear to be derived from repeat-rich regions or centromeric regions (Table 4). Two small RNAs (P22-H04 and P43-A12) are derived from putative retrotransposons. The vast majority is found in intergenic regions, but some match to coding regions and introns in the antisense polarity. We consider those in the sense polarity as possible degradation products. Some of the putative sirnas have perfect complementarity with transposases, LOB, AIG1, 23s rrna, F-box protein, or pre-mrna splicing factor. Some sirnas display complementarity with their putative target genes with one to three mismatches. For instance, P78- C02 is complementary to At2g27152 (AAO3, a drought stressinducible ABA biosynthesis enzyme) with two mismatches, whereas P52-B05 is complementary with a putative lipoxygenase with three mismatches. Some of these small RNAs may actually be mirnas but are not considered as mirnas at this time because the precursor sequences cannot adopt a fold-back structure using the mfold program. We were unable to predict targets for the majority of the putative sirnas identified. These small RNAs might target promoter regions of protein coding genes for transcriptional gene silencing (Park et al., 2002; Ambros et al., 2003b; Finnegan and Matzke, 2003; Bartel, 2004). The existence of diverse and vast numbers of endogenous small RNAs suggest that they may play important roles in the silencing of endogenous genes, in addition to their proposed roles in defense against transposons and viral invasion (Voinnet, 2002; Ambros et al., 2003b). DISCUSSION Small noncoding RNAs have recently emerged as important regulators of both transcriptional and posttranscriptional gene silencing. The identification of the entire set of small RNAs from an organism is of fundamental importance and would lay the foundation for understanding gene regulation involving small RNAs. The endeavor is as important as mining genes that code for proteins. The establishment of a comprehensive list of mirnas from any organism will be instrumental for not only gene regulation studies but also for genome organization (e.g., transposon activity and chromatin assembly), phylogenetic comparison, comparative studies of development, and other evolutionary analysis. By sequencing a library of small RNAs from seedlings treated with different abiotic stresses, we are able to add 15 new families of mirnas to the previously published 15 families in Arabidopsis. In addition, we identified new members of the previously reported mir171 and mir319 families (Table 1, Figure 1). Some mirnas are encoded by single copy genes, others by multiple genes. Most mirna genes come from intergenic regions, but a sizable minority of mirnas (about one-quarter of the human mirna genes) occurs in the introns of protein coding genes, preferentially in the same orientation as the mrnas (Bartel, 2004). All mirnas identified in this study are intergenic in origin with two exceptions: mir402 is derived from the first intron of At1g77230, and mir408 is in the antisense polarity to the 39 UTR of At2g47020. The majority of the mirnas described in this study are located in the 39 arm of the fold-back structure (Table 1). Most mirnas have a 59 U preference that is similar to earlier reports (Bartel and Bartel, 2003; Bartel, 2004), although a few mirnas begin with a 59 A or C. Analysis of the thermodynamic profiles of mirnas from plants revealed that the strand having low internal stability at the 59 end is retained within the RNAinduced silencing complex (Khvorova et al., 2003). Some of the mirnas reported in this study conform to the features described by Khvorova et al. (2003) and Schwarz et al. (2003), although some show considerable variability (see Supplemental Table 2 online). Ten of the mirnas that we have identified have clear orthologs or homologs in rice, Lotus, Medicago, or Populus, suggesting that they conduct important and perhaps conserved functions. On the other hand, many of the mirnas are not conserved. These nonconserved mirnas may play more species-specific roles. Computational prediction of putative targets for mirnas in plants has been facilitated by their perfect or near perfect complementarity to target genes (Rhoades et al., 2002). By contrast, bioinformatic approaches to identifying mrna targets for animal mirnas are complicated by the partial complementarity between an mirna and its target (Lewis et al., 2003). Sixtyone targets, predominantly transcription factors, were predicted previously in Arabidopsis (Rhoades et al., 2002; Bartel and Bartel, 2003). The predicted targets that are not transcription factors include PPR mrnas by mir161, DCL-1 by mir162, and Argonaute by mir168 (Rhoades et al., 2002). In this study, we were able to predict 51 genes that are likely targeted by the mirnas described here (Table 2). These targets include metabolic enzymes (laccases and cytochrome c oxidase), enzymes in the ubiquitination pathway (ubiquitin conjugating enzyme E2 and TIR1/ubiquitin ligase), transcription factors (Scarecrow-like transcription factors), signal transduction components (protein kinase, casein kinase, and leucine-rich repeat transmembrane [LRR-TM] kinase), and genes involved in RNA processing (spliceosomal proteins and PPR proteins), protein synthesis (peptide

2012 The Plant Cell Figure 6. Regulation of mirna Expression by Abiotic Stresses. Shown are RNA gel blots of total RNA isolated from 2-week-old seedlings untreated or treated with different abiotic stresses. The trna and 5S rrna bands were visualized by ethidium bromide staining of polyacrylamide gels and serve as loading controls. chain release factor), and transcriptional gene silencing [DNA glycosylase and Su(var)3-9, Enhancer-of-zeste, Trithorax (SET) domain proteins]. Our predicted targets appear to be involved in more diverse biological processes and functions and not predominantly in transcriptional regulation as predicted in earlier studies (Rhoades et al., 2002). It was hypothesized that a complementary site in the coding region of an mrna may lead to the cleavage of the target, whereas target sites located in 39 UTRs might attenuate translation (Bartel, 2004). Consistent with this view, Arabidopsis mirnas that target ORFs have largely been shown to be involved in the cleavage of target mrnas (Llave et al., 2002a; Kasschau et al., 2003; Palatnik et al., 2003; Xie et al., 2003; Vazquez et al., 2004). Perhaps as an exception, Arabidopsis mir172 represses translation despite having a complementary site within the ORF of its target mrna, AP2 (Aukerman and Sakai, 2003; Chen, 2004). The complementary sites for the animal mirnas have been predicted to reside in the 39 UTRs of target genes (Lewis et al., 2003; Bartel, 2004). This bias might reflect a mechanistic preference for translational repression over target cleavage. Only in a few cases are 39 UTRs predicted as the target sites for plant mirnas. For example, the complementary sites for Arabidopsis mirnas, mir156/157, mir156, and mir169, reside in the 39 UTRs of At1g53160, At2g33810, At1g17590, and At1g54160, respectively (Rhoades et al., 2002). The target sites for five of the predicted targets described here are within the 39 UTR regions. Until now, no mirna from animals or plants has been predicted to target the 59 UTR of a gene. In this work, we find that two mirna families (mir399 and mir398) are likely to target 59 UTRs. mir398 family members are complementary to the 59 UTR of At3g15640 (cytochrome c oxidase). The mir399 has multiple complementary sites (four in Arabidopsis; Figure 4) within the 59 UTR of At2g33770 (putative ubiquitin conjugating enzyme E2). Future functional studies on the mir398 and mir399 families of mirnas may reveal a novel mode of action of these mirnas on their targets. The presence of multiple mirna complementary sites is known in animal targets (Lee et al., 1993; Wightman et al., 1993; Lai, 2003; Bartel, 2004). The presence of multiple target sites within the same target implies that in some cases more than one mirna complementary site is required in the target gene for its effective downregulation. Indeed, it has been shown that increasing the number of mirna/sirna binding sites increases the degree of repression of a target gene (Doench et al., 2003). The targeting of the 39 UTR of At2g47020 by mir408 is particularly interesting. The translation of At2g47020 mrna may be repressed by mir408. At2g47020 is predicted to encode a peptide chain release factor important in mrna translation. This suggests that the translational machinery may be targeted by an mirna for translational regulation. The targeting of a ROS1- like DNA glycosylase and a SET domain protein by mir402 and mir407 mirnas, respectively, is also very interesting. ROS1 is a DNA glycosylase/lyase required for the repression of sirnatriggered transcriptional gene silencing by DNA demethylation (Gong et al., 2002). SET domain proteins are also important in transcriptional gene silencing by functioning in histone Lys methylation (Malagnac et al., 2002). These, together with previous evidence for the targeting of DCL1 by an mirna (Xie et al., 2003), point to an important role of mirnas in the regulation of machineries involved in gene silencing. The concentration and activities of many cellular proteins are regulated by the ubiquitin pathway. Ubiquitination requires a cascade of three enzymatic activities for activating (E1), conjugating (E2), and ligating (E3) ubiquitin to a substrate. In this study, we have identified two mirnas (mir399 and mir393) that may target components (ubiquitin conjugating enzyme E2 and ubiquitin ligase/tir1) of the ubiquitination pathway. The result suggests a role for mirnas in regulating the process of ubiquitination. Fungal laccases are well known for their roles in morphogenesis, pathogenicity, and lignin degradation (Mayer and Christopher, 2002). The physiological roles of higher plant