Supplemental Figure 1. Small RNA size distribution from different soybean tissues.

Similar documents
MicroRNAs as master regulators of the plant NB-LRR defense gene family via the production of phased, trans-acting sirnas

RNA-Seq Atlas of Glycine max: A guide to the Soybean Transcriptome

ANALYSIS OF SOYBEAN SMALL RNA POPULATIONS HLAING HLAING WIN THESIS

Eukaryotic small RNA Small RNAseq data analysis for mirna identification

Table S1. Relative abundance of AGO1/4 proteins in different organs. Table S2. Summary of smrna datasets from various samples.

High-resolution identification and abundance profiling of cassava (Manihot esculenta Crantz) micrornas

Supplemental Figure 1. Genes showing ectopic H3K9 dimethylation in this study are DNA hypermethylated in Lister et al. study.

he micrornas of Caenorhabditis elegans (Lim et al. Genes & Development 2003)

Identification of mirnas in Eucalyptus globulus Plant by Computational Methods

Improved annotation of C. elegans micrornas by deep sequencing reveals structures associated with processing by Drosha and Dicer

Small RNA and PARE sequencing in flower bud reveal the involvement of srnas in endodormancy release of Japanese pear (Pyrus pyrifolia 'Kosui')

Cross species analysis of genomics data. Computational Prediction of mirnas and their targets

Supplemental Figure S1. Expression of Cirbp mrna in mouse tissues and NIH3T3 cells.

Genome-Wide Analysis of leafbladeless1-regulated and Phased Small RNAs Underscores the Importance of the TAS3 ta-sirna Pathway to Maize Development

Bi 8 Lecture 17. interference. Ellen Rothenberg 1 March 2016

Transcriptome-wide analysis of microrna expression in the malaria mosquito Anopheles gambiae

sirna count per 50 kb small RNAs matching the direct strand Repeat length (bp) per 50 kb repeats in the chromosome

Differentially expressed microrna cohorts in seed development may contribute to poor grain filling of inferior spikelets in rice

Supplementary Figure S1. Gene expression analysis of epidermal marker genes and TP63.

MicroRNA in Cancer Karen Dybkær 2013

Obstacles and challenges in the analysis of microrna sequencing data

Genome-wide identification and functional analysis of lincrnas acting as mirna targets or decoys in maize

Hands-On Ten The BRCA1 Gene and Protein

Nature Structural & Molecular Biology: doi: /nsmb.2419

Supplementary Figure 1

COMPUTATIONAL ANALYSIS OF SMALL RNAs IN MAIZE MUTANTS WITH DEFECTS IN DEVELOPMENT AND PARAMUTATION. Reza Hammond

Small RNAs and how to analyze them using sequencing

mirna Dr. S Hosseini-Asl

In silico identification of mirnas and their targets from the expressed sequence tags of Raphanus sativus

A putative MYB35 ortholog is a candidate for the sex-determining genes in Asparagus

Circular RNAs (circrnas) act a stable mirna sponges

Prediction of micrornas and their targets

RIP-seq of BmAgo2-associated small RNAs reveal various types of small non-coding RNAs in the silkworm, Bombyx mori

A WIDESPREAD IMPACT ON SMALL RNAS AND GENE NETWORKS IN RICE MSP1/OSTDL1A MUTANTS, PARTNERS WITH KEY ROLES IN ANTHER DEVELOPMENT.

mir2118-triggered phased sirnas are differentially expressed during the panicle development of wild and domesticated African rice species

Supplementary Figure 1: Attenuation of association signals after conditioning for the lead SNP. a) attenuation of association signal at the 9p22.

Supplemental Information For: The genetics of splicing in neuroblastoma

genomics for systems biology / ISB2020 RNA sequencing (RNA-seq)

Introduction. Introduction

Zhao et al. BMC Bioinformatics (2017) 18:180 DOI /s

Transcriptome profiling of the developing male germ line identifies the mir-29 family as a global regulator during meiosis

Small RNAs and how to analyze them using sequencing

Figure S1 Expression of AHL gene family members in diploid (Ler Col) and triploid (Ler

MicroRNAs and Their Regulatory Roles in Plants

V16: involvement of micrornas in GRNs

SUPPLEMENTAL INFORMATION

Supplemental Figures Legends and Supplemental Figures. for. pirna-guided slicing of transposon transcripts enforces their transcriptional

Supplementary Figure 1

BWA alignment to reference transcriptome and genome. Convert transcriptome mappings back to genome space

Soft Agar Assay. For each cell pool, 100,000 cells were resuspended in 0.35% (w/v)

Genome-Wide Identification of mirnas Responsive to Drought in Peach (Prunus persica) by High-Throughput Deep Sequencing

Research Article Base Composition Characteristics of Mammalian mirnas

CRS4 Seminar series. Inferring the functional role of micrornas from gene expression data CRS4. Biomedicine. Bioinformatics. Paolo Uva July 11, 2012

SUPPLEMENTAL DATA AGING, July 2014, Vol. 6 No. 7

MODULE 3: TRANSCRIPTION PART II

mirnas involved in the development and differentiation of fertile and sterile flowers in Viburnum macrocephalum f. keteleeri

IDENTIFICATION OF IN SILICO MIRNAS IN FOUR PLANT SPECIES FROM FABACEAE FAMILY

Genome-wide identification of non-coding RNAs interacted with micrornas in soybean

Supplementary information for: Human micrornas co-silence in well-separated groups and have different essentialities

Open Flower. Juvenile leaf Flowerbud. Carpel 35 NA NA NA NA 61 NA 95 NA NA 15 NA 41 3 NA

Identification of novel and candidate mirnas in rice by high throughput sequencing. and J-K Zhu,

Nature Immunology: doi: /ni Supplementary Figure 1. Characteristics of SEs in T reg and T conv cells.

Patterns of Histone Methylation and Chromatin Organization in Grapevine Leaf. Rachel Schwope EPIGEN May 24-27, 2016

Genome-Wide Profiling and Analysis of Arabidopsis sirnas

Nature Structural & Molecular Biology: doi: /nsmb Supplementary Figure 1

Nature Structural & Molecular Biology: doi: /nsmb Supplementary Figure 1. Differential expression of mirnas from the pri-mir-17-92a locus.

ONLINE SUPPLEMENTAL INFORMATION

Host Double Strand Break Repair Generates HIV-1 Strains Resistant to CRISPR/Cas9

Santosh Patnaik, MD, PhD! Assistant Member! Department of Thoracic Surgery! Roswell Park Cancer Institute!

VirusDetect pipeline - virus detection with small RNA sequencing

Small RNA Sequencing. Project Workflow. Service Description. Sequencing Service Specification BGISEQ-500 SERVICE OVERVIEW SAMPLE PREPARATION

RNA-seq Introduction

MODULE 4: SPLICING. Removal of introns from messenger RNA by splicing

STAT1 regulates microrna transcription in interferon γ stimulated HeLa cells

Genome-Wide Identification and Comparative Analysis of Conserved and Novel MicroRNAs in Grafted Watermelon by High-Throughput Sequencing

Figure S2. Distribution of acgh probes on all ten chromosomes of the RIL M0022

High AU content: a signature of upregulated mirna in cardiac diseases

Dicer-2 Processes Diverse Viral RNA Species

MicroRNAs and Cancer

Variant Classification. Author: Mike Thiesen, Golden Helix, Inc.

Cloning and characterization of small RNAs from Medicago truncatula reveals four novel legume-specific microrna families

Criteria for Annotation of Plant MicroRNAs

Figure S1. Analysis of endo-sirna targets in different microarray datasets. The

Identifying mirnas in RNA viruses

Alternative RNA processing: Two examples of complex eukaryotic transcription units and the effect of mutations on expression of the encoded proteins.

Nature Medicine: doi: /nm.4439

Arabidopsis thaliana small RNA Sequencing. Report

MIR retrotransposon sequences provide insulators to the human genome

Evidence for the biogenesis of more than 1,000 novel human micrornas

Supplementary Figure 1: Features of IGLL5 Mutations in CLL: a) Representative IGV screenshot of first

a) List of KMTs targeted in the shrna screen. The official symbol, KMT designation,

Pirna Sequence Variants Associated With Prostate Cancer In African Americans And Caucasians

Transient β-hairpin Formation in α-synuclein Monomer Revealed by Coarse-grained Molecular Dynamics Simulation

Characterization and comparison of flower bud micrornas from yellow-horn species

mirna regulation in the early development of barley seed

Gene silencing pathways found in the green alga Volvox carteri reveal insights into evolution and origins of small RNA systems in plants

Mature microrna identification via the use of a Naive Bayes classifier

The Cornucopia of Small RNAs in Plant Genomes

RNA interference induced hepatotoxicity results from loss of the first synthesized isoform of microrna-122 in mice

Nature Genetics: doi: /ng.3731

Transcription:

Supplemental Figure 1. Small RNA size distribution from different soybean tissues. The size of small RNAs was plotted versus frequency (percentage) among total sequences (A, C, E and G) or distinct sequences (B, D, F, and H). Library codes and detailed information about each library are indicated in Supplemental Data set 1A. A B C D E F G H 1

Supplemental Figure 2. Leaf small RNA size distributions in Arabidopsis and soybean. Size distribution of total small RNAs ([A], [C]) and distinct small RNAs ([B], [D]) in Arabidopsis and soybean. The bottom panels of [C] and [D] provide details of the categories of 21-nt small RNAs in soybean, in the form of a pie chart. A Arabidopsis thaliana B Arabidopsis thaliana Frequency (%) Abundance Leaf-PCt1 Frequency (%) Distinct Leaf-PCt1 Small RNA length (nt) Small RNA length (nt) C Glycine max D Glycine max Frequency (%) Abundance Leaf-WW3023_1 Frequency (%) Distinct Leaf-WW3023_1 Small RNA length (nt) Small RNA length (nt) 2

Supplemental Figure 3. Examples of mirbase-annotated mirnas in our four categories of assessment. The mirbase-annotated mirna of interest is circled in red, with a red arrowhead for additional emphasis. One example of each type that we identified is shown, including the following: (A) Weaklyexpressed loci. (B) sirna-like mirna loci. (C) Marginal mirna loci. (D) Typical mirna loci. A B gma-mir1522 C gma-mir1520a D gma-mir4345 gma-mir156b 3

Supplemental Figure 4. The pipeline for the identification of novel mirna and mirna variants. 69 small RNA libraries 1 st Filter Set 1) Genome matched excluding t/r/sn/snornas 2) Abundance 50 TP5M in at least one library 3) Size between 18-26 nt 4) Hits to soybean genome between 1 and 20 2 nd Filter Set 1) Select two most abundant sequences 2) Modified mireap 3 rd Filter Set 1) Strand bias: sense/total 0.9 2) Abundance bias: (top1+top2)/total 0.7 3) Within or nearby (50 nt) to known t/r/sn/snorna 4 th Filter Set Stem-loop structure prediction (CentroidFold) 5 th Filter Set Exclude known mirnas Candidates : 124,526,447 Known mirnas: 370 Candidates : 29,133 Known mirnas: 198 Candidates : 2,523 (4,074 precursors) Known mirnas: 120 Candidates : 180 (361 precursors) Known mirnas: 71 Candidates : 151 (332 precursors) Known mirnas: 71 22 novel mirnas (33 precursors) 44 new variants of known mirnas 4

Supplemental Figure 5. Abundance profiles of the mir171 family. The log2-transformed values of normalized read abundance (TP5M) in flower, leaf and nodule tissues are displayed as a heat map. The mirna members with the same sequences were collapsed into one row. Log2 transformation: 0 2 4 6 8 10 12 5

Supplemental Figure 6. Secondary structures for the stem-loop region of precursors of the soybean mirnas that are triggers of PHAS loci. Sequences highlighted in red are the mature mirna sequence. (A) mirna triggers with asymmetric bulges in the precursors; the asymmetric bulge in the stem-loop is indicated by a purple arrow, combined with a zoomed-in view of the nucleotides found at the bulge. (B next page) mirna triggers lacking an asymmetric bulge in the precursor. 6

Supplemental Figure 6. (continued) 7

Supplemental Figure 7. PhasiRNAs from the ARF4 locus are triggered by tasiarfs. Small RNA and PARE data demonstrate that like ARF3/ETT (Figure 4), the ARF4 locus is targeted by 21-nt tasiarfs at both cleaved and non-cleaved sites, triggering secondary sirnas biogenesis via the two-hit mechanism. At the top, the cleavage position is indicated with a black arrow in the tasiarf-arf4 sequence alignment. See the legend of Figure 4 for a full description of the images and symbols. 204/248 Glyma12g07560 5 AAGGUCUUGCAAGGUCAAGAA 3 ARF4 o SCORE: 1.0 3 UUCCAGAAUGUUCCAGUUCUU 5 tasiarf Cleaved target Chr.12:5194617 Non-cleaved target Chr.12:5194821 Phasing Score 198bp Small RNA abundances PARE RNA abundances 5194616 Small RNA data tasiarf target site PARE data 8

Supplemental Figure 8. Several soybean loci related to arogenate dehydrogenase-encoding transcripts are abundant sources of 21-nt phasirnas. (A) An intergenic (unannotated) region on chromosome 14 approximately between positions 15,772,000 and 15,775,000 generates an abundant set of secondary sirnas (blue dots are 21-mers, green are 22- mers). This sequence is predicted to form a long hairpin structure (data not shown); this is recognizable by the presence of an inverted repeat at the loop region (orange box). The blue boxes in the center indicate tandem repeats. Other features are as described in the legend of Figure 4. (B) Two arogenate dehydrogenase-encoding genes in soybean are PHAS loci. (C) A tree of the arogenate dehydrogenase proteins encoded in the soybean genome, with gene identifiers at right. The clade highlighted in pink contains the two loci producing phasirnas. The tree was produced using from the alignment generated by CLUSTALW. A Chr.14:15,772,000; Arogenate dehydrogenase-like B Glyma17g13150; Arogenate dehydrogenase Glyma05g07870; Arogenate dehydrogenase C 0.01 Glyma17g13150 Glyma05g07870 Glyma13g01050 Glyma15g06340 Glyma14g05990 Glyma11g35760 Glyma18g02650 9

Supplemental Figure 9. Seed-specific PHAS loci from MYB5 genes. Small RNA plots and phasing scores of MYB5 phasirna precursors ([A] is Glyma05g04900, [B] is Glyma17g15270). The target binding sites for the mirna triggers are indicated with green triangles and the cleavage positions are indicated with black arrows in the sequence alignment at the top. In the lower panels, the green and red bars show the levels of normalized read abundances for phasirnas and mirna triggers, respectively, in different tissues for these two MYB5 genes. A Glyma05g04900 5 UGGAAUUCUCAUUUGUGCAAAA 3 MYB5 SCORE: 1.5 3 ACCUUAUGAGUAAACUCGUUCU 5 mir828 Cleaved target Chr.5:4076458 B Glyma17g15270 5 UGGAAUUCUCAUUUGUGCAAAA 3 MYB5 SCORE: 1.5 3 ACCUUAUGAGUAAACUCGUUCU 5 mir828 Cleaved target Chr.17:11955510 Small RNA abundances Small RNA abundances Phasing Score Phasing Score Glyma05g04900 Glyma17g15270 10

Supplemental Figure 10. Decrease of mir390 and TAS3a and TAS3b read abundances during nodule development. Normalized read abundances of mir390 and 21-nt phasirnas generated from TAS3a and TAS3b in developing nodules, from 10 to 30 days after inoculation, measured over an interval of five days. 12000 Reads abundance (TP5M) 10000 8000 6000 4000 2000 TAS3a TAS3b mir390 0 10 15 20 25 30 Stage of nodule development (in days-post-inoculation) (Day) 11

Supplemental Figure 11. Co-localization of legume anther- or flower-specific PHAS loci and their mirna triggers. (A) Small RNA abundances of mir4392 and phasirnas generated from the adjacent PHAS locus; both sets of small RNAs accumulate exclusively in soybean flower tissues, particularly in anthers. The RNA-seq read abundances of the mirna and PHAS precursors confirm the exclusive expression of both loci in flower tissues. (B) The target site of mir4392 on the soybean chromosome is indicated with brown lines, and the position of the mir4392 locus and the mature mirna is indicated with a red arrow. (C) The target site of mir5754 on Medicago chromosome 4 is indicated in brown. The locus generating mir5754 is nearby, with the mature mirna indicated with a red arrow. A B Glycine max gma-mir4392 target gma-mir4392 C Medicago truncatula Medtr4g076050 Medtr4g076060 mtr-mir5754 target mtr-mir5754 target mtr-mir5754 12

Supplemental Figure 12. A model of transposable element control in legume pollen by pollenspecific phasirnas. The MIRNA gene and the PHAS genes localized in the adjacent region are shown as blue and green horizontal bars, respectively. The long black line represents a portion of chromosome region. The MIRNA gene transcribes a mirna precursor, which will be subsequently processed to the 22-nt mirna by a dicer-like protein (DCL, normally DCL1). The 22-nt mirna will bind to an Argonaute (AGO), targeting the transcript produced from the PHAS gene. A set of 21-nt phasirnas will then be generated from the PHAS gene transcript. These phasirnas, mediated by AGO, will target hundreds of transposable elements (TEs) genome-widely and probably control those TEs via transcription (TGS) or posttranscriptional gene silencing (PTGS) mechanisms. The twenty soybean chromosomes are shown in green vertical bars and the locations of predicted TE targets are indicated as red triangles. 13