Supplementary Figure 1: sgrna library generation and the length of sgrnas for the functional screen. (a) A diagram of the retroviral vector for sgrna

Similar documents
Supplementary Figure 1

TailorMix Stranded mrna Sample Preparation Kit

Nature Structural & Molecular Biology: doi: /nsmb Supplementary Figure 1

File name: Supplementary Information Description: Supplementary figures and supplementary tables. File name: Peer review file Description:

i-stop codon positions in the mcherry gene

Guidelines for sequencing SCC libraries All libraries made using SCC supplied hydrogel after are V3 libraries

GENERAL INFORMATION...

NEXTFLEX Small RNA Barcode Primers - Set A (For Illumina Platforms) Catalog #NOVA (Kit contains 96 reactions)

NEXTflex PCR-Free Barcodes - 6 (For Illumina Platforms) Catalog # (Kit contains 48 reactions) Bioo Scientific Corp V12.

Zhang et al., RepID facilitates replication Initiation. Supplemental Information:

sherwood - UltramiR shrna Collections

Nature Biotechnology: doi: /nbt Supplementary Figure 1

Nature Genetics: doi: /ng Supplementary Figure 1. ChIP-seq genome browser views of BRM occupancy at previously identified BRM targets.

Supporting Information

Supplementary Figure 1

Nature Biotechnology: doi: /nbt Supplementary Figure 1. Number and length distributions of the inferred fosmids.

Percent survival. Supplementary fig. S3 A.

Supplementary Figure 1

Supplementary Figure 1 An overview of pirna biogenesis during fetal mouse reprogramming. (a) (b)

Nature Structural & Molecular Biology: doi: /nsmb Supplementary Figure 1

Schematic representation of the endogenous PALB2 locus and gene-disruption constructs

SUPPLEMENTARY INFORMATION

Parts of a standard FastQC report

Supporting Information

Protocol for cloning SEC-based repair templates using Gibson assembly and ccdb negative selection

Supplementary Figure 1 Activities of ABEs using extended sgrnas in HEK293T cells.

Student Learning Outcomes (SLOS)

Nature Immunology: doi: /ni.3694

NEBNext Multiplex Small RNA Library Prep Set for Illumina Set 1, Set 2, Index Primers 1 48 and Multiplex Compatible

Supplementary Information

of NOBOX. Supplemental Figure S1F shows the staining profile of LHX8 antibody (Abcam, ab41519), which is a

B. Transgenic plants with strong phenotype (%)

Dharmacon Edit-R Lentiviral sgrna Pooled Screening Libraries

Supplementary Information

SUPPLEMENTARY INFORMATION

Nature Methods: doi: /nmeth Supplementary Figure 1. Pilot CrY2H-seq experiments to confirm strain and plasmid functionality.

embryos. Asterisk represents loss of or reduced expression. Brackets represent

Revised: RG-RV2 by Fukuhara et al.

Integrated Omics Study Delineates the Dynamics of Lipid Droplets in Rhodococcus Opacus PD630

SUPPLEMENTARY INFORMATION

Supplementary Figure 1. Phenotype and genotype of cultured and transplanted S1 KCST (A) Brightfield and mcherry fluorescence images of the spheres

CloneTracker XP 10M Barcode-3 Library with RFP-Puro

AD BD TOC1. Supplementary Figure 1: Yeast two-hybrid assays showing the interaction between

Cleanup. Total Time 2.5 hr

Spermatazoa. Sertoli cells. Morc1 WT adult. Morc1 KO adult. Morc1 WT P14.5. Morc1 KO P14.5. Germ cells. Germ cells. Spermatids.

Supplementary Methods

Supplementary Figure 1

Nature Methods: doi: /nmeth Supplementary Figure 1. Construction of a sensitive TetR mediated auxotrophic off-switch.

Supplementary Figure 1

Donor DNA Utilization during Gene Targeting with Zinc- finger Nucleases

Supplementary Figure 1.

Supplemental Figure 1 A

Nature Genetics: doi: /ng.3556 INTEGRATED SUPPLEMENTARY FIGURE TEMPLATE. Supplementary Figure 1

Supplementary Information Targeting fidelity of adenine and cytosine base editors in mouse embryos

Nature Methods: doi: /nmeth.4396

Nature Genetics: doi: /ng Supplementary Figure 1

Nature Biotechnology: doi: /nbt.4166

b number of motif occurrences

A CRISPR/Cas9 Vector System for Tissue-Specific Gene Disruption in Zebrafish

Supplemental Data. Zhou et al. (2016). Plant Cell /tpc

Supplementary Figure 1 Generation of migg1-yf mice. (A) Targeting strategy. Upper panel: schematic organization of the murine ɣ1 immunoglobulin

Supplementary Information Design of small molecule-responsive micrornas based on structural requirements for Drosha processing

Analyzing an individual sequence in the Sequence Editor

Nature Structural and Molecular Biology: doi: /nsmb Supplementary Figure 1. Validation of CDK9-inhibitor treatment.

SUPPLEMENTARY FIGURES AND TABLES

Bootcamp: Molecular Biology Techniques and Interpretation

Supplemental Information

Mapping long-range promoter contacts in human cells with high-resolution capture Hi-C

Supplementary Fig. S1. SAMHD1c has a more potent dntpase activity than. SAMHD1c. Purified recombinant SAMHD1c and SAMHD1c proteins (with

Nature Genetics: doi: /ng.2949

Rassf1a -/- Sav1 +/- mice. Supplementary Figures:

Allele-specific locus binding and genome editing by CRISPR at the

Supplementary Figure S1. Immunodetection of full-length XA21 and the XA21 C-terminal cleavage product.

Somatic Primary pirna Biogenesis Driven by cis-acting RNA Elements and Trans-Acting Yb

Supplemental Table/Figure Legends

High-resolution Identification and Separation of Living Cell Types by Multiple microrna-responsive Synthetic mrnas

SUPPLEMENTARY FIGURES

SUPPLEMENTARY INFORMATION

Nature Biotechnology: doi: /nbt Supplementary Figure 1. In vitro validation of OTC sgrnas and donor template.

Nature Biotechnology: doi: /nbt Supplementary Figure 1

A Guide to CRISPR/Cas9

Supplementary Figure 1. Nature Structural & Molecular Biology: doi: /nsmb.3494

Chemical and/or enzymatic deamination across various sequencing methods to localize modified cytosines.

Cell line CDH1 mrna AS-paRNA S-paRNA

Molecular Biology: DNA sequencing

Supplementary Figure 1

Non-coding Function & Variation, MPRAs II. Mike White Bio /5/18

Supplementary Figure 1 qrt-pcr expression analysis of NLP8 with and without KNO 3 during germination.

Supplementary Figure S1. The tetracycline-inducible CRISPR system. A) Hela cells stably

1. Primers for PCR to amplify hairpin stem-loop precursor mir-145 plus different flanking sequence from human genomic DNA.

Supplementary Figure 1. NORAD expression in mouse (A) and dog (B). The black boxes indicate the position of the regions alignable to the 12 repeat

Supplemental Data. Na Xu et al. (2016). Plant Cell /tpc

SUPPLEMENTARY INFORMATION

TECH NOTE Pushing the Limit: A Complete Solution for Generating Stranded RNA Seq Libraries from Picogram Inputs of Total Mammalian RNA

Tutorial. In Silico Cloning. Sample to Insight. March 31, 2016

Construct Design and Cloning Guide for Cas9-triggered homologous recombination

Supplemental Figure 1.

Fig. S1. Clustering analysis of expression array and ChIP-PCR assay in the ARF3 locus. (A) Typical examples of the transgenic plants used for

CRISPR-dCas9 mediated TET1 targeting for selective DNA demethylation at BRCA1 promoter

Chinese Academy of Sciences, Datun Road, Chaoyang District, Beijing, 10101, China

Transcription:

Supplementary Figure 1: sgrna library generation and the length of sgrnas for the functional screen. (a) A diagram of the retroviral vector for sgrna expression. It contains a U6-promoter-driven sgrna expression cassette, and a PGK-promoter-driven puromycin resistance gene. Restriction enzyme sites for cloning of the sgrna library are also indicated. (b) A list of 17 mouse mirna or mirna clusters included as input for the Molecular Chipper procedure. (c) An empty control vector (Ctrl), an sgrna vector with G+19mer design (G+19) and an sgrna vector with G+20mer design (G+20) were transduced into the mir-142-3p reporter cell line. Top: representative FACS plot demonstrating efficient inhibition of mir-142 activity. Bottom: Design of the G+19 and G+20 sgrnas, with mir-142 genomic sequence in the middle. Yellow highlighted regions reflect mature mir-142 mirna sequences. NGG PAM (CCA on the antisense strand) is shown in red. Note that the first base G in both G+19mer and G+20mer is a mismatch on the target sequence. 1

Supplementary Figure 2: Additional data for the sgrna-based functional screen. (a) Enrichment plots similar to those in Fig 3a is shown. Left panel, sgrna enrichments in mir- 142 are shown for high- and med-gfp populations (both representing near complete ablation of mir-142 expression) in each of the three biological triplicates. X-axis indicates position in bp. Horizontal black bars indicate the locations of mature mirnas. Blue and red indicate enriched sgrnas that were mapped to sense and antisense strand, respectively. The positions of the last targeting-domain base are used to reflect the positions of sgrnas. Blue and red boxes indicate 5 - and 3 -hit regions. Yellow box highlights sgrnas mapped to the pre-mir-142 region (including mature mir-142-5p and mir-142-3p, and loop region in between). (b) Enrichment plots for sgrnas mapped to mir-126 from the high-gfp populations are shown, in biological triplicates. 2

Supplementary Figure 3: ESCScanner analysis of sgrna enrichment in the functional screen. (a) Diagram for the principle of the ESCScanner algorithm, which is used to scan for enriched clusters of sgrnas from the screen. ESCScanner calculates the probability of observing a pattern of sgrna enrichment within a scanning window. Detected sgrnas are shown in orange and enriched sgrnas are shown in blue peaks. (b) ESCScanner plots are shown for the mir- 142 region for the three biological replicates. Data for the Low-GFP samples are shown. Specifically, a 21-bp scanning window was used and the algorithm was applied independently to the three biological replicates. X-axis indicates the position of the center of the scanning window. Y-axis indicates the log10 (probability). Horizontal black bars indicate the locations of mature mirnas. Blue and red boxes indicate 5 - and 3 -hit regions. Yellow box highlights sgrnas mapped to the pre-mir-142 region (including mature mir-142-5p and mir-142-3p, and loop region in between). Pink box indicates the position of the control 3 deletion (CtrlΔ3 ) for Supplementary Fig. 4. (c) ESCScanner plots are shown for the mir-126 region for the three biological replicates. Data for the Low-GFP samples are shown. 3

Supplementary Figure 4. mirna processing reporter activities for additional mir-142 mutants. (a) A Genome Browser snap shot of RNA seq data of murine spleen cells is shown. Data are from the CSHL long RNA-seq trace. The black bars at the bottom indicate relative positions of the 5 - hit region, mir-142-5p, mir-142-3p and the 3 - hit region. Note that abundant RNA signals can be observed from >1kb upstream of the hit regions on mir-142. (b) Designs of additional mir- 142 mutants cloned into a mirna processing reporter. Designs are similar to those in Fig. 4a. Additional mutants include a control 3 deletion (CtrlΔ3, 20 bp deletion), deleting hairpin from wildtype pri-mir-142 (ΔH), deleting hairpin from the three mutant pri-mir-142 constructs as in Fig 4a (LΔ3 ΔH, SΔ3 ΔH, and Δ5 ΔH). The narrow vertical blue bar upstream of the 3 -hit region depicts a putative CNNC site, which was not disrupted by the deletions. (c) Processing reporter activities of the indicated reporter constructs were determined in the BaF3 cells. * P<0.05; **P<0.01; NS: not significant. N=3 biological replicates. Data from a representative experiment out of two performed. Error bars represent s.d. (d) GFP fluorescence of the indicated processing reporters were determined in the indicated cell lines. N=3 biological replicates. Data were normalized to the mean GFP level in WT construct, and are from a representative experiment out of two performed. Error bars represent s.d. (e) Processing efficiencies of the indicated human primir-142 mutant reporters were determined in BaF3 cells. **P<0.01; student s t-test. N=3 4

biological replicates. Data from a representative experiment out of two performed. Error bars represent s.d. 5

Supplementary Figure 5: Compatibility of Molecular-Chipper-generated sgrna library with VQR-Cas9. (a) A NGA-PAM sgrna in our library, mapped to the mir-142 loop, was cloned and transduced into BaF3 mir-142-3p reporter cells (no Cas9), reporter cells expressing the wildtype Cas9 (WT-Cas9) or reporter cells expressing VQR-Cas9 (with dsredexpress as selection marker (VQR-Cas9-dsRed)). GFP and mcherry expression levels were determined by flow cytometry. Representative flow cytometry plots are shown, with numbers indicating the percentage of cells within the gates. (b) Representative flow cytometry plots are shown for sgrna library transduced BaF3 mir-142-3p reporter cells (no Cas9), the reporter cells expressing only WT- Cas9, the reporter cells expressing only VQR-Cas9 (with dsredexpress as a marker), or the reporter cells expressing both WT-Cas9 and VQR-Cas9. Number indicates the frequency of the gated populations. (c) Quantified % of GFP+ cells (reflecting low mir-142 activity) for data in b. N=3 biological replicates. Error bars represent s.d. Numbers at each bar indicate average % of GFP+ cells. (d-e) Experiments similar to (a-c) were performed, except that a lentiviral vector with zeocin as selection marker was used. Similar results were observed. 6

Supplementary Figure 6: Additional data of the sgrna library complexity and enrichment. (a) The saturation effect of the Molecular-Chipper-generated sgrna library. The indicated number of mapped sequence reads were randomly selected from all sequence reads, and the number of unique sgrnas was determined in each set of randomly selected sequences. Note that 12310 unique sgrnas were observed with 160,000 randomly selected mapped deep sequencing reads. At this level, the median distance of neighboring NGG-PAM sgrnas is 10 bp. (b) A histogram of the log2(enrichment) scores of all NGG-PAM sgrnas in our library is shown. Data are based on all experimental pairs of samples (low-gfp, med-gfp and high-gfp versus neg-gfp). There were 2.1% of NGG-PAM sgrnas with >1 of log2(enrichment). 7

Supplementary Table 1. Primer Sequences for mirnas and their Flanking Regions mirna construct Primer sequences to amplify mirna genomic fragments Sense Antisense Size of PCR (bp) mmu mir 16 1 TACTATTGAGGTGCTAGGAG CTAAAACCCATGGCCTTGTG 484 mmu mir 23b GGTCCCTAAGGTATTGGTCT CGGCCATAATTAAAGGGCTG 408 mmu mir 23a_mir27a_miR 24 2_cluster TGTGTGGTGAGGTGTACCTA TTAATGCAAGGGTTACCCGC 780 hsa mir 24 1 AAACCCAGGTGCATCAAGGA AACACGTGGCAAATGCTCAG 495 hsa mir 26a 2 AAGGAACACTTGTGCAGGCT CTGTGCACTGACAGAAAAGC 485 hsa mir 26b TCTGCACTACACCCAGGTT CAGGAACAGTGGAGGAGGA 467 hsa mir 27b GGTTCCTGGCATGCTGATTT TTCTGTGACTCCCAATACAC 506 hsa mir 29a CCTGAAGTAAGTGTCCAGTG ACACACCCACCATCACTATG 488 hsa mir 29b 1 GAGGGCTCAGTTACCATTTG ATGTAGGTCTTCATCCGAGC 471 hsa mir 29b 2_mir 29c_cluster ATGTTGAATGGATTTGGTTCTT TAACAGTACTACATGTCAGAAA 1026 hsa mir 125b 2 GTTCTGATCTATAGGTGGCC GCCATATCTTTAGAGTACCG 433 hsa mir 126 GCTGAGCTGAACATGGAAGA AGATCCAGGCGCTAGTCAG 524 mmu mir 142 ATGTTGAGTCACCACCCACA TATCCTCACGGACGTGCATT 458 hsa mir 146 CCACATGCCCAGCATGTTTA CAATCCAACATGACTCCCTC 402 hsa mir 155 CTGAAGTCTACCTTGCCTTC CATGTGGGCTTGAAGTTGAG 472 hsa mir 222 AGGAAGTGAATCTAAAGGTAG GACTTCATCCTTCATCAAACT 426 mmu mir 451 GACCTTGGCTGGGATATCAT TCTTTGGCACAGTGAAGAGG 362 mirna Constructs: the murine mirna(s) included as input for sgrna library production. Sense: Sequences of sense primer in PCR from murine genomic DNA to amplify the corresponding mirna. Antisense: Sequences of antisense primer in PCR from murine genomic DNA to amplify the corresponding mirna. Size: Size of mirna plus flanking regions for each mirna or mirna cluster in bp.

Supplementary Table 2. Candidate sgrnas that disrupt mir 142 expression samples sgrna Gene Strand Position Length PAM Sam1HighGFP Sam1MedGFP Sam1LowGFP Sam2HighGFP Sam2MedGFP Sam2LowGFP Sam3HighGFP Sam3MedGFP Sam3LowGFP gtcaccacccacaaggccca mir 142 1 27 21 NGG 8.005411 0 0 8.765141 5.276985 2.542325 0 0 1.084265 ccacccacaaggcccaggg mir 142 1 30 20 NGG 4.312409 0 5.527111 4.379338 4.927867 0 3.338627 0 0 cacccacaaggcccaggg mir 142 1 30 19 NGG 6.232725 1.235737 0 0 0 0 5.263332 0 0 ccagggcgggccctctagg mir 142 1 43 20 NGG 6.861898 4.800835 0 0 0 1.250508 6.706058 0 9.550644 accgctccaccctgcctg mir 142 0 50 19 NGG 7.318162 0 2.199697 0.685036 0.131761 4.491559 6.270585 0 0 gaccgctccaccctgcctg mir 142 0 50 20 NGG 3.785013 7.001066 4.656849 6.740451 0 0.433952 7.309786 0 0 ggaccgctccaccctgcctg mir 142 0 50 21 NGG 4.72831 0 0 0 0 0 8.885907 0 0 cccctccgtgtaacttccca mir 142 0 71 21 NGG 2.290962 0 5.266295 9.526861 0.991583 0.023999 1.813299 0 0 ccctccgtgtaacttccca mir 142 0 71 20 NGG 9.602919 0 0 5.705461 0 0 0 0 0 tagtagtgctttctacttta mir 142 0 183 21 NGG 1.392517 4.402974 3.989113 0.249048 2.336784 3.677034 0 0 0 cactactaacagcactgga mir 142 1 213 20 NGG 0 0 2.763994 2.686821 3.424543 2.806901 0 3.2806 0 agtgcactcatccataaagt mir 142 0 230 21 NGG 10.757184 4.325043 0 0 0 8.031991 0 8.252572 0 gatgagtgcactgtgggctt mir 142 1 257 21 NGG 2.73708 7.55579 4.943738 7.093705 0 2.060173 0 0 7.242432 atgagtgcactgtgggctt mir 142 1 257 20 NGG 13.454269 10.589747 0 0 2.00231 3.472131 0.10548 0 0 cggagaccacgccacgccg mir 142 1 276 20 NGG 4.606831 4.297514 0 5.790099 0 0.397631 7.627885 8.304456 4.831378 agggggccgcggcgtggcg mir 142 0 267 20 NGG 5.146399 5.679537 0.703752 7.247352 8.631424 1.507806 5.943239 4.823648 7.603144 ggtggcagggggccgcggcg mir 142 0 272 21 NGG 3.220919 9.216321 3.125319 3.852086 0 0 2.84218 6.94886 5.391727 gtggcagggggccgcggcg mir 142 0 272 20 NGG 6.663501 0 0 0.904868 0 0.624069 2.692474 5.996965 6.312793 ggggagcctggccaaatga mir 142 0 341 20 NGG 5.119338 0 0 0 0 0 2.700889 0 0 gcattcgagagagcggctg mir 142 0 425 20 NGG 0 4.351384 0 0 0 0 6.759109 0 0 gggcaggggcctttattaa mir 142 1 112 20 NGG 11.330621 0.421764 2.535514 0 2.016078 0.91099 0.175869 0 0 cggagaccacgccacgcc mir 142 1 275 19 NGG 0.178753 1.762801 0 2.364408 0 0 3.604713 5.017566 1.595674 agaccacgccacgccgcgg mir 142 1 279 20 NGG 2.851105 0 0 0 0 0 4.034206 0 0 gtggcagggggccgcggc mir 142 0 273 19 NGG 6.477961 0 0 0 0 0 4.305152 4.98104 4.073721 gaccgctccaccctgcct mir 142 0 51 19 GTGG 4.342598 8.627086 5.31793 4.51125 0 0 5.993208 0 0 accgctccaccctgcct mir 142 0 51 18 GTGG 7.149953 0 0 0 0 0 4.457155 0 0 Candidate sgrnas were derived by the criteria in Methods. sgrna: Sequences of sgrnas, without first base of G. Only sgrnas located before "NGG" PAM (hiligted in yellow) or "GTGG" PAM (hilighted in blue) are shown. Gene: The mirna gene in which the sgrna is located. Strand: The strand to which the sgrna is mapped. "1" stands for "+" strand, "0" stands for " " strand. Position: The position of sgrna relative to the input mirna sequence. Position is calculated by the position of the last base of the targeting domain in sgrnas. Length: The length of sgrna targeting domain (including the first base of G, which is not shown in the sequence). Samples: The log2 enrichment level for each sgrna in each indicated sample is shown. Log2 enrichment level below 0 is shown as 0.

Supplementary Table 3. Samples and Barcodes for Deep Sequencing Barcode Primer Barcode Screen populations CGTGAT ATCACG Sam1NegGFP CTGATC GATCAG Sam1LowGFP GGACGG CCGTCC Sam1MedGFP CTCTAC GTAGAG Sam1HighGFP CACTGT ACAGTG Sam2NegGFP TTGACT AGTCAA Sam2LowGFP GTAGCC GGCTAC Sam2MedGFP TACAAG CTTGTA Sam2HighGFP TTTCAC GTGAAA Sam3NegGFP CGAAAC GTTTCG Sam3LowGFP CGTACG CGTACG Sam3MedGFP CCACTC GAGTGG Sam3HighGFP GCGGAC GTCCGC Unsorted Barcode: Barcode sequence in sequencing reads. Primer Barcode: Barcode sequence in reverse PCR primer to amplily sgrnas for deep sequencing. These barcodes are reverse complement of those listed in "Barcode". Screen populations: Cells with different GFP levels were sorted and/or re sorted from the BaF3 mir 142 3p reporter cell line transduced by the sgrna library in 3 biological replicates, sample 1, sample 2 and sample 3.