COMPAS for the Analysis of SELEX Experiments

Similar documents
Next-generation analysis of deep sequencing data: Bringing light into the black box of phage display experiments

Application Note. NGS Analysis of B-Cell Receptors & Antibodies by AptaAnalyzer -BCR

Module Overview. Lecture

Module Overview. Lecture. DNA library synthesis (PCR) Introduction

In Silico DNA-Based Aptamer Selection and Construction

TECHNICAL NOTE Phosphorylation Site Specific Aptamers for Cancer Biomarker Peptide Enrichment and Detection in Mass Spectrometry Based Proteomics

Identification and characterization of DNA aptamers specific for

New Product: X-Aptamer Selection Kit. Updated April 2015

What is an Aptamer? smallest unit of repeating structure

Get to Know Your DNA. Every Single Fragment.

Think Aptamer! for therapeutics, diagnostics & analysis. Marine Laville, CEO Jean-Jacques Toulmé, CSO. Think Aptamer

Box 7905, Engineering Building I, North Carolina State University, Raleigh, NC 27695, Phone: Fax:

Characterization of Aptamer Binding using SensíQ SPR Platforms

Supporting Information

A New Peptide Drug Modality; Helix-Loop-Helix Technology. Interprotein Corporation

Evolutionary Methods in B iotechnology

Pioneering Clinical Omics

Synthetic vaccine research and development. Comprehensive and innovative synthetic biology solutions and technologies

Module I: Introduction Lecture 1 4 February, 2010

SELECTION OF MOLECULAR APTAMERS FOR

Sample to Insight. Dr. Bhagyashree S. Birla NGS Field Application Scientist

Module I: Introduction

Opportunities for Accelerating Cell Line Development and Beyond

Module Overview. Lecture

Directed Evolution Methods for Protein Engineering

Personalized CAR-T Immunotherapy Platform

BunDLE-seq (Binding to Designed Library, Extracting and Sequencing) -

Custom Antibody Services. Antibodies Designed Just for You. HuCAL Recombinant Monoclonal Antibody Generation Service

Amplicons, Heteroduplexes and Enzymes - Proper Processing Elevates Detection of CRISPR Gene Editing Events

Electronic Supplementary Information. Evolved polymerases facilitate selection of fully 2 - OMe- modified aptamers

Global Leader in Viral Vector Technologies

SUPPLEMENTARY MATERIAL AND METHODS

Combinatorial RNA libraries

The Need for Scientific. Data Annotation. Alick K Law, Ph.D., M.B.A. Marketing Manager IBM Life Sciences.

Deep Sequencing technologies

Multiplexed microcolumn-based process for efficient selection of RNA aptamers

Motivation From Protein to Gene

Applicazioni biotecnologiche

Products & Services. Customers. Example Customers & Collaborators

This Technical Note describes these characterization studies in more detail.

Systematic Evolution of Ligands by Exponential Enrichment SELEX

BioFarma USEF. Expertise in drug discovery projects

Evolving Affinity Enhanced Antibodies: High Mutational Rates for Discovery of Coupled Mutations

Human Genomics. Higher Human Biology

Supplementary information Bi-specific Aptamers mediating Tumour Cell Lysis

How to Screen a Billion Drug Candidates?

This place covers: Methods or systems for genetic or protein-related data processing in computational molecular biology.

Next Generation Sequencing of HLA: Challenges and Opportunities in the era of Precision Medicine. Dr. Paul Keown, 2016

Supplementary Materials. for. array reveals biophysical and evolutionary landscapes

COMPUTER AIDED DRUG DESIGN: AN OVERVIEW

Nature Methods: doi: /nmeth Supplementary Figure 1. Construction of a sensitive TetR mediated auxotrophic off-switch.

SNP GENOTYPING WITH iplex REAGENTS AND THE MASSARRAY SYSTEM

SNP GENOTYPING WITH iplex REAGENTS AND THE MASSARRAY SYSTEM

Module 2 overview SPRING BREAK

Drug Discovery Targeting the GHKL Superfamily - HSP90, DNA Gyrase, Histidine Kinases

Performance Characteristics drmid Dx for Illumina NGS systems

Introduction to Drug Design and Discovery

Basic Information. I.1 If your company performs biotechnology research and development, uses a. The Categories of Biotechnology Applications.

Accurate Nucleic Acid Analysis is Critical to Successful Sequencing. Steve Siembieda VP Commercialization November 10, 2016

Antibody Discovery at Evotec

Basics of RNA-Seq. (With a Focus on Application to Single Cell RNA-Seq) Michael Kelly, PhD Team Lead, NCI Single Cell Analysis Facility

Welcome! Introduction to High Throughput Genomics December Norwegian Microarray Consortium FUGE Bioinformatics platform

Matthew Tinning Australian Genome Research Facility. July 2012

Cat. Nos. NT09115, NT091120, NT , NT , and NTBC0950 DISCONTINUED

SMRT Analysis Barcoding Overview (v6.0.0)

Building DNA Libraries to Explore the Combinatorial Design Space. Dr. Yifan Li Senior Scientist GenScript USA Inc.

Partnered Discovery of High-Quality Antibody Drug Candidates. Company Presentation

SureSelect XT HS. Target Enrichment

Pharma&Biotech. Developability Assessment Platform Reduce Attrition Rates, Improve Clinical Safety and Avoid the Valley of Death

Advanced Therapeutic Antibody Discovery with Multiplexed Screening

Genome-Wide Survey of MicroRNA - Transcription Factor Feed-Forward Regulatory Circuits in Human. Supporting Information

Site directed mutagenesis, Insertional and Deletion Mutagenesis. Mitesh Shrestha

Position1: Senior Scientist/Principal Scientist

RNA regulation RNA-Protein Interactions II

B. Incorrect! Ligation is also a necessary step for cloning.

High-throughput Transcriptome analysis

Next-generation sequencing and quality control: An introduction 2016

Course Overview: Mutation Detection Using Massively Parallel Sequencing

Genetics Lecture 21 Recombinant DNA

Recombinant Antibody Production in Therapeutic Antibody Projects. Keshav Vasanthavada Senior Marketing Specialist, GenScript April 7, 2016

ChIP. November 21, 2017

Bi 8 Lecture 4. Ellen Rothenberg 14 January Reading: from Alberts Ch. 8

CRISPR/Cas9 Gene Editing

Recent years have witnessed an expansion in the disciplines encompassing drug

The MiniSeq System. Explore the possibilities. Discover demonstrated NGS workflows for molecular biology applications.

Module 2 overview SPRING BREAK

Expressed genes profiling (Microarrays) Overview Of Gene Expression Control Profiling Of Expressed Genes

Fatchiyah

CAPTURE-BASED APPROACH FOR COMPREHENSIVE DETECTION OF IMPORTANT ALTERATIONS

Name with Last Name, First: BIOE111: Functional Biomaterial Development and Characterization MIDTERM EXAM (October 7, 2010) 93 TOTAL POINTS

Expression Array System

Detection of FactorV Leiden (R506Q)

Supplementary Materials for. Combinatory screening of DNA aptamers for molecular imaging of HER2 in cancer

HBV RNA pre-genome encodes specific motifs that mediate interactions with the viral core protein that promote nucleocapsid assembly

Gene Expression Technology

* + * RecA * + + for RecA-dependent strand + + MIT Department of Biology 7.28, Spring Molecular Biology

Massive Analysis of cdna Ends for simultaneous Genotyping and Transcription Profiling in High Throughput

Genome annotation & EST

Introduction to CRISPR/Cas9 Background DNA Cleavage and Repair (NHEJ and HDR) Alternative Cas9 Variants Delivery of Cas9 and sgrna Library Products

Mutation analysis in cell-free DNA from cancer patients

Transcription:

COMPAS for the Analysis of SELEX Experiments COMPAS (COMmon PAtternS) is a software tool that was especially developed to harness the technology of next generation sequencing (NGS) to bring light into the black box of in vitro selection experiments. The functionalities of COMPAS enable (A) quality control of combinatorial starting libraries, (B) improved analysis of enriched libraries as well as (C) optimization of identified leads.! 1) 2) 3) 4) 5) 6) star%ng lib. A B C COMPAS op%mized lead 1) design & synthesis of starting lib." 2) in vitro selection! 3)! Next Generation Sequencing! 4) identification of monoclones" 5) screening of monoclones" 6) lead optimization" Workflow! COMPAS can be used to leverage and optimize existing selection platforms. After the selection procedure the PCR-amplicons of successive selection rounds can be indexed and combined for an economic NGS. COMPAS provides functionalities to assign dataset sequences to its origin-experiment. Now, analysis can be performed on data of defined experiments (e.g. selection rounds).! NGS COMPAS- Analysis in vitro selec%on procedure e.g. SELEX NGS- amplicon genera%on starting library screening round 1 screening round 2 repetitive cycles of enrichment and amplification indexing different experiments economic sequencing by combination of indexed amplicons in one lane individual and comparative analysis of experiments

Quality Check and Optimization of SELEX Libraries! Check of random distribution of nucleotides! Diverse combinatorial starting libraries with random regions built up by homogeneously distributed nucleotides motifs are essential for the development of ligands with the desired binding characteristics.! Distribution of nucleotides!... before library optimization!... after library optimization! COMPAS gives the probability of nucleotides over the random regions (diagram and table view), thus helps to control and optimize the synthesis of optimal libraries! Check random distribution of short motifs! The distribution of motifs gives a meassure for the diversity of the starting pool on the level of motifs. As motifs are the parts that finally mediate binding to the target molecule of interest, equally random distributed libraries are the better source for high quality ligands.! Distribution of motifs!... before library optimization!... after library optimization! The non optimized library shows an unbalanced distribution of motifs (here 6 Nt).! Whereas the optimal pool shows an Gaussian distribution of motifs.! è The better the distribution of nucleotides, the better the distribution of motifs, the better is the ligand space, the better the chance to end up with high quality ligands!

Quality Check and Optimization of SELEX Libraries! Check length of random region lengths at high resolu?on Also the length of the random region can be checked. In terms of random region length the figured library is fine. About 1.8 Mio sequences are of correct length!! Zooming allows to analyze at a resolution of a couple of sequences! COMPAS gives an overview of lengths of the variable regions of interest! Identification of sequences with higher copy number/check for contaminations! A typical starting libraries is expected to consist to 100% out of unique sequences. If you work with overrepresented libraries you can check the copy number of full length sequences here.! Also contaminations can be identified: Load name-databases with already known clones (e.g. plasic binders, clones binding to His-tags, ) to identify them by name.!

Analysis of Enriched Libraries! COMPAS Modes for Identification of Ligands! COMPAS has different methods for ligand identification. The conventional modes Sequence Counting and Fuzzy Search as well as the advanced Co-occurrence mode! The most powerful method is the Co-occurrence mode. It identifies ligands in the context of sequence families (patterns). Non-identical sequences that contain two frequent motifs build up so called protopatterns, which are subsequently grouped into sequence families according to a user defined degree of similarity of the entire sequence. This mode is even capable to identify a relevant sequence family if each of their members is still unique.!

Analysis of Enriched Libraries! Identification of aptamers in the context of families! Example of three sequence families identified with the co-occurrence mode. Non identical but related sequences (with a user defined degree of homology) are grouped into the same family. The patterns contain additional information: The consensus sequence including their relative and absolute number as well as the sequences of each family member including their absolute and relative number.! Radioactive filter binding analysis of aptamers predicted by COMPAS reveals:! 44 aptamers could be identified after the first SELEX round with the COMPAS approach ( ) whereas only 15 aptamers could be identified after nine SELEX cycles with the conventional approach ( ).! Affinity and selectivity of software predicted aptamers are in the same range as conventionally identified clones.! All clones identified with the conventional approach could be also identified with the COMPAS approach.!

Comparison of Enriched Libraries! COMPAS compares different experiments on a monoclonal level. Experiments can be the subsequent selection cyles (see A) as well as completely independent selection cycles that have been performed against closely related targets (see B)! A)!In silico tracking of monoclonal sequences over consecutive SELEX cycles! Ligand prediction from the first selection round can be confirmed on the in silico level.! Evolution behavior of ligands can be analyzed on monoclonal level.! The increase or decrease of defined candidates can be ascribed to the stringency that was applied in the respective SELEX round. è Learn how to select.! B)!In silico identification of aptamers addressing defined binding sites! Differential selection experiments against closely related targets - against wild type protein (wt) as well as against the target protein mutated at binding site of interest (mt) - enable the identification of differentially binding monoclones by comparative in silico analysis.! Discriminatory binding of clone MP 1-4 could already be predicted on the in silico level.! Selectivity for the wild-type protein could be confirmed in binding assay.!

Lead Optimization! Secondary screening of relevant sequence families enables the identification of conserved and variable regions! Once a relevant monoclonal sequence has been identified, the focus can be set on the other family members of this family.! Having a closer look on defined families: clustering of all members belonging to a relevant aptamer family! COMPAS gives additional information to which extend nucleotides can be replaced against others.! Already information on the in silico level that would have to be generated in conventional approaches in laborious doped selection experiments.!

Summary! The integration of NGS and COMPAS analysis into the in vitro selection workflow enables!... quality control of starting libraries.! Diversity! Distribution of NTs over sequence positions! Distribution of motifs! Distribution of random region length!... improved analysis of enriched libraries.! Ligand prediction in very early enrichment cycles! Identification of rare ligands (not accessible by conventional cloning and sequencing)! Prediction of ligands addressing defined binding sites! Clustering of ligands in families!... lead optimization and advanced patenting strategies.! Comprehensive information of conserved and variable sequence regions at high resolution! about With its complementary team of scientists in the fields of physics, molecular biology, informatics and medicine, AptaIT bundles knowledge for the development of high performance software solutions to leverage biomedical research and drug discovery.! AptaIT developed the software COMPAS, which efficiently harnesses the technology of deep sequencing for the discovery of therapeutic lead structures. In doing so, AptaIT s software COMPAS optimizes and shortens the laborious and cost intensive drug development process.!! AptaIT offers:! expertise and support to implement COMPAS in drug discovery procedures! service analysis of NGS datasets derived from in vitro selection experiments! licenses of the software tool COMPAS (coming soon)! AptaIT GmbH! Goethestraße 52! D-80336 Munich, Germany!! Phone:!+49 (0) 89-59918 123! Fax:!+49 (0) 89-59918 125! Email:!info@aptaIT.de! web:!www.aptait.de!