Bayer Computational Life Science at AWS February 21 st, 2017 Dr. Oliver Fortmeier
Agenda 01 Bayer 02 Use Case: Infrastructure for Genome assembly 03 Summary Page 2
Our Mission Bayer: Science For A Better Life The steadily growing and aging global population has a need for new and better medicines and for an adequate supply of safe food. Our innovations offer answers to these challenges. We invent new molecules which can positively influence the biochemical processes in living organisms with the goal of improving the quality of life. That is what our mission Bayer: Science For A Better Life stands for. Page 3
Our Business Areas Pharmaceuticals Consumer Health Crop Science Prescription drugs Over-the-counter medicines, dietary supplements, dermatology products, foot care and sunscreen Innovative crop protection and seeds Animal Health Page 4
Our Products Bayer: Science For A Better Life Page 5
Agenda 01 Bayer 02 Use Case: Infrastructure for Genome assembly 03 Summary Page 6
Exemplary Life Science Use Case Genome Assembly http://assets.illumina.com/content/ dam/illumina-marketing/images/ techniques/solutions/ complex-disease-inset.jpg Genome assembly Assembly includes compute power (high-performance computing) Plant genome assembly may be more complex than human genomes Page 7
Whole genome shotgun sequencing Image partially taken from "Computational Biology Methods and Their Application to the Comparative Genomics of Endocellular Symbiotic Bacteria of Insects." Biol. Procedures Online (2009). Accessed via SpringerImages. Page 8
General setup Secure multi-account Data Account Compute Account VPC Compute VPC Provision Page 9
Compute Account virtual High-Performance Computing (vhpc) Endpoint to S3 data SG: VPC internal SG: Login SG: DataMover EC2: Login EC2: DataMover SG: Monitor SG: Batch EC2: Monitor EC2: Batch VPC Compute Placement group EC2: compute fleet EC2: file system Access to AWS API Design considerations Security Clear defined entry and exit points No internet access Usage of IAM Role for instances instead of access keys Performance Placement group Sizing of machines Usability Auto-scaling feature via batch system Page 10
Workflow: Provisioning a vhpc Base RHEL/ Centos AMI Base Bayer AMI Infrastructure as Code Agility and Flexibility Quality Security Immutable infrastructure vhpc AMIs EC2 instances vhpc instances Page 11
Agenda 01 Bayer 02 Use Case: Infrastructure for Genome assembly 03 Summary Page 12
Summary Take-home messages Presented use case: Genome Assembly Infrastructure for virtual HPCs including security, performance and flexibility Automated provisioning of on-demand cloud services yields feasibility for R&D projects Innovative projects from R&D requires a high flexibility Requirements are not known at the beginning of projects Requirements are constantly changing Objective: Compute anywhere, i.e., bring the code to the data. Our requests Increased network performance Encrypted Elastic File System (EFS) AWS API listening on a smaller range of IPs Transparency of resources Our benefits Infrastructure as Code flexibility and security On-demand scalable feasibility and performance Secure environments usability for life science Stable environment enterprise ready Page 13
Disclaimer Forward-Looking Statements: This presentation may contain forward-looking statements based on current assumptions and forecasts made by Bayer Group management. Various known and unknown risks, uncertainties and other factors could lead to material differences between the actual future results, financial situation, development or performance of the company and the estimates given here. These factors include those discussed in Bayer s public reports, which are available on the Bayer website at www.bayer.com. The company assumes no liability whatsoever to update these forward-looking statements or to conform them to future events or developments. Legal Notice: The product names designated with are brands of the Bayer Group or our distribution partners and are registered trademarks in many countries. Page 14
Thank you for your attention