China National Grid --- BioNode. Jun Wang Beijing Genomics Institute

Similar documents
GREG GIBSON SPENCER V. MUSE

High peformance computing infrastructure for bioinformatics

LARGE DATA AND BIOMEDICAL COMPUTATIONAL PIPELINES FOR COMPLEX DISEASES

Genetics and Bioinformatics

Introduction to BIOINFORMATICS

Frumkin, 2e Part 1: Methods and Paradigms. Chapter 6: Genetics and Environmental Health

From Proteomics to Systems Biology. Integration of omics - information

AGILENT S BIOINFORMATICS ANALYSIS SOFTWARE

BIOINFORMATICS AND SYSTEM BIOLOGY (INTERNATIONAL PROGRAM)

Overview of Health Informatics. ITI BMI-Dept

Leonardo Mariño-Ramírez, PhD NCBI / NLM / NIH. BIOL 7210 A Computational Genomics 2/18/2015

Deakin Research Online

What are proteomics? And what can they tell us about seed maturation and germination?

The Ensembl Database. Dott.ssa Inga Prokopenko. Corso di Genomica

Research Powered by Agilent s GeneSpring

2017 Qualifying Examination

Bioinformatics. Dick de Ridder. Tuinbouw Digitaal, 12/11/15

Computational Challenges of Medical Genomics

Introduction to Bioinformatics

Era with Computational Biology/Toxicology

Agilent Genomics Software Future Directions

Grand Challenges in Computational Biology

Introduction. CS482/682 Computational Techniques in Biological Sequence Analysis

Rapid Transcriptome Characterization for a nonmodel organism using 454 pyrosequencing

Advanced Bioinformatics Biostatistics & Medical Informatics 776 Computer Sciences 776 Spring 2018

Bayer Pharma s High Tech Platform integrates technology experts worldwide establishing one of the leading drug discovery research platforms

This place covers: Methods or systems for genetic or protein-related data processing in computational molecular biology.

Bioinformatics Analysis of Nano-based Omics Data

SYLLABUS FOR BS BIOINFORMATICS (4-YEAR DEGREE PROGRAMME)

Fundamentals of Bioinformatics: computation, biology, computational biology

Transcriptome Assembly and Evaluation, using Sequencing Quality Control (SEQC) Data

Introduction to Bioinformatics

MARINE BIOINFORMATICS & NANOBIOTECHNOLOGY - PBBT305

Array-Ready Oligo Set for the Rat Genome Version 3.0

Cory Brouwer, Ph.D. Xiuxia Du, Ph.D. Anthony Fodor, Ph.D.

From Variants to Pathways: Agilent GeneSpring GX s Variant Analysis Workflow

The Human Toxome Project a test case for pathway identification by multiomics. Thomas Hartung

Bioinformatics and computational tools

Introduction to 'Omics and Bioinformatics

Next-Generation Sequencing Gene Expression Analysis Using Agilent GeneSpring GX

The RNA tools registry

Random matrix analysis for gene co-expression experiments in cancer cells

Biology 644: Bioinformatics

METABOLOMICS: OPPORTUNITIES AND CHALLENGES

From genome-wide association studies to disease relationships. Liqing Zhang Department of Computer Science Virginia Tech

Outline and learning objectives. From Proteomics to Systems Biology. Integration of omics - information

Data Intensive Scientific Discovery Vijay Chandru

IPA : Maximizing the Biological Interpretation of Gene, Transcript & Protein Expression Data with IPA

Pioneering Clinical Omics

Minimum Information About a Microarray Experiment (MIAME) Successes, Failures, Challenges

Territory Account Manager (MD, NC or TX based)

Network System Inference

Proteomics and Cancer

Suberoylanilide Hydroxamic Acid Treatment Reveals. Crosstalks among Proteome, Ubiquitylome and Acetylome

Retrieval of gene information at NCBI

ACCELERATING GENOMIC ANALYSIS ON THE CLOUD. Enabling the PanCancer Analysis of Whole Genomes (PCAWG) consortia to analyze thousands of genomes

Soil invertebrates as a genomic model to study pollutants in the field

Nanoparticle risk assessment. practical solutions. systems toxicology

Following text taken from Suresh Kumar. Bioinformatics Web - Comprehensive educational resource on Bioinformatics. 6th May.2005

Just the Facts: A Basic Introduction to the Science Underlying NCBI Resources

Gene expression analysis. Biosciences 741: Genomics Fall, 2013 Week 5. Gene expression analysis

Introduction to Bioinformatics and Gene Expression Technology

Capabilities & Services

Genomics and Bioinformatics GMS6231 (3 credits)

Introduction to Bioinformatics

Functional Genomics Overview RORY STARK PRINCIPAL BIOINFORMATICS ANALYST CRUK CAMBRIDGE INSTITUTE 18 SEPTEMBER 2017

Advances in Biomedical Research at Comenius University Bratislava

Introduction to Bioinformatics and Gene Expression Technologies

Introduction to Bioinformatics and Gene Expression Technologies

CSC 121 Computers and Scientific Thinking

AIT - Austrian Institute of Technology

Classification and Learning Using Genetic Algorithms

Bioinformatics for Proteomics. Ann Loraine

Tooling up for Functional Genomics

NPTEL VIDEO COURSE PROTEOMICS PROF. SANJEEVA SRIVASTAVA

Advanced Technology in Phytoplasma Research

The PHOENIX Center: the hub of proteomics in the age of big data

Introduction to iplant Collaborative Jinyu Yang Bioinformatics and Mathematical Biosciences Lab

Data representation for clinical data and metadata

The Gene Ontology Annotation (GOA) project application of GO in SWISS-PROT, TrEMBL and InterPro

Jade Q. Clement Associate Professor (Tenured) Department of Chemistry

QIAGEN s NGS Solutions for Biomarkers NGS & Bioinformatics team QIAGEN (Suzhou) Translational Medicine Co.,Ltd

ECS 234: Introduction to Computational Functional Genomics ECS 234

FROM DISCOVERY TO INSIGHT

DEVELOPING WEB TOOLS FOR DATA MINING AND ANALYSIS OF SAGE

What is Bioinformatics?

Multivariate Methods to detecting co-related trends in data

Introduction to Bioinformatics

ATIP Avenir Program 2018 Young group leader

A draft sequence of bread wheat chromosome 7B based on individual MTP BAC sequencing using pair end and mate pair libraries.

Bioinformatics. Ingo Ruczinski. Some selected examples... and a bit of an overview

ECS 234: Introduction to Computational Functional Genomics ECS 234

OPTICHINA. Breeding to Optimise Chinese Agriculture

Welcome! Introduction to High Throughput Genomics December Norwegian Microarray Consortium FUGE Bioinformatics platform

E2ES to Accelerate Next-Generation Genome Analysis in Clinical Research

RESEARCH METHODOLOGY, BIOSTATISTICS AND IPR

Computers in Biology and Bioinformatics

Genome Informatics. Systems Biology and the Omics Cascade (Course 2143) Day 3, June 11 th, Kiyoko F. Aoki-Kinoshita

Next Generation Bioinformatics on the Cloud

Genomes contain all of the information needed for an organism to grow and survive.

Transcription:

China National Grid --- BioNode Jun Wang Beijing Genomics Institute

Core of life science and bio-tech: Getting, Mining, Applying the basic life information

Old China meets New China?

Sequencing, sequencing, and sequencing IS IT MY TURN? Typing, typing, and typing Functional analysis SNP in population and individuals HealthHealth-related Microarray Proteomics

Building Upon Genomics Cancer genomics Toxicogenomics Pharmacogenomics Systems Physiomics genomics Mechnomics Proteomics Metabolomics Transcriptomics & Proteomics Genoinformatics Genomics All gene products interact, construct distinct pathways, mechanisms and conduct physiological activities. All genes function through their RNA or protein products. All living organisms have their genomes. All the genes are encoded by the genomes.

Engine And Wheels Information generators : engine Information mangers and analyzer : wheels A car can not go fast without them.

Core of Computational Biology: Data analysis and mining Algorithm and software development High performance computing

http://biogrid.genomics.org.cn

System Architecture

Data Grid Three Main Parts Share/Integration/Analysis Rice/Chicken/Silkworm Genome Data National Data Bank? Computing Grid High Performance Computing Special Computing Services on Bioinformatics Software Packages? Knowledge Grid Distributed Annotation System Cooperation of Large Sequencing Project

Data Grid Rice Genome Database Data Download Computing Map Services View Over View Scaffold Gene View View cdna View Compare View

Data Grid Chicken Variation Database Data Download Computing Services MapView TraceView XML

Data Grid Silkworm Database Data & Statistics Over View Scaffold View MapView Search Report Tools&Services Download Schema

Computing Grid Based on CNGrid Five Main Node Specific Bioinformatics Application Genome Analysis Gene Predication Sequencing Alignment 20 TeroFlops 200 TB storage 10 TB memory

Computing Grid

Knowledge Grid Distributed Annotation System

Knowledge Grid Large scale Sequencing Project Cooperation Project-Oriented Project-Collaborated Real Time Management

Status quo Data Stat. Data Grid :1T Computing Grid:50G Access Stat. 2005-01-05~2005-10-25 Total:2,958,748 IPs 10,200 IPs/Day Services Stat. Average 2000 per Day

Funded projects High Performance Computer & Core Software Bioinformatics Apply Grid (BAG) of Chinese High-Tech Research & Development Plan. Applications in China Next Generation Internet (CNGI) Demonstration & Application of Bioinformatics Supported by NSFC Grid Computing Environment of Bioinformatics Supported by MOST

Developing More data More computing service based on the software package More applications based on the grid technology More information and experience shared

The science will get more complex

Distinct Steps To Systematically Understand Genome And Its Biology Improve Health And Design Better Crops Link Networks In A Background of Organisms Define Cellular Processes Into Systematic Networks Technology Integration Map Gene Products To Cellular Processes Map Genes And Their Expression Identify Genes And Other Elements Determine Genome Sequence and variation Project Integration

For Bio Part ultimate dream is to go from molecular studies to models of biology at all physical+temporal scales

from OMICS to systems biology

For Medical part: ultimate dream is develop effective predictive, preventive and personalized health care programs

Sino-UK collaboration(bgi-sanger) Ortholog Gene Database Integration of different software packages Human curation after automatic process HPC based on all sequenced genome

Collaboration Wellcome Trust Sanger Institute (UK) Center for Biological Sequence Analysis (Denmark) Dept. Proteomics and Signal Transduction Max-Planck Institute for Biochemistry (Germany)

Acknowledgement Chen Jie, Yuan Haifeng, Dai Mingtao, and many others at BGI s grid computing group Data generating platform at BGI Institute of Computing Technology, CAS