China National Grid --- BioNode Jun Wang Beijing Genomics Institute
Core of life science and bio-tech: Getting, Mining, Applying the basic life information
Old China meets New China?
Sequencing, sequencing, and sequencing IS IT MY TURN? Typing, typing, and typing Functional analysis SNP in population and individuals HealthHealth-related Microarray Proteomics
Building Upon Genomics Cancer genomics Toxicogenomics Pharmacogenomics Systems Physiomics genomics Mechnomics Proteomics Metabolomics Transcriptomics & Proteomics Genoinformatics Genomics All gene products interact, construct distinct pathways, mechanisms and conduct physiological activities. All genes function through their RNA or protein products. All living organisms have their genomes. All the genes are encoded by the genomes.
Engine And Wheels Information generators : engine Information mangers and analyzer : wheels A car can not go fast without them.
Core of Computational Biology: Data analysis and mining Algorithm and software development High performance computing
http://biogrid.genomics.org.cn
System Architecture
Data Grid Three Main Parts Share/Integration/Analysis Rice/Chicken/Silkworm Genome Data National Data Bank? Computing Grid High Performance Computing Special Computing Services on Bioinformatics Software Packages? Knowledge Grid Distributed Annotation System Cooperation of Large Sequencing Project
Data Grid Rice Genome Database Data Download Computing Map Services View Over View Scaffold Gene View View cdna View Compare View
Data Grid Chicken Variation Database Data Download Computing Services MapView TraceView XML
Data Grid Silkworm Database Data & Statistics Over View Scaffold View MapView Search Report Tools&Services Download Schema
Computing Grid Based on CNGrid Five Main Node Specific Bioinformatics Application Genome Analysis Gene Predication Sequencing Alignment 20 TeroFlops 200 TB storage 10 TB memory
Computing Grid
Knowledge Grid Distributed Annotation System
Knowledge Grid Large scale Sequencing Project Cooperation Project-Oriented Project-Collaborated Real Time Management
Status quo Data Stat. Data Grid :1T Computing Grid:50G Access Stat. 2005-01-05~2005-10-25 Total:2,958,748 IPs 10,200 IPs/Day Services Stat. Average 2000 per Day
Funded projects High Performance Computer & Core Software Bioinformatics Apply Grid (BAG) of Chinese High-Tech Research & Development Plan. Applications in China Next Generation Internet (CNGI) Demonstration & Application of Bioinformatics Supported by NSFC Grid Computing Environment of Bioinformatics Supported by MOST
Developing More data More computing service based on the software package More applications based on the grid technology More information and experience shared
The science will get more complex
Distinct Steps To Systematically Understand Genome And Its Biology Improve Health And Design Better Crops Link Networks In A Background of Organisms Define Cellular Processes Into Systematic Networks Technology Integration Map Gene Products To Cellular Processes Map Genes And Their Expression Identify Genes And Other Elements Determine Genome Sequence and variation Project Integration
For Bio Part ultimate dream is to go from molecular studies to models of biology at all physical+temporal scales
from OMICS to systems biology
For Medical part: ultimate dream is develop effective predictive, preventive and personalized health care programs
Sino-UK collaboration(bgi-sanger) Ortholog Gene Database Integration of different software packages Human curation after automatic process HPC based on all sequenced genome
Collaboration Wellcome Trust Sanger Institute (UK) Center for Biological Sequence Analysis (Denmark) Dept. Proteomics and Signal Transduction Max-Planck Institute for Biochemistry (Germany)
Acknowledgement Chen Jie, Yuan Haifeng, Dai Mingtao, and many others at BGI s grid computing group Data generating platform at BGI Institute of Computing Technology, CAS