Genomes summary. Bacterial genome sizes

Similar documents
How does the human genome stack up? Genomic Size. Genome Size. Number of Genes. Eukaryotic genomes are generally larger.

BENG 183 Trey Ideker. Genotyping. To be covered in one 1.5 hr lecture

Multiple choice questions (numbers in brackets indicate the number of correct answers)

GENETICS EXAM 3 FALL a) is a technique that allows you to separate nucleic acids (DNA or RNA) by size.

CHAPTER 21 GENOMES AND THEIR EVOLUTION

3. human genomics clone genes associated with genetic disorders. 4. many projects generate ordered clones that cover genome

Chapter 20 DNA Technology & Genomics. If we can, should we?

Enzyme that uses RNA as a template to synthesize a complementary DNA

Chapter 6 Genomic Architecture. Molecular Structure of Genes and Chromosomes

A) (5 points) As the starting step isolate genomic DNA from

Computational Biology I LSM5191 (2003/4)

3I03 - Eukaryotic Genetics Repetitive DNA

Unit 6: Molecular Genetics & DNA Technology Guided Reading Questions (100 pts total)

7.03, 2005, Lecture 20 EUKARYOTIC GENES AND GENOMES I

POPULATION GENETICS studies the genetic. It includes the study of forces that induce evolution (the

GENETICS - CLUTCH CH.15 GENOMES AND GENOMICS.

GENES AND CHROMOSOMES II

Biology 105: Introduction to Genetics PRACTICE FINAL EXAM Part I: Definitions. Homology: Reverse transcriptase. Allostery: cdna library

Name_BS50 Exam 3 Key (Fall 2005) Page 2 of 5

BIOLOGY - CLUTCH CH.20 - BIOTECHNOLOGY.

Genomics and Gene Recognition Genes and Blue Genes

CHAPTER 21 LECTURE SLIDES

Molecular Biology (2)

"PhD Course Fall 2017"

A. Incorrect! This statement is true. Transposable elements can cause chromosome rearrangements.

CHAPTERS , 17: Eukaryotic Genetics

_ DNA absorbs light at 260 wave length and it s a UV range so we cant see DNA, we can see DNA only by staining it.

Biotechnology Chapter 20

Lecture 12. Genomics. Mapping. Definition Species sequencing ESTs. Why? Types of mapping Markers p & Types

LECTURE 20. Repeated DNA Sequences. Prokaryotes:

Biology 201 (Genetics) Exam #3 120 points 20 November Read the question carefully before answering. Think before you write.

Applicazioni biotecnologiche

CHAPTERS 16 & 17: DNA Technology

SELECTED TECHNIQUES AND APPLICATIONS IN MOLECULAR GENETICS

Reading Lecture 8: Lecture 9: Lecture 8. DNA Libraries. Definition Types Construction

BIOTECHNOLOGY. Sticky & blunt ends. Restriction endonucleases. Gene cloning an overview. DNA isolation & restriction

Motivation From Protein to Gene

BIO 202 Midterm Exam Winter 2007

Gene mutation and DNA polymorphism

Chapter 20. Gene creatures, partii: jumping genes and junk DNA. Prepared by Woojoo Choi

Introduction to some aspects of molecular genetics

GENE REGULATION. Gene regulation occurs at the level of transcription or production of mrna

Bio 101 Sample questions: Chapter 10

DNA Evolution of knowledge about gene. Contains information about RNAs and proteins. Polynucleotide chains; Double stranded molecule;

d. reading a DNA strand and making a complementary messenger RNA

Chapter 15 Gene Technologies and Human Applications

Unit 8: Genomics Guided Reading Questions (150 pts total)

Chapter 5. Structural Genomics

Chapter 10 Genetic Engineering: A Revolution in Molecular Biology

Before starting, write your name on the top of each page Make sure you have all pages

Some representative viruses

The Little Things About the Little Things Inside of Us The Eukaryotic Genome and Its Expression

Molecular Genetics Techniques. BIT 220 Chapter 20

Genomes: What we know and what we don t know

Research techniques in genetics. Medical genetics, 2017.

Concepts: What are RFLPs and how do they act like genetic marker loci?

Genomes and Their Evolution

Methods for Working with DNA and RNA

2. Outline the levels of DNA packing in the eukaryotic nucleus below next to the diagram provided.

Genome annotation & EST

I. Prokaryotic Gene Regulation. Figure 1: Operon. Operon:

Computational gene finding

You are genetically unique

Authors: Vivek Sharma and Ram Kunwar

Test Bank for Molecular Cell Biology 7th Edition by Lodish

Introducing new DNA into the genome requires cloning the donor sequence, delivery of the cloned DNA into the cell, and integration into the genome.

MOLECULAR TYPING TECHNIQUES

Genome Projects. Part III. Assembly and sequencing of human genomes

CHAPTER 9 DNA Technologies

CS313 Exercise 1 Cover Page Fall 2017

Unit 6 DNA ppt 3 Gene Expression and Mutations Chapter 8.6 & 8.7 pg

Genome research in eukaryotes

Bio 121 Practice Exam 3

Sept 2. Structure and Organization of Genomes. Today: Genetic and Physical Mapping. Sept 9. Forward and Reverse Genetics. Genetic and Physical Mapping

Human genetic variation

Chapter 20 Recombinant DNA Technology. Copyright 2009 Pearson Education, Inc.

DESIGNER GENES - BIOTECHNOLOGY

Genes found in the genome include protein-coding genes and non-coding RNA genes. Which nucleotide is not normally found in non-coding RNA genes?

2054, Chap. 14, page 1

Gene is the basic physical and functional unit of heredity. A Gene, in molecular terms,

BENG 183 Trey Ideker. Genome Assembly and Physical Mapping

From DNA to Protein: Genotype to Phenotype

Midterm 1 Results. Midterm 1 Akey/ Fields Median Number of Students. Exam Score

Moayyad Al-shafei. Mohammad Tarabeih. Dr Ma'mon Ahram. 1 P a g e

Degenerate site - twofold degenerate site - fourfold degenerate site

Map-Based Cloning of Qualitative Plant Genes

Chapter 8: Recombinant DNA. Ways this technology touches us. Overview. Genetic Engineering

Chromosomal Mutations. 2. Gene Mutations

Introduction to Bioinformatics

Learning Genetic Transposition with an Interactive Computer Program

COURSE OUTLINE Biology 103 Molecular Biology and Genetics

Chapter 13: Biotechnology

Chapter 9 Genetic Engineering

Biotechnology and Genomics in Public Health. Sharon S. Krag, PhD Johns Hopkins University

Problem Set 8. Answer Key

Methods that do not require growth in laboratory PCR

Chapter 20 Biotechnology

BIOLOGY 205 Midterm II - 19 February Each of the following statements are correct regarding Eukaryotic genes and genomes EXCEPT?

2014 Pearson Education, Inc. CH 8: Recombinant DNA Technology

Using mutants to clone genes

Transcription:

Genomes summary 1. >930 bacterial genomes sequenced. 2. Circular. Genes densely packed. 3. 2-10 Mbases, 470-7,000 genes 4. Genomes of >200 eukaryotes (45 higher ) sequenced. 5. Linear chromosomes 6. On average, ~50% of gene functions known. 7. Human genome: <40,000 genes code for >120,000 proteins. Large gene families (e.g. 500 protein kinases) 98% of human DNA is noncoding. ~3% of human DNA = simple repeats (satellites, minisatellites, microsatellites) ~50% of DNA = mobile elements (DNA transposons, retrotransposons (LTR and nonltr) & pseudogenes) Bacterial genome sizes Predicted genes in bacterial species Mycoplasma genitalium 470 Mycoplasma mycoides 985 E. coli 4,288 B. anthracis 5,508 P. aeruginosa 5,570 Mycobacterium leprae 1,604 Mycobacterium tuberculosis 3,995 + ~930 sequenced microbial genomes (http://www.ncbi.nlm.nih.gov/sutils/genom_table.cgi) Small and large 1

Genome sizes Gene density down in mammals Bacterial genomes are circular and densely packed with genes - 1 E. coli. Genes (circles 1 & 2). B. anthracis. Genes (circles 1 & 2). 2

Bacterial genomes are circular and densely packed with genes - 2 M. tuberculosis (4.41 MB). Genes (circles 1 & 2). M. leprae (4.41 MB). Genes (circles 1 & 2), 1116 pseudogenes (circles 3 & 4). Representative gene arrangements in 50 kb segments of yeast, fly and human DNA. Few yeast genes contain introns (exons are blue). Genes above and below the line are transcribed in opposite directions. 3

Numbers and types of genes in different eukaryotes About half the genes encode proteins of unknown function. Human genome: <2% ORFs & 48% repeats Human genome: <40,000 genes Average ~3 proteins/gene 98% of DNA is noncoding Individuals 99.9% identical (1 difference/1000 bp means many markers for mapping). Large families of repeats. 481 sequences >200 bp that are absolutely conserved in mouse. Large gene families (E.g. ~500 Ser/Thr protein kinases many Zn 2+ fingers, etc.) 4

Human genome: individuals 99.9% identical For every 1000 people... Sequencing revealed one major allele for most genes in populations Human populations have not been genetically isolated for very long (~2-3 M years) Many variations have not had time to spread throughout populations. Human genome: individuals 0.1% different For every person... Lots of variation! 3.2 x 10 9 bp/genome x 0.001 changes/bp = 5

Human genome: individuals 0.1% different For every person... Lots of variation! 3.2 x 10 9 bp/genome x 0.001 changes/bp = 3.2 x 10 6 changes/genome Human genome: individuals 0.1% different For every person... Lots of variation! 3.2 x 10 9 bp/genome x 0.001 changes/bp = 3.2 x 10 6 changes/genome Two major types of variation SNPs Repeated DNA - short to long repeats Variations produce RFLPs (Restriction Fragment Length Polymorphisms)! 6

SNPs Single Nucleotide Polymorphisms (Changes of a single base) Some are neutral Some alter gene function Identifying SNPs Phenotype (disease), e.g Sickle cell anemia Sequencing genes/cdnas Restriction digest RFLPs Restriciton Fragment Length Polymorphisms (Changes of restriction enzyme sites) 7

RFLPs Restriciton Fragment Length Polymorphisms (Changes of restriction enzyme sites) For every random 3 x 10 6 SNPs: ~1/256 will be in 4-base restriction sites --> ~10 4 RFLPs for EACH four-base cutter! ~1/4096 will be in 6-base restriction sites --> ~ 7.5 x 10 2 RFLPs for EACH six-base cutter! Lots of markers (RFLPs) to map genes by linkage to RFLPs Human genome: 48% repeats Human genome: <40,000 genes Average ~3 proteins/gene 95% of DNA is noncoding Individuals 99.9% identical (1 difference/1000 bp means many markers for mapping). Large families of repeats. Satellites (micro, mini and conventional) Transposons Retrotransposons 8

Satellites Microsatellites: 1-13 bps in ~150 bp arrays Minisatellites: 15-100 bps in 1-5 kb arrays Satellites: 14-500 bps in 20-100 kb arrays Origins of length polymorphisms in simplesequence repeats. Generation of length differences by unequal crossing over in meiosis 9

Southern blotting detects DNA sequences by hybridization 1. Digest DNA using restriction enzyme(s) 2. Run gel 3. Transfer DNA from gel to (nitrocellulose) paper. 4. Denature DNA, hybridize probe DNA, and wash off excess probe. 5. Detect the probe on the paper. E.g. by autoradiography. Different distributions of minisatellites Three repeats (a, b, c) in 3 people (1, 2, 3) Southern blot of HinfI-digested DNA 10

RFLPs -- DNA finger print in a murder case Southern blot of DNA samples digested with a restriction enzyme Human genome: 48% repeats Human genome: <40,000 genes Average ~3 proteins/gene 95% of DNA is noncoding Individuals 99.9% identical (1 difference/1000 bp means many markers for mapping). Large families of repeats. Satellites (micro, mini and conventional) Transposons Retrotransposons 11

Two major classes of mobile elements Proks and euks DNA intermediate Eukaryotes RNA intermediate Some consequences of repeat sequences in eukaryotes Genomic diversity in individuals and species. The most common retrotransposon sequences in the human genome are derived from endogenous retroviruses (ERVs). Most of these >440,000 sequences consist only of isolated LTRs, which arise from recombination between the ends. Gene families arise by duplication and divergence. Pseudogenes arise from RT acting on mrnas. New genes arise by exon shuffling. 12

Exon shuffling may create new proteins in eukaryotes Mechanism 1: Recombination between homologous interspersed repeats in the introns of separate genes would produce a new combination of exons. Exon shuffling may create new proteins in eukaryotes Mechanism 2: Transposition of an exon (a) DNA hopping of flanking transposons (b) Reverse transcription of a LINE RNA extending into the 3 exon of gene 1 can produce a DNA that gives gene 2 a new 3 exon upon integration. 13

Possible results of exon shuffling 1. Modular proteins (with alternate splicing patterns). E.g. Fibronectin gene and mrna. 2. Separate proteins that form a complex in one organism are sometimes fused into a single polypeptide chain in another organism. C. elegans Ade 5,7,8 Yeast Pur2 Yeast Pur3 Genomes summary 1. >930 bacterial genomes sequenced. 2. Circular. Genes densely packed. 3. 2-10 Mbases, 470-7,000 genes 4. Genomes of >200 eukaryotes (45 higher ) sequenced. 5. Linear chromosomes 6. On average, ~50% of gene functions known. 7. Human genome: <40,000 genes code for >120,000 proteins. Large gene families (e.g. 500 protein kinases) 98% of human DNA is noncoding. ~3% of human DNA = simple repeats (satellites, minisatellites, microsatellites) ~50% of DNA = mobile elements (DNA transposons, retrotransposons (LTR and nonltr) & pseudogenes) 14

Model for DNA transposition in bacteria Structure of a eukaryotic LTR retrotransposon Two kinds: LINEs and SINEs Long Interspersed Elements: encode proteins including RT Short Interspersed Elements: deletion of protein-coding region ORF1=RNA binding protein; ORF2=RT and endonuclease. 15

Experiments with yeast Ty elements demonstrated an RNA intermediate Introns lost in transposed Tys! Summary: Two major classes of mobile elements Proks and euks DNA intermediate Eukaryotes RNA intermediate LINEs and SINEs 16