Epigenetics Evolution and Replacement Histones: Evolutionary Changes at Drosophila H2AvD

Size: px
Start display at page:

Download "Epigenetics Evolution and Replacement Histones: Evolutionary Changes at Drosophila H2AvD"

Transcription

1 Journal of Data Mining in Genomics &Proteomics Journal of Data Mining in Genomics & Proteomics Matsuo and Kakubayashi, J Data Mining Genomics Proteomics 2017, 8:1 DOI: / Research Article OMICS International Epigenetics Evolution and Replacement Histones: Evolutionary Changes at Drosophila H2AvD Matsuo Y 1-3* and Kakubayashi N 1 1 Graduate School of Integrated Arts and Sciences, Tokushima University, Tokushima, Japan 2 Institute of Socio-Arts and Sciences, Tokushima University, Tokushima, Japan 3 Graduate School of Science and Technology, Tokushima University, Tokushima, Japan * Corresponding author: Matsuo Y, Graduate School of Science and Technology, Tokushima University, Minamijosanjima-cho 2-1, Tokushima, , Japan, Tel: ; matsuo.yoshinori@tokushima-u.ac.jp Received date: Nov 1, 2016; Accepted date: Nov 18, 2016; Published date: Nov 30, 2016 Copyright: 2016 Matsuo Y, et al. This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited. Abstract The evolutionary changes in the Drosophila H2A and H2AvD genes, which encode histones, were analyzed using the sequences of 12 Drosophila sp. for understanding the evolution of histone replacement and epigenetics. The Ball gene, coding for a histone threonine kinase, was located head-to-head with the H2AvD gene in seven Drosophila sp. A strongly conserved DNA sequence was also found in the region upstream of the H2AvD gene; this sequence is most likely a transcriptional signal, because the sequence was also conserved in four other Drosophila sp. that did not have an upstream Ball gene. The SPARC gene, coding for a calcium-binding domain, was located tail-to-tail in the region downstream of the H2AvD gene in 11 Drosophila sp. studied. A moderately conserved DNA sequence was found in the H2AvD gene region at the splicing site in the first intron. Different codon usages for the H2A and H2AvD genes were found for 11 of 17 amino acids, and codon usages characteristic of replacement histones (H2AvD, H4r, H3.3A and H3.3B) were found for amino acids. Codon usage was considerably different at several histone modification sites in the H2A gene. These results suggested that unlike the H3.3 and H4r genes, not only post-transcriptional control, but also transcriptional control played a role in the H2AvD gene. In addition to posttranscriptional controls, such as splicing and translation, the development of a control system for transcription must have occurred during the evolution of histone replacement and epigenetic systems. Keywords: Epigenetics; Histone replacement; H2A; H2AvD; Drosophila Introduction Since epigenetics plays an important role in phenotypic expression and many biological phenomena [1-5], the evolutionary mechanism of phenotypic evolution can be studied by investigating the evolution of epigenetics [6,7]. Histones, which maintain the nucleosome structure, are proteins involved in the packing of DNA [8]. There are two types of histones according to the timing of its expression: Replicationdependent (C: canonical type) and replication-independent (R: replacement or variant type) histones [9]. Nucleosome remodelling is thought to be caused by changes in the nucleosome structure due to the modification of histones or the replacement of histones with different modification states [1-3,5]. Gene expression is controlled partly by nucleosome remodelling, that is, phenotypic expression is controlled in part by epigenetics [3,10]. Studying how these two types of histones (C and R) have evolved may enable us to understand the evolution of histone replacement and epigenetics, leading to the elucidation of the evolutionary mechanism of phenotypes [6,7]. Analyses of genetic variability, molecular evolution, concerted evolution, codon bias and GC content have been conducted and reported for histone genes of the C type in Drosophila [11-21]. In addition to H3.3A, H3.3B and H4r, H2AvD is known to be a histone of the R type in Drosophila [22,23]. H2AvD was first reported to be a variant of histone 2A (H2A) [22]. Unlike the H2A gene, the H2AvD gene is a single-copy gene containing three introns [22,23]. Histone is a basic protein and is composed of many basic amino acids, lysine and arginine. Histones for H2A and H2AvD in Drosophila are small proteins of 123 and 140 amino acids, respectively. The primary structures are evolutionary conservative but not highly as H3 nor H4. Together with histone 3 (H3) and histone 4 (H4), H2A and histone 2B (H2B) are assembled into the histone core of the nucleosome [8]. H2AvD takes part in nucleosome remodelling by replacing H2A [22,24-25]. Recently, analyses of the histone genes of the R type, H4r, H3.3A and H3.3B, have been reported in Drosophila [6,7]. In this paper, the H2AvD gene was analyzed as a remaining R type histone gene for analysis in Drosophila. Materials and Methods Data analysis The nucleotide sequences of the regions flanking and including the histone 2A genes (H2A and H2AvD) from each of 12 Drosophila sp. (D. simulans, D. sechellia, D. melanogaster, D. yakuba, D. erecta, D. ananassae, D. pseudoobscura, D. persimilis, D. willistoni, D. mojavensis, D. virilis and D. grimshawi) were obtained from either FlyBase (FB ) or the DNA Databank of Japan (DDBJ) (Table 1). Clustal W (ver. 2.1) was used for multiple sequence alignment of DNA and protein [26]. Codon usage analysis The codon usage of each amino acid in two histone 2A genes (H2A and H2AvD) was calculated.

2 Page 2 of 10 Species H2A gene H2AvD gene D. melanogaster X14215 (DDBJ) CG5499 (FlyBase) D.simulans AB (DDBJ) GD21310 (FlyBase) D.sechellia AB (DDBJ) GM10348 (FlyBase) D.yakuba AB (DDBJ) GE23696 (FlyBase) D.erecta AB (DDBJ) GG11506 (FlyBase) D. ananassae GF20540 (FlyBase) GF16458 (FlyBase) D. pseudoobscura AB (DDBJ) GA18930 (FlyBase) D. persimilis GL17803 (FlyBase) GL27227 (FlyBase) D. willistoni GK19250 (FlyBase) GK13546 (FlyBase) D. mojavensis GI23444 (FlyBase) GI23052 (FlyBase) D. virilis GJ15536 (FlyBase) GJ15170 (FlyBase) D. grimshawi GH23875 (FlyBase) GH10352 (FlyBase) Table 1: Analysis of the H2A and H2AvD genes in 12 Drosophila sp. Nucleotide sequence data were obtained from FlyBase (annotation ID) or the DDBJ (accession number). The number of use of each codon was summed over 12 Drosophila sp. Because differences in codon usage are known to be present among Drosophila sp. for a given gene, to compare under the same conditions, the summed data of 12 species were analyzed. Results Genomic organization of the H2AvD gene in 12 Drosophila sp. In contrast to the tandem cluster array of C type histone genes [11-13,16-17,19-21,27], H2AvD is a single-copy gene with three introns located at IIIR97D3 in Drosophila melanogaster (Figure 1) [22,23]. The last exon contains a long 3 untranslated region. Neighbouring genes and the transcriptional orientation may provide information on gene expression control, e.g., whether they are coregulated or not, as found for the H3-H4 gene pair [28]. The genomic arrangements of the H2AvD gene in the 12 Drosophila sp., whose complete genomes have been sequenced, are shown diagrammatically in Figure 2. The genes in the upstream and downstream regions of the H2AvD gene and its transcriptional orientation are also shown. Although the genomic information for the H2AvD gene of D. virilis was available, no information was available for the upstream and downstream regions of the gene (Figure 2). Figure 1: Gene structure of H2AvD in Drosophila melanogaster. The chromosomal location of the gene is IIIR97D3. The coding region is colored in red.

3 For seven Drosophila sp. (D. simulans, D. sechellia, D. melanogaster, D. yakuba, D. erecta, D. ananassae and D. willistoni), the H2AvD gene was paired with the Ball gene in a head-to-head manner (Figure 2). The Ball gene encodes an enzyme with histone threonine kinase activity, suggesting its functional relatedness to the H2AvD gene in histone modification. Since the H2AvD-Ball gene pairing was not observed in all 11 species, a functional relation, if any, is restricted to Page 3 of 10 the 7 species. In addition, the DSRB gene, which encodes a doublestranded RNA-binding domain, was located upstream of the H2AvD gene in 2 species (D. pseudoobscura and D. persimilis). A gene encoding pre-mrna splicing Prp18-interacting factor was located upstream of the H2AvD gene in the two remaining species (D. mojavensis and D. grimshawi). Figure 2: Gene arrangements of Ball (light green), H2AvD (red), SPARC (blue), DSRB (yellow) and Splicing factor (purple) in Drosophila genomes. The orientation of transcription is indicated by arrows. The phylogenetic relationship between the 12 Drosophila sp. is indicated on the right side of the Figure. The outside regions of the H2AvD gene in D. virilis were not available (dotted regions). A SPARC gene was located downstream of the H2AvD gene in a tail-to-tail configuration in all 11 species for which the nucleotide sequences were available (except for D. virilis). The SPARC gene encodes Osteonectin-like protein that contains a calcium-binding domain. The conservation of the H2AvD and SPARC gene pairing in a broad range of Drosophila sp. suggests its involvement in the control of expression. Conserved nucleotide sequences in the H2AvD gene DNA sequences with important functions can be detected by investigating their evolutionary conservation between distantly related species. The upstream, downstream and intron regions of the H2AvD gene were investigated in the 12 Drosophila sp. A sequence of 17 base pairs (bp) that was strongly conserved among the 12 species, 5 -CN(C/ T)T(G/C)ACGGCGTCT(T/C)(T/A)T-3, was identified in the upstream region of the H2AvD gene (Figure 3). The DNA sequence was located bp upstream of the transcription start site of the H2AvD gene, suggesting that it is an important signal for transcription. The sequence was located between two divergently transcribed genes, Ball and H2AvD, in 7 species. However, the possibility of a coregulation for both genes is small, because the sequence was also present in four other Drosophila sp. that did not have a Ball gene in the upstream region of the H2AvD gene. None of the other DNA sequences in the upstream regions of the H2AvD gene were found to be conserved among the 12 species. Other DNA sequences that were found to be conserved within the gene are shown in Figures 4 and 5. Short DNA sequences (3-9 bp) were intermittently preserved in the region of exon 1 and intron 1 in the H2AvD gene (Figure 4). One of the conserved sites in Figure 4 corresponded to a first splicing site of exon 1 and was considered to be a recognition sequence for a splicing complex or spliceosome. None of the other splicing sites showed any conservative sequences, except at exact donor and acceptor sites. In the regions for intron 2 and the 3 untranslated regions of exon 4 in the H2AvD gene (Figure 5), short DNA sequences (3-8 bp) were intermittently preserved. The functional importance of these sequences is unknown. A DNA fragment insertion of approximately 2 kb in length was found in exon 4 of D. yakuba. For all 11 species, the H2AvD gene was paired with the SPARC gene in a tail-to-tail manner (Figure 2); however, the distance between the two genes, as measured by the transcriptional region in D. melanogaster, was only 17 bp, and no conserved sequence was recognized in this region. These facts suggested that the H2AvD gene is controlled, at the least, by transcription via a signal sequence upstream of the gene.

4 Page 4 of 10 Figure 3: Comparison of the nucleotide sequences in the upstream region of the H2AvD genes in 12 Drosophila sp. The nucleotide sequences of 12 Drosophila sp. are aligned. Asterisks under the sequences indicate nucleotide identity at the respective sites in the 12 Drosophila sp. The regions are labeled with the same colors as in Figure 1. MEL: D. melanogaster, SIM: D. simulans, SEC: D. sechellia, YAK: D. yakuba, ERE: D. erecta, ANA: D. ananassae, PSE: D. pseudoobscura, PER: D. persimilis, WIL: D. willistoni, VIR: D. virilis, MOJ: D. mojavensis, GRI: D. grimshawi. Figure 4: Comparison of the nucleotide sequences of the first exon and the first intron of the H2AvD genes in 12 Drosophila sp. The nucleotide sequences of 12 Drosophila sp. are aligned. Asterisks under the sequences indicate nucleotide identity at the respective sites in the 12 Drosophila sp. The regions are labeled with the same colors as in Figure 1. The second segment corresponds to the splicing site. MEL: D. melanogaster, SIM: D. simulans, SEC: D. sechellia, YAK: D. yakuba, ERE: D. erecta, ANA: D. ananassae, PSE: D. pseudoobscura, PER: D. persimilis, WIL: D. willistoni, VIR: D. virilis, MOJ: D. mojavensis, GRI: D. grimshawi.

5 Page 5 of 10 Figure 5: Comparison of the nucleotide sequences of the second intron and the fourth exon of the H2AvD genes in 12 Drosophila sp. The nucleotide sequences of 12 Drosophila sp. are aligned. Asterisks under the sequences indicate nucleotide identity at the respective sites in the 12 Drosophila sp. The regions are labeled with the same colors as in Figure 1. MEL: D. melanogaster, SIM: D. simulans, SEC: D. sechellia, YAK: D. yakuba, ERE: D. erecta, ANA: D. ananassae, PSE: D. pseudoobscura, PER: D. persimilis, WIL: D. willistoni, VIR: D. virilis, MOJ: D. mojavensis, GRI: D. grimshawi. Evolutionary changes in the coding regions of the H2A and H2AvD genes in Drosophila The alignment of amino acid sequences of the histones encoded by the H2A and H2AvD genes of 12 Drosophila sp. are shown in Figure 6. The amino acid sequences, excluding the start methionine codon, for H2A were 123 residues in length and were completely identical among the 12 species. No amino acid substitution was found. The amino acid sequences of the histones encoded by the H2AvD gene of 12 Figure 6: Comparison of the amino acid sequences of the H2A and H2AvD proteins in 12 Drosophila sp. The amino acid sequences of 12 Drosophila sp. are aligned. Asterisks under the sequences indicate amino acid identity at the respective sites in the 12 Drosophila sp. MEL: D. melanogaster, SIM: D. simulans, SEC: D. sechellia, YAK: D. yakuba, ERE: D. erecta, ANA: D. ananassae, PSE: D. pseudoobscura, PER: D. persimilis, WIL: D. willistoni, VIR: D. virilis, MOJ: D. mojavensis, GRI: D. grimshawi.

6 Page 6 of 10 Drosophila sp. were 140 amino acid residues in length and they were also highly similar (Figure 6). Three amino acid substitutions were found in H2AvD among the 12 Drosophila sp.: a D-E substitution at position 123, a T-S substitution at position 124, and a V-P substitution at position 134 (Figure 6). The average amino acid identity between the sequences of the two H2AvD proteins was 99.3%, and the differences were more frequently found near the C-terminal end. As described previously [15,19-20], several amino acid substitutions and indels were found between H2A and H2AvD (Figure 6). The total length of the two histones differed by 17 amino acid residues. The codon usage for amino acids and the GC content at the third codon position were compared between the two H2A genes (H2A and H2AvD) (Figures 7 and 8). Significant differences in codon bias were found in the H2A genes for 11 of 17 amino acids (Table 2). The remaining three amino acids were Met, Trp, and Cys. The former two were coded by a unique codon and the latter was not used in H2A. Seven amino acids, including Lys and Arg, showed highly significant differences (Table 2). Of these 11 amino acids, six (Gln, Asn, Ile, Ala, Arg and Lys) were identified as amino acids with significant differences in previous studies of the H3 and H4 genes [6,7]. The pattern of codon usage for these six amino acids is shown in Figure 9. The codon usage for three amino acids, excluding Lys, Ile and Asn, indicated respective patterns for the C and R types. For example, for the Arg sites, CGU was predominant for the C type, but CGC was predominant for the R type, except for the H3.3B gene. The R type more predominantly used G or C at the third codon position when compared to the C type. In the codon usage for the five remaining amino acids (Val, Pro, Tyr, Gly and Ser), a significant difference was found for H2A, but neither for H3 nor H4, as shown in Figure 10. Similar to the previous pattern, many amino acids, with the exception of Ser, had codon usage patterns that were distinctive for each histone type, C or R. This can also be seen by the comparison of GC content at the third codon position for the two H2A genes (Figure 8). Previous analyses of the H3 and H4 genes showed higher GC content at the third codon position in R-type genes [6,7]. As for the H2A gene (Figure 8), most species, with the exception of three, showed a similar tendency. In D. pseudoobscura, D. persimilis and D. mojavensis, the GC content of the two H2A genes was approximately the same. Figure 7: Codon usage in the H2A and H2AvD histone genes in Drosophila. Numbers within the parentheses indicate the number of residues of that amino acid in the H2A and H2AvD of D. melanogaster. Codon usage was also analyzed site-by-site to study the relationship between codon usage and histone modification for H2A (Figure 11). Four histone modifications (methylation, acetylation, phosphorylation and ubiquitylation) for four amino acids (Lys, Arg, Thr and Ser) were analyzed [2,29]. For the Ser sites, UCU was used most of the time at positions C1 (first site of the C type) and R124 (124th site of the R type). C1 is a phosphorylation site. For the Thr sites, ACC was predominant at positions R103 and C119, unlike in other sites. C119 is a modification site. For the Arg sites, AGA was predominant only at position C76. C76 is a methylation site. For the Lys sites, AAA was predominant at positions R4, C5, C8, R77 and R79. C5 and C8 are acetylation sites. These results indicated that amino acid sites showing a different codon bias when compared to other sites are frequently histone modification sites. This is consistent with the results obtained from previous studies on H3 and H4 genes, suggesting a connection between translation and histone modification.

7 Page 7 of 10 Figure 8: GC content at the third codon positions of the H2A (blue) and H2AvD (red) genes in Drosophila. Discussion Studying how the two H2A genes (H2A and H2AvD) have functionally differentiated and evolved is expected to provide information that will enable us to understand the evolution of histone replacement and epigenetics. The SPARC gene was located downstream of the H2AvD gene in all Drosophila sp. studied. It is not easy to understand why the Osteonectin-like gene was located tail-totail. It is possible that the two genes, H2AvD and SPARC, have to be expressed under similar nucleosome conditions for some reasons and are thus located next to each other, or that their close proximity was simply coincidental. The Ball gene was located upstream of the H2AvD gene in seven of 11 Drosophila sp. studied. Although the Ball gene encoding an enzyme with histone threonine kinase activity is functionally related to the H2AvD gene, it remains unknown why the relation is restricted to these 7 species. Is it related to a specific feature of these 7 species? A strongly conserved DNA sequence was found in the region upstream of the H2AvD gene. The conserved sequence was located bp upstream of the gene, indicating an important role for it in the transcription of the H2AvD gene. Functional divergence in transcriptional control was found for the H2AvD gene, but not for the H3.3A, H3.3B or H4r genes [6,7]. The genes for histones of the R type contained several introns [23,30-31]. A possibility of expression control via splicing was found for the H3.3B and H2AvD genes from the observation of conservative sequences at the first splicing site [6,7]. The reasons why they were found only in the first intron remain unknown, but we speculated that the first intron plays some roles in expressing the genes. It is also unclear whether the Splicing factor gene located upstream of the H2AvD gene in 2 species, D. mojavensis and D. grimshawi, is relevant for the splicing site in the first intron of the H2AvD gene. The two H2A proteins, H2A and H2AvD, showed high similarity in amino acid sequence and are therefore supposed to have a similar function [22,32]. However, compared to the differences in amino acids between the C and R types, the differences within the H2A proteins (H2A and H2AvD) seemed to be small. A possible reason for this is that if the two proteins diverged from a common ancestor, they diverged fairly early and well before the divergence of Drosophila [33]. Alternatively, the two types of proteins may have evolved as similar, but different proteins under different selection pressures. The differences between the two types of proteins may result in some functional differences [29,34]. In addition to the purifying selection caused by strong constraints on the amino acid sequence as a histone 2A, the weak selection for synonymous sites, the force of which is small as it is affected by the population size [14,18,20,35], operates differently for the C and R types. The difference in codon usage observed for the two types was possibly caused by weak selection [18,20,36]. The distinctive codon usage for the C or R types could affect the speed and efficiency of translation, leading to different amounts of gene expression. Consistent with previous results for the H3 and H4 genes [6,7], a marked codon bias was found for some amino acid sites that are supposed to be modified. The stronger bias at histone modification sites when compared to other sites indicated a higher selection coefficient at the modification site. As for the timing of histone modification, it is thought to have occurred post-translationally. However, a recent report showed that one of the histone modifications had occurred during translation [37]. These results suggested that histones could be modified through specific trnas. It is highly likely that the accumulated weak selection acting on many sites enabled the generation of different types of histones. Changes in gene structure as well as in the control of transcription, splicing and translation may have caused the evolution of epigenetics, leading to the evolution of phenotypes.

8 Page 8 of 10 Figure 9: Comparison of the codon usage in replacement histones and replication-dependent histones of the H2A, H3 and H4 genes in Drosophila. Codon usages of six amino acids that were significantly different in H2A, H3 and H4 are compared. Figure 10: Comparison of the codon usage in replacement histones and replication-dependent histones of the H2A, H3 and H4 genes in Drosophila. Codon usages of five amino acids that were significantly different in H2A, but not in H3 nor H4, are compared.

9 Page 9 of 10 Figure 11: Codon usage at each amino acid site of the respective histone 2A genes in Drosophila. The locations of histone modifications are indicated by * acetylation, ** methylation, *** acetylation and ubiquitylation and # phosphorylation after the position number. The protein length of H2A from human/mouse and Drosophila differed by six amino acids. Indels in human/mouse and Drosophila may have caused gaps in the amino acid numbers defined from the N-terminal end for each species; therefore, caution is needed regarding the position numbers of the modification sites. Amino acid χ 2 d.f. Tyr 4.19 * 1 Thr Gly 8.87 * 2 a Arg *** 5 Lys *** 1 Leu Ala *** 3 His Val *** 3 Asp Asn 4.91 * 1 Ile 7.73 * 2 Gln *** 1 Pro *** 3 Ser *** 5 Glu Phe Table 2: χ 2 tests for codon bias in the H2A and H2AvD genes in 12 Drosophila sp. *** P<0.001, ** P<0.01, * P<0.05, d.f.: degree of freedom, a When the expected value was smaller than 1, it was combined with the next smallest value. Acknowledgement This research was supported by a Grant-in-Aid for Scientific Research to Y. M. from the Ministry of Education, Culture, Sports, Science and Technology of Japan. References 1. Grunstein M (1997) Histone acetylation in chromatin structure and transcription. Nature 389: Kouzarides T (2007) Chromatin modifications and their function. Cell 128: Feng S, Jacobsen SE, Reik W (2010) Epigenetic reprogramming in plant and animal development. Science 330:

10 Page 10 of Rothbart SB, Strahl BD (2014) Interpreting the language of histone and DNA modifications. Biochem Biophys Acta 1839: Weber CM, Henikoff S (2014) Histone variants: dynamic punctuation in transcription. Genes Dev 28: Yamamoto Y, Watanabe T, Nakamura M, Kakubayashi N, Saito Y, et al. (2016) Epigenetics evolution and replacement histones: Evolutionary changes at Drosophila H4r. J Phylogenet Evol Biol 4: Matsuo Y, Kakubayashi N (2016) Epigenetics evolution and replacement histones: Evolutionary changes at Drosophila H3.3A and H3.3B. J Phylogenet Evol Biol 4: Kornberg RD (1977) Structure of chromatin. Annu Rev Biochem 46: Schumperli D (1986) Cell-cycle regulation of histone gene expression. Cell 45: Maze I, Noh KM, Soshnev AA, Allis CD (2014) Every amino acid matters: essential contributions of histone variants to mammalian development and disease. Nature Reviews 15: Lifton RP, Goldberg ML, Karp RW, Hogness DS (1977) The organization of the histone genes in Drosophila melanogaster: functional and evolutionary implications. Proc Cold Spring Harbor Symp Quant Biol 42: Matsuo Y, Yamazaki T (1989) trna derived insertion element in histone gene repeating unit of Drosophila melanogaster. Nucleic Acids Res 17: Matsuo Y, Yamazaki T (1989) Nucleotide variation and divergence in the histone multigene family in Drosophila melanogaster. Genetics 122: Matsuo Y (2000) Evolutionary change of codon usage for the histone gene family in Drosophila melanogaster and Drosophila hydei. Mol Phylogenet Evol 15: Matsuo Y (2000) Molecular evolution of the histone 3 multigene family in the Drosophila melanogaster species subgroup. Mol Phylogenet Evol 16: Tsunemoto K, Matsuo Y (2001) Molecular evolutionary analysis of a histone gene repeating unit from Drosophila simulans. Genes Genet Syst 76: Kakita M, Shimizu T, Emoto M, Nagai M, Takeguchi M, et al. (2003) Divergence and heterogeneity of the histone gene repeating units in the Drosophila melanogaster species subgroup. Genes Genet Syst 78: Matsuo Y (2003) Evolution of the GC content of the histone 3 gene in seven Drosophila species. Genes Genet Syst 78: Nagoda N, Fukuda A, Nakashima Y, Matsuo Y (2005) Molecular characterization and evolution of the repeating units of histone genes in Drosophila americana: coexistence of quartet and quintet units in a genome. Insect Mol Biol 14: Matsuo Y (2006) Genomic structure and evolution of the histone gene family in Drosophila. Current Topics in Genetics 2: Nakashima Y, Higashiyama A, Ushimaru A, Nagoda N, Matsuo Y (2016) Evolution of GC content in the histone gene repeating units from Drosophila lutescens, D. takahashii and D. pseudoobscura. Genes Genet Syst 91: van Daal A, White EM, Gorovsky MA, Elgin SCR (1988) Drosophila has a single copy of the gene encoding a highly conserved histone H2A variant of the H2A F/Z type. Nucleic Acids Res 16: van Daal A, Elgin SCR (1992) A histone variant, H2AvD, is essential in Drosophila melanogaster. Mol Biol Cell 3: Dalton S, Robins AJ, Harvey RP, Wells JRE (1989) Transcription from the intron-containing chicken histone H2A F gene is not S-phase regulated. Nucleic Acids Res 17: Mizuguchi G, Shen X, Landry J, Wu WH, Sen S, Wu C (2004) ATP-driven exchange of histone H2A Z variant catalyzed by SWR1 chromatin remodeling complex. Science 303: Thompson JD, Higgins DG, Gibson TJ (1994) CLUSTAL W: improving the sensitivity of progressive multiple sequence alignment through sequence weighting, position-specific gap penalties and weight matrix choice. Nucleic Acids Res 22: Kremer H, Hennig W (1990) Isolation and characterization of a Drosophila hydei histone DNA repeat unit. Nucleic Acids Res 18: Salzler HR, Tatomer DC, Malek PY, McDaniel SL, Orlando AN, et al. (2013) A sequence in the Drosophila H3-H4 promoter triggers 1 Histone Locus Body assembly and biosynthesis of replication-coupled histone mrnas. Dev Cell 24: Li KK, Luo C, Wang D, Jiang H, Zheng YG (2012) Chemical and biochemical approaches in the study of histone methylation and demethylation. Med Res Rev 32: Akhmanova AS, Bindels PC, Xu J, Miedema K, Kremer H, et al. (1995) Structure and expression of histone H3.3 genes in Drosophila melanogaster and Drosophila hydei. Genome 38: Akhmanova A, Miedema K, Hennig W (1996) Identification and characterization of the Drosophila histone H4 replacement gene. FEBS Lett 388: Clarkson MJ, Wells JRE, Gibson F, Saint R, Tremethick DJ (1999) Regions of variant histone His2AvD required for Drosophila development. Nature 399: Talbert P, Ahmad K, Almouzni G, Ausio J, Berger F, et al. (2012) A unified phylogeny-based nomenclature for histone variants. Epigenet Chromatin 5: Tanabe M, Kouzmenko AP, Ito S, Sawatsubashi S, Suzuki E, et al. (2008) Activation of facultatively silenced Drosophila loci associates with increased acetylation of histone H2AvD. Genes to Cells 13: Ohta T (1972) Population size and rate of evolution. J Mol Evol 1: Akashi H (1995) Inferring weak selection from patterns of polymorphism and divergence at silent sites in Drosophila DNA. Genetics 139: Rivera C, Saavedra F, Alvarez F, Diaz-Celis C, Ugalde V, et al. (2015) Methylation of histone H3 lysine 9 occurs during translation. Nucleic Acids Res 43:

BS 50 Genetics and Genomics Week of Oct 24

BS 50 Genetics and Genomics Week of Oct 24 BS 50 Genetics and Genomics Week of Oct 24 Additional Practice Problems for Section Question 1: The following table contains a list of statements that apply to replication, transcription, both, or neither.

More information

Outline. Annotation of Drosophila Primer. Gene structure nomenclature. Muller element nomenclature. GEP Drosophila annotation projects 01/04/2018

Outline. Annotation of Drosophila Primer. Gene structure nomenclature. Muller element nomenclature. GEP Drosophila annotation projects 01/04/2018 Outline Overview of the GEP annotation projects Annotation of Drosophila Primer January 2018 GEP annotation workflow Practice applying the GEP annotation strategy Wilson Leung and Chris Shaffer AAACAACAATCATAAATAGAGGAAGTTTTCGGAATATACGATAAGTGAAATATCGTTCT

More information

Lecture for Wednesday. Dr. Prince BIOL 1408

Lecture for Wednesday. Dr. Prince BIOL 1408 Lecture for Wednesday Dr. Prince BIOL 1408 THE FLOW OF GENETIC INFORMATION FROM DNA TO RNA TO PROTEIN Copyright 2009 Pearson Education, Inc. Genes are expressed as proteins A gene is a segment of DNA that

More information

Agenda. Web Databases for Drosophila. Gene annotation workflow. GEP Drosophila annotation projects 01/01/2018. Annotation adding labels to a sequence

Agenda. Web Databases for Drosophila. Gene annotation workflow. GEP Drosophila annotation projects 01/01/2018. Annotation adding labels to a sequence Agenda GEP annotation project overview Web Databases for Drosophila An introduction to web tools, databases and NCBI BLAST Web databases for Drosophila annotation UCSC Genome Browser NCBI / BLAST FlyBase

More information

(a) Which enzyme(s) make 5' - 3' phosphodiester bonds? (c) Which enzyme(s) make single-strand breaks in DNA backbones?

(a) Which enzyme(s) make 5' - 3' phosphodiester bonds? (c) Which enzyme(s) make single-strand breaks in DNA backbones? EXAMPLE QUESTIONS AND ANSWERS 1. Topoisomerase does which one of the following? (a) Makes new DNA strands. (b) Unties knots in DNA molecules. (c) Joins the ends of double-stranded DNA molecules. (d) Is

More information

BIOSTAT516 Statistical Methods in Genetic Epidemiology Autumn 2005 Handout1, prepared by Kathleen Kerr and Stephanie Monks

BIOSTAT516 Statistical Methods in Genetic Epidemiology Autumn 2005 Handout1, prepared by Kathleen Kerr and Stephanie Monks Rationale of Genetic Studies Some goals of genetic studies include: to identify the genetic causes of phenotypic variation develop genetic tests o benefits to individuals and to society are still uncertain

More information

Basic concepts of molecular biology

Basic concepts of molecular biology Basic concepts of molecular biology Gabriella Trucco Email: gabriella.trucco@unimi.it Life The main actors in the chemistry of life are molecules called proteins nucleic acids Proteins: many different

More information

7.016 Problem Set 3. 1 st Pedigree

7.016 Problem Set 3. 1 st Pedigree 7.016 Problem Set 3 Question 1 The following human pedigree shows the inheritance pattern of a specific disease within a family. Assume that the individuals marrying into the family for all generations

More information

7.014 Problem Set 4 Answers to this problem set are to be turned in. Problem sets will not be accepted late. Solutions will be posted on the web.

7.014 Problem Set 4 Answers to this problem set are to be turned in. Problem sets will not be accepted late. Solutions will be posted on the web. MIT Department of Biology 7.014 Introductory Biology, Spring 2005 Name: Section : 7.014 Problem Set 4 Answers to this problem set are to be turned in. Problem sets will not be accepted late. Solutions

More information

1. DNA, RNA structure. 2. DNA replication. 3. Transcription, translation

1. DNA, RNA structure. 2. DNA replication. 3. Transcription, translation 1. DNA, RNA structure 2. DNA replication 3. Transcription, translation DNA and RNA are polymers of nucleotides DNA is a nucleic acid, made of long chains of nucleotides Nucleotide Phosphate group Nitrogenous

More information

Basic concepts of molecular biology

Basic concepts of molecular biology Basic concepts of molecular biology Gabriella Trucco Email: gabriella.trucco@unimi.it What is life made of? 1665: Robert Hooke discovered that organisms are composed of individual compartments called cells

More information

Problem Set Unit The base ratios in the DNA and RNA for an onion (Allium cepa) are given below.

Problem Set Unit The base ratios in the DNA and RNA for an onion (Allium cepa) are given below. Problem Set Unit 3 Name 1. Which molecule is found in both DNA and RNA? A. Ribose B. Uracil C. Phosphate D. Amino acid 2. Which molecules form the nucleotide marked in the diagram? A. phosphate, deoxyribose

More information

Bioinformatics. ONE Introduction to Biology. Sami Khuri Department of Computer Science San José State University Biology/CS 123A Fall 2012

Bioinformatics. ONE Introduction to Biology. Sami Khuri Department of Computer Science San José State University Biology/CS 123A Fall 2012 Bioinformatics ONE Introduction to Biology Sami Khuri Department of Computer Science San José State University Biology/CS 123A Fall 2012 Biology Review DNA RNA Proteins Central Dogma Transcription Translation

More information

Annotating Fosmid 14p24 of D. Virilis chromosome 4

Annotating Fosmid 14p24 of D. Virilis chromosome 4 Lo 1 Annotating Fosmid 14p24 of D. Virilis chromosome 4 Lo, Louis April 20, 2006 Annotation Report Introduction In the first half of Research Explorations in Genomics I finished a 38kb fragment of chromosome

More information

7.014 Solution Set 4

7.014 Solution Set 4 7.014 Solution Set 4 Question 1 Shown below is a fragment of the sequence of a hypothetical bacterial gene. This gene encodes production of HWDWN, protein essential for metabolizing sugar yummose. The

More information

A Zero-Knowledge Based Introduction to Biology

A Zero-Knowledge Based Introduction to Biology A Zero-Knowledge Based Introduction to Biology Konstantinos (Gus) Katsiapis 25 Sep 2009 Thanks to Cory McLean and George Asimenos Cells: Building Blocks of Life cell, membrane, cytoplasm, nucleus, mitochondrion

More information

Draft 3 Annotation of DGA06H06, Contig 1 Jeannette Wong Bio4342W 27 April 2009

Draft 3 Annotation of DGA06H06, Contig 1 Jeannette Wong Bio4342W 27 April 2009 Page 1 Draft 3 Annotation of DGA06H06, Contig 1 Jeannette Wong Bio4342W 27 April 2009 Page 2 Introduction: Annotation is the process of analyzing the genomic sequence of an organism. Besides identifying

More information

Understanding Genes & Mutations. John A Phillips III May 16, 2005

Understanding Genes & Mutations. John A Phillips III May 16, 2005 Understanding Genes & Mutations John A Phillips III May 16, 2005 Learning Objectives Understand gene structure Become familiar with genetic & mutation databases Be able to find information on genetic variation

More information

Annotation of contig27 in the Muller F Element of D. elegans. Contig27 is a 60,000 bp region located in the Muller F element of the D. elegans.

Annotation of contig27 in the Muller F Element of D. elegans. Contig27 is a 60,000 bp region located in the Muller F element of the D. elegans. David Wang Bio 434W 4/27/15 Annotation of contig27 in the Muller F Element of D. elegans Abstract Contig27 is a 60,000 bp region located in the Muller F element of the D. elegans. Genscan predicted six

More information

Thr Gly Tyr. Gly Lys Asn

Thr Gly Tyr. Gly Lys Asn Your unique body characteristics (traits), such as hair color or blood type, are determined by the proteins your body produces. Proteins are the building blocks of life - in fact, about 45% of the human

More information

36. The double bonds in naturally-occuring fatty acids are usually isomers. A. cis B. trans C. both cis and trans D. D- E. L-

36. The double bonds in naturally-occuring fatty acids are usually isomers. A. cis B. trans C. both cis and trans D. D- E. L- 36. The double bonds in naturally-occuring fatty acids are usually isomers. A. cis B. trans C. both cis and trans D. D- E. L- 37. The essential fatty acids are A. palmitic acid B. linoleic acid C. linolenic

More information

Daily Agenda. Warm Up: Review. Translation Notes Protein Synthesis Practice. Redos

Daily Agenda. Warm Up: Review. Translation Notes Protein Synthesis Practice. Redos Daily Agenda Warm Up: Review Translation Notes Protein Synthesis Practice Redos 1. What is DNA Replication? 2. Where does DNA Replication take place? 3. Replicate this strand of DNA into complimentary

More information

Bi Lecture 3 Loss-of-function (Ch. 4A) Monday, April 8, 13

Bi Lecture 3 Loss-of-function (Ch. 4A) Monday, April 8, 13 Bi190-2013 Lecture 3 Loss-of-function (Ch. 4A) Infer Gene activity from type of allele Loss-of-Function alleles are Gold Standard If organism deficient in gene A fails to accomplish process B, then gene

More information

Algorithms in Bioinformatics ONE Transcription Translation

Algorithms in Bioinformatics ONE Transcription Translation Algorithms in Bioinformatics ONE Transcription Translation Sami Khuri Department of Computer Science San José State University sami.khuri@sjsu.edu Biology Review DNA RNA Proteins Central Dogma Transcription

More information

Protein Synthesis. Application Based Questions

Protein Synthesis. Application Based Questions Protein Synthesis Application Based Questions MRNA Triplet Codons Note: Logic behind the single letter abbreviations can be found at: http://www.biology.arizona.edu/biochemistry/problem_sets/aa/dayhoff.html

More information

Annotation of contig62 from Drosophila elegans Dot Chromosome

Annotation of contig62 from Drosophila elegans Dot Chromosome Abstract: Annotation of contig62 from Drosophila elegans Dot Chromosome 1 Maxwell Wang The goal of this project is to annotate the Drosophila elegans Dot chromosome contig62. Contig62 is a 32,259 bp contig

More information

Unit 1. DNA and the Genome

Unit 1. DNA and the Genome Unit 1 DNA and the Genome Gene Expression Key Area 3 Vocabulary 1: Transcription Translation Phenotype RNA (mrna, trna, rrna) Codon Anticodon Ribosome RNA polymerase RNA splicing Introns Extrons Gene Expression

More information

From Gene to Protein. How Genes Work

From Gene to Protein. How Genes Work From Gene to Protein How Genes Work 2007-2008 The Central Dogma Flow of genetic information in a cell How do we move information from DNA to proteins? DNA RNA protein replication phenotype You! Step 1:

More information

ENZYMES AND METABOLIC PATHWAYS

ENZYMES AND METABOLIC PATHWAYS ENZYMES AND METABOLIC PATHWAYS This document is licensed under the Attribution-NonCommercial-ShareAlike 2.5 Italy license, available at http://creativecommons.org/licenses/by-nc-sa/2.5/it/ 1. Enzymes build

More information

Agenda. Annotation of Drosophila. Muller element nomenclature. Annotation: Adding labels to a sequence. GEP Drosophila annotation projects 01/03/2018

Agenda. Annotation of Drosophila. Muller element nomenclature. Annotation: Adding labels to a sequence. GEP Drosophila annotation projects 01/03/2018 Agenda Annotation of Drosophila January 2018 Overview of the GEP annotation project GEP annotation strategy Types of evidence Analysis tools Web databases Annotation of a single isoform (walkthrough) Wilson

More information

Adaptive Molecular Evolution. Reading for today. Neutral theory. Predictions of neutral theory. The neutral theory of molecular evolution

Adaptive Molecular Evolution. Reading for today. Neutral theory. Predictions of neutral theory. The neutral theory of molecular evolution Adaptive Molecular Evolution Nonsynonymous vs Synonymous Reading for today Li and Graur chapter (PDF on website) Evolutionary EST paper (PDF on website) Neutral theory The majority of substitutions are

More information

MULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question.

MULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question. Ch 17 Practice Questions MULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question. 1) Garrod hypothesized that "inborn errors of metabolism" such as alkaptonuria

More information

7.014 Quiz II Handout

7.014 Quiz II Handout 7.014 Quiz II Handout Quiz II: Wednesday, March 17 12:05-12:55 54-100 **This will be a closed book exam** Quiz Review Session: Friday, March 12 7:00-9:00 pm room 54-100 Open Tutoring Session: Tuesday,

More information

11 questions for a total of 120 points

11 questions for a total of 120 points Your Name: BYS 201, Final Exam, May 3, 2010 11 questions for a total of 120 points 1. 25 points Take a close look at these tables of amino acids. Some of them are hydrophilic, some hydrophobic, some positive

More information

Disease and selection in the human genome 3

Disease and selection in the human genome 3 Disease and selection in the human genome 3 Ka/Ks revisited Please sit in row K or forward RBFD: human populations, adaptation and immunity Neandertal Museum, Mettman Germany Sequence genome Measure expression

More information

BIO 311C Spring Lecture 36 Wednesday 28 Apr.

BIO 311C Spring Lecture 36 Wednesday 28 Apr. BIO 311C Spring 2010 1 Lecture 36 Wednesday 28 Apr. Synthesis of a Polypeptide Chain 5 direction of ribosome movement along the mrna 3 ribosome mrna NH 2 polypeptide chain direction of mrna movement through

More information

iclicker Question #28B - after lecture Shown below is a diagram of a typical eukaryotic gene which encodes a protein: start codon stop codon 2 3

iclicker Question #28B - after lecture Shown below is a diagram of a typical eukaryotic gene which encodes a protein: start codon stop codon 2 3 Bio 111 Handout for Molecular Biology 4 This handout contains: Today s iclicker Questions Information on Exam 3 Solutions Fall 2008 Exam 3 iclicker Question #28A - before lecture Which of the following

More information

DNA.notebook March 08, DNA Overview

DNA.notebook March 08, DNA Overview DNA Overview Deoxyribonucleic Acid, or DNA, must be able to do 2 things: 1) give instructions for building and maintaining cells. 2) be copied each time a cell divides. DNA is made of subunits called nucleotides

More information

Degenerate Code. Translation. trna. The Code is Degenerate trna / Proofreading Ribosomes Translation Mechanism

Degenerate Code. Translation. trna. The Code is Degenerate trna / Proofreading Ribosomes Translation Mechanism Translation The Code is Degenerate trna / Proofreading Ribosomes Translation Mechanism Degenerate Code There are 64 possible codon triplets There are 20 naturally-encoding amino acids Several codons specify

More information

Gene Regulation Biology

Gene Regulation Biology Gene Regulation Biology Potential and Limitations of Cell Re-programming in Cancer Research Eric Blanc KCL April 13, 2010 Eric Blanc (KCL) Gene Regulation Biology April 13, 2010 1 / 21 Outline 1 The Central

More information

Chapter 17 Lecture. Concepts of Genetics. Tenth Edition. Regulation of Gene Expression in Eukaryotes

Chapter 17 Lecture. Concepts of Genetics. Tenth Edition. Regulation of Gene Expression in Eukaryotes Chapter 17 Lecture Concepts of Genetics Tenth Edition Regulation of Gene Expression in Eukaryotes Chapter Contents 17.1 Eukaryotic Gene Regulation Can Occur at Any of the Steps Leading from DNA to Protein

More information

Control of Eukaryotic Gene Expression (Learning Objectives)

Control of Eukaryotic Gene Expression (Learning Objectives) Control of Eukaryotic Gene Expression (Learning Objectives) 1. Compare and contrast chromatin and chromosome: composition, proteins involved and level of packing. Explain the structure and function of

More information

Genomics and Gene Recognition Genes and Blue Genes

Genomics and Gene Recognition Genes and Blue Genes Genomics and Gene Recognition Genes and Blue Genes November 3, 2004 Eukaryotic Gene Structure eukaryotic genomes are considerably more complex than those of prokaryotes eukaryotic cells have organelles

More information

CS 4491/CS 7990 SPECIAL TOPICS IN BIOINFORMATICS

CS 4491/CS 7990 SPECIAL TOPICS IN BIOINFORMATICS 1 CS 4491/CS 7990 SPECIAL TOPICS IN BIOINFORMATICS * Some contents are adapted from Dr. Jean Gao at UT Arlington Mingon Kang, PhD Computer Science, Kennesaw State University 2 Genetics The discovery of

More information

Genes and Proteins. Objectives

Genes and Proteins. Objectives Genes and Proteins Lecture 15 Objectives At the end of this series of lectures, you should be able to: Define terms. Explain the central dogma of molecular biology. Describe the locations, reactants, and

More information

Station 1: DNA Structure Use the figure above to answer each of the following questions. 1.This is the subunit that DNA is composed of. 2.

Station 1: DNA Structure Use the figure above to answer each of the following questions. 1.This is the subunit that DNA is composed of. 2. 1. Station 1: DNA Structure Use the figure above to answer each of the following questions. 1.This is the subunit that DNA is composed of. 2.This subunit is composed of what 3 parts? 3.What molecules make

More information

Mutation Rates and Sequence Changes

Mutation Rates and Sequence Changes s and Sequence Changes part of Fortgeschrittene Methoden in der Bioinformatik Computational EvoDevo University Leipzig Leipzig, WS 2011/12 From Molecular to Population Genetics molecular level substitution

More information

Materials Protein synthesis kit. This kit consists of 24 amino acids, 24 transfer RNAs, four messenger RNAs and one ribosome (see below).

Materials Protein synthesis kit. This kit consists of 24 amino acids, 24 transfer RNAs, four messenger RNAs and one ribosome (see below). Protein Synthesis Instructions The purpose of today s lab is to: Understand how a cell manufactures proteins from amino acids, using information stored in the genetic code. Assemble models of four very

More information

1/4/18 NUCLEIC ACIDS. Nucleic Acids. Nucleic Acids. ECS129 Instructor: Patrice Koehl

1/4/18 NUCLEIC ACIDS. Nucleic Acids. Nucleic Acids. ECS129 Instructor: Patrice Koehl NUCLEIC ACIDS ECS129 Instructor: Patrice Koehl Nucleic Acids Nucleotides DNA Structure RNA Synthesis Function Secondary structure Tertiary interactions Wobble hypothesis DNA RNA Replication Transcription

More information

NUCLEIC ACIDS. ECS129 Instructor: Patrice Koehl

NUCLEIC ACIDS. ECS129 Instructor: Patrice Koehl NUCLEIC ACIDS ECS129 Instructor: Patrice Koehl Nucleic Acids Nucleotides DNA Structure RNA Synthesis Function Secondary structure Tertiary interactions Wobble hypothesis DNA RNA Replication Transcription

More information

Drosophila ficusphila F element

Drosophila ficusphila F element 5/2/2016 CONTIG52 Drosophila ficusphila F element Vahag Kechejian BIO434W Abstract Contig52 is a 35,000 bp region located on the F element of Drosophila ficusphila. Genscan predicts six features in the

More information

Gene function at the level of traits Gene function at the molecular level

Gene function at the level of traits Gene function at the molecular level Gene expression Gene function at the level of traits Gene function at the molecular level Two levels tied together since the molecular level affects the structure and function of cells which determines

More information

MODULE 5: TRANSLATION

MODULE 5: TRANSLATION MODULE 5: TRANSLATION Lesson Plan: CARINA ENDRES HOWELL, LEOCADIA PALIULIS Title Translation Objectives Determine the codons for specific amino acids and identify reading frames by looking at the Base

More information

Important points from last time

Important points from last time Important points from last time Subst. rates differ site by site Fit a Γ dist. to variation in rates Γ generally has two parameters but in biology we fix one to ensure a mean equal to 1 and the other parameter

More information

CH 17 :From Gene to Protein

CH 17 :From Gene to Protein CH 17 :From Gene to Protein Defining a gene gene gene Defining a gene is problematic because one gene can code for several protein products, some genes code only for RNA, two genes can overlap, and there

More information

Reading for today. Adaptive Molecular Evolution. Predictions of neutral theory. The neutral theory of molecular evolution

Reading for today. Adaptive Molecular Evolution. Predictions of neutral theory. The neutral theory of molecular evolution Final exam date scheduled: Thursday, MARCH 17, 2005, 1030-1220 Reading for today Adaptive Molecular Evolution Li and Graur chapter (PDF on website) Evolutionary EST paper (PDF on website) Page and Holmes

More information

Biological Sciences 50 Practice Exam 2

Biological Sciences 50 Practice Exam 2 NAME: Fall 2005 TF: Biological Sciences 50 Practice Exam 2 A. Be sure to write your name on the top of each of page of the examination. B. Write each answer only on the same page as the pertinent question.

More information

Big Idea 3C Basic Review

Big Idea 3C Basic Review Big Idea 3C Basic Review 1. A gene is a. A sequence of DNA that codes for a protein. b. A sequence of amino acids that codes for a protein. c. A sequence of codons that code for nucleic acids. d. The end

More information

The Structure of RNA. The Central Dogma

The Structure of RNA. The Central Dogma 12-3 12-3 RNA and Protein Synthesis The Structure of RNA The Central Dogma Phenotype A gene is a SEQUENCE of DNA that codes for a protein (or functional RNA). Phenotype is the individual s observable trait

More information

Collect, analyze and synthesize. Annotation. Annotation for D. virilis. Evidence Based Annotation. GEP goals: Evidence for Gene Models 08/22/2017

Collect, analyze and synthesize. Annotation. Annotation for D. virilis. Evidence Based Annotation. GEP goals: Evidence for Gene Models 08/22/2017 Annotation Annotation for D. virilis Chris Shaffer July 2012 l Big Picture of annotation and then one practical example l This technique may not be the best with other projects (e.g. corn, bacteria) l

More information

Nucleic acid and protein Flow of genetic information

Nucleic acid and protein Flow of genetic information Nucleic acid and protein Flow of genetic information References: Glick, BR and JJ Pasternak, 2003, Molecular Biotechnology: Principles and Applications of Recombinant DNA, ASM Press, Washington DC, pages.

More information

Transcription is the first stage of gene expression

Transcription is the first stage of gene expression Transcription is the first stage of gene expression RNA synthesis is catalyzed by RNA polymerase, which pries the DNA strands apart and hooks together the RNA nucleotides The RNA is complementary to the

More information

Transcription and Translation

Transcription and Translation Transcription and Translation Central Dogma of Molecular The flow of information in the cell starts at DNA, which replicates to form more DNA. Information is then transcribed into RNA, and then it is translated

More information

Collect, analyze and synthesize. Annotation. Annotation for D. virilis. GEP goals: Evidence Based Annotation. Evidence for Gene Models 12/26/2018

Collect, analyze and synthesize. Annotation. Annotation for D. virilis. GEP goals: Evidence Based Annotation. Evidence for Gene Models 12/26/2018 Annotation Annotation for D. virilis Chris Shaffer July 2012 l Big Picture of annotation and then one practical example l This technique may not be the best with other projects (e.g. corn, bacteria) l

More information

Biology. Biology. Slide 1 of 39. End Show. Copyright Pearson Prentice Hall

Biology. Biology. Slide 1 of 39. End Show. Copyright Pearson Prentice Hall Biology Biology 1 of 39 12-3 RNA and Protein Synthesis 2 of 39 Essential Question What is transcription and translation and how do they take place? 3 of 39 12 3 RNA and Protein Synthesis Genes are coded

More information

Biology. Biology. Slide 1 of 39. End Show. Copyright Pearson Prentice Hall

Biology. Biology. Slide 1 of 39. End Show. Copyright Pearson Prentice Hall Biology Biology 1 of 39 12-3 RNA and Protein Synthesis 2 of 39 12 3 RNA and Protein Synthesis Genes are coded DNA instructions that control the production of proteins. Genetic messages can be decoded by

More information

RNA: Transcription and Triplet Code

RNA: Transcription and Triplet Code RNA: Transcription and Triplet Code There are Five Kinds of RNA, All of Which are Templated from DNA. The first type of RNA is trna. The "t" stands for "transfer". This RNA is the RNA that transfers amino

More information

The Genetic Code and Transcription. Chapter 12 Honors Genetics Ms. Susan Chabot

The Genetic Code and Transcription. Chapter 12 Honors Genetics Ms. Susan Chabot The Genetic Code and Transcription Chapter 12 Honors Genetics Ms. Susan Chabot TRANSCRIPTION Copy SAME language DNA to RNA Nucleic Acid to Nucleic Acid TRANSLATION Copy DIFFERENT language RNA to Amino

More information

You are genetically unique

You are genetically unique BNF 5106 - Lecture 1 Genetics, Genes, Genetic codes, and Mutations You are genetically unique Since each parent has 23 pairs of chromosomes, the probability that each parent gives twice the same chromosomes

More information

From DNA to Protein. Chapter 14

From DNA to Protein. Chapter 14 From DNA to Protein Chapter 14 What do genes code for? How does DNA code for cells & bodies? How are cells and bodies made from the instructions in DNA? DNA proteins cells bodies The Central Dogma Flow

More information

Find this material useful? You can help our team to keep this site up and bring you even more content consider donating via the link on our site.

Find this material useful? You can help our team to keep this site up and bring you even more content consider donating via the link on our site. Find this material useful? You can help our team to keep this site up and bring you even more content consider donating via the link on our site. Still having trouble understanding the material? Check

More information

This is the knowledge that you should understand upon completing this section:

This is the knowledge that you should understand upon completing this section: DN 11 Syllabus hecklist This is the knowledge that you should understand upon completing this section: 11.1 DN DN occurs bound to proteins in chromosomes in the nucs and as unbound DN in the mitochondria.

More information

Key Area 1.3: Gene Expression

Key Area 1.3: Gene Expression Key Area 1.3: Gene Expression RNA There is a second type of nucleic acid in the cell, called RNA. RNA plays a vital role in the production of protein from the code in the DNA. What is gene expression?

More information

DNA Evolution of knowledge about gene. Contains information about RNAs and proteins. Polynucleotide chains; Double stranded molecule;

DNA Evolution of knowledge about gene. Contains information about RNAs and proteins. Polynucleotide chains; Double stranded molecule; Evolution of knowledge about gene G. Mendel Hereditary factors W.Johannsen, 1909 G.W.Beadle, E.L.Tatum, 1945 Ingram, 1957 Actual concepts The gene hereditary unit located in chromosomes Hypotheses One

More information

Differential Gene Expression

Differential Gene Expression Biology 4361 Developmental Biology Differential Gene Expression June 19, 2008 Differential Gene Expression Overview Chromatin structure Gene anatomy RNA processing and protein production Initiating transcription:

More information

03-511/711 Computational Genomics and Molecular Biology, Fall

03-511/711 Computational Genomics and Molecular Biology, Fall 03-511/711 Computational Genomics and Molecular Biology, Fall 2011 1 Problem Set 0 Due Tuesday, September 6th This homework is intended to be a self-administered placement quiz, to help you (and me) determine

More information

The Stringent Response

The Stringent Response The Stringent Response When amino acids are limiting a response is triggered to shut down a wide range of biosynthetic processes This process is called the Stringent Response It results in the synthesis

More information

Genes found in the genome include protein-coding genes and non-coding RNA genes. Which nucleotide is not normally found in non-coding RNA genes?

Genes found in the genome include protein-coding genes and non-coding RNA genes. Which nucleotide is not normally found in non-coding RNA genes? Midterm Q Genes found in the genome include protein-coding genes and non-coding RNA genes Which nucleotide is not normally found in non-coding RNA genes? G T 3 A 4 C 5 U 00% Midterm Q Which of the following

More information

NGS Approaches to Epigenomics

NGS Approaches to Epigenomics I519 Introduction to Bioinformatics, 2013 NGS Approaches to Epigenomics Yuzhen Ye (yye@indiana.edu) School of Informatics & Computing, IUB Contents Background: chromatin structure & DNA methylation Epigenomic

More information

BIOLOGY - CLUTCH CH.17 - GENE EXPRESSION.

BIOLOGY - CLUTCH CH.17 - GENE EXPRESSION. !! www.clutchprep.com CONCEPT: GENES Beadle and Tatum develop the one gene one enzyme hypothesis through their work with Neurospora (bread mold). This idea was later revised as the one gene one polypeptide

More information

Biology: The substrate of bioinformatics

Biology: The substrate of bioinformatics Bi01_1 Unit 01: Biology: The substrate of bioinformatics What is Bioinformatics? Bi01_2 handling of information related to living organisms understood on the basis of molecular biology Nature does it.

More information

Eukaryotic Gene Structure

Eukaryotic Gene Structure Eukaryotic Gene Structure Terminology Genome entire genetic material of an individual Transcriptome set of transcribed sequences Proteome set of proteins encoded by the genome 2 Gene Basic physical and

More information

8/21/2014. From Gene to Protein

8/21/2014. From Gene to Protein From Gene to Protein Chapter 17 Objectives Describe the contributions made by Garrod, Beadle, and Tatum to our understanding of the relationship between genes and enzymes Briefly explain how information

More information

Unit IX Problem 3 Genetics: Basic Concepts in Molecular Biology

Unit IX Problem 3 Genetics: Basic Concepts in Molecular Biology Unit IX Problem 3 Genetics: Basic Concepts in Molecular Biology - The central dogma (principle) of molecular biology: Information from DNA are transcribed to mrna which will be further translated to synthesize

More information

Chem 250 Answer Key In-class Quiz #3v1

Chem 250 Answer Key In-class Quiz #3v1 age 1 of 6 Quiz #3 ame. Chem 250 Answer Key In-class Quiz #3v1 This exam is composed of 20 questions. lease scan them all before starting. As discussed in the course syllabus, honesty and integrity are

More information

Transcriptomics. Marta Puig Institut de Biotecnologia i Biomedicina Universitat Autònoma de Barcelona

Transcriptomics. Marta Puig Institut de Biotecnologia i Biomedicina Universitat Autònoma de Barcelona Transcriptomics Marta Puig Institut de Biotecnologia i Biomedicina Universitat Autònoma de Barcelona Central dogma of molecular biology Central dogma of molecular biology Genome Complete DNA content of

More information

Outline. Pseudogenes. Pseudo-genes. The genetic code (DNA version) What is a gene? What is a gene? Dead genes Vitamin C Urate oxidase. Alan R.

Outline. Pseudogenes. Pseudo-genes. The genetic code (DNA version) What is a gene? What is a gene? Dead genes Vitamin C Urate oxidase. Alan R. Pseudogenes Alan R. Rogers January 15, 2016 Dead genes Vitamin C Urate oxidase ψmyh16 GBA Globins 1 / 35 2 / 35 Pseudo-genes Genes are DNA sequences that code for protein. Some genes are broken and cannot

More information

Chapter 13. The Nucleus. The nucleus is the hallmark of eukaryotic cells; the very term eukaryotic means having a "true nucleus".

Chapter 13. The Nucleus. The nucleus is the hallmark of eukaryotic cells; the very term eukaryotic means having a true nucleus. Chapter 13 The Nucleus The nucleus is the hallmark of eukaryotic cells; the very term eukaryotic means having a "true nucleus". Fig.13.1. The EM of the Nucleus of a Eukaryotic Cell 13.1. The Nuclear Envelope

More information

Unit II Problem 3 Genetics: Summary of Basic Concepts in Molecular Biology

Unit II Problem 3 Genetics: Summary of Basic Concepts in Molecular Biology Unit II Problem 3 Genetics: Summary of Basic Concepts in Molecular Biology - The central dogma (principle) of molecular biology: Information from DNA are transcribed to mrna which will be further translated

More information

Lecture 11: Gene Prediction

Lecture 11: Gene Prediction Lecture 11: Gene Prediction Study Chapter 6.11-6.14 1 Gene: A sequence of nucleotides coding for protein Gene Prediction Problem: Determine the beginning and end positions of genes in a genome Where are

More information

Biomolecules: lecture 6

Biomolecules: lecture 6 Biomolecules: lecture 6 - to learn the basics on how DNA serves to make RNA = transcription - to learn how the genetic code instructs protein synthesis - to learn the basics on how proteins are synthesized

More information

MATH 5610, Computational Biology

MATH 5610, Computational Biology MATH 5610, Computational Biology Lecture 2 Intro to Molecular Biology (cont) Stephen Billups University of Colorado at Denver MATH 5610, Computational Biology p.1/24 Announcements Error on syllabus Class

More information

Alpha-helices, beta-sheets and U-turns within a protein are stabilized by (hint: two words).

Alpha-helices, beta-sheets and U-turns within a protein are stabilized by (hint: two words). 1 Quiz1 Q1 2011 Alpha-helices, beta-sheets and U-turns within a protein are stabilized by (hint: two words) Value Correct Answer 1 noncovalent interactions 100% Equals hydrogen bonds (100%) Equals H-bonds

More information

Problem: The GC base pairs are more stable than AT base pairs. Why? 5. Triple-stranded DNA was first observed in 1957. Scientists later discovered that the formation of triplestranded DNA involves a type

More information

From Gene to Protein. How Genes Work (Ch. 17)

From Gene to Protein. How Genes Work (Ch. 17) From Gene to Protein How Genes Work (Ch. 17) What do genes code for? How does DNA code for cells & bodies? how are cells and bodies made from the instructions in DNA DNA proteins cells bodies The Central

More information

Suppl. Figure 1: RCC1 sequence and sequence alignments. (a) Amino acid

Suppl. Figure 1: RCC1 sequence and sequence alignments. (a) Amino acid Supplementary Figures Suppl. Figure 1: RCC1 sequence and sequence alignments. (a) Amino acid sequence of Drosophila RCC1. Same colors are for Figure 1 with sequence of β-wedge that interacts with Ran in

More information

GENETICS - CLUTCH CH.10 TRANSCRIPTION.

GENETICS - CLUTCH CH.10 TRANSCRIPTION. !! www.clutchprep.com CONCEPT: OVERVIEW OF TRANSCRIPTION Transcription is the process of using DNA as a template to RNA RNA polymerase is the enzyme that transcribes DNA - There are many different types

More information

Genetics Biology 331 Exam 3B Spring 2015

Genetics Biology 331 Exam 3B Spring 2015 Genetics Biology 331 Exam 3B Spring 2015 MULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question. 1) DNA methylation may be a significant mode of genetic regulation

More information

CLEP Biology - Problem Drill 11: Transcription, Translation and The Genetic Code

CLEP Biology - Problem Drill 11: Transcription, Translation and The Genetic Code CLEP Biology - Problem Drill 11: Transcription, Translation and The Genetic Code No. 1 of 10 1. Three types of RNA comprise the structural and functional core for protein synthesis, serving as a template

More information

MULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question.

MULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question. Exam Chapter 17 Genes to Proteins Name MULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question. The following questions refer to Figure 17.1, a simple metabolic

More information