Partial Sequence of the Mitochondrial Genome of Littorina saxatilis: Relevance to Gastropod Phylogenetics*

Similar documents
Materials Protein synthesis kit. This kit consists of 24 amino acids, 24 transfer RNAs, four messenger RNAs and one ribosome (see below).

Lecture 10, 20/2/2002: The process of solution development - The CODEHOP strategy for automatic design of consensus-degenerate primers for PCR

Disease and selection in the human genome 3

PROTEIN SYNTHESIS Study Guide

Lecture 19A. DNA computing

Codon Bias with PRISM. 2IM24/25, Fall 2007

PGRP negatively regulates NOD-mediated cytokine production in rainbow trout liver cells

ORFs and genes. Please sit in row K or forward

NAME:... MODEL ANSWER... STUDENT NUMBER:... Maximum marks: 50. Internal Examiner: Hugh Murrell, Computer Science, UKZN

Det matematisk-naturvitenskapelige fakultet

DNA sentences. How are proteins coded for by DNA? Materials. Teacher instructions. Student instructions. Reflection

Lecture 11: Gene Prediction

Degenerate Code. Translation. trna. The Code is Degenerate trna / Proofreading Ribosomes Translation Mechanism

Homework. A bit about the nature of the atoms of interest. Project. The role of electronega<vity

Protein Synthesis. Application Based Questions

Primer Design Workshop. École d'été en géné-que des champignons 2012 Dr. Will Hintz University of Victoria

Project 07/111 Final Report October 31, Project Title: Cloning and expression of porcine complement C3d for enhanced vaccines

1. DNA, RNA structure. 2. DNA replication. 3. Transcription, translation

Supplementary. Table 1: Oligonucleotides and Plasmids. complementary to positions from 77 of the SRα '- GCT CTA GAG AAC TTG AAG TAC AGA CTG C

G+C content. 1 Introduction. 2 Chromosomes Topology & Counts. 3 Genome size. 4 Replichores and gene orientation. 5 Chirochores.

Arabidopsis actin depolymerizing factor AtADF4 mediates defense signal transduction triggered by the Pseudomonas syringae effector AvrPphB

Important points from last time

Molecular Level of Genetics

Electronic Supplementary Information

Figure S1. Characterization of the irx9l-1 mutant. (A) Diagram of the Arabidopsis IRX9L gene drawn based on information from TAIR (the Arabidopsis

Protein Structure Analysis

Genomics and Gene Recognition Genes and Blue Genes

Level 2 Biology, 2017

Biomolecules: lecture 6

Supplemental Data Supplemental Figure 1.

Evolution of protein coding sequences

CONVERGENT EVOLUTION. Def n acquisition of some biological trait but different lineages

Cat. # Product Size DS130 DynaExpress TA PCR Cloning Kit (ptakn-2) 20 reactions Box 1 (-20 ) ptakn-2 Vector, linearized 20 µl (50 ng/µl) 1

Chapter 3: Information Storage and Transfer in Life

Lezione 10. Bioinformatica. Mauro Ceccanti e Alberto Paoluzzi

The combination of a phosphate, sugar and a base forms a compound called a nucleotide.

ANCIENT BACTERIA? 250 million years later, scientists revive life forms

(a) Which enzyme(s) make 5' - 3' phosphodiester bonds? (c) Which enzyme(s) make single-strand breaks in DNA backbones?

Supporting Information. Copyright Wiley-VCH Verlag GmbH & Co. KGaA, Weinheim, 2006

7.016 Problem Set 3. 1 st Pedigree

Biomolecules: lecture 6

Multiplexing Genome-scale Engineering

Supporting information for Biochemistry, 1995, 34(34), , DOI: /bi00034a013

Chemistry 121 Winter 17

Supplement 1: Sequences of Capture Probes. Capture probes were /5AmMC6/CTG TAG GTG CGG GTG GAC GTA GTC

Supplementary Figure 1A A404 Cells +/- Retinoic Acid

iclicker Question #28B - after lecture Shown below is a diagram of a typical eukaryotic gene which encodes a protein: start codon stop codon 2 3

Hes6. PPARα. PPARγ HNF4 CD36

strain devoid of the aox1 gene [1]. Thus, the identification of AOX1 in the intracellular

Bioinformatics CSM17 Week 6: DNA, RNA and Proteins

PCR analysis was performed to show the presence and the integrity of the var1csa and var-

SAY IT WITH DNA: Protein Synthesis Activity by Larry Flammer

Supplemental Data. mir156-regulated SPL Transcription. Factors Define an Endogenous Flowering. Pathway in Arabidopsis thaliana

How life. constructs itself.

p-adic GENETIC CODE AND ULTRAMETRIC BIOINFORMATION

Human Gene,cs 06: Gene Expression. Diversity of cell types. How do cells become different? 9/19/11. neuron

Y-chromosomal haplogroup typing Using SBE reaction

INTRODUCTION TO THE MOLECULAR GENETICS OF THE COLOR MUTATIONS IN ROCK POCKET MICE

TRANSCRIPTION. Renáta Schipp


Fishy Amino Acid Codon. UUU Phe UCU Ser UAU Tyr UGU Cys. UUC Phe UCC Ser UAC Tyr UGC Cys. UUA Leu UCA Ser UAA Stop UGA Stop

A Zero-Knowledge Based Introduction to Biology

Expression of Recombinant Proteins

Add 5µl of 3N NaOH to DNA sample (final concentration 0.3N NaOH).

Folding simulation: self-organization of 4-helix bundle protein. yellow = helical turns

2

Table S1. Bacterial strains (Related to Results and Experimental Procedures)

Supplemental material

Thr Gly Tyr. Gly Lys Asn

for Programmed Chemo-enzymatic Synthesis of Antigenic Oligosaccharides

Gene synthesis by circular assembly amplification

Deoxyribonucleic Acid DNA. Structure of DNA. Structure of DNA. Nucleotide. Nucleotides 5/13/2013

ΔPDD1 x ΔPDD1. ΔPDD1 x wild type. 70 kd Pdd1. Pdd3

Genomic Sequence Analysis using Electron-Ion Interaction

UNIT I RNA AND TYPES R.KAVITHA,M.PHARM LECTURER DEPARTMENT OF PHARMACEUTICS SRM COLLEGE OF PHARMACY KATTANKULATUR

National PHL TB DST Reference Center PSQ Reporting Language Table of Contents

Protein Synthesis: Transcription and Translation

Just one nucleotide! Exploring the effects of random single nucleotide mutations

Honors packet Instructions

evaluated with UAS CLB eliciting UAS CIT -N Libraries increase in the

Supplemental Data. Bennett et al. (2010). Plant Cell /tpc

FROM DNA TO GENETIC GENEALOGY Stephen P. Morse

DNA Begins the Process

Supporting Information

BIOSTAT516 Statistical Methods in Genetic Epidemiology Autumn 2005 Handout1, prepared by Kathleen Kerr and Stephanie Monks

Dierks Supplementary Fig. S1

Supporting Online Information

RPA-AB RPA-C Supplemental Figure S1: SDS-PAGE stained with Coomassie Blue after protein purification.

Supplementary Information. Construction of Lasso Peptide Fusion Proteins

Overexpression Normal expression Overexpression Normal expression. 26 (21.1%) N (%) P-value a N (%)

II 0.95 DM2 (RPP1) DM3 (At3g61540) b

Chapter 10. The Structure and Function of DNA. Lectures by Edward J. Zalisko

MULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question.

Codon bias and gene expression of mitochondrial ND2 gene in chordates

SUPPLEMENTARY INFORMATION

Legends for supplementary figures 1-3

SUPPLEMENTAL TABLE S1. Additional descriptions of plasmid constructions and the oligonucleotides used Plasmid or Oligonucleotide

Supplementary Materials for

Creation of A Caspese-3 Sensing System Using A Combination of Split- GFP and Split-Intein

Transcription:

J Mol Evol (1999) 48:348 359 Springer-Verlag New York Inc. 1999 Partial Sequence of the Mitochondrial Genome of Littorina saxatilis: Relevance to Gastropod Phylogenetics* Craig S. Wilding, Peter J. Mill, John Grahame School of Biology, The University of Leeds, Leeds LS2 9JT, UK Received: 4 June 1998 / Accepted: 20 August 1998 Abstract. A 8022 base pair fragment from the mitochondrial DNA of the prosobranch gastropod Littorina saxatilis has been sequenced and shown to contain the complete genes for 12 transfer RNAs and five protein genes (CoII, ATPase 6, ATPase 8, ND1, ND6), two partial protein genes (CoI and cyt b), and two ribosomal RNAs (small and large subunits). The order of these constituent genes differs from those of other molluscan mitochondrial gene arrangements. Only a single rearrangement involving a block of protein coding genes and three trna translocations are necessary to produce identical gene orders between L. saxatilis and K. tunicata. However, only one gene boundary is shared between the L. saxatilis gene order and that of the pulmonate gastropod Cepaea nemoralis. This extends the observation that there is little conservation of mitochrondrial gene order amongst the Mollusca and suggests that radical mitochondrial DNA gene rearrangement has occurred on the branch leading to the pulmonates. Key words: Littorina saxatilis Mitochondrial genome Gastropod phylogenetics Gene order Mollusca * The nucleotide sequence data reported in this paper will appear in the DDBJ/EMBL/GenBank nucleotide sequence databases with accession number AJ132137. Correspondence to: C.S. Wilding Introduction With few exceptions, the circular mitochondrial genome of the Metazoa contains 13 protein coding genes, 2 ribosomal RNAs (rrna genes), and 22 transfer RNAs (trna genes). Although the number and nature of constituent genes remain constant, their organization around the circle does not, resulting in considerable differences in mitochondrial genome organization between taxa (Wolstenholme 1992). However, because of the immense number of possible arrangements of the constituent genes, it is unlikely that taxa will converge on identical arrangements independently, and thus identical gene order, or the presence of shared gene boundaries, is likely to indicate common ancestry. Thus analysis of mtdna gene order is expected to provide important data for understanding deep phylogenies (Boore and Brown 1994b; Boore et al. 1995). The elucidation of complete mitochondrial sequences from a range of taxa has allowed limited testing of this theory, and indeed it seems that closely related taxa typically do display similar gene orders, and within a phylum, gene order is typically conserved. Thus the 37 constituent genes of the mtdna are arranged in the same order for most vertebrates examined thus far (see Wolstenholme 1992, and references therein). The exceptions concern the marsupials, where there has been some rearrangement of trna genes (Pääbo et al. 1991); birds, where ND6 and trna glu have been transposed relative to cytochrome b; trna thr and trna pro (Desjardins and Morais 1990; Desjardins et al. 1990); and minor rearrangements in certain reptiles argued to result from movement of the light strand origin of replication (Macey et al. 1997). This tendency for a consensus gene order within a phylum also holds for the Echinodermata, where echinoids differ from asteroids in only a single transversion (Smith et al. 1993). Even within the largest phylum, the arthropods, gene orders

349 for the protein coding and ribosomal rrna genes are constant (Staton et al. 1997) and only the position of transfer RNAs varies (Boore et al. 1995; Staton et al. 1997). In contrast, within the Mollusca, there are divergences in mitochondrial gene order. This is the second largest phylum, comprising seven classes: Aplacophora (e.g., solenogasters), Monoplacophora (Neoplina), Polyplacophora (chitons), Scaphopoda (tusk shells), Bivalvia (clams and mussels), Gastropoda (slugs and snails), and Cephalopoda (octopods and squids). The phylogeny of the Mollusca is ambiguous. A number of alternative phylogenies have been proposed by various authors (Boore and Brown 1994b; Ponder and Linberg 1996, 1997), with even some debate still ongoing concerning monophyly of the phylum (Boore and Brown 1994b). Mitochondrial gene order may provide valuable information for phylogeny construction or resolution of polytomies. Despite the diversity of form within the phylum and the evidence that mtdna gene order can provide valuable information for phylogeny reconstruction, few molluscan mitochondrial gene orders are known. The only available mitochondrial gene arrangements are for the polyplacophoran Katharina tunicata (Boore and Brown, 1994a), the bivalve mollusk Mytilus edulis (Hoffmann et al. 1992), and three pulmonate gastropods, Albinaria coerulea (Hatzoglou et al. 1995), Cepaea nemoralis (Terrett et al. 1996), and Euhadra herklotsi (Yamazaki et al. 1997). These three groupings, the Polyplacophora, mytilid bivalves, and pulmonate gastropods, all display substantial differences in mitochondrial gene order, suggesting that mitochondrial gene order may provide useful characters for phylogeny analysis in this phylum. However, no information is yet available on the extent of gene order rearrangements within a class [although Yamazaki et al. (1997) describe limited differences within the pulmonates] and such information is vital to understand the great differences already seen between classes. Here we report on sequence information of approximately half the mitochondrial DNA of the prosobranch gastropod Littorina saxatilis (Olivi). The availability of further mtdna gene orders such as this is essential for examining the usefulness of mtdna gene organisation for the study of mollusk relationships. PCR Amplification of Mitochondrial DNA Amplification of mtdna was initially undertaken in an effort to obtain sequence to enable primers to be designed for the C terminus of the cytochrome oxidase gene for use in population-level sequencing studies (Wilding et al. unpublished data) of this region, which is known to be highly polymorphic in insects (Lunt et al. 1996). Preliminary sequence of the N terminus of the gene was obtained using the universal CoI primers of Folmer et al. (1994). From this a new primer (CoI-long) was designed and used in conjunction with each of the universal cyt b primers of Kocher et al. (1989). Because we had no prior information on gene order, CoI-long was used with each cyt b primer. The Expand Long Template PCR System (Boehringer Mannheim) was employed to reduce PCR errors to an absolute minimum. This uses Taq enzyme as the main polymerase with a low-concentration proofreading Pwo enzyme. Approximately 8 kb of mtdna was PCR amplified from genomic DNA using the primers CoI-long (5 -GCT CAT GCT GGG GGC TC-3 ) and cyt b-rev (5 -AGG GAA CTT TTT CTC CAT CTC TGT- 3 ). Two fragments of 5614 and 2237 bp were amplified. From sequence information these were shown to be near-contiguous and the interim sequence was then amplified using primers (ATPF 5 -ATG CCT CAA CTA TCC CCT-3 and ATPR 5 -CTA ATG AGG ATG TTC GTC TGA-3 ) designed from the original sequence data. This PCR fragment was then sequenced using the amplification primers and one internal primer. Both larger fragments were cloned into PCR-script SK+ (Stratagene). Subcloning of the two larger fragments was performed through religation following restriction digestion with a variety of enzymes recognizing sites within the inserts and vector MCS (Fig. 1). The original and subclones were sequenced using standard sequencing primers (M13R and M13-20) by MWG Biotechnology (Milton Keynes, UK). Where these subclones did not adequately cover the sequence, internal sequencing primers were designed and sequencing was performed on an ABI377. The sequencing strategy is depicted in Fig. 1. Sequence Analysis Open reading frames were detected using MAP in the GCG suite of programs (Devereux et al. 1984), initially using the Drosophila yakuba genetic code (Clary and Wolstenholme 1985), and genes identified using BLASTX (Altschul et al. 1995). Open reading frames with limited homology to mitochondrial proteins were identified through comparison of hydropathy profiles with those of the chiton Katharina tunicata (Boore and Brown 1994a) and Drosophila yakuba (Wolstenholme and Clary 1985). Hydropathy profiles (Kyte and Doolittle 1982) were constructed using PEPPLOT in the GCG package. Transfer RNA genes were identified using the trna scan program (Lowe and Eddy 1997). Alignment of amino acid sequences was achieved using CLUSTALW (Thompson et al. 1994). Results and Discussion Materials and Methods DNA Extraction Total genomic DNA was extracted from the head foot region of two Littorina saxatilis collected from the Gann estuary, Pembrokeshire, Wales, using the protocol of Ashburner (1989). Two large filaments (5614 and 2237 bp) of L. saxatilis mtdna were PCR amplified and sequenced. Paradoxically each had been primed by a single (but different) primer the larger fragment by CoI-long and the smaller by cyt b-rev. This self-priming was probably mediated by the low annealing temperature used in the PCR amplification. Bridging sequence was obtained to yield a

350 Fig. 1. Sequencing strategy of cloned L. saxatilis mtdna fragments. Two fragments (A and B) were PCR amplified and cloned (see text) and the intervening fragment amplified using primers complementary to the ends of the cloned regions. Subcloning was performed with EcoRI, HindIII, KpnI, PstI, SacI, and SalI. Arrows below fragments depict the orientation and length of sequences obtained using universal sequencing primers. Arrows above the sequence depict the orientation and length of sequences obtained using primers designed from the Littorina sequence. Below: Inferred gene order. CoI and CoII, subunits I and II of cytochrome c oxidase. A6 and A8, subunits 6 and 8 of mitochondrial ATP synthase. s-rrna and l-rrna, small (12S) and large (16S) subunit ribosomal RNA. ND1 and ND6, subunits 1 and 6 of NADH dehydrogenase. Cyt b, cytochrome b apoenzyme. Transfer RNA genes are denoted by their single-letter amino acid codes. All genes are transcribed from left to right except those that are underlined. total of 8022 bp of mitochondrial sequence (Fig. A1). The exact length of L. saxatilis mtdna has not been determined. However, Crossland (1994) crudely estimated the size at 14.5 kbp from a clone library probed with an L. littorea clone, suggesting that there is approximately 55% of the L. saxatilis mtdna cloned in the present study. Gene content and order The 8 kb of L. saxatilis partial mtdna sequence reported here contains genes for 7 proteins (ATPase 6, ATPase 8, CoI, CoII, cytochrome b, ND1, and ND6), 2 ribosomal RNAs (large and small subunits), and 12 trnas (Fig. 1). All genes are transcribed from the same strand with the exception of a single block of eight trnas. No protein coding genes overlap, although trna trp and trna gln havea4bpoverlap and trna gly and trna glu overlap by 2 bp. Only 78 bp of the 8022 bp sequence (0.97%) have not been ascribed a function; 48 bp of this apparent noncoding sequence is represented by small (<10 bp) intergenic regions. The remaining 30 bp is situated between the CoI and the CoII genes. The polyplacophoran K. tunicata (Boore et al. 1994) and the four gastropods for which mtdna gene order information is available [A. coerulea (Hatzoglou et al. 1995), C. nemoralis (Terrett et al. 1996), E. herklotsi (Yamazaki et al. 1997), and L. saxatilis (this study)] all contain an ATPase 8 gene. However, this gene is absent in the bivalve M. edulis (Hoffmann et al. 1992). As in K. tunicata but unlike for the pulmonates, the L. saxatilis ATPase 8 gene follows on directly from ATPase 6. Terrett et al. (1996) suggested that, as ATPase 8 has been lost independently from the nematode lineage and from M. edulis, a broader depth of sampling was needed to clarify where the loss had occurred. It seems likely that the gene is universally present in the gastropods but a wider range of sampling throughout the Mollusca is necessary to understand the distribution of ATPase 8 in this phylum. The gene arrangement of L. saxatilis mtdna resembles that of the chiton (Boore and Brown 1994a) far more closely than it does the gene orders of pulmonate gastropods, e.g., C. nemoralis (Fig. 2). Disregarding trna translocations, only one rearrangement is necessary to produce identical gene orders (of the seven protein coding genes and two rrna genes sequenced in the present study) between L. saxatilis and K. tunicata, while at least seven separate translocations of protein coding regions are necessary to produce an arrangement similar to that of C. nemoralis. The 12 trna portions are also more similar between L. saxatilis and K. tunicata; only three movements are necessary to produce identical orders. Comparisons of the trna orders between L. saxatilis and C. nemoralis reveal a radically different gene order, with only one (trna glu ; E, Fig. 2) at the same gene boundary (A6 s-rrna) in both organisms. One trna rearrangement is particularly interesting. In L. saxatilis the arrangement 1-rRNA/tRNA leu(uur) /

351 Fig. 2. Comparison of mitochondrial gene order in three species of mollusk: Littorina saxatilis (this study), Katharina tunicata (Boore and Brown 1994a), and Cepaea nemoralis (Terrett et al. 1996). All genes are transcribed from left to right except where underlined. Transpositions of protein coding genes are denoted by lines connecting genes or blocks of genes when marked by a bar, and inversions are denoted by lines with encircling arrows. trna leu(cun) /ND1 is found. In K. tunicata (Boore and Brown 1994a) and also, and more pertinently, in the prosobranch gastropod Plicopurpurata columellaris (Boore et al. 1995), the arrangement is 1-rRNA/ trna leu(cun) /trna leu(uur) /ND1. Littorina and the muricid Plicopurpurata are both members of the Caenogastropoda, to which the Littorinidae form the most basal group (Winnepenninckx et al. 1998), and therefore since Plicopurpurata appears to have the ancestral gene arrangement, the trna translocation seen in Littorina is presumably limited to the Littorinidae or part of this group. As in most mtdnas the partial L. saxatilis mtdna is strongly AT rich (66.94% AT). This AT bias does seem to affect codon usage. For the third position of codons, 1210 (76.5%) end in A or T and the 3 most common codons, TTT (F), TTA (L), and ATT (I), are composed solely of A and T. Translation Initiation and Termination Prediction of translation initiation and termination sites can be undertaken with absolute certainty only through transcript mapping. Here we infer these sites through the presence of apparent stop and start codons bounding open reading frames of a length comparable to those of other mtdnas. The six protein coding genes for which we have sequenced the initiation sites all have methionine as an initiation codon. Acceptance of this results in genes of lengths comparable to those of other mollusks (K. tunicata and C. nemoralis; Table 1). All genes identified end in complete (TAA or TAG) termination codons. There appear to be no incomplete (TA or T) codons as in C. nemoralis and K. tunicata. There is an alternative ATA start codon four codons upstream of the predicted ATG start of ATPase 6. This would reduce the noncoding region between ATPase 8 and ATPase 6 from 13 to 1 bp. We have chosen the ATG codon, as this results in an ATPase 6 gene of standard length with good alignment with the initial amino acid sequence of the K. tunicata sequence (K. tunicata, MMMDIFSSFDDN; L. saxatilis, MLVDIFSSFDDN; differences boldfaced). Genetic Code The mitochondrial genetic code differs from the universal genetic code for all mtdnas (Wolstenholme 1992). The genetic code used here is that used to translate Drosophila, Katharina, and Cepaea mitochondrial genomes. The differences between this code and the standard genetic code are discussed with reference to L. saxatilis.

352 Table 1. Protein Inferred sizes, a initiation and termination codons, and calculated identity b of seven genes in three mollusk species Percentage amino acid identity Number of amino acids Initiation and termination codons Littorina/ Littorina/ Littorina/ Littorina Katharina Cepaea Katharina Cepaea Drosophila Littorina Katharina Cepaea ATP 6 231 230 186 52.06 34.05 36.92 ATG TAG ATG TAA TA ATP 8 52 53 53 40.00 24.76 30.48 ATG TAG ATG TAG TAA CoI 371 (partial) 513 509 84.99 69.81 74.93? TAA ATG T TAA CoII 228 229 217 70.90 41.80 50.44 ATG TAA ATG TAG TAA Cyt b 211 (partial) 379 380 75.83 59.10 75.83 ATG? ATG TAA TAA ND1 312 316 294 58.60 47.19 52.52 ATG TAA ATG TAA TA ND6 170 166 164 38.70 22.16 30.81 ATG TAG ATG T TA a CoI and cyt b in L. saxatilis are incomplete. b Identities calculated following alignment with CLUSTALW (Thompson et al. 1994) and dividing the number of identical amino acids by the average length of the compared sequences. Table 2. Codon usage in L. saxatilis mtdna a Amino acid Codon N % Amino acid Codon N % Amino acid Codon N % Amino acid Codon N % Phe TTT 97 6.14 Ser TCT 45 2.85 Tyr TAT 45 2.85 Cys TGT 5 0.32 (GAA) TTC 37 2.34 (UGA) TCC 23 1.45 (GUA) TAC 25 1.58 (GCA) TGC 9 0.57 Leu TTA 103 6.51 TCA 23 1.45 TER TAA 3 0.19 Trp TGA 41 2.59 (UAA) TTG 11 0.70 TCG 3 0.19 TAG 3 0.19 (UCA) TGG 5 0.32 Leu CTT 57 3.61 Pro CCT 37 2.34 His CAT 24 1.52 Arg CGT 8 0.51 (UAG) CTC 21 1.33 (UGG) CCC 5 0.32 (GUG) CAC 18 1.14 (UCG) CGC 6 0.38 CTA 45 2.85 CCA 22 1.39 Gln CAA 29 1.83 CGA 18 1.14 CTG 5 0.32 CCG 1 0.06 (UUG) CAG 2 0.13 CGG 1 0.06 Ile ATT 103 6.51 Thr ACT 27 1.71 Asn AAT 33 2.09 Ser AGT 16 1.01 (GAU) ATC 25 1.58 (UGU) ACC 16 1.01 (GUU) AAC 15 0.95 (GCU) AGC 17 1.08 Met ATA 47 2.97 ACA 26 1.64 Lys AAA 30 1.90 AGA 19 1.20 (CAU) ATG 11 0.70 ACG 3 0.19 (UUU) AAG 5 0.32 AGG 2 0.13 Val GTT 50 3.16 Ala GCT 58 3.67 Asp GAT 19 1.20 Gly GGT 33 2.09 (UAC) GTC 11 0.70 (UGC) GCC 26 1.64 (GUG) GAC 18 1.14 (UCC) GGC 26 1.64 GTA 47 2.97 GCA 24 1.52 Glu GAA 41 2.59 GGA 35 2.21 GTG 4 0.25 GCG 1 0.06 (UUC) GAG 2 0.13 GGG 14 0.89 a N, number of occurrences; %, percentage of total amino acids. ATA Codes for Methionine. ATA encodes isoleucine in the standard genetic code but methionine in the mtdna of many organisms. The ATA codon occurs 47 times in the L. saxatilis sequence (Table 2). Of the corresponding positions in the K. tunicata sequence, 28 encode methionine (21 ATA, 7 ATG), 8 encode leucine (6 TTA, 2 TTG), 2 encode isoleucine (ATT), 2 phenylalanine (TTT), and 7 for other amino acids. The much higher proportion of corresponding amino acids corresponding to methionine as opposed to isoleucine suggests ATA encodes methionine in L. saxatilis. TGA Specifies Tryptophan, Not Termination. TGA is a common codon in the L. saxatilis sequence, accounting for 2.6% of all codons (Table 2). Specification of TGA as a termination codon would severely truncate all proteins. Of the 41 occurrences of TGA, the homologous position in the sequence of K. tunicata was tryptophan in 37 cases (also 2 phenylalanine TTT, 1 serine TGT, and 1 leucine TTA). We therefore accept TGA as specifying tryptophan as in Drosophila. AGA and AGG Specify Serine. AGA and AGG encode a variety of amino acids in various taxa. In the universal code, AGA specifies arginine but in D. yakuba codes for serine. In L. saxatilis both AGA and AGG codons are represented, although AGG is rare, occurring only twice (0.13%). When aligned to the K. tunicata amino acid sequence the two AGG codons align with serine (AGC and AGA), while, for the 19 AGA codons, seven correspond to serine. The remaining 12 align with a variety of other amino acids but none with arginine. We believe this to confirm that AGA/AGG specifies serine in L. saxatilis.

353 Fig. 3. Kyte and Doolittle (1982) hydropathy profiles for three genes (ND6, ATPase 6, and ATPase 8) in three species, Littorina saxatilis (this study), Katharina (Boore and Brown 1994a), and Drosophila yakuba (Clary and Wolstenholme 1985). Profiles were computed using PEPPLOT with a search window of 7. Values along the x-axis are the amino acid position, while the y-axis represents hydrophobicity: values above zero are hydrophobic, and negative values are hydrophilic. Protein Coding Genes Seven protein coding genes were identified. We have no information on the presence/absence of the other six genes usually found in metazoan mtdna. CoI, CoII, ND1, and cyt b could all be identified unambiguously from amino acid similarity to other mitochondrial genomes. ATPase 6, ATPase 8, and ND6 showed only weak similarity to the corresponding sequences of other animals. However, hydropathy profiles (Fig. 3) provide strong evidence that these genes are correctly identified. The lengths of the five fully sequenced genes appear more similar to K. tunicata than to C. nemoralis (Table 1). The similarity of lengths for L. saxatilis, K. tunicata, and D. yakuba but dissimilarity to C. nemoralis suggests that the coding sequences of Cepaea are unusually short.

354 Fig. 4. Predicted secondary structure of 12 trna genes in L. saxatilis.

355 Fig. 5. Potential secondary structure of the noncoding region between CoI and CoII. Numbers refer to the position in the L. saxatilis mitochondrial sequence (Fig. A1). The bar represents the termination codon at the end of CoI. L. saxatilis sequences also appear more similar in amino acid sequence to K. tunicata than to C. nemoralis (Table 1). In fact, sequence identity is greater between L. saxatilis and D. yakuba than between L. saxatilis and C. nemoralis over all seven genes. Transfer RNAs Twelve sequences could be folded into typical trna structures (Fig. 4). Five of these have mismatches in the amino-acyl stem. Three of these (trna pro, trna cys, and trna val ) involve only 1 bp, while trna tyr and trna gln involve two and three, respectively. For trna gln a better match would be possible if the A at nucleotide 3135 were dropped. This would have a perfect trna. Sequencing of an alternative clone was undertaken to check if this represented a PCR error, but the sequence was identical. All of the trna secondary structures predicted here demonstrate both a variable loop and a T C arm. There is no evidence of TV replacement loops as have been demonstrated in gastropods (Yamazaki et al. 1997) but not in the chiton (Boore and Brown 1994a). Transfer RNA genes are believed to act as processing signals for cleavage of the polycistronic transcript into gene-specific RNAs (Ojala et al. 1981). However, three gene boundaries (CoI/CoII, A8/A6, and ND6/Cyt b) are not separated by a trna. At the CoI/CoII boundary there is stem-loop structure within the apparent noncoding region which may serve a similar purpose (see Fig. 5). Comparable structures for the other two boundaries would have to be folded from within coding sequence. Ribosomal RNAs Determination of the absolute 3 end of the small and large subunit RNAs can be reliably performed only using S1 hybridization-protection analysis, while 5 RACE can be used to locate the 5 terminus (e.g., see Pont-Kingdon et al. 1994). We did not determine these exactly. Instead the termini of these genes have been given as abutting the neighboring genes to produce a s-rrna of 964 bp and a 1-rRNA of 1382 bp. These rrna genes are larger than those of the chiton (826 and 1275 bp) but not uncharacteristically large when other organisms are considered, e.g., s-rrna and 1-rRNA are 945 and 1245 bp in Mytilus (Hoffmann et al. 1992), 955 and 1570 bp in Mus musculus (Bibb et al. 1981), and 880 and 1525 bp in echinoderms (Jacobs et al. 1988). Evolutionary Implications Substantial differences in gene order exist between three distantly related mollusk groups, the bivalves represented by M. edulis (Hoffmann et al. 1992), the chitons represented by K. tunicata (Boore and Brown 1994a), and the pulmonates (Hatzoglou et al. 1995; Terrett et al. 1996; Yamazaki et al. 1997). In this study we show that the considerable differences seen in mtdna gene order between classes of Mollusca are also seen within a class (the Gastropoda), with the mtdna of L. saxatilis having a radically different gene order from that of the pulmonate gastropods (Yamazaki et al. 1997). Indeed, the mtdna of L. saxatilis appears more similar to that of K. tunicata (Boore and Brown 1994a) but still differs due to a substantial inversion and three trna translocations. Even within the prosobranch gastropods there is evidence of mtdna gene rearrangement, with switching of the position of trna leu(cun) and trna leu(uur) when L. saxatilis mtdna is compared with that of Plicopurpurata (Boore et al. 1995). The most recent morphological phylogeny of the mollusks (Ponder and Linberg, 1997) places the Littorinidae in the caenogastropod clade, which in turn forms the apogastropod clade with the Pulmonata and Opisthobranchia. This clade is separated from the Polyplacophora by numerous groups including the Archaegastropoda, Neritopsina, Patellogastropoda, and monoplacophorans (Ponder and Linberg, 1996; Fig. 5). In this phylogeny, therefore, and indeed under any of the alternatives (see Ponder and Linberg 1996), the Littorinidae appear much more closely related to members of the Pulmonata than to the very distant Polyplacophora. Given that convergence on similar gene orders is unlikely, the resemblance of L. saxatilis mtdna to K. tunicata mtdna suggests that some of the gene order is plesiomorphic and maintained throughout most of the Gastropoda, the pulmonate branch being an important exception. It is unclear in the absence of additional genome organizations whether the translocations that distinguish pulmonate mtdna from prosobranch (or polyplacophoran) mtdna occurred on the branch leading to the Opisthobranchia and Pulmonata or on the pulmonate branch. To examine this hypothesis further requires data on additional molluscan mitochondrial genomes.

356 Appendix Fig. A1. Sequence of 8022 bp of L. saxatilis mtdna.

357 Fig. A1. Continued.

358 Fig. A1. Continued.

359 Acknowledgment. This research was supported by the MAST-3 program of the European Commission under Contract MAS3-CT95-0042 (AMBIOS). References Altschul SF, Gish W, Miller W, Myers EW, Lipman DL (1995) Basic local alignment search tool. J Mol Biol 215:403 410 Ashburner M (1989) Drosophila, a laboratory manual. Cold Spring Harbor Laboratory Press, Cold Spring, Harbor, NY Bibb MJ, van Etten RA, Wright CT, Walberg MW, Clayton DA (1981) Sequence and gene organization of mouse mitochondrial DNA. Cell 26:167 180 Boore JL, Brown WM (1994a) Complete DNA sequence of the mitochondrial genome of the black chiton, Katharina tunicata. Genetics 138:423 443 Boore JL, Brown WM (1994b) Mitochondrial genomes and the phylogeny of mollusks. Nautilus S2:61 78 Boore JL, Brown WM (1995) Complete sequence of the mitochondrial DNA of the annelid worm Lumbricus terrestris. Genetics 141:305 319 Boore JL, Collins TM, Stanton D, Daehler TL, Brown WM (1995) Deducing the pattern of arthropod phylogeny from mitochondrial DNA rearrangements. Nature 376:163 165 Clary DO, Wolstenholme DR (1985) The mitochondrial DNA of Drosophila yakuba: Nucleotide sequence, gene organization, and genetic code. J Mol Evol 22:252 271 Crossland S (1994) Genetic variation in Littorina species, PhD thesis. University of Leeds, Leeds Desjardins P, Morais R (1990) Sequence and gene organization of the chicken mitochondrial genome: A novel gene order in higher vertebrates. J Mol Biol 212:599 634 Desjardins P, Ramirez V, Morais R (1990) Gene organization of the Peking duck mitochondrial genome. Curr Genet 17:515 518 Devereux J, Haberli P, Smithies O (1984) A comprehensive set of sequence analysis programs for the VAX. Nucleic Acids Res 12: 387 395 Folmer O, Black M, Hoeh W, Lutz R, Vrijenhoek R (1994) DNA primers for amplification of mitochondrial cytochrome c oxidase subunit I from diverse metazoan invertebrates. Mol Mar Biol Biotech 3:294 299 Hatzoglou E, Rodakis GC, Lecanidou R (1995) Complete sequence and gene organization of the mitochondrial genome of the land snail Albinaria coerulea. Genetics 140:1353 1366 Hoffmann RJ, Boore JL, Brown WM (1992) A novel mitochondrial genome organization for the blue mussel, Mytilus edulis. Genetics 131:397 412 Jacobs HT, Elliott DJ, Math VB, Farquarson A (1988) Nucleotide sequence and gene organization of sea urchin mitochondrial DNA. J Mol Biol 202:185 217 Kyte J, Doolittle RF (1982) A simple method for displaying the hydropathic character of a protein. J Mol Biol 157:105 132 Lowe TM and Eddy SR (1997) trnascan-se: A program for improved detection of transfer RNA genes in genomic sequence. Nucleic Acids Res 25:955 964 Lunt DH, Zhang D-X, Szymura JM, Hewitt GM (1996) The insect cytochrome oxidase I gene: evolutionary patterns and conserved primers for phylogenetic studies. Insect Mol Biol 5:153 165 Macey RJ, Larson A, Ananjeva NB, Papenfuss TJ (1997) Replication slippage may cause parallel evolution in the secondary structures of mitochondrial transfer RNAs. Mol Biol Evol 14:30 39 Ojala D, Montoya J, Attardi G (1981) trna punctuation model of RNA processing in human mitochondria. Nature 290:470 474 Pääbo S, Thomas WK, Whitfield KM, Kumazawa Y, Wilson AC (1991) Rearrangements of mitochondrial transfer RNA genes in marsupials. J Mol Evol 33:426 430 Ponder WF, Lindberg DR (1996) Gastropod phylogeny-challenges for the 90s. In: Taylor J (ed) Origin and evolutionary radiation of the mollusca. Malacological Society of London, London, pp 135 154 Ponder WF, Linberg DR (1997) Towards a phylogeny of gastropod molluscs: An analysis using morphological characters. Zool J Linn Soc 119:83 265 Staton JL, Daehler LL, Brown WM (1997) Mitochondrial gene rearrangement of the horseshoe crab Limulus polyphemus L.: Conservation of major features across arthropod classes. Mol Biol Evol 14:867 874 Terrett JA, Miles S, Thomas RH (1996) Complete DNA sequence of the mitochondrial genome of Cepaea nemoralis (Gastropoda: Pulmonata). J Mol Evol 42:160 168 Thompson JD, Higgins DG, Gibson TJ (1994) CLUSTAL W: Improving the sensitivity of progressive multiple sequence alignment through sequence weighting, positions-specific gap penalties and weight matrix choice. Nucleic Acids Res 22:4673 4680 Winnepenninckx B, Steiner G, Backeljau T, De Wachter R (1998) Details of gastropod phylogeny inferred from 18S rrna sequences. Mol Phyl Evol 9:55 63 Wolstenholme DR (1992) Animal mitochondrial DNA: Structure and evolution. Int Rev Cytol 141:173 216 Yamazaki N, Ueshima R, Terrett JA, Yokobori S, Kaifu M, Segawa R, Kobayashi T, Numachi K, Ueda T, Nishikawa K, Watanabe K, Thomas RH (1997) Evolution of pulmonate gastropod mitochondrial genomes: Comparisons of gene organizations of Euhadra, Cepaea and Albinaria and implications of unusual trna secondary structures. Genetics 145:749 758