UNIT (12) MOLECULES OF LIFE: NUCLEIC ACIDS

Size: px
Start display at page:

Download "UNIT (12) MOLECULES OF LIFE: NUCLEIC ACIDS"

Transcription

1 UIT (12) MLECULE F LIFE: UCLEIC ACID ucleic acids are extremely large molecules that were first isolated from the nuclei of cells. Two kinds of nucleic acids are found in cells: RA (ribonucleic acid) is found mainly in the cytoplasm of living cells. DA (deoxyribonucleic acid) is found primarily in the nucleus of cells. Both RA and DA are large polymers containing repeating structural units, or monomers, called nucleotides Components of ucleic Acids A nucleotide is composed of three units: an organic base, a sugar, and a phosphate. A) rganic Bases The organic bases found in nucleic acids are derivatives of pyrimidine or purine. yrimidine is a six-membered heterocyclic ring. A heterocyclic ring is a ring compound containing atoms that are not all identical. urine is a fused-ring compound containing a six-membered ring connected to a fivemembered ring. yrimidine urine The three pyrimidine derivatives found in nucleic acids are cytosine (C), thymine (T), and uracil (U). They are commonly identified using the first letter in their name which is always capitalized. 2 C 3 Cytosine (C) Thymine (T) Uracil (U) (DA and RA) (DA only) (RA only) 12-1

2 The two purine derivatives found in nucleic acids are adenine (A) and guanine (G). 2 2 Adenine (A) (DA and RA) Guanine (G) (DA and RA) Adenine, guanine, and cytosine are found in both DA and RA. Thymine is found only in DA, while uracil is found only in RA. Thymine and uracil are often used to differentiate DA from RA. B) ugars The five-carbon sugar in nucleic acids is ribose or a ribose derivative. In RA the sugar is ribose, in DA it is 2 -deoxyribose. The only difference between these two sugars is found at the 2 -carbon of the ribose ring. Ribose has a hydroxyl group (-) bound to this carbon, while deoxyribose has a hydrogen atom ( deoxy means no oxygen). 5' C 2 4' 3' 2' Ribose in RA 1' 5' C 2 4' 3' 2' 1' Deoxyribose in DA o oxygen bonded otice that the carbon atoms in five-carbon sugars are numbered with primes (1, 2, 3, 4, and 5 ). This is done to differentiate them from the atoms in the nitrogenous bases (purines and pyrimidines). 12-2

3 C) hosphate Group The third component of a nucleotide is derived from phosphoric acid ( 3 4 ). hosphoric acid contains three hydrogen atoms and it can exist in one of the following four different forms depending on the p of the solution. present present at physiological p present at low p at high p ucleosides and ucleotides When ribose or 2 -deoxyribose is combined with a purine or pyrimidine base, a nucleoside is formed. A nucleoside is basically a nucleotide that is missing the phosphate portion. ugar + Base ucleoside The formation of a nucleoside (in this case adenosine) could be shown as: ribose adenine adenosine ote the name of adenine changes to adenosine when it is used to form a nucleoside. These subtle changes must be recognized because they identify different structures. ther nitrogenous bases (purines and pyrimidines) also have subtle name changes when used to form nucleosides. The table below lists the names. 12-3

4 ames of ucleosides Base ucleosides RA DA Adenine Guanine Cytosine Uracil Adenine Guanine Cytosine Thymine Adenosine Guanosine Cytidine Uridine Deoxyadenosine Deoxyguanosine Deoxycytidine Deoxythymidine hosphate ion reacts with the groups on the sugar residue of a nucleoside to form a phosphate monoester and a nucleotide is produced. This commonly occurs at the attached at the 5 carbon. hosphate(s) + ucleoside ucleotide The formation of a nucleotide from a nucleoside and a phosphate is shown below: phosphate + adenosine - - 5' adenosine-5'-monophosphate + 2 ames of ucleotides (AM) Base ucleosides ucleotides RA Adenine Adenosine Adenosine-5 -monophosphate (AM) Guanine Guanosine Guanosine-5 -monophosphate (GM) Cytosine Cytidine Cytidine-5 -monophosphate (CM) Uracil Uridine Uridine-5 -monophosphate (UM) DA Adenine Deoxyadenosine Deoxyadenosine-5 -monophosphate (dam) Guanine Deoxyguanosine Deoxyguanosine-5 -monophosphate (dgm) Cytosine Deoxycytidine Deoxycytidine-5 -monophosphate (dcm) Thymine Deoxythymidine Deoxythymidine-5 -monophosphate (dtm) 12-4

5 12.3 olynucleotides A polynucleotide chain is formed by connecting several nucleotides in succession. RA is a polynucleotide that, upon hydrolysis, yields D-ribose, phosphoric acid, and the four bases adenine, guanine, cytosine, and uracil. DA is a polynucleotide that yields D-2 -deoxyribose, phosphoric acid, and the four bases adenine, guanine, cytosine, and thymine. ucleotides can be connected to one another to form oligonucleotides (2 to 10 nucleotide residues) and polynucleotides (more than 10 nucleotides) The tructure of DA Each cell in a particular living organism contains the exact same DA. In plant and animal cells, most of the DA is found in the cell nucleolus. The size of the DA polymer is directly related to the complexity of the organism; more complex organisms tend to have larger molecules of DA, while less complex organisms have smaller. The DA in simple bacteria contains about 8 million nucleotides, whereas human DA contains up to 500 million nucleotides. In unit 11, we learned that proteins have primary, secondary, and higher structures. ucleic acids are also chains of monomeric units that have primary, secondary, and higher structures. rimary tructure of DA The primary structure of DA is simply the sequence of nucleotides. The sugarphosphate chain is called the DA backbone, and it is constant throughout the entire DA molecule. The variable portion of DA is the sequence of nitrogenous bases. A diagram of a nucleic acid is shown below 12-5

6 phosphate sugar B base B B B The phosphate groups link the 3 carbon of one sugar (of deoxyribose or ribose) to the 5 carbon of the next sugar (of deoxyribose or ribose). The following illustrates the structure of ACGT (a tetranucleotide). It represents an example of the structural formula of a partial DA molecule (note presence of thymine, therefore DA). A strand of DA has two distinct terminals or ends, one will be a 5 - phosphate end and the other will be a 3 -hydroxyl end. By convention, a nucleic acid sequence is always read in the 5 to 3 direction, that is, from the sugar with the free 5 - phosphate to the sugar with 3 -hydroxyl group. The order of nucleotides is generally written using the capitalized first letter of the name of base. As stated above, the following structure is written ACGT (in the 5 to 3 direction). 12-6

7 2 - - adenine ' 2 cytosine guanine thymine ' econdary tructure of DA; The DA Double elix The secondary structure of DA was proposed by James Watson and Francis Crick in This was perhaps the greatest discovery of modern biology and one of the most remarkable and profound events in the history of science. Watson and Crick concluded that DA is a double helix containing two polynucleotide strands wound as if around a central axis. A good analogy would be to think of a rope ladder fixed at one end to the top of a pole, and subsequently wound downward around it without twisting the ladder. The two polynucleotide strands are connected by hydrogen bonds formed between a purine on one strand and a pyrimidine on the other. In DA, adenine is always paired with thymine and guanine is always paired with cytosine. The pairs A-T and G-C are called complementary base pairs. Revisiting our rope ladder analogy, the two pieces of rope (two polynucleotide strands) are connected by the rungs of the ladder (hydrogen bonding between complementary base-pairs). According to basepairing rules discovered by Watson and Crick, each A is bound to T and each G is bound to C. Therefore, the total number of A s in any molecule of DA must be equal to total number of T s (the same is true of G and C). Thus, the % of A in DA must equal the % of T (the same is true of G and C). The total percent of A, T, G, and C must, of course, equal

8 Always %A = %T and %C = %G uman DA contains 30% adenine, 30% thymine, 20% cytosine, and 20% guanine. ugar A -T C3 ugar ugar G - C ugar Base airing: ydrogen bonding between the complementary base pairs: adenine/ thymine and cytosine/guanine. otice that A-T pairing has two hydrogen bonds (AT is a two letter word) and G-C pairing has three hydrogen bonds. ne important feature of the DA double helix is that the two strands run antiparallel to one another, that is, the two strands run in opposite directions-one in the 5 to 3 direction, the other in the 3 to 5 direction. Therefore, both ends of the double helix contain the 5 end of one strand (5 phosphate) and the 3 end of the other (3 ). 12-8

9 5' A T 3' 3' A T 5' C G G C DA is responsible for the storage and transmission of hereditary information. A human cell normally contains 46 chromosomes. Each chromosome contains one molecule of DA bound to a group of proteins called histones. A gene is a segment of DA that carries a single, specific command, for example, make a globin molecule. 12-9

10 ractice 12-1 Write the complementary strand of DA to the following sequence. 5 A-C-T-C-G-G-T-A-A 3 Answer Remember, A pairs with T and G pairs with C. Go through the original 5 to 3 sequence pairing each A with T and each C with G. Keep in mind that the complementary strand will read from left to right in the 3 to 5 direction. Therefore, the complementary strand starts with 3 and ends with 5. riginal strand 5 A-C-T-C-G-G-T-A-A 3 Complementary strand 3 T-G-A-G-C-C-A-T-T DA Replication When a cell divides, each of the resulting daughter cells receives a copy of DA that is nearly identical to the DA of the parent cell. Replication is a biological process that duplicates the DA molecule. In DA replication, the double helix (parent strand) unzips forming two separate strands called templates. These templates provide the base sequences used to synthesize new DA (daughter) strands. Replication is a very complicated enzyme-catalyzed process. Enzymes are needed to unwind the DA prior to replication and repackage the DA after synthesis Ribonucleic Acid (RA) ne of the main functions of DA is to direct the synthesis of RA molecules. There are four major differences between RA molecules and DA molecules. 1) RA contains ribose sugar units rather than deoxyribose. 2) RA contains the base uracil instead of thymine. 3) RA is single stranded, except in some viruses. 4) RA molecules are much smaller than DA molecules. Types of RA Molecules There are three classes of RA: Messenger RA (mra) carries genetic information from DA to the ribosomes and serves as a template for protein synthesis. Transfer RA (tra) delivers individual amino acids to the site of protein synthesis. Ribosomal RA (rra) combines with a series of proteins to form ribosomes, the physical site of active protein synthesis

11 12.7 Gene Expression and rotein ynthesis The central dogma (something held as an established opinion) of molecular biology states that the information contained in DA molecules is transferred to RA molecules which is subsequently expressed in the structure of proteins. More simply stated; DA produces RA which produces proteins. Gene expression is the activation (turning on) of a gene to produce a specific protein. Two steps are involved in the flow of genetic information: transcription and translation. Transcription: ynthesis of mra Transcription is the process of mra synthesis from a single stranded DA template. The enzyme that catalyzes transcription is called RA polymerase. Transcription begins when a portion of the DA double helix unwinds near the gene to be expressed. Ribonucleotides assemble along the unwound DA strand according to complementary base pairing. There is no change in G-C base pairing, G or C on DA pairs with C and G on mra. There is a significant point of difference with A-T base pairing; T on DA pairs with A on mra, but A on DA pairs with U on mra. Recall that RA contains no thymine (T), it has uracil (U) instead. Remember EVER write T in mra. (see worked example ) When RA polymerase reaches the termination site, transcription ends and the newly formed mra is released. The unwound portion of the DA returns to its double helix configuration. hydrogen bond G C A U C G T A DA template RA transcript 12-11

12 Worked Example 12-1 Write the mra produced from the following DA template. 3 G-A-A-C-T 5. olution The bases on the DA template are paired with their complementary bases to form mra. Remember C with G, G with C, T (on DA) with A (on RA), and A (on DA) with U (on RA). There are two ways to approach this problem. ome find it easier to simply memorize the base-pairings above and apply them to mra synthesis. thers chose to associate mra synthesis with the procedure used previously to write complementary strands of DA and simply replace all T s with U s. Applying memorized base-pairings: DA template 3 G-A-A-C-T 5. Complementary bases in mra: 5 C-U-U-G-A 3. Associate with DA synthesis: DA template 3 G-A-A-C-T 5. Complementary bases in DA: 5 C-T-T-G-A 3. Change all T s to U (no T in RA): 5 C-U-U-G-A 3. ractice 12-2 What is the DA template that codes for the mra segment with the nucleotide sequence of sequence of 5 G-C-U-A-G-U 3? Answer Again, there are two ways to approach this problem: Memorizing base-pair rules: Complementary bases in mra 5 G-C-U-A-G-U 3 ortion of DA template 3 C-G-A-T-C-A 5 Associate with DA synthesis: Complementary bases in mra 5 G-C-U-A-G-U 3 Change all U s to T (no U in DA) 5 G-C-T-A-G-T 3 Follow normal base-pairing rules: 3 C-G-A-T-C-A

13 ost-transcription The RA produced from gene activation in transcription is a pre-mra. The pre-mra contains two segments: one is coded for amino acids (exon) and the other carries no codes for amino acids (intron). An exon is a gene segment that conveys (codes for) genetic information. An intron is a gene segment that does not convey (code for) genetic information. plicing is the process of removing the introns from the pre-mra molecule and joining (splicing) the remaining exons together to form a mra molecule. re-mra Exon Itron Exon Intron Exon Intron Introns are cut out Intron Intron Intron Exons are joined together Exon Exon Exon mra 12.8 The Genetic Code The information carried on the mra will be used to produce proteins. The mra sequence is read three bases (triplet) at a time and each segment of three bases is called a codon. Each codon specifies a particular amino acid in the primary structure of the protein (its sequence of amino acids). There are 64 different codons used to specify amino acids and each could possibly appear on the mra molecule. A triplet arrangement of adenine (A), guanine (G), cytosine (C), or uracil (U) results in a total of 64 different combinations (64 different sets of 3 bases). It has been found that 61 of the 64 codons identify specific amino acids; the other three combinations are termination codons ( stop signals) for protein synthesis. Codons have been determined for all 20 amino acids. The genetic code is the assignment of the 64 mra codons to specific amino acids (or stop signals). ne important characteristic of the genetic code is that it is almost universal. With minor exceptions, the triplet codons represent the same amino acids in every organism. Another interesting feature of the genetic code is that it is highly degenerative. Many amino acids are designated by more than one codon. This allows for slight mutations in the code without changing the amino acid, ie; glycine is represented by four codons

14 The 64 possible codons for mra are given in tables 1 and 2. It should be noted that the codons are always read in the 5 to 3 direction on the mra strand. The concepts are consistent in the two tables. The first table is used if an amino acid is given and the triplet code is asked. The second table is used if the triplet code is given and the amino acid is asked. You will T be required to memorize the tables. Table (1) For a given amino acid find the triplet codon. Amino Acids Codons umber of codons Alanine GCA, GCC, GCG, GCU 4 Arginine AGA, AGG, CGA, CGC, CGG, CGU 6 Asparagine AAC, AAU 2 Aspartic acid GAC, GAU 2 Cysteine UGC, UGU 2 Glutamic acid GAA, GAG 2 Glutamine CAA, CAG 2 Glycine GGA, GGC, GGG, GGU 4 istidine CAC, CAU 2 Isoleucine AUA, AUC, AUU 3 Lucine CUA, CUC, CUG, CUU, UUA, UUG 6 Lysine AAA, AAG 2 Methionine, initiation AUG 1 henylalanine UUC, UUU 2 roline CCA, CCC, CCG, CCU 4 erine UCA, UCC, UCG, UCU, AGC, AGU 6 Threonine ACA, ACC, ACG, ACU 4 Tryptophan UGG 1 Tyrosine UAC, UAU 2 Valine GUA, GUC, GUG, GUU 4 top signals UAG, UAA, UGA 3 Total number of codons

15 Table (2) Triplet codes to assigned amino acids First Base econd Base Third Base U C A G U he he Leu Leu U C er er er er A Tyr Tyr top top G Cys Cys top Trp U Leu Leu Leu Leu C C ro ro ro ro A is is Gln Gln G Arg Arg Arg Arg U Ile Ile Ile Met A C Thr Thr Thr Thr A Asn Asn Lys Lys G er er Arg Arg U Val Val Val Val G C Ala Ala Ala Ala A Asp Asp Glu Glu G Gly Gly Gly Gly ractice 12-3 Answer the following: a) What codons specify tyrosine? b) What amino acid is coded by CCG? Answer a) Table (1) UAC and UAU. b) Table (2) ro (proline)

16 Translation: rotein ynthesis The process of protein synthesis from mra is called translation. roteins contain amino acids and mra contains nucleotides, we think of these as different languages so we translate mra into proteins. To direct the synthesis of a particular protein, the mra migrates out of the nucleus and into the cytoplasm where it binds to structures called ribosomes. The transfer RAs (tras) deliver individual amino acids to the mra as each codon is read. There are 61 different tras, one for each of the 61 codons that specify an amino acid. A typical tra is roughly the shape of a cloverleaf as shown below. Amino acid attachment Anticodon Each tra molecule carries a three-base sequence called an anticodon that specifies which amino acid it will deliver. Anticodon: A sequence of three nucleotides on tra, complementary to the codon on mra. For example, the codon sequence UGG on a mra is read by a tra having the complementary anticodon sequence ACC and carrying a tryptophan. uccessive codons on the mra are read and the appropriate tra s bring the correct amino acid into position for enzyme-mediated transfer to the growing peptide. When synthesis of the proper protein is complete, a stop codon signals the end of translation and the protein is released from the ribosome

17 omework roblems 12.1 Draw the structures of the following nucleosides: a. uridine b. deoxythymidine 12.2 Draw the structure of the dinucleotide CG that would be in RA Draw a structure showing the hydrogen bonding between uracil and adenine, and compare it with that of adenine and thymine Write the base sequence in a new DA segment if the original segment has the following base sequence: a. 5 C T G T A T A C G T T A 3 b. 5 A G T C C A G G T What is the difference between a codon and an anticodon? 12.6 A segment of a DA strand consists of GCTTAGACCTGA. a. What is the nucleotide order in the complementary mra? b. What is the anticodon order in the tra? c. What is the sequence of amino acids coded by the DA? 12.7 Consider the following portion of mra produced by a normal order of DA nucleotides: 5 -ACC AGU- AGG GUU 3 a. What is the amino acid order produced for normal DA? b. What is the amino acid order if a mutation changes AGU to ACA? c. What is the amino acid order if a mutation changes AGG to GGG? d. What happens to protein synthesis if a mutation changes AGU to UGA? 12-17