From DNA to Protein

DNA directs the cell’s activities by telling it what proteins to make and when. These proteins form structural elements in the cell and regulate the production of other cell products. By controlling protein synthesis, DNA is hugely important in directing life.

Protein synthesis is a two-step process. DNA resides in the nucleus, but proteins are made in the cytoplasm. The cell copies the information held in DNA onto RNA molecules in a process called transcription. Proteins are synthesized at the ribosomes from the codes in RNA in a process called translation.

Before getting into the way that the information on DNA can be transcribed and then translated into protein, we have to spend some time studying the major players in the process: DNA and RNA.

DNA and the Genetic Code

The sequence of nucleotides in DNA makes up a code that controls the functions of the cell by telling it what proteins to produce. Cells need to be able to produce 20 different amino acids in order to produce all the proteins necessary to function. DNA, however, has only four nitrogen bases. How can these four bases code for the 20 amino acids? If adenine, thymine, guanine, and cytosine each coded for one particular amino acid, DNA would only be able to code for four amino acids. If two bases were used to specify an amino acid, there would only be room to code for 16 () different amino acids.

In order to be able to code for 20 amino acids, it is necessary to use three bases (which offer a total of 64 coding combinations) to code for each amino acid. These triplets of nucleotides that make up a single coding group are called codons or genes. Two examples of codons are CAG, which codes for the amino acid glutamine, and CGA, which codes for arginine.

Codons are always read in a non-overlapping sequence. This means that any one nucleotide can only be a part of one codon. Given the code AUGCA, AUG could be a codon for the amino acid methionine, with CA starting a new codon. Alternatively, GCA could be a codon specifying alanine, while the initial AU was the last two letters of a previous codon. But AUG and GCA cannot both be codons at the same time.

Degeneracy of the Genetic Code

There are 64 codons but only 20 amino acids. What happens to the other 44 coding possibilities? It happens that some of the different codons call for the same amino acid. The genetic code is said to be degenerate because of its redundancy.

Experiments have shown that there are also three stop codons, which signal when a protein is fully formed, and one start codon, which signals the beginning of an amino acid sequence.

Mutations of the Genetic Code

Since the sequence of nucleotides in DNA determines the order of amino acids in proteins, a change or error in the DNA sequence can affect a protein’s function. These errors or changes in the DNA sequence are called mutations.

There are two basic types of mutations: substitution mutations and frameshift mutations.

Substitution Mutation

A substitution mutation occurs when a single nucleotide is replaced by a different nucleotide. The effects of substitution mutations can vary. Certain mutations might have no effect at all: these are called silent mutations. For instance, because the genetic code is degenerate, if the particular codon GAA becomes GAG, it will still code for the amino acid glutamate and the function of the cell will not change. Other substitution mutations can drastically affect cellular and organismal function. Sickle-cell anemia, which cripples human red blood cells, is caused by a substitution mutation. A person will suffer from sickle-cell anemia if he has the amino acid valine in his hemoglobin rather than glutamic acid. The codon for valine is GUA or GUG, while the codon for glutamic acid is GAA or GAG. A simple substitution of A for U results in the disease.

Frameshift Mutation

A frameshift mutation occurs when a nucleotide is wrongly inserted or deleted from a codon. Both types of frameshifts usually have debilitating or lethal effects. An insertion or deletion will affect every codon in a particular genetic sequence by throwing the entire three-by-three codon structure out of whack. For example, if the A in the GAU were to be deleted, the code:

GAU GAC UCC GCU AGG

would become:

GUG ACU CCG CUA GG

and code for an entirely different set of amino acids in translation. The results of such mutations on an organism are usually catastrophic.

The only sort of frameshift mutation that might not have dire effects is one in which an entire codon is inserted or deleted. This will result in the gain or loss of one amino acid but will not affect surrounding codons.

Chromosomes

Even the tiniest cells contain meters upon meters of DNA. With the aid of special proteins called histones, this DNA is coiled into an entangled fibrous mass called chromatin. When it comes time for the cell to replicate (a process covered later in this chapter), these masses gather into a number of discrete compact structures called chromosomes.

In eukaryotes, the chromosomes are located in the nucleus of the cell. Prokaryotes don’t have a nucleus: their DNA is located in a single chromosome that is joined together in a ring. This ring chromosome is found in the cytoplasm. In this chapter, when we talk about chromosomes, we will be referring to eukaryotic chromosomes.

Different eukaryotes have varying numbers of chromosomes. Humans, for example, have 46 chromosomes arranged in 23 pairs. (Dogs have 78 chromosomes in 39 pairs. A larger number of chromosomes is not a sign of greater biological sophistication.) The total number of distinct chromosomes in a cell is the cell’s diploid number.

The cells in a human body that are not passed down to offspring, called somatic cells, contain chromosomes in two closely related sets—one set of 23 each from a person’s mother and father—making up a total of 46 chromosomes. These sets pair up, and the pairs are known as homologous chromosomes. Each homologous pair consists of one maternal and one paternal chromosome. The haploid number of a cell refers to half of the total number of chromosomes in a cell (half the diploid number), or the number of homologous pairs in somatic cells.

In humans and other higher animals, only the sex cells that are passed on to offspring have the haploid number of chromosomes. These sex cells are also called gametes.

RNA

Ribonucleic acid (RNA) helps DNA turn stored genetic messages into proteins. As discussed in the Biochemistry chapter, RNA monomers (nucleotides) are similar to those of DNA, but with three crucial differences:

  • DNA’s five-carbon sugar is deoxyribose. RNA nucleotides contain a slightly different sugar, called ribose.
  • RNA uses the nitrogenous base uracil in place of DNA’s thymine.
  • The RNA molecule takes the form of a single helix—half a spiral ladder—as compared with the double helix structure of DNA.

Two different types of RNA play important roles in protein synthesis. During transcription, DNA is copied to make messenger RNA (mRNA), which then leaves the nucleus to bring its still encoded information to the ribosomes in the cytoplasm. In order to use the information contained in the transcribed mRNA to make a protein, a second type of RNA is used. Transfer RNA (tRNA) moves amino acids to the site of protein synthesis at the ribosome according to the code specified by the mRNA strand. There are many different tRNAs, each of which bond to a different amino acid and the mRNA sequence corresponding to that amino acid.


Protein Synthesis

Now that we’ve described DNA and RNA, it’s time to take a look at the process of protein synthesis. The synthesis of proteins takes two steps: transcription and translation. Transcription takes the information encoded in DNA and encodes it into mRNA, which heads out of the cell’s nucleus and into the cytoplasm. During translation, the mRNA works with a ribosome and tRNA to synthesize proteins.


Transcription

The first step in transcription is the partial unwinding of the DNA molecule so that the portion of DNA that codes for the needed protein can be transcribed. Once the DNA molecule is unwound at the correct location, an enzyme called RNA polymerase helps line up nucleotides to create a complementary strand of mRNA. Since mRNA is a single-stranded molecule, only one of the two strands of DNA is used as a template for the new RNA strand.


The new strand of RNA is made according to the rules of base pairing:

  • DNA cytosine pairs with RNA guanine
  • DNA guanine pairs with RNA cytosine
  • DNA thymine pairs with RNA adenine
  • DNA adenine pairs with RNA uracil

For example, the mRNA complement to the DNA sequence TTGCAC is AACGUG. The SAT II Biology frequently asks about the sequence of mRNA that will be produced from a given sequence of DNA. For these questions, don’t forget that RNA uses uracil in place of thymine.

After transcription, the new RNA strand is released and the two unzipped DNA strands bind together again to form the double helix. Because the DNA template remains unchanged after transcription, it is possible to transcribe another identical molecule of RNA immediately after the first one is complete. A single gene on a DNA strand can produce enough RNA to make thousands of copies of the same protein in a very short time.

Translation

In translation, mRNA is sent to the cytoplasm, where it bonds with ribosomes, the sites of protein synthesis. Ribosomes have three important binding sites: one for mRNA and two for tRNA. The two tRNA sites are labeled the A site and P site.


Once the mRNA is in place, tRNA molecules, each associated with specific amino acids, bind to the ribosome in a sequence defined by the mRNA code. tRNA molecules can perform this function because of their special structure. tRNA is made up of many nucleotides that bend into the shape of a cloverleaf. At its tail end, tRNA has an acceptor stem that attaches to a specific amino acid. At its head, tRNA has three nucleotides that make up an anticodon.


An anticodon pairs complementary nitrogenous bases with mRNA. For example if mRNA has a codon AUC, it will pair with tRNA’s anticodon sequence UAG. tRNA molecules with the same anticodon sequence will always carry the same amino acids, ensuring the consistency of the proteins coded for in DNA.

The Process of Translation

Translation begins with the binding of the mRNA chain to the ribosome. The first codon, which is always the start codon methionine, fills the P site and the second codon fills the A site. The tRNA molecule whose anticodon is complementary to the mRNA forms a temporary base pair with the mRNA in the A site. A peptide bond is formed between the amino acid attached to the tRNA in the A site and the methionine in the P site.


The ribosome now slides down the mRNA, so that the tRNA in the A site moves over to the P site, and a new codon fills the A site. (One way to remember this is that the A site brings new amino acids to the growing polypeptide at the P site.) The appropriate tRNA carrying the appropriate amino acid pairs bases with this new codon in the A site. A peptide bond is formed between the two adjacent amino acids held by tRNA molecules, forming the first two links of a chain.


The ribosome slides again. The tRNA that was in the P site is let go into the cytoplasm, where it will eventually bind with another amino acid. Another tRNA comes to bind with the new codon in the A site, and a peptide bond is formed between the new amino acid to the growing peptide chain.


The process continues until one of the three stop codons enters the A site. At that point, the protein chain connected to the tRNA in the P site is released. Translation is complete.


Make a Free Website with Yola.