RFLP

DNA Restriction Mapping

   Techniques based on the use of restriction endonucleases and chromatographic separation of DNA fragments by size allow very precise gene mapping. The same basic techniques - e.g. restriction endonuclease digestion, separation of nucleic acid fragments by size, using radioactivity or sometimes specially constructed analogs to label molecules of interest and to distinguish them from other similar molecules, comparison of DNA sequences by hybridization, etc. - in a variety of combinations and with different substrates, enable us to accomplish a wide range of goals. Whole chromosomes can be mapped, DNA sequence analysis allows intragenic mapping of actual base sequences within or surrounding a transcribed region, allelic variation can be determined at a molecular level, even in the absence of any measurable phenotypic difference. This lab exercise will serve as an introduction to some of these techniques and their possible uses, focusing primarily on molecular mapping methods.

   Restriction endonucleases are enzymes which recognize, bind, and hydrolyze DNA at specific nucleotide sequences. Each restriction endonuclease recognizes one specific nucleotide sequence, usually 4-6 base pairs (bp) long. Hydrolysis may occur at one or the other end of this sequence, or internally, depending on the enzyme. (see Fig 1.) Thus, complete digestion of DNA by a single restriction endonuclease results in a distinct set of fragments representing the intervals between restriction endonuclease recognition sites. When separated by size, this pattern is characteristic of that DNA - restriction endonuclease combination.

   The most common method for separating DNA fragments by size is gel electrophoresis (similar methods are used to separate RNA or protein). A mixture of DNA fragments is placed on a strip of gel (polyacrylamide and agarose are the most common gel substances). A voltage potential is established across the gel which draws charged particles through the gel. The movement of larger molecules is retarded more than that of smaller molecules, so that eventually distinct size classes may be distinguished in the gel. A similar method, called liquid chromatography, using a particulate matrix through which substances percolate at rates based on their size and shape is sometimes used instead - mostly for automated DNA sequencing.

   Before DNA can be sequenced it is necessary to generate a quantity of identical, fairly short pieces because the techniques can only accommodate a few hundred bp at a time. DNA fragments can be cloned by inserting them individually into a vector - usually a plasmid, episome, or viral DNA - and then the altered vector is placed into an appropriate host cell - usually bacteria but sometimes a eukaryotic cell line. The vector can replicate independently of the host cell cycle, so large numbers may be generated in each cell. Thus as the culture grows, the number of copies of the DNA of interest grows even faster. Each colony derived from a single altered cell consists entirely of cells genetically identical to each other and to the original cell. Thus, all contain the vector containing the inserted sequence. Large quantities of this DNA can be generated under controlled conditions. The vector can be constructed so that the DNA of interest can be recovered readily, in large quantities, with little contamination.

   A piece of DNA prepared in this way can be sequenced by one of two methods developed by Sanger, and Maxam and Gilbert, respectively.   Briefly, in the Sanger method the DNA to be sequenced is replicated using (a) a 5' end label and (b) 2',3'-dideoxy NTPs to terminate replication. Resulting fragments are analyzed by chromatographic separation. This is the method which has proven to be most easily automated through the use of fluorescent, rather than radioactive, labels. The Maxam and Gilbert method involves (a) radioactive label on the 5' end, and (b) partial digestion with a different sort of restriction endonuclease which cleaves after one or another nucleotide residue in a base-specific manner.

   Analysis of base sequences is very precise, but other methods are also very useful. Analysis of long pieces of DNA can be done using restriction endonucleases without the need to clone or sequence the DNA. Using one restriction endonuclease at a time, target DNA is hydrolysed. This generates a mixture of fragments which can be separated by gel electrophoresis and visualized as a characteristic pattern depending on the DNA and enzyme used. One restriction endonuclease cuts at one specific sequence which occurs at intervals along the DNA. These intervals are unique to a DNA sequence. Thus the fragments' sizes, representing these intervals between restriction sites, will characterize the particular DNA. Using several restriction endonucleases in separate experiments generates a restriction map of the DNA which is unique to that DNA sequence. These maps can be used to identify a particular species of DNA. One increasingly common use of such a map is called restriction endonuclease fragment length polymorphism (RFLP) analysis.

   RFLP analysis has been used successfully to identify different alleles in a population, and ultimately to locate the genes. Specifically, several human genes responsible for genetic abnormalities or diseases are being investigated using this methodology. The approach requires that an abnormality is identified as being inherited in a family. Samples of DNA from as many members of the family are prepared for analysis by cleavage with a series of endonucleases and separation of the resulting fragments by gel electrophoresis. From the pedigree it is possible to infer those who are homozygous normal, and at least those who express the trait, if not some heterozygotes. Thus, DNA from individuals known to carry at least one mutant allele is compared to DNA from individuals known to be homozygous normal. The restriction digest patterns are compared. Any variant DNA which either lacks one of the restriction sites, or has an extra one, compared to the normal DNA, will generate a different pattern of fragments when subjected to the same hydrolysis and electrophoresis. If there has been a sizable insertion or deletion between two restriction sites, that altered pattern can also be identified. Thus, for the family under study, a recognizable RFLP pattern can be identified which represents the normal allele, and a distinct pattern would represent the mutant allele. It is important to note that these patterns are valid within a family only, because it is necessary to reduce the likelihood of other genetic variation being superimposed on the relevant restriction pattern.

   One other step is necessary for analysis of RFLPs generated from total eukaryotic genomic digests. Eukaryotic genomes are so large that a typical restriction digest appears as a smear on a normal gel because there are so many restriction sites, and so many fragments. In order to generate an interpretable pattern it is necessary to limit the number of fragments visualized at any one time. The most convenient way to do this is to use a DNA probe which will hybridize to a specific DNA sequence. The probe is labeled, for example with radioactivity. After the restriction digest is electrophoresed, the DNA is transferred to a nitrocellulose membrane, and then exposed to the labeled probe. The labeled probe DNA will hybridize - form base pairs with - its complementary sequences, which are found, on only a subset of the restriction fragments. An X-ray film placed over the membrane will be exposed by the radioactive emission, so bands will be seen only where there was radioactivity. Thus a few bands in the smear will be visualized, while the rest remains unhybridized, unlabeled, and invisible.

   Whenever possible the probe is chosen because it contains sequences believed to be related to the gene of interest or close to the gene on the chromosome (from other sorts of analyses). So what is actually being mapped in these experiments is an RFLP which appears to be LINKED to the gene of interest. The presence or absence of the RFLP correlates with the presence or absence of the mutant trait. The RFLP, then, is an marker in its own right. For this analysis to be useful the RFLP need not be part of the gene under study. However, the closer the mutated restriction site to the gene of interest, the more closely linked will be the two traits. If RFLP analysis is being used to predict the presence of one or two mutant alleles, the closer the linkage the more accurate the prediction. If the goal is to map the gene, it is clear that the closer the two alleles, the more accurate the map.

   Restriction mapping is also used extensively in genetic engineering. There are several ways in which restriction endonucleases are used. One is to identify a DNA which one has or hopes to manipulate. Restriction endonucleases are used to accomplish the manipulation as well. Since they cleave DNA at specific sequences, they permit the combination of two pieces of DNA at a predetermined specific site. Today's lab exercise will illustrate some of these methods.

   OH - MY - GOODNESS !!! DISASTER !!!   We have four plasmids in the stockroom freezer and somehow the labels have all fallen off!! How can we tell which is which? Luckily we have, on file, partial restriction maps for each one. [ these must be obtained from the instructor ] So we quickly get out the three restriction endonucleases used for the maps, and digest samples of each unknown plasmid with each of the three enzymes (all in separate tubes). The resulting fragments from each digest are separated by electrophoresis and the results presented below. Match the digest data with the plasmid map by inferring the approximate relative sizes of fragments generated by each enzyme. Write the name of the plasmid on the line below each set of data (three lanes for each plasmid).

   Now that we have relabeled our plasmid stocks with indelible marker, we are ready to do experiments. Our research demands that we genetically modify one or another of these plasmids in different ways in order to test the function of certain DNA sequences. We must analyze the products to identify which samples contain the desired alterations. In order for the technician to identify the correct pattern, predict the patterns that would be generated by the following mutations. Sketch your predicted electrophoretic patterns in the empty gel lanes.

A. The region of pA1 indicated by the box and * is removed from pA1. What would digests of pA1 with the same 3 restriction endonucleases look like now?

B, C. The fragment removed from pA1 is inserted into a Pst I site in pR43. Draw both possible outcomes. Can you distinguish between them?

D. The Bgl II site indicated by an * in pX3 suffers a point mutation which renders it unrecognizable to Bgl II. How would we know?

1. Why are sites sensitive to some restriction endonucleases found more frequently than sites sensitive to others? Given an enzyme with known specificity, how would you predict whether it is likely to find many target sites or few?

2. Genetic manipulation can be accomplished using restriction endonucleases to cut DNA at specific points. How are these enzymes used to facilitate linking together DNA from different sources?

3. What is meant by "blunt end" cut? -- "sticky end"?

4. Why would an RFLP pattern shown to correlate with a genetic abnormality in one family be a poor indicator in another family? Under what circumstances would you expect an RFLP pattern to be a good indicator in most or all people?

5. What are some of the arguments for and against genetic testing?
•   Which sorts of diseases or predispositions would you want to be tested for? Which would you rather not know?
•   Should prospective employers be allowed to take a DNA sample for analysis? Why or why not?
   •   insurance companies?
   •   the government?
   •   an information clearinghouse that promises confidentiality?
   •   a sperm or ovum bank?
   •   an adoption agency - from the parents of the child so that the information is available in case of future need?
   •   etc.?