20.109(S09):Induce protein and evaluate DNA (Day5)

Introduction
Last time you transformed your mutant DNA into BL21(DE3) cells. The colonies that arose were moved to liquid cultures, and today you will add IPTG to these cultures to induce protein expression by the bacteria. Next time you will purify the resultant protein. I won’t shy away from telling you that there are many things that can go wrong at this stage! However, each one is certainly a learning experience.

As evidenced by Nagai’s work, wild-type inverse pericam is not toxic to BL21(DE3) cells. Although it is unlikely for your small mutation to dramatically change this fact, in general a novel protein may turn out to be toxic. If this is the case, only very small amounts of protein are produced before the bacteria die. Keep in mind that overexpressing a single protein may come at the expense of producing proteins needed for survival, and will most likely cause cell death eventually; however, toxic proteins hasten this demise. Aberrant toxicity can sometimes be alleviated by reducing the culture temperature (e.g., to 30 &deg;C).

Based on its fluorescence activity, wild-type inverse pericam allows proper folding of (cp)EYFP, and based on its response to calcium, it also allows calmodulin to fold. One problem you may encounter is that your mutant proteins will no longer fold correctly. Since you made mutations in the calcium sensor part of IPC, rather than the fluorescent part, it is unlikely that your protein will destroy EYFP fluorescence. However, a common problem with misfolded proteins is the formation of insoluble aggregates, due for instance to improperly exposed hydrophobic surfaces. Proteins can be purified from these aggregates – called inclusion bodies – but the process is more labour-intensive than for soluble proteins. (The proteins must be extracted under more harsh conditions than you will use next time, then purified under denaturing conditions, before finally attempting to renature the proteins.) Inclusion bodies sometimes form simply due to very high expression of the protein of interest, causing it to pass its solubility limit. This outcome can be prevented by lowering the culture temperature or time, the amount of IPTG, or the growth phase of the bacteria.

One final point to keep in mind is that not all proteins can be produced in bacteria. Eukaryotic proteins that require post-translational modifications (such as glycosylation) for activity require eukaryotic hosts (such as yeast, or the ubiquitous CHO – Chinese hamster ovary – cells). Sometimes eukaryote-derived proteins will be truncated or otherwise mistranslated by E. coli due to differential codon bias; errors in translation can be prevented by providing additional tRNAs to the culture or directly to the bacteria via plasmids. Despite all this complexity, prokaryotic hosts have been plenty good enough to produce proteins for certain therapies, notably the cytokine G-CSF for patients needing to replenish their white blood cells (e.g., after chemotherapy), sold as Neupogen by Amgen.

After you induce your cells with IPTG, you will let the resultant protein factories do their work for 2-3 hours. During this time, you will evaluate the DNA from your two X#Z candidates (and from the S101L mutant). First, you will run your diagnostic digests from last time out on a gel. The banding patterns will allow you to determine (or diagnose) whether either of your putative X#Z mutants actually contains the new restriction site that you introduced. Of course, there is a slim possibility that the silent mutation was incorporated but the non-silent mutation wasn’t. To get more direct evidence for whether the site-directed mutagenesis worked, you will analyze data from the sequencing reactions that you set up last time.



The invention of automated sequencing machines has made sequence determination a relatively fast and inexpensive endeavor. The method for sequencing DNA is not new but automation of the process is recent, developed in conjunction with the massive genome sequencing efforts of the 1990s. At the heart of sequencing reactions is chemistry worked out by Fred Sanger in the 1970s which uses dideoxynucleotides (see schematic above left). These chain-terminating bases can be added to a growing chain of DNA but cannot be further extended. Performing four reactions, each with a different chain-terminating base, generates fragments of different lengths ending at G, A, T, or C. The fragments, once separated by size, reflect the DNA’s sequence. In the “old days” (all of 10 years ago!) radioactive material was incorporated into the elongating DNA fragments so they could be visualized on X-ray film (image above center). More recently fluorescent dyes, one color linked to each dideoxy-base, have been used instead. The four colored fragments can be passed through capillaries to a computer that can read the output and trace the color intensities detected (image above right). Your sample was sequenced in this way on an ABI 3730 DNA Analyzer.

Analysis of sequence data is no small task. “Sequence gazing” can swallow hours of time with little or no results. There are also many web-based programs to decipher patterns. The nucleotide or its translated protein can be examined in this way. Thanks to the genome sequence information that is now available, a new verb, “to BLAST,” has been coined to describe the comparison of your own sequence to sequences from other organisms. BLAST is an acronym for Basic Local Alignment Search Tool, and can be accessed through the National Center for Biotechnology Information (NCBI) home page.

You might be wondering why you would ever go through the trouble of designing and performing diagnostic digests, when sequencing is relatively simple and yields more information. Here, the idea of scale becomes important. Sequencing costs $8 per reaction, which can add up if you need to examine, say, 10 or more candidates. Agarose gel electrophoresis, by comparison, costs perhaps $1 per candidate. Since both methods require DNA isolation, one is not dramatically more labour intensive than the other. (A method called colony PCR avoids this labour. Can you guess what it might entail?) Finally, banding patterns can give a quick readout of many candidate colonies compared to the time it takes for the individual sequencing analyses you will perform today. Of course, there's no reason one couldn't automate the analysis process with a bit of (computer, not DNA) code!

Part 1: Cell measurement and IPTG induction

 * 1) For each mutant protein (X#Z candidates 1 and 2, S101L), you will be given a 6 mL aliquot of BL21(DE3) cells carrying the mutant plasmid; you will also receive a tube of BL21(DE3) carrying wild-type inverse pericam. These cells should be in or close to the mid-log phase of growth for good induction, just as they were for transformation. Like last time, check the OD600 values of your cells until they fall between 04.and 06. (Better to overshoot a little than undershoot.)
 * 2) Once your cells have reached the appropriate growth phase, set aside - on ice - 1.5 mL of cells from each tube as a control (no IPTG) sample. Then take an aliquot of cold IPTG (0.1 M), and add to your remaining cells at a final concentration of 1 mM. You should prepare three mutant and one wild-type tube.
 * 3) Return your tubes to the rotary shaker in the 37 &deg;C incubator, and note down the time.
 * 4) While your IPTG-stimulated cells are producing protein, you will analyze the sequence data and digests of the plasmids they are carrying. At the end of the day, you will choose only one of your X#Z candidates to save, and give the other sample to the teaching faculty to be bleached and thrown away.

Part 2: Run diagnostic gel
The scheme below assumes that both Digest 1 (D1, used to analyze the S101L mutant) and Digest 2 (D2, used to analyze your X#Z mutant candidates) use only one enzyme. If you are doing double-digests and need to run single enzyme controls, hopefully you spoke to the teaching faculty about this last time. Load your samples on a 1% agarose gel in the following order:

Once all the samples are loaded, the power will be applied (100 V for 45 minutes) and the gel will be photographed. When the gel is ready, you will compare the band sizes in the photograph with the expected band sizes that you previously calculated. In the meantime, you can analyze your sequence data.

Part 3: Analyze sequence data
Your goal today is to analyze the sequencing data for three samples - two independent colonies from your X#Z mutant, and one S101L clone for practice - and then decide which colony to proceed with for the X#Z mutant. You will want to have [[Media:S09_IPC-for-sequencing.doc | this document]] handy, and to mark the expected location of your mutation with bold text before proceeding. (Just compare to your annotation of [[Media:S09_annotated-IPC.doc | the old IPC sequence document]], using Word Count or the Find tool.) The new file contains the inverse pericam sequence as before, but also the flanking sequences (from the pRSET vector) before and after IPC, which are separated from IPC by a blank line. The second page of the document contains the reverse complement of IPC/pRSET, which was gotten using this website. Be sure to bold your mutant codon in the reverse complement sequence as well.

The data from the MIT Biopolymers Facility is available at this link. Choose the "Login to dnaLIMS" link and then use "astachow" and "be109" to login. At the bottom of the left panel should be a link to download your sequencing results. First, select order #15626 and sample S101LSM, then click "submit." Later you can select the order # for your own mutant. The quickest way to start working with your data is to follow the "view" link. From this link you'll see the sequencing traces and can click on "sequence text" to view it.

Rather than look through the sequence to magically find the relevant portion, you can align the data you just got with the standard inverse pericam sequence and the differences will be quickly identified. There are several web-based programs for aligning sequences and still more programs that can be purchased. The steps for using one web-based tool are sketched below.

Align with "bl2seq" from NCBI
 * 1) The alignment program can be accessed through the NCBI BLAST page or directly from this link
 * 2) To allow for gaps in the sequence alignment, uncheck the "filter" box. All the other default settings should be fine.
 * 3) Paste the sequence text from your sequencing run into the "Sequence 1" box. This will now be the "query." If there were ambiguous areas of your sequencing results, these will be listed as "N" rather than "A" "T" "G" or "C" and it's fine to include Ns in the query.
 * 4) Paste the inverse pericam sequence into the "Sequence 2" box. For samples probed with the forward primer, use the regular IPC sequence; for those using a reverse primer, you should put in the reverse complement. Which alignment will be more useful depends on the location of your mutation.
 * 5) Click on Align. Matches will be shown by vertical lines between the aligned sequences. You should see a long stream of matches, followed by lots of errors in the last ~200bp of the sequence – ignore the error-ridden part of the data, as it may not accurately reflect your mutant plasmid. In this stream of matches, the 1-3 missing lines indicating your mutant codon should stand out. If they don’t, use the numbering or Find tool to locate the appropriate codon.
 * 6) You should print a screenshot of each alignment to pdf (and to paper if you desire). These will be used to prepare a figure showing what you found today. You might want to email yourself the alignment screen shots or post them to your wiki userpage.

If both colonies for your mutant have the correct sequence, flip a coin and proceed with one or the other; ditto if both are inconclusive. If one appears right and the other doesn’t, of course proceed with the former. Finally, if both are clearly wrong, talk to a member of the teaching faculty.

For your reference, another popular sequence alignment program is "CLUSTAL-W" from EMBL-EBI, located here.

Part 4: Observe mutant colonies
Last time you transformed BL21(DE3) cells with three different plasmids (two candidates for the X#Z mutant, and one S101L clone). Compare the relative colony formation of cells carrying the different plasmids. If all the plates have dense cell growth, there is no need for you to get an exact colony count; just do your best to get a relative estimate, and describe any findings in your notebook.

Part 5: Cell observation and collection

 * 1) After ~2-3 hours, you will pour 1.5 mL from each tube (from Part 1) into a labeled eppendorf, then spin for 1 min. at maximum speed. Save the other 3 mL!
 * 2) Aspirate the supernatant from each eppendorf, using a fresh yellow pipet tip on the end of the glass pipet each time.
 * 3) Observe the colour of each of your pellets, and compare to the examples on today's Talk page. If the wild-type and both mutant pellets all appear yellow-greenish to the eye, proceed as follows:
 * 4) *Do NOT toss the rest of the liquid cultures. First, measure their OD600 values, according to part 6 of today's protocol.
 * 5) *Next, pour 1.5 mL more of the relevant liquid culture on top of each pellet, spin again, and aspirate the supernatant.
 * 6) *The last 1.5 mL of culture may be aspirated in your vacuum flask, to be later bleached and discarded.
 * 7) If one or more of your pellets are white or only dimly coloured, please ask one of the teaching staff too show you the room temperature rotary shaker. You will continue to grow your bacteria overnight. Tomorrow morning, the teaching staff will collect your pellets for you and freeze them. As you can see above, the +IPTG pellets are from 3 mL of culture, while the -IPTG pellets come from 1.5 mL of culture.

Part 6: Preparation for next time
Next time, you will lyse your bacterial samples to release their proteins, and run these out on a protein gel. In order to compare the amount of protein in the -IPTG versus +IPTG samples, you would like to normalize by the number of cells. At this point, you may have only three samples ready (-IPTG only), or you may have all six. In either case, measure the OD600 of a 1:10 dilution of cells for each finished sample (for -IPTG you have done so already), and write this number down in your notebook and on today's Talk page. Then spin down the cells and aspirate the supernatant. Give the cell pellets to the teaching faculty; they will be stored frozen. (Be sure to make a 2X pellet for the +IPTG samples.)

For next time

 * 1) Prepare a figure depicting the results of your diagnostic digest. Keep in mind the best practices for figures that we have discussed, from content to presentation. For example, the content should include the expected and observed band sizes, and the presentation should include labeling of lanes as well as a few reference bands.
 * 2) Write a few sentences of results and discussion text (clearly delineating which is which) to accompany your diagnostic digest figure.
 * 3) Day 6 of this module will be an intense one, and has the potential to run long. You will do yourself a great service if you carefully read the text in advance, then consider what preparations you will need to do on that day, and organize your thoughts in your notebook and/or with your partner. This part will not be collected or evaluated.

Reagent List

 * IPTG (isopropyl &beta;-D-1-thiogalactoside), 0.1M


 * Loading Dye
 * 0.25% xylene cyanol
 * 30% glycerol
 * RNase


 * 1% and 1.2% agarose gels with 0.3 &mu;g/mL ethidium bromide


 * Gels made and run in 1X TAE buffer
 * 40 mM Tris
 * 20 mM Acetic Acid
 * 1 mM EDTA, pH 8.3