QRT-PCR: Difference between revisions

From OpenWetWare
Jump to navigationJump to search
mNo edit summary
(34 intermediate revisions by 3 users not shown)
Line 1: Line 1:
'''Quantitative reverse transcriptase PCR''' (QRT-PCR) is a [[PCR techniques|PCR technique]] used to determine the amount of cDNA in a sample. It is the most commonly used form of [[quantitative PCR]] (qPCR). This technique is also called ''real-time reverse transcriptase PCR''.
{{back to protocols}}
'''Quantitative reverse transcriptase PCR''' (QRT-PCR or qRT-PCR) is a [[PCR techniques|PCR technique]] used to determine the amount of cDNA in a sample. It is the most commonly used form of [[quantitative PCR]] (qPCR). This technique is also called ''real-time reverse transcriptase PCR''.
 
== Comparison of normalisation methods ==
There is an ongoing debate what is the best way to normalise qPCR data. Reference genes are the most common method, although single unverified reference genes invalidate the qPCR data generated. Total RNA, ribosomal RNA, and genomic DNA have been suggested as alternative methods.
 
=== Reference genes ===
Most common method. Frequently, a panel is used for normalization, e.g. [http://www.bioline.com/h_prod_detail.asp?user_prodname=Human%20Endogenous%20Control%20Gene%20Panel] not just a single reference gene and including data on suitability as reference genes. Often ''housekeeping gene''  is used here instead of reference gene but the term is poorly defined and can be misleading. It is to be noted that panels are often composed of genes that are supposed to be stable based on their function. However, more than 100 peer-reviewed articles report problems related to genes chosen from a panel, because they were not suitable for a particular context. A recent approach is to select a reference gene based on its stability across microarrays done within one's condition of interest. There is a public tool called RefGenes that searches a microarray database of more than 50,000 arrays to identify genes that are stable across subsets of conditions. It is available at the Genevestigator website [http://www.genevestigator.com].
 
=== RNA ===
Total rRNA [http://scholar.google.com/scholar?hl=en&lr=&safe=off&cluster=12435126891737656303] [http://scholar.google.com/scholar?hl=en&lr=&safe=off&cluster=9547016096229453970], or total RNA. Drawback: rapidly dividing cells will have more rRNA and different rRNA/mRNA ratio which will complicate comparison; difference in cDNA synthesis not taken into account.
 
=== Genomic DNA ===
Genomic DNA or cell number. Drawbacks: RNA degrades faster than RNA which can distort the data; sample cannot be DNase treated; efficiency of cDNA synthesis not taken into account.


== Reference mRNAs ==
== Reference mRNAs ==
{{main|Choosing reference genes for qPCR normalisation}}


{{main|QRT-PCR reference genes}}
Picking reference genes will make or break your quantification via qPCR (real time PCR). If you pick only one reference gene and your pick is not constant across different conditions or samples, your results will be skewed. Choose several reference genes and check whether they satisfy the criteria for a good reference gene. Some commonly used reference genes, like 18S and GAPDH, are known to be problematic but continue to be used.


Picking reference genes will make or break your quantification via qPCR (real time PCR). If you pick only one reference gene and your pick is not constant across different conditions or samples, your results will be skewed. Choose several reference genes and check whether they satisfy the criteria for a good reference gene.
*'''[[User:Ajeffs|Ajeffs]]''' 06:55, 21 April 2007 (EDT): Screen a handful of ref genes, select the most stable using [http://medgen.ugent.be/~jvdesomp/genorm/ genorm], bestkeeper etc, use at least 2 reference genes for subsequent reactions and normalisation. Inlcude your genorm M values when publishing qPCR data.


Some commonly used reference genes are known to be problematic but continue to be used. 18S is a bad choice because it is typically much more abundant than the mRNA you are probing for (copy numbers should be in the same ballpark) and 18S is degraded differently than the average mRNA. Another common but bad choice is GAPDH. It's not just a metabolic enzyme but has many other functions [http://en.wikipedia.org/wiki/Glyceraldehyde_3-phosphate_dehydrogenase#Additional_functions] and is therefore regulated in many ways. Its levels are not constant [Zhu 2001 PMID 11237753] and problems using GAPDH as a qPCR reference gene have been published [Ke 2000 PMID 10799275, Suzuki 2000 PMID 10948434].
== Primer selection ==
{{main|Choosing primers for qPCR}}


== Primer selection ==
Choosing suitable primers is an early crucial step in your qPCR experiment. Reusing a tested primer pair from a repository or publication can save you some time. Otherwise primer selection from scratch is similar to that for a standard qualitative PCR experiment but the product size is typically much smaller (below 200nt) and the amplification characteristics of the primer have to be rigorously tested.
=== Primer repositories ===
 
*[http://medgen.ugent.be/rtprimerdb/ RTPrimerDB] is an excellent database storing known primer and probe sequences for popular techniques (SYBR Green I, Taqman, Hybridisation Probes, Molecular Beacon). This can help you save the time of designing and testing your own primers. It is also intended to facilitate standardisation among different laboratories. The database is hosted by the Center for Medical Genetics, Gent, Belgium. Please submit you tested primer pairs.
== Quantification methods ==
There are 3 common quantification methods. The standard curve method is the only one that gives you are absolute concentration. Both the Pfaffl method and the ΔΔC<sub>t</sub> method produce relative data with the Pfaffl method being superior.
 
=== Standard curve method ===
* requires template at known concentration (e.g. cDNA or TA cloned PCR product)
* requires dilution series of known template for standard curve (more wells)
* yields absolute concentrations by comparing unknown samples to known
 
=== Pfaffl method ===
* requires that primer efficiency be known but needs to be determined only once with a standard curve or a different method
* produces relative amount (e.g. treated is 2x untreated)
 
(named after the inventor; see Pfaffl 2001 PMID 11328886)
 
=== ΔΔC<sub>t</sub> (delta delta Ct) ===
 
* easiest, oldest, least reliable
* assumes that primers for unknown and reference gene have very similar efficiency
: or that v little correction is necessary (i.e. reference gene almost same level)
* yields relative amounts
 
(Ct = cycle threshold; point when fluorescence reading surpasses a set baseline)
 
== Primer efficiency estimation ==
 
A Ct difference of 1 between two samples has a different meaning depending on the efficiency of the primers used. If primers are 100% efficient, then ΔC<sub>t</sub> = 1 means one sample has twice the amount of template compared to the other. The simple ΔΔC<sub>t</sub> method, described above, often wrongly assumes perfect efficiency. It is better to experimentally verify the primer efficiency and use the Pfaffl method instead. The standard method takes the primer efficiency into account via the standard curve run with each sample. However, primer efficiencies in the standard curve dilutions and the actual samples are not necessarily the same.
 
=== Linear regression on dilution curve C<sub>t</sub> data ===
 
Primer efficiencies can be calculated by making a dilution series, calculating a linear regression based on the data points, and inferring the efficiency from the slope of the line. For a base 10 logarithm the formulae is:
 
:'''efficiency = 10^(-1/slope)'''
 
Slopes between -3.3 and -4 will thus give you estimated primer efficiencies between 100% and 78% respectively. It can happen that the calculated efficiencies are above 100% [http://www.protocol-online.org/forums/index.php?showtopic=7611] [http://www.protocol-online.org/forums/index.php?showtopic=22170] [http://www.protocol-online.org/forums/index.php?showtopic=28319]. This may be due to incorrect template concentrations, too concentrated template, inhibition of the PCR reaction, unspecific PCR amplification, mistakes in the calculation [http://www.protocol-online.org/forums/index.php?showtopic=29925], etc.


*[http://www.realtimeprimers.org/ RealTimePrimers.org] is another such database but it contains only few primers.
See a figure explaining the fitting process from the Hunts' qPCR tutorial [http://pathmicro.med.sc.edu/pcr/PCR_Standard_Curve2.htm].


*[http://pga.mgh.harvard.edu/primerbank/ PrimerBank]. From the website "''PrimerBank is a public resource for PCR primers. These primers are designed for gene expression detection or quantification (real-time PCR). PrimerBank contains over 306,800 primers covering most known human and mouse genes. ...The primer design algorithm has been extensively tested by real-time PCR experiments for PCR specificity and efficiency. We have tested 26,855 primer pairs that correspond to 27,681 mouse genes by Real Time PCR followed by agarose gel electrophoresis and sequencing of the PCR products. The design success rate is 82.6% (22,187 successful primer pairs) based on agarose gel electrophoresis''". Don't neglect to check the efficiency and specificity of the oligos yourself though. Link to [http://nar.oxfordjournals.org/cgi/content/abstract/31/24/e154 paper].
=== Efficiency estimation based on the kinetics of single PCR runs ===


*[http://primerdepot.nci.nih.gov/ qPrimerDepot]. From the website "''This database provides qRT PCR primers for 99.96% human RefSeq sequences. For 99% of intron-bearing genes, the PCR product will cross an exon-exon border which overlaps one of the largest introns. All primers have annealing temperatures approximately 60C''". Link to [http://www.pubmedcentral.nih.gov/articlerender.fcgi?artid=1635330 paper].
Efficiency (and C<sub>t</sub> values) can also be calculated from the fluorescence data of a single PCR run or preferably replicates of the same PCR. The '''Miner''' algorithm (PMID 16241897) is an example for this type of method and can be used online at [http://www.miner.ewindup.info/miner/].


=== Primer design ===
== qPCR data quality ==
An excellent and fast way to select primers is with the free online-tool [http://fokker.wi.mit.edu/primer3/input.htm Primer3], currently in v0.3. [http://www.bioinformatics.nl/cgi-bin/primer3plus/primer3plus.cgi Primer3Plus], a variation of Primer3 has qPCR settings. Or just apply the following or similar settings to Primer3:
*pair towards 3' end (often more specific, some cDNAs don't contain)
*pair separated by an exon-exon boundary (reduces genomic background) e.g. last exon & penultimate
*amplified region must be no biger than 200 bp; usally 60-150 bp
*GC content: 50-60%
*min length: 18, max length 24 (best: 20 nt)
*melting temperature: min 60, max 63, best 60
*max Tm difference: 10 (shouldn't be more than 1 in final pair)
*max 3' self complementary: 1
*max poly-x: 3


Verify by blasting the primers sequences. Target gene should come out with the lowest E value. No other gene should be close. Also check whether possible isoforms will be detected by the candidate primer pair. See also: [[Designing primers]]
=== Sources of variability: Operator ===


== Sources of variability ==
Due to the small amount of liquid handled and the sensitivity of the technique, operator variability is high. Bustin  reports that the same qPCR experiment repeated by 3 people using the same reagents lead to very different copy number estimations [Bustin 2002 PMID 12200227, figure 3]:  
=== Operator ===
Due to the small amount of liquid handled and the sensitivity of the technique, operator variability is high. Bustin  reports that the same qPCR experiment repeated by 3 people using the same reagents lead to very different copy number estimations [Bustin 2002 PMID 12200227]:  
* person A: 8·7 × 10<sup>5</sup>  
* person A: 8·7 × 10<sup>5</sup>  
* person B: 2·8 × 10<sup>5</sup> B is 1/3 of A!
* person B: 2·8 × 10<sup>5</sup> different by a factor of 3!
* person C: 2·7 × 10<sup>3</sup> C is 3/1000 of A!!
* person C: 2·7 × 10<sup>3</sup> different by a factor of 300!!
 
=== Sources of variability: Reagent lots/age ===
 
Different lots of reagents can lead to different results. Experiment repeated by same operator 5 times, same RNA sample, different kits; values are copies/μg total RNA:
* kit 1: 13±32 × 10<sup>7</sup>
* kit 2: 5.4±1.6 × 10<sup>7</sup> - different by a factor of 2.4
 
Similar experiment with old (9 months 4°C) and new probe (3 months 4°C), values are copies/μg total RNA:
* old: (5.6 ± 1.3) x 10<sup>3</sup>
* new: (3.8 ± 0.6) x 10<sup>8</sup> - different by a factor of 100'000!!
 
both experiments above from [Bustin 2002 PMID 12200227, figure 4]
 
=== More information ===
* [http://www.dddmag.com/reliability-of-qPCR-data.aspx Partial transcript of a webcast discussing qPCR data quality]


== Notes ==
== Notes ==
Line 53: Line 104:
*#Ensure that the concentration of deoxynucleotides doesn't run out <cite>MeasuringGeneExpression</cite>.
*#Ensure that the concentration of deoxynucleotides doesn't run out <cite>MeasuringGeneExpression</cite>.


== Specific protocols ==
== Appendix ==
=== Specific protocols ===
*[[Endy:Real-time RT-PCR]] (in progress)
*[[Endy:Real-time RT-PCR]] (in progress)
*[[qRT-PCR/Two tubes]] (in progress)
*[[qRT-PCR/Two tubes]] (in progress)
*[[qRT-PCR/Single tube]]
*[[qRT-PCR/Single tube]]


== See also ==
=== See also ===
* [[Q-PCR]] and [[Real-time PCR]]
* [[Q-PCR]] and [[Real-time PCR]]
* [[PCR techniques]]
* [[PCR techniques]]


== External links ==
=== External links ===
* An excellent, detailed [http://pathmicro.med.sc.edu/pcr/realtime-home.htm Q-PCR tutorial] by Margaret and Richard Hunt, University of South Carolina
* venerable [http://tech.groups.yahoo.com/group/qpcrlistserver/ qpcrlistserv]. Anyone doing qPCR should be subscribed to this list.
* The venerable [http://tech.groups.yahoo.com/group/qpcrlistserver/ qpcrlistserv]. Anyone doing qPCR should be subscribed to this list.
 
* [[Image:3stars.png]] excellent, detailed [http://pathmicro.med.sc.edu/pcr/realtime-home.htm qPCR tutorial] with many figures by Margaret and Richard Hunt, University of South Carolina
* [[Image:3stars.png]] detailed [http://qcom.etsu.edu/mbcf/pcr.htm qPCR tutorial] by East Tennessee State University facility
* [[Image:3stars.png]] excellent, up-to-date [http://www.dddmag.com/reliability-of-qPCR-data.aspx discussion on how to do qPCR experiments] with Maurice Exner, Mark Anderson, and Stephen Bustin
* [[Image:2stars.png]] [http://www.gene-quantification.info/ gene-quantification.info - a site dedicated to qPCR] edited by M W Pfaffl, Technical University Munich (very busy design, lots of advertisement but a real trove of qPCR info)
* [[Image:2stars.png]] [http://dorakmt.tripod.com/genetics/realtime.html qPCR (real-time PCR) backgrounder] without figures but with many links by Tevfik Dorak
* [[Image:2stars.png]] [http://www.appliedbiosystems.com/support/tutorials/pdf/rtpcr_vs_tradpcr.pdf 15 page comparison of PCR vs qPCR] with several figures by Applied Biosystems
* [[Image:1star.png]] [http://www.ambion.com/techlib/basics/rtpcr/#10 short qRT-PCR summary] by Ambion


== References ==
=== References ===
<biblio>
<biblio>
#MeasuringGeneExpression isbn=0415374723
#MeasuringGeneExpression isbn=0415374723
</biblio>
</biblio>


[[Category:Protocol]] [[Category:In vitro]] [[Category:RNA]]
[[Category:Protocol]] [[Category:In vitro]] [[Category:RNA]] [[Category:DNA]]

Revision as of 23:46, 4 July 2010

back to protocols

Quantitative reverse transcriptase PCR (QRT-PCR or qRT-PCR) is a PCR technique used to determine the amount of cDNA in a sample. It is the most commonly used form of quantitative PCR (qPCR). This technique is also called real-time reverse transcriptase PCR.

Comparison of normalisation methods

There is an ongoing debate what is the best way to normalise qPCR data. Reference genes are the most common method, although single unverified reference genes invalidate the qPCR data generated. Total RNA, ribosomal RNA, and genomic DNA have been suggested as alternative methods.

Reference genes

Most common method. Frequently, a panel is used for normalization, e.g. [1] not just a single reference gene and including data on suitability as reference genes. Often housekeeping gene  is used here instead of reference gene but the term is poorly defined and can be misleading. It is to be noted that panels are often composed of genes that are supposed to be stable based on their function. However, more than 100 peer-reviewed articles report problems related to genes chosen from a panel, because they were not suitable for a particular context. A recent approach is to select a reference gene based on its stability across microarrays done within one's condition of interest. There is a public tool called RefGenes that searches a microarray database of more than 50,000 arrays to identify genes that are stable across subsets of conditions. It is available at the Genevestigator website [2].

RNA

Total rRNA [3] [4], or total RNA. Drawback: rapidly dividing cells will have more rRNA and different rRNA/mRNA ratio which will complicate comparison; difference in cDNA synthesis not taken into account.

Genomic DNA

Genomic DNA or cell number. Drawbacks: RNA degrades faster than RNA which can distort the data; sample cannot be DNase treated; efficiency of cDNA synthesis not taken into account.

Reference mRNAs

Main article: Choosing reference genes for qPCR normalisation

Picking reference genes will make or break your quantification via qPCR (real time PCR). If you pick only one reference gene and your pick is not constant across different conditions or samples, your results will be skewed. Choose several reference genes and check whether they satisfy the criteria for a good reference gene. Some commonly used reference genes, like 18S and GAPDH, are known to be problematic but continue to be used.

  • Ajeffs 06:55, 21 April 2007 (EDT): Screen a handful of ref genes, select the most stable using genorm, bestkeeper etc, use at least 2 reference genes for subsequent reactions and normalisation. Inlcude your genorm M values when publishing qPCR data.

Primer selection

Main article: Choosing primers for qPCR

Choosing suitable primers is an early crucial step in your qPCR experiment. Reusing a tested primer pair from a repository or publication can save you some time. Otherwise primer selection from scratch is similar to that for a standard qualitative PCR experiment but the product size is typically much smaller (below 200nt) and the amplification characteristics of the primer have to be rigorously tested.

Quantification methods

There are 3 common quantification methods. The standard curve method is the only one that gives you are absolute concentration. Both the Pfaffl method and the ΔΔCt method produce relative data with the Pfaffl method being superior.

Standard curve method

  • requires template at known concentration (e.g. cDNA or TA cloned PCR product)
  • requires dilution series of known template for standard curve (more wells)
  • yields absolute concentrations by comparing unknown samples to known

Pfaffl method

  • requires that primer efficiency be known but needs to be determined only once with a standard curve or a different method
  • produces relative amount (e.g. treated is 2x untreated)

(named after the inventor; see Pfaffl 2001 PMID 11328886)

ΔΔCt (delta delta Ct)

  • easiest, oldest, least reliable
  • assumes that primers for unknown and reference gene have very similar efficiency
or that v little correction is necessary (i.e. reference gene almost same level)
  • yields relative amounts

(Ct = cycle threshold; point when fluorescence reading surpasses a set baseline)

Primer efficiency estimation

A Ct difference of 1 between two samples has a different meaning depending on the efficiency of the primers used. If primers are 100% efficient, then ΔCt = 1 means one sample has twice the amount of template compared to the other. The simple ΔΔCt method, described above, often wrongly assumes perfect efficiency. It is better to experimentally verify the primer efficiency and use the Pfaffl method instead. The standard method takes the primer efficiency into account via the standard curve run with each sample. However, primer efficiencies in the standard curve dilutions and the actual samples are not necessarily the same.

Linear regression on dilution curve Ct data

Primer efficiencies can be calculated by making a dilution series, calculating a linear regression based on the data points, and inferring the efficiency from the slope of the line. For a base 10 logarithm the formulae is:

efficiency = 10^(-1/slope)

Slopes between -3.3 and -4 will thus give you estimated primer efficiencies between 100% and 78% respectively. It can happen that the calculated efficiencies are above 100% [5] [6] [7]. This may be due to incorrect template concentrations, too concentrated template, inhibition of the PCR reaction, unspecific PCR amplification, mistakes in the calculation [8], etc.

See a figure explaining the fitting process from the Hunts' qPCR tutorial [9].

Efficiency estimation based on the kinetics of single PCR runs

Efficiency (and Ct values) can also be calculated from the fluorescence data of a single PCR run or preferably replicates of the same PCR. The Miner algorithm (PMID 16241897) is an example for this type of method and can be used online at [10].

qPCR data quality

Sources of variability: Operator

Due to the small amount of liquid handled and the sensitivity of the technique, operator variability is high. Bustin reports that the same qPCR experiment repeated by 3 people using the same reagents lead to very different copy number estimations [Bustin 2002 PMID 12200227, figure 3]:

  • person A: 8·7 × 105
  • person B: 2·8 × 105 different by a factor of 3!
  • person C: 2·7 × 103 different by a factor of 300!!

Sources of variability: Reagent lots/age

Different lots of reagents can lead to different results. Experiment repeated by same operator 5 times, same RNA sample, different kits; values are copies/μg total RNA:

  • kit 1: 13±32 × 107
  • kit 2: 5.4±1.6 × 107 - different by a factor of 2.4

Similar experiment with old (9 months 4°C) and new probe (3 months 4°C), values are copies/μg total RNA:

  • old: (5.6 ± 1.3) x 103
  • new: (3.8 ± 0.6) x 108 - different by a factor of 100'000!!

both experiments above from [Bustin 2002 PMID 12200227, figure 4]

More information

Notes

  • The most commonly used specialist reverse transcriptase enzyme for cDNA production is AMV reverse transcriptase. It has RNase H activity (so that RNA molecules are only transcribed once) and has a high temperature stability (to reduce RNA secondary structure and nonspecific primer annealing) [1].
  • Since RNA can degrade with repeated freeze-thaw steps, experimental variability is often seen during successive reverse transcription reactions of the same RNA sample [1].
  • Reverse transcriptase enzymes are notorious for their thermal instability. Repeated removals from the freezer can degrade the efficiency of the enzyme [1].
  • Producing total cDNA from total RNA can be advantageous because
    1. cDNA is more stable than RNA so making total cDNA allows you to make multiple sequence-specific RNA measurements [1].
    2. This approach could reduce experimental variability stemming from RNA degradation [1].
  • To make total cDNA
    1. Use a polyT primer (most but not all eukaryotic mRNA) or random decamers (prokaryotic mRNA) [1].
    2. Random decamers give longer cDNAs on average than random hexamer primers [1].
    3. Use longer reverse transcription reaction times [1].
    4. Ensure that the concentration of deoxynucleotides doesn't run out [1].

Appendix

Specific protocols

See also

External links

  • venerable qpcrlistserv. Anyone doing qPCR should be subscribed to this list.

References

  1. ISBN:0415374723 [MeasuringGeneExpression]