Choosing reference genes for qPCR normalisation: Difference between revisions

From OpenWetWare
Jump to navigationJump to search
m (QRT-PCR reference genes moved to QRT-PCR normalisation: not all normalisation is w reference genes)
No edit summary
 
(12 intermediate revisions by 2 users not shown)
Line 1: Line 1:
Quantifying mRNA via cDNA levels as in a [[QRT-PCR]] hinges on the references you choose. If you pick only one reference gene and your pick is not constant across different conditions or samples, your results will be skewed. Pick several and check whether they satisfy the criteria for a good reference gene.
'''Which reference genes should I use for my qPCR experiment?''' Quantifying mRNA via cDNA levels as in a quantitative reverse transcriptase PCR ('''[[QRT-PCR]]''') hinges on the references you choose. If you pick only one reference gene and your pick is not constant across different conditions or samples, your results will be skewed. Pick several and check whether they satisfy the criteria for a good reference gene.
 
== Normalisation against "housekeeping" genes, rRNA, total RNA? ==
 
There is an ongoing debate what is the best reference to normalise against. Many studies use a single :-( or a set of so called housekeeping genes, which despite their label are often not constant or comparable. Other studies have suggested the use of cell number, total rRNA [http://scholar.google.com/scholar?hl=en&lr=&safe=off&cluster=12435126891737656303] [http://scholar.google.com/scholar?hl=en&lr=&safe=off&cluster=9547016096229453970], or total RNA.  


== The ideal reference gene ==
== The ideal reference gene ==
A mRNA used as reference or standard of a [[QRT-PCR]] (and other experiments) should have the following properties:
A mRNA used as reference or standard of a [[QRT-PCR]] (and other experiments) should have the following properties:
* expressed in all cells
* expressed in all cells
Line 13: Line 8:


== Common reference genes ==
== Common reference genes ==
Common reference mRNAs linked to known mouse primer pairs:
* β-actin (common cytoskeletal enzyme) [http://medgen.ugent.be/rtprimerdb/search_results.php?id=&application=-1&organism=2&gene=actin&string_type=substring&detection=-1&primer=&locuslink=&snp=&last_name=&pubmed=&order=0&search=Search+Primers&first_result=0], [http://www.realtimeprimers.org/SYBR%20Green/Mouse%20SYBR%20Green.html]
* ribosomal proteins (e.g. RPLP0)
* ribosomal proteins (e.g. RPLP0)
* cyclophilin mRNA
* PPIA peptidylprolyl isomerase A = cyclophilin A
* MHC I (major histocompatibility complex I)
* MHC I (major histocompatibility complex I)
* TUBB β-tubulin (common cytoskeletal enzyme)
* ACTB β-actin (common cytoskeletal enzyme) [http://medgen.ugent.be/rtprimerdb/search_results.php?id=&application=-1&organism=2&gene=actin&string_type=substring&detection=-1&primer=&locuslink=&snp=&last_name=&pubmed=&order=0&search=Search+Primers&first_result=0], [http://www.realtimeprimers.org/SYBR%20Green/Mouse%20SYBR%20Green.html]
* YMHAZ tyrosine 3/tryptophan 5 -monooxygenase activation protein, zeta polypeptide
* B2M β2-microglobulin
* UBC ubiquitin C
* TBP TATAA-box binding protein
* GUSB β-glucuronidase
* HPRT1 hypoxanthine-guanine phosphoribosyltransferase
==Common but problematic references==
* glyceraldehyde-3-phosphate dehydrogenase '''GAPDH''' (common metabolic enzyme) [http://medgen.ugent.be/rtprimerdb/search_results.php?primer=&application=-1&detection=-1&locuslink=&snp=&last_name=&order=0&search=Search+Primers&first_result=0&pubmed=&string_type=substring&gene=gapdh&organism=2&Search=Search&id=], [http://www.realtimeprimers.org/SYBR%20Green/Mouse%20SYBR%20Green.html] - see [[QRT-PCR#Reference_mRNAs]]
:GAPDH has many functions besides the most well known in the glycolytic pathway [http://en.wikipedia.org/wiki/Glyceraldehyde_3-phosphate_dehydrogenase#Additional_functions]. Its levels are not constant [Zhu 2001 PMID 11237753] and vary more than for other genes across different tissues [Radonic 2004 PMID 14706621]. Problems using GAPDH as a qPCR reference gene have been published previously [Ke 2000 PMID 10799275, Suzuki 2000 PMID 10948434].
* ribosomal RNAs ('''28S or 18S''')
:'''[[User:Ajeffs|Ajeffs]]''' 06:55, 21 April 2007 (EDT): 18S is generally a terrible choice for a reference gene thanks to the combination of (i) high abundance (creating a 1:100 dilution of template to run in parallel with neat template just for 18S is a complete drag); and (ii) having different degradation characteristics to mRNAs (it appears to be more resistant to degradation). However, if you can show that you have screened 5-10 reference genes, and 18S is still the best for your specific situation then so be it (but do try 28S if you or you PI is hung-up on 18S).


Common but not recommended references
* Panels of "housekeeping genes"
* glyceraldehyde-3-phosphate dehydrogenase GAPDH (common metabolic enzyme) [http://medgen.ugent.be/rtprimerdb/search_results.php?primer=&application=-1&detection=-1&locuslink=&snp=&last_name=&order=0&search=Search+Primers&first_result=0&pubmed=&string_type=substring&gene=gapdh&organism=2&Search=Search&id=], [http://www.realtimeprimers.org/SYBR%20Green/Mouse%20SYBR%20Green.html] - see [[QRT-PCR#Reference_mRNAs]]
:All genes used for normalization can show problems in one or the other condition. There are always conditions in which their expression differs significantly from their general level of expression. That is why reference genes should be validated for one's condition of interest. Ideally, one should choose from the complete genome rather than from a gene panel. Genevestigator RefGenes [http://www.refgenes.org] is an open access online tool that uses a very large microarray database to identify genes that are most stable in conditions similar to that of your own experiment.
* ribosomal RNAs (28S or 18S) - see below


== Reference genes across tissues ==
== Reference genes across tissues ==
Line 31: Line 37:
* genes with the largest range (unsuitable for cross-tissue comparison): HPRT, Alb, PBGD, GAPDH, β2M
* genes with the largest range (unsuitable for cross-tissue comparison): HPRT, Alb, PBGD, GAPDH, β2M


* genes undetectable in tissue: Alb - colon; PPIA - ovaries; HPRT - prostate, testis, ovary, small intestine, colon; PBL, skeletal muscle; Tub - ovaries, PBGD - skeletal muscle; TBP - lung, prostate, colon; G6PDH - colon
* genes undetectable in tissue: Alb - colon; PPIA - ovaries; HPRT - prostate, testis, ovary, small intestine, colon; PBL, skeletal muscle; PBGD - skeletal muscle; TBP - lung, prostate, colon; G6PDH - colon
* genes detected in all tissues: GAPDH, Act, β2M, L13, PLA, RP2
* genes detected in all tissues: GAPDH, Act, Tub, β2M, L13, PLA, RP2
(note the source Fig 2 is sometimes impossible to read and the describing text is incomplete; that might have lead to some errors above)
(note the source Fig 2 is sometimes impossible to read and the describing text is incomplete; that might have lead to some errors above)
== Primer collections ==
Search primer repositories like [http://medgen.ugent.be/rtprimerdb/ RTPrimerDB] (see also below) and check the literature before doing it from scratch. <br>
Check out the Eccles Lab collection of human and mouse [[Eccles:QPCR_reference_genes| qPCR reference genes]] on OWW.


== Stability ==
== Stability ==
*'''[[User:Ajeffs|Ajeffs]] 06:55, 21 April 2007 (EDT):''' In addition to the given requirements of good (well, acceptable) specificity and efficiency of the reference gene primers, the next most important aspect of reference gene selection is stability. I don't care if the CT value of my reference genes (yes, genes, not gene) is close to the target genes/s or not - as long as the efficiency of all the primers is similar, and they are all working within their respective limits of detection i.e. linear range, then the stability of the reference genes between samples, treatments, etc. is the most crucial aspect of generating believable qPCR results.
*'''[[User:Ajeffs|Ajeffs]]''' 06:55, 21 April 2007 (EDT): In addition to the given requirements of good (well, acceptable) specificity and efficiency of the reference gene primers, the next most important aspect of reference gene selection is stability. I don't care if the CT value of my reference genes (yes, genes, not gene) is close to the target genes/s or not - as long as the efficiency of all the primers is similar, and they are all working within their respective limits of detection i.e. linear range, then the stability of the reference genes between samples, treatments, etc. is the most crucial aspect of generating believable qPCR results.
 
== Selection ==
*'''[[User:Ajeffs|Ajeffs]] 06:55, 21 April 2007 (EDT):''' Screen a handful of ref genes, select the most stable using genorm, bestkeeper etc, use at least 2 reference genes for subsequent reactions and normalisation. Inlcude your genorm M values when publishing qPCR data.
 
== Use of 18S ==
*'''[[User:Ajeffs|Ajeffs]] 06:55, 21 April 2007 (EDT):''' 18S is generally a terrible choice for a reference gene thanks to the combination of (i) high abundance (creating a 1:100 dilution of template to run in parallel with neat template just for 18S is a complete drag); and (ii) having different degradation characteristics to mRNAs (it appears to be more resistant to degradation). However, if you can show that you have screened 5-10 reference genes, and 18S is still the best for your specific situation then so be it (but do try 28S if you or you PI is hung-up on 18S).

Latest revision as of 12:25, 5 October 2011

Which reference genes should I use for my qPCR experiment? Quantifying mRNA via cDNA levels as in a quantitative reverse transcriptase PCR (QRT-PCR) hinges on the references you choose. If you pick only one reference gene and your pick is not constant across different conditions or samples, your results will be skewed. Pick several and check whether they satisfy the criteria for a good reference gene.

The ideal reference gene

A mRNA used as reference or standard of a QRT-PCR (and other experiments) should have the following properties:

  • expressed in all cells
  • constant copy number in all cells
  • medium copy number for more accuracy (or similar copy number to gene of interest)

Common reference genes

  • ribosomal proteins (e.g. RPLP0)
  • PPIA peptidylprolyl isomerase A = cyclophilin A
  • MHC I (major histocompatibility complex I)
  • TUBB β-tubulin (common cytoskeletal enzyme)
  • ACTB β-actin (common cytoskeletal enzyme) [1], [2]
  • YMHAZ tyrosine 3/tryptophan 5 -monooxygenase activation protein, zeta polypeptide
  • B2M β2-microglobulin
  • UBC ubiquitin C
  • TBP TATAA-box binding protein
  • GUSB β-glucuronidase
  • HPRT1 hypoxanthine-guanine phosphoribosyltransferase

Common but problematic references

GAPDH has many functions besides the most well known in the glycolytic pathway [5]. Its levels are not constant [Zhu 2001 PMID 11237753] and vary more than for other genes across different tissues [Radonic 2004 PMID 14706621]. Problems using GAPDH as a qPCR reference gene have been published previously [Ke 2000 PMID 10799275, Suzuki 2000 PMID 10948434].
  • ribosomal RNAs (28S or 18S)
Ajeffs 06:55, 21 April 2007 (EDT): 18S is generally a terrible choice for a reference gene thanks to the combination of (i) high abundance (creating a 1:100 dilution of template to run in parallel with neat template just for 18S is a complete drag); and (ii) having different degradation characteristics to mRNAs (it appears to be more resistant to degradation). However, if you can show that you have screened 5-10 reference genes, and 18S is still the best for your specific situation then so be it (but do try 28S if you or you PI is hung-up on 18S).
  • Panels of "housekeeping genes"
All genes used for normalization can show problems in one or the other condition. There are always conditions in which their expression differs significantly from their general level of expression. That is why reference genes should be validated for one's condition of interest. Ideally, one should choose from the complete genome rather than from a gene panel. Genevestigator RefGenes [6] is an open access online tool that uses a very large microarray database to identify genes that are most stable in conditions similar to that of your own experiment.

Reference genes across tissues

If you are comparing mRNA/cDNA levels from different tissue it is especially important that reference gene levels are close to constant across different tissues. Radonić et al compared 13 putative reference gene levels in 13 different human tissues [PMID 14706621]. The results are summarised below:

  • genes with the smallest range (most constant levels): TBP, RP2, Act, Tub, PLA
  • genes with the largest range (unsuitable for cross-tissue comparison): HPRT, Alb, PBGD, GAPDH, β2M
  • genes undetectable in tissue: Alb - colon; PPIA - ovaries; HPRT - prostate, testis, ovary, small intestine, colon; PBL, skeletal muscle; PBGD - skeletal muscle; TBP - lung, prostate, colon; G6PDH - colon
  • genes detected in all tissues: GAPDH, Act, Tub, β2M, L13, PLA, RP2

(note the source Fig 2 is sometimes impossible to read and the describing text is incomplete; that might have lead to some errors above)

Stability

  • Ajeffs 06:55, 21 April 2007 (EDT): In addition to the given requirements of good (well, acceptable) specificity and efficiency of the reference gene primers, the next most important aspect of reference gene selection is stability. I don't care if the CT value of my reference genes (yes, genes, not gene) is close to the target genes/s or not - as long as the efficiency of all the primers is similar, and they are all working within their respective limits of detection i.e. linear range, then the stability of the reference genes between samples, treatments, etc. is the most crucial aspect of generating believable qPCR results.