BioMicroCenter:Sequencing Quality Control: Difference between revisions

From OpenWetWare
Jump to navigationJump to search
No edit summary
 
(14 intermediate revisions by 3 users not shown)
Line 5: Line 5:
It is very important to have a reliable measurement of the amount of starting material so that the sample can be prepared for hybridization and amplification on the flow cell. The ideal sample is a library with 10nM of successfully ligated DNA. A 1.3ng/ul, 200bp sample is approximately 10nM.  
It is very important to have a reliable measurement of the amount of starting material so that the sample can be prepared for hybridization and amplification on the flow cell. The ideal sample is a library with 10nM of successfully ligated DNA. A 1.3ng/ul, 200bp sample is approximately 10nM.  


As of March 2011 the optimum cluster range is from about 250,000 to 350,000 per tile on the GAIIx and 600,000-800,000 on the HiSeq. It is crucial that we have accurate concentrations on hand to prevent under- and over-clustering. If a sample is too concentrated or the fragment size is too variable the sequencers will not be able to distinguish between clusters properly, resulting in the loss of reads. If the sample is too dilute the optimum number of reads per lane will not be achieved. Having reliable concentration measurements allows us to optimize the number of reads per lane and maximize the quality of data produced.
As of July 2012 the optimum cluster range is from about 250,000 to 350,000 per tile on the GAIIx and 700,000-900,000 on the HiSeq. It is crucial that we have accurate concentrations on hand to prevent under- and over-clustering. If a sample is loaded at too high of a concentration or the fragment size is highly variable the sequencers will not be able to distinguish between clusters properly, resulting in the loss of reads. If the sample is too dilute the optimum number of reads per lane will not be achieved. Having reliable concentration measurements allows us to optimize the number of reads per lane and maximize the quality of data produced.


[[Image:HiSeqStats.png|thumb|right|HiSeq QC]]
[[Image:HiSeqStats.png|thumb|right|HiSeq QC]]


For results generated with our choice of QC methods, please view the following poster:
For sample results generated with BioMicro Center's QC methods, please view our[[Media:ABRF2011_poster_final.pdf| QC POSTER]]  
* [[BioMicroCenter:Sequencing|ILLUMINA QC]] - [[Media:ABRF2011_poster_final.pdf| DOWNLOAD]]  


Much effort and time has been spent on optimizing loading concentrations and accounting for variations in various library prep techniques to generate optimal cluster densities on the HiSeq. The following plot shows the optimal cluster range we aim for in each experiment to efficiently generate reads with a high pass filter percentage:
Much effort and time has been spent on optimizing loading concentrations and accounting for variations in various library prep techniques to generate optimal cluster densities on the HiSeq. The following plot shows the optimal cluster range we aim for in each experiment to efficiently generate reads with a high pass filter percentage:


== Possible Sample QC Techniques ==
== BioMicro Center Methods ==
 
All samples run on the BioMicro Center sequencers are tested using both the [[Advanced Analytical Fragment Aanlyzer]] and [[BioMicroCenter:RTPCR_Protocol_for_Sample_QC|qPCR]]. The AA, a high throughput version of the [[BioMicroCenter:2100BioAnalyzer|Agilent BioAnalyzer]], is used to confirm the size range of the library while the qPCR is used to determine absolute concentration and confirm the presence of the Illumina anchor sequences. All qPCRs are run on using the BioMicro Center's [[BioMicroCenter:Tecan Freedom Evo|Tecan robot "B9"]], allowing 4 point titrations of each sample in order to correct for differences in amplification efficiency.
 
== Other Sample QC techniques not recommended ==


* [[BioMicroCenter:NanoDrop ND-1000|NanoDrop ND-1000]]- The NanoDrop is one of the most commonly used tools to measure the concentration of DNA in solution. The NanoDrop has a detection limit of about 5ng/ul. Unfortunately, due to noise at the lower detection limit, samples speced on the Nanodrop have not shown reliable results on the Solexa sequencer. We do not recommend using the NanoDrop as the primary method of determining concentration for samples on the Sequencer.  
* [[BioMicroCenter:NanoDrop ND-1000|NanoDrop ND-1000]]- The NanoDrop is one of the most commonly used tools to measure the concentration of DNA in solution. The NanoDrop has a detection limit of about 5ng/ul. Unfortunately, due to noise at the lower detection limit, samples speced on the Nanodrop have not shown reliable results on the Solexa sequencer. We do not recommend using the NanoDrop as the primary method of determining concentration for samples on the Sequencer.  
* [[BioMicroCenter:2100BioAnalyzer|2100 BioAnalyzer]] - The Bioanalyzer produces data similar to that of gel electrophoresis, although it requires much less sample input (1uL) and provides quantification data in addition to valuable distribution information. Due to the amount of error introduced in low concentration or widely distributed samples the Bioanalyzer is not recommended as the primary method of determining concentration.  Samples are run on the Bioanalyzer during sample prep at the outset, and at the conclusion of size selection.  Agilent recommends samples within a 5-500pg/ul range for accurate determination of concentration on the High Sensitivity assay.  Internal testing is underway to determine the accurate range of quantification, however, the Bioanalyzer's primary function is determination of distribution.


* Qubit - The Qubit is a selective flurometer that uses fluorescent dyes specific for dsDNA, RNA or protein to quantify samples. The Qubit is used in the Fraenkel lab to quantify their samples for Solexa sequencing. Information about this application can be found at: http://www.invitrogen.com/site/us/en/home/brands/Product-Brand/Qubit.html
* Qubit - The Qubit is a selective flurometer that uses fluorescent dyes specific for dsDNA, RNA or protein to quantify samples. The Qubit is used in the Fraenkel lab to quantify their samples for Solexa sequencing. Information about this application can be found at: http://www.invitrogen.com/site/us/en/home/brands/Product-Brand/Qubit.html
Line 24: Line 25:
* PicoGreen - PicoGreen is a fluorescent dye that binds specifically to dsDNA and allows for quantification. PicoGreen can be measured in a few different ways, the Boyer lab uses a photospectrometer and  Invitrogen states that the Qubit can also be used. More information can be found at: http://probes.invitrogen.com/media/pis/mp07581.pdf  
* PicoGreen - PicoGreen is a fluorescent dye that binds specifically to dsDNA and allows for quantification. PicoGreen can be measured in a few different ways, the Boyer lab uses a photospectrometer and  Invitrogen states that the Qubit can also be used. More information can be found at: http://probes.invitrogen.com/media/pis/mp07581.pdf  


* [[BioMicroCenter:RTPCR_Protocol_for_Sample_QC|RT-PCR, SYBERgreen assay]] - This assay uses primers that are specific for the adapters used during the ligation step of sample preparation.  This allows for the amount of DNA that will actually bind to the flowcell to be quantified. The RT-PCR assay is recommended in addition to the techniques explained above and provides additional and more precise concentration information.


== Flow Cell QC ==
== Flow Cell QC ==


[[BioMicroCenter:Cluster_Visualization_Protocol_for_Flowcell_QC|SYBR Green Cluster Visualization]]- This protocol allows for the visualization of clusters on a flowcell after amplification and before it is put on the Genome Analyzer.  It is especially helpful to ensure proper amplification has occurred if there has been a clog or an error on the cluster station or if an older Cluster Generation kit that may be expired is being used.
[[BioMicroCenter:Cluster_Visualization_Protocol_for_Flowcell_QC|SYBR Green Cluster Visualization]]- This protocol allows for the visualization of clusters on a flowcell after amplification and before it is put on the Genome Analyzer.  It is especially helpful to ensure proper amplification has occurred if there has been a clog or an error on the cluster station or if an older Cluster Generation kit that may be expired is being used.

Latest revision as of 07:59, 13 June 2013

HOME -- SEQUENCING -- LIBRARY PREP -- HIGH-THROUGHPUT -- COMPUTING -- OTHER TECHNOLOGY

Why is QC Important?

It is very important to have a reliable measurement of the amount of starting material so that the sample can be prepared for hybridization and amplification on the flow cell. The ideal sample is a library with 10nM of successfully ligated DNA. A 1.3ng/ul, 200bp sample is approximately 10nM.

As of July 2012 the optimum cluster range is from about 250,000 to 350,000 per tile on the GAIIx and 700,000-900,000 on the HiSeq. It is crucial that we have accurate concentrations on hand to prevent under- and over-clustering. If a sample is loaded at too high of a concentration or the fragment size is highly variable the sequencers will not be able to distinguish between clusters properly, resulting in the loss of reads. If the sample is too dilute the optimum number of reads per lane will not be achieved. Having reliable concentration measurements allows us to optimize the number of reads per lane and maximize the quality of data produced.

HiSeq QC

For sample results generated with BioMicro Center's QC methods, please view our QC POSTER

Much effort and time has been spent on optimizing loading concentrations and accounting for variations in various library prep techniques to generate optimal cluster densities on the HiSeq. The following plot shows the optimal cluster range we aim for in each experiment to efficiently generate reads with a high pass filter percentage:

BioMicro Center Methods

All samples run on the BioMicro Center sequencers are tested using both the Advanced Analytical Fragment Aanlyzer and qPCR. The AA, a high throughput version of the Agilent BioAnalyzer, is used to confirm the size range of the library while the qPCR is used to determine absolute concentration and confirm the presence of the Illumina anchor sequences. All qPCRs are run on using the BioMicro Center's Tecan robot "B9", allowing 4 point titrations of each sample in order to correct for differences in amplification efficiency.

Other Sample QC techniques not recommended

  • NanoDrop ND-1000- The NanoDrop is one of the most commonly used tools to measure the concentration of DNA in solution. The NanoDrop has a detection limit of about 5ng/ul. Unfortunately, due to noise at the lower detection limit, samples speced on the Nanodrop have not shown reliable results on the Solexa sequencer. We do not recommend using the NanoDrop as the primary method of determining concentration for samples on the Sequencer.
  • PicoGreen - PicoGreen is a fluorescent dye that binds specifically to dsDNA and allows for quantification. PicoGreen can be measured in a few different ways, the Boyer lab uses a photospectrometer and Invitrogen states that the Qubit can also be used. More information can be found at: http://probes.invitrogen.com/media/pis/mp07581.pdf


Flow Cell QC

SYBR Green Cluster Visualization- This protocol allows for the visualization of clusters on a flowcell after amplification and before it is put on the Genome Analyzer. It is especially helpful to ensure proper amplification has occurred if there has been a clog or an error on the cluster station or if an older Cluster Generation kit that may be expired is being used.