After determining a list of genes involved in a given biological process the next step is to map these genes to known pathways/Gene Ontology terms and determine i.e. which pathways are overrepresented in a given set of genes.
Recent review (Jan 2008 !): Nam, Dougu, and Seon-Young Kim. “Gene-set approach for expression pattern analysis.” Brief Bioinform (17, 2008): bbn001. HTML See table 1 for complete list of tools.
- g:Profiler a web-based toolset for functional profiling of gene lists from large-scale experiments. Easy to use web server
- KOBAS server used for i.e. elucidating pathways in addiction
- takes both FASTA files and lists of genes
- excise gi| from typical FASTA NCBI entry to get unique IDs
- only about 1/3 of genes will get annotated in the first step
- Li, Chuan-Yun, Xizeng Mao, and Liping Wei. “Genes and (Common) Pathways Underlying Drug Addiction.” PLoS Computational Biology 4, no. 1 (1, 2008) HTML
- GSEA withMSigDB "Gene Set Enrichment Analysis (GSEA) is a computational method that determines whether an a priori defined set of genes shows statistically significant, concordant differences between two biological states"
objections (Damian D, Gorfine M. Statistical concerns about the GSEA procedure): http://www.nature.com/ng/journal/v36/n7/full/ng0704-663a.html and reply: http://www.nature.com/ng/journal/v36/n7/full/ng0704-663b.html
- ErmineJ "ErmineJ performs analyses of gene sets in expression microarray data. A typical goal is to determine whether particular biological pathways are "doing something interesting" in the data. The software is designed to be used by biologists with little or no informatics background."
- GAGE is applicable independent of sample sizes, experimental design, assay platforms, and other types of heterogeneity (paper). This Biocondutor package also provides functions and data for pathway, GO and gene set analysis in general. Tutorials describe both RNA-Seq and microarray data analysis workflows.
Other tools to check
- GEPAT Genome Expression Pathway Analysis Tool. Performs standard microarray analyzes plus "Ensembl database and provides information about gene names, chromosomal location, GO categories and enzymatic activity for each probe on the chip.". Complex installation of java jars/MySQL etc.
- PAGE Parametric Analysis of Gene Set Enrichment
- CPath database and software suite for storing, visualizing, and analyzing biological pathways demo page
- EASE (old but highly cited) http://www.pubmedcentral.gov/articlerender.fcgi?tool=pubmed&pubmedid=14519205
- nonparametric multivariate analysis Nettleton et al. HTML. R code available from author.
- Cytoscape leader in the field
- ONDEX HTML "enables data from diverse biological data sets to be linked, integrated and visualised through graph analysis techniques"
- Pathview R/Bioconductor tool for pathway based data integration and visualization, easy to integrate in pathway analysis workflows. R-Forgehas an overview with some nice example plots. The work has been published in Bioinformatics.
- BIANA biological database integration and network management framework, successor of PIANA
- MATISSE Modular Analysis for Topology of Interactions and Similarity SEts
- automating the analysis of protein-protein interactions networks.
- KEGG first choice for scope
- Reactome human + model organisms pathways. Expert annotations from literature.
- PID Pathway Interaction Database @NIH
- Cyclone - provides an open source Java API for easier access to BioCyc.
- RegulonDB E.coli K12 DB (operons/genes/regulatory elements)
Pathway specific languages
- BioPAX Biological Pathway Exchange Language
Stuff 2 check
- GenMapp, Pathway Processor GeneXpress see:
Cavalieri D, De Filippo C. Bioinformatic methods for integrating whole-genome expression results into cellular networks. Drug Discov Today. 2005;10:727–734. doi: 10.1016/S1359-6446(05)03433-1
Related pages on OpenWetWare
- Luo W, Friedman M, Shedden K, Hankenson KD, Woolf JP (2009). "GAGE: generally applicable gene set enrichment for pathway analysis". BMC Bioinformatics 10: 161: http://www.biomedcentral.com/1471-2105/10/161.
- Aittokallio, Tero, and Benno Schwikowski. “Graph-based methods for analysing networks in cell biology.” Brief Bioinform 7, no. 3 (September 1, 2006): 243-255.
- Li, Chuan-Yun, Xizeng Mao, and Liping Wei. “Genes and (Common) Pathways Underlying Drug Addiction.” PLoS Computational Biology 4, no. 1 (1, 2008): e2 EP -.
- Nam, Dougu, and Seon-Young Kim. “Gene-set approach for expression pattern analysis.” Brief Bioinform (17, 2008): bbn001.
- Resources for integrative systems biology: from data through databases to networks and dynamic system models -- Ng et al. 7 (4): 318 -- Briefings in Bioinformatics.” http://bib.oxfordjournals.org/cgi/content/full/7/4/318.
- Stromback, Lena, Vaida Jakoniene, He Tan, and Patrick Lambrix. “Representing, storing and accessing molecular interaction data: a review of models and tools.” Brief Bioinform 7, no. 4 (December 1, 2006): 331-338.
- “Tools for visually exploring biological networks -- Suderman and Hallett 23 (20): 2651 -- Bioinformatics.” http://bioinformatics.oxfordjournals.org/cgi/content/full/23/20/2651.
- "Pathways to the analysis of microarray data",Trends in Biotechnology, Volume 23, Issue 8, August 2005, Pages 429-435 R.Keira Curtis, Matej Oresic, Antonio Vidal-Puig
- "Bioinformatics applications for pathway analysis of microarray data",Current Opinion in Biotechnology, Volume 19, Issue 1, February 2008, Pages 50-54,Thomas Werner