From OpenWetWare
Jump to navigationJump to search



SERVICE Bioinformatics Support
External - contact
UNIT Hourly or as FTE
SUPPORT Informatics staff are subsidized by Departments of Biology and Biological Engineering, MIT CEHS and the Koch Institute.

As part of our mission statement, the BioMicro Center is designed to assist users in computational challenges. One key aspect of this is providing our users with the bioinformatics support to interpret their data readily and to assist them in analyzing data for publications and grants. To accomplish this, the BioMicro Center has a team of informatics scientists on staff able to assist labs with experience in a broad number of methodologies. Bioinformatics consultations are available by appointment for CORE lab members. Bioinformatic projects are undertaken by the BioMicro Center on a collaborative basis.

If you are looking for informatics support, the easiest way to begin is with an ilabs request with a brief description of your experiment. One of the members of our informatics staff will reach out to you to schedule a one on one meeting. This meeting is free for CORE lab members. Once an experimental plan is approved we will begin work on your project, checking in with you at regular intervals to be sure the project is on track. The project can be paused at any time by the researcher.

Significant portions of the informatics staff salaries are paid by direct support from faculty members. As such, these "sponsors" of the informatics groups have priority access on their projects. Additional blocks of time are reserved for each informaticist to work on projects that are billed hourly with priority given to CORE labs. A small number of hours may remain to assist non-MIT labs with data analysis but such projects are always considered low priority. Labs interested in sponsoring the informatics staff at the BioMicro Center should contact Stuart Levine

Informatics Offices


Informatics projects through the BioMicro Center are considered collaborative efforts. Publications resulting from statistic or bioinformatics work frequently merit co-authorship: the order of authorship is not of concern. Placing BioMicro Center staff on publications significantly increases our ability to obtain financial support through grants and is helpful in the renewal process for CEHS and KI core grants.


Vincent Butty, MD, PhD

Informatics Scientist

Dr. Vincent Butty joined the core in April 2012. He is an MD/PhD with a strong background in RNA sequencing methodologies and immunology. Vincent obtained his MD at the University of Geneva and received a PhD in Immunology from Harvard where he studied the population genetics of autoimmune diseases such as Type-I diabetes in the Benoist/Mathis lab. Vincent did his postdoctoral training here at MIT in Dr. Chris Burge's group, where he investigated the regulation of RNA processing across a broad variety of disease contexts using RNA sequencing. In the core, Vincent is currently involved in the analysis of long non-coding RNAs, the role of chromatin in development and the response to genotoxic and infectious stresses. Specialties

  • RNAseq
  • Immunology & Immunogenetics

Duan Ma, PhD

Informatics Scientist

Huiming Ding, PhD

Informatics Scientist

Huiming is the longest serving member of the team. He received his PhD in Physics from Jilin University in China before working for nearly a decade as a senior research associate in bioinformatics at the University of Toronto where he supported the work of Dr. Charlie Boone in studying cellular networks and pathways using synthetic genetic array (SGA) and synthetic dosage lethal (SDL) screens. Huiming has significant experience in deriving interaction networks and in creating statistical scoring algorithms and databases. His current collaborative projects involve the study of functional interaction networks from time course data and generation of genome-scale metabolic pathway models. Specialties

  • Statistical Analysis
  • Network Analysis

Charlie Whittaker, PhD

Stuart Levine, PhD

Core Director

Stuart Levine's primary responsibility is to direct the BioMicro Center but in a previous life Stuart was a bioinformaticist and is still available to assist on data analysis as time is available. Stuart received his BS in Biology from MIT (where he UROPed with Dr. Peter Sorger) then did his graduate work with Dr. Bob Kingston and Dr. William Forrester at Harvard Medical School where he studied the biochemical activities of the polycomb group of gene regulators. Stuart then did his post-doctoral work with Dr. Richard Young where he switched from biochemistry to bioinformatics, studying gene regulation on a genome wide scale using expression and chromatin immunoprecipitation data. Stuart has numerous publications in the areas of regulation of transcription, genomic architecture and cell fate determination.

  • Chromatin IP
  • expression analysis
  • Transcription mechanism


Gene Expression Isoform Visualization and Quantitation Functional Analyses
Gene expression analysis can be done using RNA-Seq and microarrays. (A) The RNA-Seq analysis pipeline has been implemented and the method is gaining popularity as sequencing becomes more affordable. Benefits include flexibility with level of detail collected and a lack of platform-dependent biases. (B) Microarrays are still a viable alternative to genome-wide mRNA analysis. They are generally less expensive and benefit from routine and well-developed processing and analysis methods. RNA-Seq reads compatible with different transcript isoforms are quantitated using a Bayesian analytical framework (MISO), generating posterior distributions of isoform abundances. Read densities are visualized using Sashimi plots. Core Network of Genes regulated by Braveheart, which drive Cardiac Differentiation, generated using the Spring-embedded algorithm in Cytoscape. A sequence of cardiac transcription factors and genes involved in myofibril organization were not induced during EB differentiation in Bvht-depleted cells.

From Klattenhoff et al. (Boyer, Burge labs), Cell 152:570. / Hierarchical clustering of genes identified from a time series microarray experiment. Clustering was performed using Cluster and the corresponding heatmap was generated with TreeView. (collaboration with Walker Lab)

SNPs and Mutations ChIPseq Analysis and Motif Finding Phylogenetic Analyses
NGS variant detection procedure: (A) Input material can either be whole-genome or transcriptome isolate, or a selected subset such as the exome (exon capture) or specific genome regions. Raw sequences are pre-processed and aligned using various options (BWA or Bowtie). The GeZnome Analysis Took Kit (GATK) and SAMtools are used to call variants and snpeff is used to annotate the consequences of those variations. (B) IGV is used to visualize the alignments in the context of the genome and its associated annotations. Characterizations of the significant Chip-Seq peaks. Figures describing proportions of binding sites located around annotated genic locations. Sequence logos from binding sites identified through MAST. (Horvitz Lab) Large-scale phylogenetic analyses of sequences can be performed using combinations of multiple sequence alignment tools and data visualization packages.

In this example phylogenetic relationships of the hits were visualized using iTOL. (Boyden lab)