Moore Notes 9 2 09
From OpenWetWare
Jump to navigationJump to search
Group Call
- INFERNAL/OTU analysis (Tom will post update on wiki)
- have 16S reads from GOS
- how to align?
- INFERNAL aligns reads to probabilistic model
- has an option for shotgun data (--sub) that does best local alignment if read is short
- what profile to use? makes a big difference
- native (just 5 sequences)
- from your alignment
- RDP hand-curated bacterial and archaeal alignments
- Sam: how different from hmmalign (from AMPHORA)?
- hmmalign uses hmm
- INFERNAL uses scfg (better for RNA)
- even short (<100bp) reads align well
- to do:
- like STAP, classify first (by domain) - need euk 18S profile, have 100 seqs from STAP
- size threshold may not be needed
- post-processing (OK to drop reads since they aligned independently): big gaps, amount aligned to profile (big indels)
- next: phylogenetic analysis
- Steve: run through raxml, NJ (?)
- MOTHUR
- Tom: looking for a dataset of all sequenced microbial genomes to do gene family analyses on genes that are not in AMPHORA
- Srijak: ComboDb
- 6 months old
- coding regions only: what protein, NCBI accession
- Martin's personal version of db may have the flat files
- Morgan: microdb
- genome project
- chromosomes
- gene level: protein and DNA sequences
- updated monthly with cron job
- Srijak: ComboDb
- Sam: simulation update
- posting data and code on edhar
- 20 runs for RPOB, doing a few other genes (SMPB)
- what else do people need?
- problem with private html directory (Morgan too)
- Russell will work on it
- posting data and code on edhar
- version control on edhar
- git and svn are installed (can run over ssh)
- to share with people who don't have edhar accounts, use public html for now
- have group iseem directory - Russell will put an svn directory there with code and data subdirectories
- Authorship discussion
- Jessica did a literature review
- She and Katie have discussed on PI call last week, plus some email with Jonathan
- issues:
- who is an author?
- what is author order?
- other issues: e.g. contributions
- idea: on next call, discuss 3-4 papers together to set the plan, e.g.
- simulation paper
- SAR data paper
- depth gradient paper
- Sam: need a model for how things change and evolve during the project