Moore Notes 2 9 11
From OpenWetWare
Jump to navigationJump to search
Group Call
- James: SAR for niche mapping project
- Plots with different average numbers of reads per cell: plot1(0.4), plot2 (4), plot3 (40), plot4 (400)
- At very high sampling depth, all areas will have all genera
- This could be real biologically since bacteria genera are big and we're not looking at particularly niche driven taxa (like hyperthermophiles)
- Filtering out things that only occur in <10 samples
- There is a linear pattern on the log-log scale at shallower depths
- Is the non-zero probability output by GLM a realistic framework for this?
- They are driving the flattening
- The minimum p-hat depends on the number of samples/reads going into the model fitting
- Should we look at model mis-specification and the dependence of results on sampling depth in general?
- Can probably just acknowledge limitations for now
- Josh: additional niche mapping updates
- AUC>0.7 as a criteria for including taxa omits 6 common genera (including Pelagibacter and Prochlorococcus)
- Draft manuscript is being edited, will send out to group soon (1 week turnaround OK for every one)
- Alex: working on phylum specific richness maps
- Need to adjust for numbers of genera in each phylum
- Waiting for new AUC values from Josh