Tessa A. Morris Week 11
From OpenWetWare
Electronic Lab Notebook
Date:
3/19/2015
Assignment:
Partner:
Purpose:
Analyze microarray data, comparing the wild type and a mutant strain, in this experiment Δgln3.
Methods:
Statistical Analysis Part 1: ANOVA
- Download microarray data from Lionshare and save to desktop, changing the name of the file to include initials (TM)
- Record number of replicates (change the color of the fill for each different time point to make it easier to see)
- Create a new worksheet, label it "stats"
- Copy the first two columns from sheet 1, "data"
- In Row 1:
- Columns C-G label: GLN3_xbar_(TIME)
- Columns H and I label: GLN3_xbar_grand and (STRAIN)_ss_HO.
- Columns J-N label: GLN3_ss_(TIME)
- Columns O, P, and Q label: GLN3_SS_full, Fstat and p-value.
- For C2 type
=AVERAGE(
then in the "data" sheet, highlight the data in Row 2 that is associated with the GLN3 and t15- Copy this formula down the row in the stats cell by double clicking the black plus sign in the bottom right hand corner of C2
- Repeat for all of the time points
- Record total number of data points
- For H2 labeled GLN3_xbar_grand take the average of C2-G2 and copy this formula down the column (Note Step 4)
- For I2 type
=SUMSQ(
then in the "data" sheet, highlight the data in Row 2 that is associated with the GLN3 and t15 and copy this formula down the column (Note Step 4) - Repeat for all of the time points
- For J2 type
=SUMSQ(data!C2:F2)-4*stats!C2^2
and copy this formula down the column (Note Step 4)- "data!C2:F2" is the data associated with t15 // The number "4" is the number of data points // "stats!c2" gets the average from Step 4 for t15 // "^2" squares thevalue
- Repeat for Cells K-N
- To save time, take note of which columns the data for each time points is in, so the formula from J2 can be copied and then adjusted slightly
- For O2 type
=sum(j2:n2)
and copy down the column (Note Step 4) - For P2 type
=((n-5)/5)*(i2-o2)/o2
, where n is the total number of data points - For Q2 type
=FDIST(P2,5,n-5)
, where n is the total number of data points - To adjust the p-value to correct for the multiple testing problem
- Label R2 "GLN3_Bonferroni_p-value"
- In R2 type
=q2*6189
and copy down the column (Note Step 4)
- To see how many of the p-values are less than 0.05
- Sort & Filter >> Filter >> on drop down arrow for Q1 (p-value) Number Filter >> less than >> 0.05 >> OK
- The number of values less than 0.05 will appear in the bottom left hand of the screen
- Record this value
- To correct p-values that are greater than 1 by the number 1
- In S2 type
=IF(r2>1,1,r2)
- In S2 type
- Save the data set (Upload to Lionshareand share with professors, Dr. Dahlquist and Dr. Fitzpatrick, and lab partner, Alyssa N Gomes)
Data & Observations:
- Alyssa and I were assigned to analyze Wild type vs. Δgln3
- While Alyssa analyzes the wild type, I am going to analyze Δgln3
- Note about excel: ID: gene id; standard name: gene symbol (more user friendly); each column represents one microarray
- Time 15: 4 replicates // Time 30: 4 replicates // Time 60: 4 replicates // Time 90: 4 replicates // Time 120: 4 replicates
- Total Number of data points: 20
- 15: C-F // 30: G-J // 60: K-N // 90: O-R // 120: S-V
- 1864 out of 6189 genes have a p-value of less than 0.05
- Data was shared with Dr. Dahlquist, Dr. Fitzpatrick, and Alyssa N Gomes through Lionshare
User Page: Tessa A. Morris
Course Page: Biomathematical Modeling
- Week 1
- Week 2
- Week 3
- Week 4
- Week 5
- Week 6: No Assignment
- Week 7
- Week 8: No Assignment
- Week 9
- Week 10
- Week 11
- Week 12
- Week 13
- Week 14
- Week 15