Andrew Forney Week 8: Difference between revisions
progress for now... |
added more description of process |
||
| Line 11: | Line 11: | ||
After the columns were set, copying and pasting the pseudo-dynamic formula was made more efficient because the syntactic similarities made changes to the equations easy. | After the columns were set, copying and pasting the pseudo-dynamic formula was made more efficient because the syntactic similarities made changes to the equations easy. | ||
=====Statistical Analysis===== | |||
This part of the analysis had a couple rough patches but was still pretty straightforward. The first hiccup was the instruction: "Go to a new column on the right of your worksheet. Type the header "Avg_LogFC_A", "Avg_LogFC_B", and "Avg_LogFC_C" into the top cell of the next three columns." I was unsure of what "right side of your worksheet" really meant--did it mean the right of the current screen bounds of the worksheet or just the right-most unoccupied column? I read a little further and decided that since the AVG_LogFC columns corresponded to their respective four individual samples, I would place them adjacent to one another. This seemed to be the most logical choice as it mimicked the format of the first part of the instructions, but I then realized that the remainder of the statistical analysis section assumed that the AVG_LogFC columns were directly adjacent to one another. As such, I needed to adapt a tiny bit of the formulas to fit my differing layout, but in the end, the results were the same--the difference should have made no change to the "forGenMapp" sheet. | |||
The second minor hiccup was the formula, "=TDIST(ABS(R2),degrees of freedom,2)" as I got a #NAME error when trying to enter it. I then looked up this error in the help documentation and facepalmed when I saw that "degrees of freedom" should actually be an integer value--in our case, 2. | |||
As far as tips and tricks went for this section, I discovered that the copy-paste process could be sped by CTRL-selecting columns that were not necessarily adjacent, which then allowed me to paste them in their proper order where needed. Additionally, I had always known of the presence of "Paste Special..." but never really used it. This example, pasting the values of the copied material over their formulaic derivation, let me get a gist for their purpose. Other than these two nuances, coupled with those that I noted about the normalization step, there wasn't a whole lot to say about the process. The instructions were easy to follow aside from the couple of small issues I initially had, and I'm confident that my end result is accurate. | |||
Revision as of 09:02, 24 October 2010
Author: Andrew Forney
Assignment: Individual Journal 8
Electronic Lab Notebook
Normalization of Log Ratios
This portion was fairly straightforward and simply a task of rote repetition. Being a fan of algorithms, and as such, efficiency, I discovered that the fastest method was to first set up the appropriate columns (insert, then title) because the titles were so similar--the pattern buffer was used effectively because few changes were made between many copies and pastes.
After the columns were set, copying and pasting the pseudo-dynamic formula was made more efficient because the syntactic similarities made changes to the equations easy.
Statistical Analysis
This part of the analysis had a couple rough patches but was still pretty straightforward. The first hiccup was the instruction: "Go to a new column on the right of your worksheet. Type the header "Avg_LogFC_A", "Avg_LogFC_B", and "Avg_LogFC_C" into the top cell of the next three columns." I was unsure of what "right side of your worksheet" really meant--did it mean the right of the current screen bounds of the worksheet or just the right-most unoccupied column? I read a little further and decided that since the AVG_LogFC columns corresponded to their respective four individual samples, I would place them adjacent to one another. This seemed to be the most logical choice as it mimicked the format of the first part of the instructions, but I then realized that the remainder of the statistical analysis section assumed that the AVG_LogFC columns were directly adjacent to one another. As such, I needed to adapt a tiny bit of the formulas to fit my differing layout, but in the end, the results were the same--the difference should have made no change to the "forGenMapp" sheet.
The second minor hiccup was the formula, "=TDIST(ABS(R2),degrees of freedom,2)" as I got a #NAME error when trying to enter it. I then looked up this error in the help documentation and facepalmed when I saw that "degrees of freedom" should actually be an integer value--in our case, 2.
As far as tips and tricks went for this section, I discovered that the copy-paste process could be sped by CTRL-selecting columns that were not necessarily adjacent, which then allowed me to paste them in their proper order where needed. Additionally, I had always known of the presence of "Paste Special..." but never really used it. This example, pasting the values of the copied material over their formulaic derivation, let me get a gist for their purpose. Other than these two nuances, coupled with those that I noted about the normalization step, there wasn't a whole lot to say about the process. The instructions were easy to follow aside from the couple of small issues I initially had, and I'm confident that my end result is accurate.