GRNmap Testing Report 16 Test Files from Dahlquist-data 2015-05-26 TM: Difference between revisions

From OpenWetWare
Jump to navigationJump to search
(add 6 is running)
(→‎Results: added 2-genes_47-edges_Dahlquist-data_MM_estimation_fixP-0_no-graph)
 
(151 intermediate revisions by 2 users not shown)
Line 1: Line 1:
==Sigmoidal==
==Test Conditions==
====Estimate + Forward====
*Date started: 2015-05-26
*'''Estimate b and Estimate p'''
*Test Performed by: [[User:Tessa A. Morris| Tessa A. Morris]], [[Tessa A. Morris Electronic Lab Notebook| Electronic Notebook]]
** Graph (1)
*Code Version: [https://github.com/kdahlquist/GRNmap/archive/v1.0.6.zip GRNmap] version 1.0.6
***[[Media:22-genes 47-edges Dahlquist-data Sigmoidal estimate estimate-b estimate-p graph.xls| Input]]
*MATLAB Version: 2014b
***[[Media:22-genes 47-edges Dahlquist-data Sigmoidal estimate estimate-b estimate-p graph output.xlsx| Output]]
*Computer on which the model was run: Row 1 #5, #6; Row 2 #4, #5, #6; Row 3 #1
***[[Media:22-genes 47-edges Dahlquist-data Sigmoidal estimate estimate-b estimate-p.zip| Images]]
** No graph (2)
***[[Media:22-genes 47-edges Dahlquist-data Sigmoidal estimate estimate-b estimate-p nograph.xlsx| Input]]
***[[Media:22-genes 47-edges Dahlquist-data Sigmoidal estimate estimate-b estimate-p nograph output.xlsx| Output]]
*** No images
*'''Estimate b and Fix p'''
** Graph (3)
***[[Media:22-genes 47-edges Dahlquist-data Sigmoid estimation estimate-b fix-P graph.xlsx| Input]]
***[[Media:22-genes 47-edges Dahlquist-data Sigmoid estimation estimate-b fix-P graph output.xlsx| Output]]
***[[Media:22-genes 47-edges Dahlquist-data Sigmoid estimation estimate-b fix-P graph.zip| Images]]
** No graph (4)
***[[Media:22-genes 47-edges Dahlquist-data Sigmoid estimation estimate-b fix-P no-graph.xlsx | Input]]
*'''Fix b and Estimate p'''
** Graph (5)
***[[Media:22-genes 47-edges Dahlquist-data Sigmoidal estimate fix-b estimate-p graph.xls| Input]]
***[[Media:22-genes 47-edges Dahlquist-data Sigmoidal estimate fix-b estimate-p graph output.xlsx| Output]]
***[[Media:22-genes 47-edges Dahlquist-data Sigmoidal estimate fix-b estimate-p.zip| Images]]
** No graph (6) ---currently running
***[[Media:22-genes 47-edges Dahlquist-data Sigmoid estimation fix-b estimate-P no-graph.xlsx | Input]]
*'''Fix b and Fix p'''
** Graph (7)
***[[Media:22-genes 47-edges Dahlquist-data Sigmoid estimation fix-b fix-P graph.xlsx| Input]]
***[[Media:22-genes 47-edges Dahlquist-data Sigmoid estimation fix-b fix-P graph output.xlsx| Output]]
***[[Media:22-genes 47-edges Dahlquist-data Sigmoid estimation fix-b fix-P graph.zip| Images]]
**No graph (8)
***[[Media:22-genes 47-edges Dahlquist-data Sigmoid estimation fix-b fix-P no-graph.xlsx | Input]]
***[[Media:22-genes 47-edges Dahlquist-data Sigmoid estimation fix-b fix-P no-graph output.xlsx| Output]]
*** No images


===Forward only===
''Redone on 2015-06-03 to fix the naming convention and the input sheet''
*Graph (9)
*Date started: 2015-06-08
** [[Media:22-genes 47-edges Dahlquist-data Sigmoid forward graph.xlsx | Input]]
*Test Performed by: [[User:Tessa A. Morris| Tessa A. Morris]], [[Tessa A. Morris Electronic Lab Notebook| Electronic Notebook]]
** [[Media:22-genes 47-edges Dahlquist-data Sigmoid forward graph output.xlsx| Output]]
*Code Version: GRNmap-beta (9:17 am 2015-06-08)
** [[Media:22-genes 47-edges Dahlquist-data Sigmoid forward graph.zip| Images]]
*MATLAB Version: 2014b
*No graph (10)
*Computer on which the model was run: Row 2 #4
** [[Media:22-genes 47-edges Dahlquist-data Sigmoid forward no-graph.xlsx | Input]]
** [[Media:22-genes 47-edges Dahlquist-data Sigmoid forward no-graph output.xlsx| Output]]
** No images


==Michaelis Menten==
''Update alpha and optimization parameters''
====Estimate + Forward====
*Date started: 2015-06-22
*'''Fix p'''
*Test Performed by: [[User:Tessa A. Morris| Tessa A. Morris]], [[Tessa A. Morris Electronic Lab Notebook| Electronic Notebook]]
** Graph (11)
*Code Version: GRNmap-beta (10:28 am 2015-06-22)
***[[Media:22-genes 47-edges Dahlquist-data MM estimation fix-p graph.xlsx| Input]]
*MATLAB Version: 2014b
** No graph (12)
*The computer each input sheet was run on is noted in the results section.
***[[Media:22-genes 47-edges Dahlquist-data MM estimation fix-p no-graph.xlsx| Input]]
*'''Estimate p'''
** Graph (13)
***[[Media:22-genes 47-edges Dahlquist-data MM estimation estimate-p graph.xlsx| Input]]
** No graph (14)
***[[Media:22-genes 47-edges Dahlquist-data MM estimation estimate-p no-graph.xlsx|Input]]


====Forward only====
==Purpose==
*Graph (15)
*The purpose was to create the sixteen different variations of the input sheet created from dCIN5 Dahlquist-data to test various versions of GRNmap.  
** [[Media:22-genes 47-edges Dahlquist-data MM forward graph.xlsx| Input]]
*[https://github.com/kdahlquist/GRNmap/issues/74#issuecomment-108097323 Issue 74]
* No graph (16)
** [[Media:22-genes 47-edges Dahlquist-data MM forward no-graph.xlsx| Input]]


==Testing each GRNmap input==
==Results==  
*Follow the [[Dahlquist:Microarray Data Analysis Workflow| protocol]] described by Dr. Dahlquist to prepare the data and select a list of candidate transcription factors to test.  
*22-genes_47-edges_Dahlquist-data_Sigmoidal_estimation_fixb-0_fixP-0_graph (Run on Row 2 #4 at 11:51 am 2015-06-22)
=== Create the Input Excel Workbook for the Model ===
**[[Media:22-genes 47-edges Dahlquist-data Sigmoidal estimation fixb-0 fixP-0 graph.xlsx]]
**[[Media:22-genes 47-edges Dahlquist-data Sigmoidal estimation fixb-0 fixP-0 graph output.xlsx]]
**[[Media:22-genes 47-edges Dahlquist-data Sigmoidal estimation fixb-0 fixP-0 graph output.mat]]
**[[Media:22-genes 47-edges Dahlquist-data Sigmoidal estimation fixb-0 fixP-0 graph plots.zip]]
**[[Media:22-genes_47-edges_Dahlquist-data_Sigmoidal_estimation_fixb-0_fixP-0_graph_optimization_diagnostic.jpg]]
*22-genes_47-edges_Dahlquist-data_Sigmoidal_estimation_fixb-0_fixP-0_no-graph (Run on Row 2 #3 at 1:22 pm 2015-06-22)
**[[Media:22-genes 47-edges Dahlquist-data Sigmoidal estimation fixb-0 fixP-0 no-graph.xlsx]]
**[[Media:22-genes 47-edges Dahlquist-data Sigmoidal estimation fixb-0 fixP-0 no-graph output.xlsx]]
**[[Media:22-genes 47-edges Dahlquist-data Sigmoidal estimation fixb-0 fixP-0 no-graph output.mat]]
**No graph
**[[Media:22-genes_47-edges_Dahlquist-data_Sigmoidal_estimation_fixb-0_fixP-0_no-graph_optimization_diagnostic.jpg]]
***Had to save manually '''BUG'''
*22-genes_47-edges_Dahlquist-data_Sigmoidal_estimation_fixb-0_fixP-1_graph (Run on Row 2 #2 at 1:37 pm 2015-06-22)
**[[Media:22-genes 47-edges Dahlquist-data Sigmoidal estimation fixb-0 fixP-1 graph.xlsx]]
**[[Media:22-genes 47-edges Dahlquist-data Sigmoidal estimation fixb-0 fixP-1 graph output.xlsx]]
**[[Media:22-genes 47-edges Dahlquist-data Sigmoidal estimation fixb-0 fixP-1 graph output.mat]]
**[[Media:22-genes 47-edges Dahlquist-data Sigmoidal estimation fixb-0 fixP-1 graph plots.zip]]
**[[Media:22-genes_47-edges_Dahlquist-data_Sigmoidal_estimation_fixb-0_fixP-1_graph_optimization_diagnostic.jpg]]
*22-genes_47-edges_Dahlquist-data_Sigmoidal_estimation_fixb-0_fixP-1_no-graph (Run on Row 2 #1 at 1:34 pm 2015-06-22)
**[[Media:22-genes 47-edges Dahlquist-data Sigmoidal estimation fixb-0 fixP-1 no-graph.xlsx]]
**[[Media:22-genes 47-edges Dahlquist-data Sigmoidal estimation fixb-0 fixP-1 no-graph output.xlsx]]
**[[Media:22-genes 47-edges Dahlquist-data Sigmoidal estimation fixb-0 fixP-1 no-graph output.mat]]
**No graph
**[[Media:22-genes_47-edges_Dahlquist-data_Sigmoidal_estimation_fixb-0_fixP-1_no-graph_optimization_diagnostic.jpg]]
***Had to save manually '''BUG'''
*22-genes_47-edges_Dahlquist-data_Sigmoidal_estimation_fixb-1_fixP-0_graph (Run on Row 1 #6 at 1:34 pm 2015-06-22)
**[[Media:22-genes 47-edges Dahlquist-data Sigmoidal estimation fixb-1 fixP-0 graph.xlsx]]
**[[Media:22-genes 47-edges Dahlquist-data Sigmoidal estimation fixb-1 fixP-0 graph output.xlsx]]
**[[Media:22-genes 47-edges Dahlquist-data Sigmoidal estimation fixb-1 fixP-0 graph output.mat]]
**[[Media:22-genes 47-edges Dahlquist-data Sigmoidal estimation fixb-1 fixP-0 graph plots.zip]]
**[[Media:22-genes_47-edges_Dahlquist-data_Sigmoidal_estimation_fixb-1_fixP-0_graph_optimization_diagnostic.jpg]]
*22-genes_47-edges_Dahlquist-data_Sigmoidal_estimation_fixb-1_fixP-0_no-graph (Run on Row 1 #5 at 1:45 pm 2015-06-22)
**[[Media:22-genes 47-edges Dahlquist-data Sigmoidal estimation fixb-1 fixP-0 no-graph.xlsx]]
**[[Media:22-genes 47-edges Dahlquist-data Sigmoidal estimation fixb-1 fixP-0 no-graph output.xlsx]]
**[[Media:22-genes 47-edges Dahlquist-data Sigmoidal estimation fixb-1 fixP-0 no-graph output.mat]]
**[[Media:22-genes_47-edges_Dahlquist-data_Sigmoidal_estimation_fixb-1_fixP-0_no-graph_optimization_diagnostic.jpg]]
***Had to save manually '''BUG'''
*22-genes_47-edges_Dahlquist-data_Sigmoidal_estimation_fixb-1_fixP-1_graph  (Run on Row 1 #4 at 1:50 pm 2015-06-22 CPU 7)
**[[Media:22-genes 47-edges Dahlquist-data Sigmoidal estimation fixb-1 fixP-1 graph.xlsx]]
**[[Media:22-genes 47-edges Dahlquist-data Sigmoidal estimation fixb-1 fixP-1 graph output.xlsx]]
**[[Media:22-genes 47-edges Dahlquist-data Sigmoidal estimation fixb-1 fixP-1 graph output.mat]]
**[[Media:22-genes 47-edges Dahlquist-data Sigmoidal estimation fixb-1 fixP-1 graph plots.zip]]
**[[Media:22-genes 47-edges Dahlquist-data Sigmoidal estimation fixb-1 fixP-1 graph optimization diagnostic.jpg]]
*22-genes_47-edges_Dahlquist-data_Sigmoidal_estimation_fixb-1_fixP-1_no-graph (Run on Row 1 #4 at 2:07 pm 2015-06-22 CPU 6)
**[[Media:22-genes 47-edges Dahlquist-data Sigmoidal estimation fixb-1 fixP-1 no-graph.xlsx]]
**[[Media:22-genes 47-edges Dahlquist-data Sigmoidal estimation fixb-1 fixP-1 no-graph output.xlsx]]
**[[Media:22-genes 47-edges Dahlquist-data Sigmoidal estimation fixb-1 fixP-1 no-graph output.mat]]
**No graph 
**optimization_diagnostic.jpeg did not save '''BUG'''
*22-genes_47-edges_Dahlquist-data_Sigmoidal_forward_graph (Run on Row 1 #4 at 2:19 pm 2015-06-22 CPU 7)
**[[Media:22-genes 47-edges Dahlquist-data Sigmoidal forward graph.xlsx]]
**[[Media:22-genes 47-edges Dahlquist-data Sigmoidal forward graph output.xlsx]]
**No output .mat file '''BUG'''
**[[Media:22-genes 47-edges Dahlquist-data Sigmoidal forward graph plots.zip]]
**[[Media:22-genes 47-edges Dahlquist-data Sigmoidal forward graph optimization diagnostic.jpg]]
**MATLAB error
  Error in output (line 140)
  outputDiag{3,2} = GRNstruct.GRNOutput.reg_out;
  Error in GRNmodel (line 34)
  GRNstruct = output(GRNstruct);
:*Repeated and got same error
*22-genes_47-edges_Dahlquist-data_Sigmoidal_forward_no-graph (Run on Row 1 #4 at 2:23 pm 2015-06-22 CPU 7)
**[[Media:22-genes 47-edges Dahlquist-data Sigmoidal forward no-graph.xlsx]]
**[[Media:22-genes 47-edges Dahlquist-data Sigmoidal forward no-graph output.xlsx]]
**No output .mat file '''BUG''' (''same MATLAB error as above'')
**No graph
**optimization_diagnostic.jpeg did not save '''BUG'''
*22-genes_47-edges_Dahlquist-data_MM_estimation_fixP-1_graph (Run on Row 1 #4 2015-06-22 CPU 7)
**[[Media:22-genes 47-edges Dahlquist-data MM estimation fixP-1 graph.xlsx]]
**[[Media:22-genes 47-edges Dahlquist-data MM estimation fixP-1 graph output.xlsx]]
**[[Media:22-genes 47-edges Dahlquist-data MM estimation fixP-1 graph output.mat]]
**[[Media:22-genes 47-edges Dahlquist-data MM estimation fixP-1 graph plots.zip]]
**[[Media:22-genes_47-edges_Dahlquist-data_MM_estimation_fixP-1_graph_optimization_diagnostic.jpg]]
*22-genes_47-edges_Dahlquist-data_MM_estimation_fixP-1_no-graph (Run on Row 1 #4 2015-06-22 CPU 6)
**[[Media:22-genes 47-edges Dahlquist-data MM estimation fixP-1 no-graph.xlsx]]
**[[Media:22-genes 47-edges Dahlquist-data MM estimation fixP-1 no-graph output.xlsx]]
**[[Media:22-genes 47-edges Dahlquist-data MM estimation fixP-1 no-graph output.mat]]
**No graph
**optimization_diagnostic.jpeg did not save '''BUG'''
*22-genes_47-edges_Dahlquist-data_MM_estimation_fixP-0_graph (Run on Row 1 #4 2015-06-22 CPU 5)
**[[Media:22-genes 47-edges Dahlquist-data MM estimation fixP-0 graph.xlsx]]
**[[Media:22-genes 47-edges Dahlquist-data MM estimation fixP-0 graph output.xlsx]]
**[[Media:22-genes 47-edges Dahlquist-data MM estimation fixP-0 graph output.mat]]
**[[Media:22-genes 47-edges Dahlquist-data MM estimation fixP-0 graph plots.zip]]
**[[Media:22-genes_47-edges_Dahlquist-data_MM_estimation_fixP-0_graph_optimization_diagnostic.jpg]]
*22-genes_47-edges_Dahlquist-data_MM_estimation_fixP-0_no-graph (Run on Row 1 #4 2015-06-22 CPU 4)
**[[Media:22-genes 47-edges Dahlquist-data MM estimation fixP-0 no-graph.xlsx]]
**[[Media:22-genes 47-edges Dahlquist-data MM estimation fixP-0 no-graph output.xlsx]]
**[[Media:22-genes 47-edges Dahlquist-data MM estimation fixP-0 no-graph output.mat]]
**[[Media:22-genes_47-edges_Dahlquist-data_MM_estimation_fixP-0_no-graph_optimization_diagnostic.jpg]]
***Had to save manually '''BUG'''
*22-genes_47-edges_Dahlquist-data_MM_forward_graph (Run on Row 1 #4 2015-06-22 CPU 3)
**[[Media:22-genes 47-edges Dahlquist-data MM forward graph.xlsx]]
**[[Media:22-genes 47-edges Dahlquist-data MM forward graph output.xlsx]]
**No output .mat file '''BUG''' (''same MATLAB error as above'')
**[[Media:22-genes 47-edges Dahlquist-data MM forward graph plots.zip]]
**optimization_diagnostic.jpg did not save correctly. It saved the plot of ACE2 instead.
*22-genes_47-edges_Dahlquist-data_MM_forward_no-graph (Run on Row 1 #4 2015-06-22 CPU 2)
**[[Media:22-genes 47-edges Dahlquist-data MM forward no-graph.xlsx]]
**[[Media:22-genes 47-edges Dahlquist-data MM forward no-graph output.xlsx]]
**No output .mat file '''BUG''' (''same MATLAB error as above'')
**No graph
**optimization_diagnostic.jpeg did not save '''BUG'''


# Your file will be similar to the file "21-genes_50-edges_Dahlquist-data_Sigmoid_estimation.xls", but with your expression data and network.  You should download this file, change the name, and edit it to include your data.  Make sure to give it a meaningful filename that includes your last name or initials.  [https://github.com/kdahlquist/GRNmap/blob/master/test_files/data_samples/21-genes_50-edges_Dahlquist-data_Sigmoid_estimation.xls?raw=true Click this link to download the sample file from the GRNmap GitHub repository.)]
==Discussion==
# The first thing you need to do is determine the transcription factors that you are including in your network.  You are going to use the "transposed" Regulation Matrix that you generated from YEASTRACT in the previous section.
====Explanation of the intended effect of changing each parameter====
#* Copy the transposed matrix from your "network" sheet and paste it into the worksheets called "network" and "network_weights".
* Sigmoid: =1 if sigmoidal model, =0 if Michaelis-Menten model
#* Note that the transcription factor names have to be in the same order and same format across the top row and first column.  CIN5 does not match Cin5p, so the latter will need to be changed to CIN5 if you have not already done so.
* estimateParams =1 if want to estimate parameters and =0 if the user wants to do just one forward run
#* It may be easier for you if you put the transcription factors in alphabetical order (using the sort feature in Excel), but whether you leave your list the same as it is from the YEASTRACT assignment or in alphabetical order, make sure it is the same order for all of the worksheets.
* makeGraphs =1 to output graphs; =0 to not output graphs
# The next worksheet to edit is the one called "degradation_rates".
* fix_P: =1 if the user does not want to estimate the production rate, P, parameter, use initial guess and never change; =0 to estimate
#* Paste your list of transcription factors from your "network" sheet into the column named "StandardName".  You will need to look up the "SystematicName" of your genes.  YEASTRACT has a feature that will allow you to paste your list of standard names in to retrieve the systematic names [http://www.yeastract.com/formorftogene.php here].
* fix_b: =1 if the user does not want to estimate the b parameter, use initial guess and never change; =0 to estimate
#* Next, you will need to look up the degradation rates for your list of transcription factors.  These rates have been calculated from protein half-life data from a paper by Belle et al. (2006).  Look up the rates for your transcription factors from [[Media:Belle_PNAS_06_degradation_rates_203_TFs.xls | this file]] and include them in your "degradation_rates" worksheet.
#* If a transcription factor does not appear in the file above, use the value "0.027182242" for the degradation rate.
# The next worksheet to edit is the one called "production_rates".
#* Paste the "SystematicName" and "StandardName" columns from your "degradation_rates" sheet into the "production_rates" sheet.
#* The initial guesses for the production rates we are using for the model are two times the degradation rate.  Compute these values from your degradation rates and paste the values into the column titled "ProductionRate".
# Next you will input the expression data for the wild type strain and one other strain (dcin5, dgln3, dhap4, dhmo1, dzap1, or spar; note that we can't use dswi4 because it only has 2 cold shock timepoints).  You need to include only the data for the genes in your network, in the same order as they appear in the other worksheets.
#* Put the wild type data in the sheet called "wt".
#* The sample spreadsheet has a worksheet named "dcin5".  Change this name to match the strain you are using (listed above).  The instructions below should be followed for each strain sheet.
#* Paste the SystematicName and StandardName columns from one of your previous sheets into this one.
#* This data in this sheet is the Log Fold Changes for each replicate and each timepoint from the "Rounded_Normalized_Data" worksheet from the big Excel workbook in which you computed the [[Dahlquist:Microarray_Data_Analysis_Workflow#Step_6:_Statistical_Analysis | statistics]].  We are only going to use the cold shock timepoints for the modeling.  Thus your column headings for the data should be "15", "30", and "60". There will be multiple columns for each timepoint (typically 4) to represent the replicate data, but they will all have the same name.  For example, you may have four columns with the header "15".
#* Copy and paste the data from your spreadsheet into this one.  You need to include only the data for the genes in your network.  Make sure that the genes are in the same order as in the other sheets.
# The "optimization_parameters" worksheet should have the following values:
#* alpha should be 0.01
#* kk_max should be 1
#* MaxIter should be 1e08 (one hundred million in plain English)
#* TolFun should be 1e-6
#* MaxFunEval should be 1e08 (one hundred million in plain English)
#* TolX should be 1e-6
#* Sigmoid should be 1
#* estimateParams should be 1
#* makeGraphs should be 1
#* fix_P should be 0
#* fix_b should be 1
#* For the parameter "time" (Cell A13), we should have "15", "30", and "60", since these are the timepoints we have in our data.
#* For the parameter "Strain" (Cell A14), replace "dcin5" with the name of the second strain you are using, making sure that the capitalizaiton and spelling is the same as what you named the worksheet containing that strain's expression data.  We are only going to compare two strains, so you can delete the other strain information.
#* For the parameter "Sheet" (Cell A15), give the number of the worksheet from left to right that your "Strain" log2 expression data is in.  Delete any extra numbers because we are only comparing two strains.
# For the parameter "Deletion", leave the zero in cell B15 (corresponding to wt).  In cell C15, put a number corresponding to the position in the list of gene names that the gene that was deleted appears.  In the sample file, CIN5 is number 3 in the list.  Note, disregard the column header in this count and only consider the actual gene names themselves.
#* For the parameter, "simtime", you perform the forward simulation of the expression in five minute increments from 0 to 60 minutes.  Thus, this row should read: simtime should be 0, 5, <...fill by steps of 5...>, 60, each number in a different cell.
# The last sheet you will need to modify is called "network_b".
#* Paste in the list of standard names for your transcription factors from one of your previous sheets.  Note that this sheet does not have a column for the Systematic Name.
#* Cell A1 in the sample files has the text "rows genes affected/cols genes controlling".  I believe you can either have this text in cell A1 or "StandardName".
#* The "threshold" value for each gene should be "0".
# When you have completed the modifications to your file, upload it to [http://lionshare.lmu.edu LionShare] and send Dr. Dahlquist an e-mail with a link to the file.


Running GRNmap
====Observations from test====
'''May 26, 2015'''
*Setting "makeGraphs" to 1 yielded graphs and when it was set to 0 it did not produce graphs, indicating that the "makeGraphs" command is working properly.
*I did not complete the comparison yet, but so far it looks like there is no change in the production rates for any of the sixteen.
*I only completed comparing the weights between (1) and (2), but they looked identical, which further shows that the "makeGraphs" command is working correctly.
*Tomorrow finish adding in the production rates and comparing the weights.
*Comparing the weights proves to be tedious. The easiest method I found was to copy the matrix and transpose it, copy each column into one master column, create a "master list", sort the weights column by number (smallest to largest), delete the zeros, then sort back into the order of the master list. Then make the list of the Controller ---> target, which is also tedious, however, it does make it easy to double check that the previous step was done correctly. There may be a more efficient way of performing this step.
'''May 27, 2015'''
*[[Media:20150527 GRNmap Test TAR TM.pptx| Preliminary Results]] are shown in the presentation.
*[[Media:20150527 GRNmap Test TAR TM.xlsx| Excel document with analysis]]
*The difference was taken between "Graph" and "No graph" values for the b, production rates, and weights were calculated. Overall there was little to no difference in the values between "Graph" and "No graph." The "Graph" function produced plots each time and the "No graph" did not indicating it was working properly.
*The differences between Sigmoidal and Michaelis Menten could be seen when looking at a bar graph comparing their weights. They seemed to follow the overall shape with the exception of MIG2-->CIN5, MIG2-->MSN2, MSN2-->GCR2, AND MSN2-->HMO1, where there was a change in sign.
**Small magnitude sign changes: ARG80-->MSN2, FKH2-->HMO1, PDR1-->MSN2,SFP1--> MSN2, YHP1-->MSN2
*It was worth noting that there was primarily positive weights for the Michaelis Menten, with the exception of a few negative weights with very small magnitude ( in between 0 and -0.5)
*When b was fixed (b=1), there was a difference in the network_b values for GLN3 and ZAP1 for the Sigmoidal estimation fixb-1 & fixP-0, Sigmoidal fixb-1 & fixP-1, and Sigmoidal forward only. There was no output for setting b=0, which indicates that fixb-1 is estimating and fixb-0 is fixing b.
*The production rates for "fixP-1" were the same as the production rates in the input sheet, meaning that P was fixed and not estimated, which was the intended function.
*There was no production rate produced for "fixP-0" which should have produced results where the production rates were estimated.
'''May 28, 2015'''
*Format plots of the weight to only show one controller. Make sure that the scale is from -3 to 3 when the min and max and less than the absolute value of 3 and -6 to 6 when not.
*Generate GRNsight maps of each output. Make sure that the nodes are in approximately the same position so the maps can be easily compared.
*[[Media:20150527 GRNmap Test TAR TM.pptx| Final Presentation]]
*[[Media:20150527 GRNmap Test TAR TM.xlsx| Final Excel document]]
*The MM_estimation_fixP-1_graph had no repression (cyan lines) only activation (magenta) and no significance (gray)
*The MM_estimation_fixP-0_graph had four cyan lines, which is still less than all of the Sigmoidal GRNsight maps 
*The CIN5 to MIG2 relationship went from MIG2 being strongly repressed (Sigmoid_estimation_fixb-1_fixP-1_graph) to strongly activated (MM_estimation_fixP-0_graph), and was also shown to be insignificant
*The MSN2 to MIG2 relationship: MIG 2 was strongly activated in all models except for the Sigmoidal_estimation_fixb-0_fixP-0_graph_output GRNsight where it was strongly repressed
*HMO1 to MSN2 also changed from MSN2 being repressed (Sigmoidal_estimation_fixb-0_fixP-0_graph, Sigmoidal_estimation_fixb-1_fixP-1_graph, Sigmoid_estimation_fixb-1_fixP-1_graph) to activated (Sigmoidal_estimation_fixb-0_fixP-0_graph).
*GCR2 to MSN2: MSN2 was activated in all cases except Sigmoidal_estimation_fixb-0_fixP-0_graph


You will now finally run the GRNmap model on the input workbook you created above.  You will run the optimization twice; once where the threshold parameters, b, are '''not''' estimated and once where the threshold parameters ''''are''' estimated.  You will compare the estimated weight and production rate parameters outputted by these two runs with each other.
[[Category:Dahlquist Lab]]
 
[[Category: GRNmap]]
# Download the current version of GRNmap from GitHub.  Version 1.0.6 can be downloaded by following this [https://github.com/kdahlquist/GRNmap/archive/v1.0.6.zip link]
#* For the sake of organization, save it into a new folder called "GRNmap" either on your Desktop or within your "Microarray Analysis" folder.
#* Unzip the file by right-clicking on it and choosing 7-zip > Extract here.
# Open the "GRNmap-1.0.6" folder and open the "matlab" subfolder.  Double-click on the file "GRNmodel.m" to open GRNmap in MATLAB 2014b.
# Click on the green triangle "Run" button to run the model.
#* You will be prompted by an Open dialog to find your input file that you created in the previous section.  Browse and select this input file and click OK.
#* Note that the Open dialog will default to show files of <code>*.xlsx</code> only.  If your file is saved as <code>*.xls</code>, you will need to select the drop-down menu to show all files.
#* A window called "Figure 1" will appear.  The counter is showing the number of iterations of the least squares optimization algorithm.  The top plot is showing the values of all the parameters being estimated.  You should see some movement of the diamonds each time the counter iterates.
# Once the model has completed its run, plots showing the expression over time for all of the genes in the network will appear.  Since we selected "makeGraphs = 1" these will automatically be saved as <code>*.jpg</code> files in the same folder as your input file.  Compile the figures into a single PowerPoint file. Please label things clearly, placing an appropriate number of graphs on each page for a readable visual.  Take some care to make sure that the graphs are the same size and the aspect ratio has not been changed. <!--maybe suggest to put graphs for the same gene side by side-->
# Create a new workbook for analyzing the weight data.  In this workbook, create a new sheet: call it estimated_weights. In this new worksheet, create a column of labels of the form ControllerGeneA -> TargetGeneB, replacing these generic names with the standard gene names for each regulatory pair in your network. Remember that columns represent Controllers and rows represent Targets in your network and network_weights sheets.
# Extract the non-zero optimized weights from their worksheet and put them in a single column next to the corresponding ControllerGeneA -> TargetGeneB label.
# Now we will run the model a second time, this time estimating the threshold parameters, b.  Save the input workbook that you previously created as a new file with a meaningful name (e.g. append "estimate-b" to the previous filename), and change fix_b to 0 in the "optimization_parameters" worksheet, so that the thresholds will be estimated. Rerun GRNmodel with the new input sheet.
# Repeat Parts (4) through (6) with the new output.
# Create an empty excel workbook, and copy both sets of weights into a worksheet.
# Create a bar chart in order to compare the "fixed b" and "estimated b" weights.
# Create bar charts to compare the production rates from each run.
# Copy the two bar charts into your powerpoint.
# Visualize the output of each of your model runs with GRNsight.
#* In order for this to work, you need to alter your output workbook slightly.  You need to change the name of the sheet called "out_network_optimized_weights" to "network_optimized_weights"; i.e., delete the "out_" from that sheet name.
#* Arrange the genes in the same order you used to display them previously when you visualized the networks from YEASTRACT for both of your model output runs.  Take a screenshot of each of the results and paste it into your PowerPoint presentation.  Clearly label which screenshot belongs to which run.
#* Note that GRNsight will display differently now that you have estimated the weights.  For positive weights > 0, the edge will be given a regular (pointy) arrowhead to indicate an activation relationship between the two nodes. For negative weights < 0, the edge will be given a blunt arrowhead (a line segment perpendicular to the edge direction) to indicate a repression relationship between the two nodes. The thickness of the edge will vary based on the magnitude of the absolute value of the weight. Larger magnitudes will have thicker edges and smaller magnitudes will have thinner edges. The way that GRNsight determines the edge thickness is as follows. GRNsight divides all weight values by the absolute value of the maximum weight in the matrix to normalize all the values to between zero and 1. GRNsight then adjusts the thickness of the lines to vary continuously from the minimum thickness (for normalized weights near zero) to maximum thickness (normalized weights of 1). The color of the edge also imparts information about the regulatory relationship. Edges with positive normalized weight values from 0.05 to 1 are colored magenta; edges with negative normalized weight values from -0.05 to -1 are colored cyan. Edges with normalized weight values between -0.05 and 0.05 are colored grey to emphasize that their normalized magnitude is near zero and that they have a weak influence on the target gene.
# Upload your PowerPoint, your two input workbooks, and your two output workbooks and link to them in your individual journal.  Also upload the workbook where you made the bar charts comparing the weights from both runs.
#* Interpret the results of the model simulation. 
#** Examine the graphs that were output by each of the runs.  Which genes in the model have the closest fit between the model data and actual data?  Which genes have the worst fit between the model and actual data?  Why do you think that is?  (Hint: how many inputs do these genes have?)  How does this help you to interpret the microarray data? 
#** Which genes showed the largest dynamics over the timecourse?  In other words, which genes had a log fold change that is different than zero at one or more timepoints.  The  p values from the [[BIOL398-04/S15:Week 11 | Week 11]] ANOVA analysis are informative here.  Does this seem to have an effect on the goodness of fit (see question above)?
#** Which genes showed differences in dynamics between the wild type and the other strain your group is using? Does the model adequately capture these differences?  Given the connections in your network (see the visualization in GRNsight), does this make sense? Why or why not?
#** Examine the bar charts comparing the weights and production rates between the two runs. Were there any major differences between the two runs? Why do you think that was? Given the connections in your network (see the visualization in GRNsight), does this make sense? Why or why not?
#** Finally, based on the results of your entire project, which transcription factors are most likely to regulate the cold shock response and why?
#* Based on these results, what future directions do you want to take?
 
 
{{Template:Tessa A. Morris ELN}}

Latest revision as of 09:19, 23 June 2015

Test Conditions

  • Date started: 2015-05-26
  • Test Performed by: Tessa A. Morris, Electronic Notebook
  • Code Version: GRNmap version 1.0.6
  • MATLAB Version: 2014b
  • Computer on which the model was run: Row 1 #5, #6; Row 2 #4, #5, #6; Row 3 #1

Redone on 2015-06-03 to fix the naming convention and the input sheet

  • Date started: 2015-06-08
  • Test Performed by: Tessa A. Morris, Electronic Notebook
  • Code Version: GRNmap-beta (9:17 am 2015-06-08)
  • MATLAB Version: 2014b
  • Computer on which the model was run: Row 2 #4

Update alpha and optimization parameters

  • Date started: 2015-06-22
  • Test Performed by: Tessa A. Morris, Electronic Notebook
  • Code Version: GRNmap-beta (10:28 am 2015-06-22)
  • MATLAB Version: 2014b
  • The computer each input sheet was run on is noted in the results section.

Purpose

  • The purpose was to create the sixteen different variations of the input sheet created from dCIN5 Dahlquist-data to test various versions of GRNmap.
  • Issue 74

Results

 Error in output (line 140)
 outputDiag{3,2} = GRNstruct.GRNOutput.reg_out;
 Error in GRNmodel (line 34)
 GRNstruct = output(GRNstruct);
  • Repeated and got same error

Discussion

Explanation of the intended effect of changing each parameter

  • Sigmoid: =1 if sigmoidal model, =0 if Michaelis-Menten model
  • estimateParams =1 if want to estimate parameters and =0 if the user wants to do just one forward run
  • makeGraphs =1 to output graphs; =0 to not output graphs
  • fix_P: =1 if the user does not want to estimate the production rate, P, parameter, use initial guess and never change; =0 to estimate
  • fix_b: =1 if the user does not want to estimate the b parameter, use initial guess and never change; =0 to estimate

Observations from test

May 26, 2015

  • Setting "makeGraphs" to 1 yielded graphs and when it was set to 0 it did not produce graphs, indicating that the "makeGraphs" command is working properly.
  • I did not complete the comparison yet, but so far it looks like there is no change in the production rates for any of the sixteen.
  • I only completed comparing the weights between (1) and (2), but they looked identical, which further shows that the "makeGraphs" command is working correctly.
  • Tomorrow finish adding in the production rates and comparing the weights.
  • Comparing the weights proves to be tedious. The easiest method I found was to copy the matrix and transpose it, copy each column into one master column, create a "master list", sort the weights column by number (smallest to largest), delete the zeros, then sort back into the order of the master list. Then make the list of the Controller ---> target, which is also tedious, however, it does make it easy to double check that the previous step was done correctly. There may be a more efficient way of performing this step.

May 27, 2015

  • Preliminary Results are shown in the presentation.
  • Excel document with analysis
  • The difference was taken between "Graph" and "No graph" values for the b, production rates, and weights were calculated. Overall there was little to no difference in the values between "Graph" and "No graph." The "Graph" function produced plots each time and the "No graph" did not indicating it was working properly.
  • The differences between Sigmoidal and Michaelis Menten could be seen when looking at a bar graph comparing their weights. They seemed to follow the overall shape with the exception of MIG2-->CIN5, MIG2-->MSN2, MSN2-->GCR2, AND MSN2-->HMO1, where there was a change in sign.
    • Small magnitude sign changes: ARG80-->MSN2, FKH2-->HMO1, PDR1-->MSN2,SFP1--> MSN2, YHP1-->MSN2
  • It was worth noting that there was primarily positive weights for the Michaelis Menten, with the exception of a few negative weights with very small magnitude ( in between 0 and -0.5)
  • When b was fixed (b=1), there was a difference in the network_b values for GLN3 and ZAP1 for the Sigmoidal estimation fixb-1 & fixP-0, Sigmoidal fixb-1 & fixP-1, and Sigmoidal forward only. There was no output for setting b=0, which indicates that fixb-1 is estimating and fixb-0 is fixing b.
  • The production rates for "fixP-1" were the same as the production rates in the input sheet, meaning that P was fixed and not estimated, which was the intended function.
  • There was no production rate produced for "fixP-0" which should have produced results where the production rates were estimated.

May 28, 2015

  • Format plots of the weight to only show one controller. Make sure that the scale is from -3 to 3 when the min and max and less than the absolute value of 3 and -6 to 6 when not.
  • Generate GRNsight maps of each output. Make sure that the nodes are in approximately the same position so the maps can be easily compared.
  • Final Presentation
  • Final Excel document
  • The MM_estimation_fixP-1_graph had no repression (cyan lines) only activation (magenta) and no significance (gray)
  • The MM_estimation_fixP-0_graph had four cyan lines, which is still less than all of the Sigmoidal GRNsight maps
  • The CIN5 to MIG2 relationship went from MIG2 being strongly repressed (Sigmoid_estimation_fixb-1_fixP-1_graph) to strongly activated (MM_estimation_fixP-0_graph), and was also shown to be insignificant
  • The MSN2 to MIG2 relationship: MIG 2 was strongly activated in all models except for the Sigmoidal_estimation_fixb-0_fixP-0_graph_output GRNsight where it was strongly repressed
  • HMO1 to MSN2 also changed from MSN2 being repressed (Sigmoidal_estimation_fixb-0_fixP-0_graph, Sigmoidal_estimation_fixb-1_fixP-1_graph, Sigmoid_estimation_fixb-1_fixP-1_graph) to activated (Sigmoidal_estimation_fixb-0_fixP-0_graph).
  • GCR2 to MSN2: MSN2 was activated in all cases except Sigmoidal_estimation_fixb-0_fixP-0_graph