Biomod/2011/Caltech/DeoxyriboNucleicAwesome/Random Walk Formula
From OpenWetWare
(→General Modeling Idea) 
Current revision (09:20, 2 November 2011) (view source) (→Cumulative Function) 

(42 intermediate revisions not shown.)  
Line 1:  Line 1:  
{{Template:DeoxyriboNucleicAwesomeHeader}}  {{Template:DeoxyriboNucleicAwesomeHeader}}  
  
__TOC__  __TOC__  
  ==  +  =Random Walk Formula= 
  The random walk on DNA origami can be modeled as one dimensional random walk with a reflecting and an absorbing barrier (Figure 1). Tracks in the same column are grouped into rectangles, and each step is defined as walking from one rectangle to an adjacent one.  +  
+  ==Modeling Idea==  
+  The random walk on DNA origami can be modeled as one dimensional random walk with a reflecting and an absorbing barrier (Figure 1). Tracks in the same column are grouped into rectangles, and each step is defined as walking from one rectangle to an adjacent one. We are interested in expressing the probability of reaching the walker goal as a function of the number of steps taken.  
Line 12:  Line 13:  
[[Image:RW.bmpthumbcenter800pxFigure 2. A line segment with N+1 distinct sites. The walker starts the random walk at i and is reflected at N with a probability of β. The random walk ends once it reaches site 0. ]]  [[Image:RW.bmpthumbcenter800pxFigure 2. A line segment with N+1 distinct sites. The walker starts the random walk at i and is reflected at N with a probability of β. The random walk ends once it reaches site 0. ]]  
+  ==General Approach==  
+  Two assumptions are made in our case. 1) The DNA origami is immune to any free floating walkers in solution, meaning that free floating walkers cannot bind to an origami and starts random walking; 2) walkers are immediately absorbed when reaching the rectangles with WGs, despite the presence of two TR2 in the same rectangle.  
+  
+  Let <math> h(t;i)\!</math> be the probability that the walker reaches 0 for the first time after <math>t\!</math> steps given its starting position being <math>i\!</math>. <math>h(t;i)\!</math> obeys the following difference equation  
+  <div class="center" style="width:auto; marginleft:auto; marginright:auto;"><math>h(t;i) = q·h(t1;i1) + r·h(t1;i) + p·h(t1; i + 1)\!</math></div>  
+  
+  for <math>t = 1,2,3,...\!</math> and <math>i = 1,2,3,...,N1\!</math>. We define <math>h(t;0)= 1\!</math> <math>if\!</math> <math>t = 0\!</math>; <math>h(t;0)= 0\!</math> <math>if\!</math> <math>t > 0\!</math>. Also, <math>h(t;i) = 0\!</math> <math>for\!</math> <math>t < i\!</math>.  
+  
+  When <math>i = N\!</math> we have  
+  <div class="center" style="width:auto; marginleft:auto; marginright:auto;"><math>h(t;N) = αh(t1;N)+ βh(t1; N1) \!</math></div>  
+  
+  The generating function for <math>h(t;i)\!</math> can be expressed as  
+  <div class="center" style="width:auto; marginleft:auto; marginright:auto;"><math>H_i (s) = \sum_{t=0}^\infty h(t;1)s^t,s <1 \!</math></div>  
+  
+  Following Netus (1963), the explicit expression of the generating function is  
+  <div class="center" style="width:auto; marginleft:auto; marginright:auto;"><math>H_i (s) =\begin{cases} \frac {q^i s^i T_i (s)} {T_0 (s)}, 0\leqslant i < N\\  
+  \frac {\beta q^{N1} s^N (\lambda_1  \lambda_2)} {T_0 (s)},i=N\end{cases}\!</math></div>  
+  
+  where  
+  <div class="center" style="width:auto; marginleft:auto; marginright:auto;"><math>T_i (s) = (1\alpha s)(\lambda_1 ^{Ni}\lambda_2 ^{Ni})\beta p s^2 (\lambda_1 ^{Ni1}\lambda_2 ^{Ni1})\!</math></div>  
+  
+  and  
+  <div class="center" style="width:auto; marginleft:auto; marginright:auto;"><math>\lambda_{1,2} = \frac {1} {2} ( \pm \sqrt{{(1rs)}^24 p q s^2} +1rs ).\!</math></div>  
+  
+  The explicit expression of <math>h(t;i)\!</math> can thus be deduced from <math>H_i (s)\!</math> using partial fraction expansion (Feller, 1971).  
+  
+  Assume that <math>T_0 (s)\!</math> has <math>k\!</math> distinct roots <math>s_1,s_2,..., s_k . H_i (s)\!</math> can then be decomposed into partial fractions  
+  <div class="center" style="width:auto; marginleft:auto; marginright:auto;"><math> H_i (s) = \frac {\rho_1} {ss_1} + \frac{\rho_2}{ss_2} +...+ \frac {\rho_k}{ss_k} \!</math></div>  
+  
+  where  
+  <div class="center" style="width:auto; marginleft:auto; marginright:auto;"><math> \rho_k = \frac {q^i {s_m}^i T_i (S_m)}{{T_0}^' (s_m)}, m\leqslant k \!</math></div>  
+  
+  It follows that  
+  <div class="center" style="width:auto; marginleft:auto; marginright:auto;"><math>h(t;i) = \sum_{m=1}^k \frac {\rho_m}{{s_m}^{t+1}} \!</math></div>  
+  
+  <math>h(t;N)\!</math> can be similarly deduced from <math>H_N (s)\!</math> using the same method.  
+  
+  ==Parameter Estimation==  
+  Possible positions of walkers on the origami can be grouped into two categories, either in the center of rectangles (Figure 1, Line 2) or at the sides (Figure 1, Lines 1&3), and the values of <math>p\!</math>, <math>q\!</math> and <math>r\!</math> are different in each category. Hence there is a need to find out the distribution of walkers in these two categories in order to estimate <math>p\!</math>, <math>q\!</math> and <math>r\!</math>. Assuming no reflecting boundaries, when the walker is in the center (Figure 1, Line 2), the probability of staying in Line 2 after one branch migration is <math>\frac {1}{2}\!</math>, while the probability of going to Lines 1 or 3 is <math>\frac{1}{2}\!</math>. Similarly, when it is in Lines 1 or 3, the probability of staying in the same line after one branch migration is <math>\frac{2}{3}\!</math> while the probability of going to Line 2 is <math>\frac{1}{3}\!</math>. Here we assume no reflecting barrier in the system. The distribution of relative positions of walkers can be modeled using Markov chain as follows.  
+  
+  Let <math>v(t) = (a, 1a)\!</math> denote the proportion of walkers in Line 2 <math>(a)</math> and Lines 1&3 <math>(1a)</math> after <math>t\!</math> branch migrations. Since all the walkers are initially planted in the center, <math>v(0) = (1, 0)\!</math>. Define the transition matrix  
+  
+  <div class="center" style="width:auto; marginleft:auto; marginright:auto;"><math> M = \begin{pmatrix}  
+  1/2 & 1/2 \\  
+  1/3 & 2/3  
+  \end{pmatrix}. \!</math></div>  
+  
+  It follows that  
+  <div class="center" style="width:auto; marginleft:auto; marginright:auto;"><math> v(t)=v(0). \underbrace{M.M.M...M}_{t} \!</math></div>  
+  
+  It was found that <math>\lim_{t \to \infty} v(t)= (0.4,0.6).</math>Hence at the stable state, 40% of the walkers are in Line 2 while 60% are in Lines 1 or 3. Since <math>v(4) = v(4) = (0.40007,0.59993)\!</math>, the distribution of relative positions of walkers on the origami stabilizes after approximately 4 branch migrations.  
+  
+  The overall probabilities <math>p, q, r, \alpha,\beta\!</math> can thus be calculated (Table 1).  
+  
+  
+  { class="wikitable" border="1" cellpadding="2" style="textalign: center"  
+  + Table 1. Calculation of overall probabilities used in random walk modeling.  
+  !width="160"  
+  !width="120"<div class="center" style="width:auto; marginleft:auto; marginright:auto;"><math>p\!</math></div>  
+  !width="120"<math>q\!</math>  
+  !width="120"<math>r\!</math>  
+  !width="120"<math>\alpha\!</math>  
+  !width="120"<math>\beta\!</math>  
+    
+   Walkers at the center (Line 2) (40%)  <math>\frac{1}{4}</math> <math>\frac{1}{4}</math> <math>\frac{1}{2}</math> <math>\frac{2}{3}</math> <math>\frac{1}{3}</math>  
+  
+    
+   Walkers at the sides (Lines 1 and 3) (60%)  <math>\frac{1}{3}</math> <math>\frac{1}{3}</math> <math>\frac{1}{3}</math> <math>\frac{1}{2}</math> <math>\frac{1}{2}</math>  
+  
+    
+   Overall Probability <math>\frac{3}{10}</math> <math>\frac{3}{10}</math> <math>\frac{2}{5}</math> <math>\frac{17}{30}</math> <math>\frac{13}{30}</math>  
+  
+  }  
+  
+  
+  Determination of the overall probabilities allow us to derive the explicit probability distribution functions for SP10, 22 and 34.  
+  
+  ==Explicit Expression of Formulae==  
+  The explicit expression for the probability of reaching the walker goal (h(t;i)) as a function of the number of steps taken (t) can then be deduced using the general approach and the estimated parameters.  
+  
+  <math>SP10: h(t;12)= \begin{cases} 0, 0\leqslant t < 12\\  
+  \frac{8.4287}{19.6975^{t+1}}  \frac{0.6389}{7.5119^{t+1}} + \frac{0.0938}{5.4569^{t+1}} + \frac{0.0064}{1.0050^{t+1}}  \frac{0.0202}{1.0463^{t+1}} + \frac{0.0378}{1.1365^{t+1}} \\ \quad  \frac{0.0634}{1.2945^{t+1}} + \frac{0.1058}{1.5602^{t+1}}  \frac{0.1876}{2.0242^{t+1}} + \frac{0.3811}{2.9273^{t+1}}  \frac{1.0555}{5.1593^{t+1}} + \frac{8.7388}{16.5916^{t+1}}  
+  ,t \geqslant 12\end{cases}</math>  
+  
+  
+  <math>SP22: h(t;8)= \begin{cases} 0, 0\leqslant t < 8\\  
+  \frac{7.6854}{16.0673^{t+1}} + \frac{0.3265}{6.0382^{t+1}} +\frac{0.0143}{1.0111^{t+1}} \frac{0.0484}{1.1067^{t+1}} +\frac{0.1051}{1.3413^{t+1}} \frac{0.2323}{1.8564^{t+1}} +\frac{0.6577}{3.1862^{t+1}} \frac{4.7287}{9.6237^{t+1}}  
+  ,t \geqslant 8\end{cases}</math>  
+  
+  
+  <math>SP34: h(t;4)= \begin{cases} 0, 0\leqslant t < 4\\  
+  
+  \frac{5.4999}{11.0280^{t+1}} + \frac{0.05677}{1.0434^{t+1}}  \frac{0.2816}{1.5107^{t+1}} + \frac{2.0738}{4.2196^{t+1}}  
+  ,t \geqslant 4\end{cases}</math>  
+  
+  == Cumulative Function==  
+  The cumulative density function of <math>h(t;i)\!</math> can be computed and used to fit with SPEX experimental data.  
+  <div class="center" style="width:auto; marginleft:auto; marginright:auto;"><math> C(t;i)= \sum_{t=0}^\infty h(t;i)\!</math></div>  
+  
+  A plot of <math>C(t;i)\!</math> verses <math>t\!</math> can be used as a good model to fit the experimental data (Figure 3), where <math>C(t;i)</math> is the proportion of walkers that reach the walker goal and <math>t\!</math> is the number of steps needed. It can be found that the number of steps needed for half of the walkers to reach WG decreases as we move SP nearer to the WG.  
+  
+  [[Image:Cumulative_RW.PNGthumbcenter800pxFigure 3. A plot of the cumulative proportion of walkers reaching the walker goal versus number of steps. Blue, SP10. Green, SP22. Red, SP34. SP10 is the longest track, followed by SP22, while SP34 is the shortest.]]  
+  
+  ==Number of Branch Migrations per Step==  
+  To further study the validity of this model, the number of branch migrations per step was calculated in order to obtain the average rate of branch migration.  
+  Let <math>P(x)\!</math> denote the probability of walking to the adjacent column after <math>x\!</math> branch migrations.  
+  
+  For walkers at the center line(Line 2),  
+  
+  <div class="center" style="width:auto; marginleft:auto; marginright:auto;"><math>P(x) =\begin{cases} \frac {1} {2} \times ({\frac{2}{3}})^{\frac{x}{2}} \times ({\frac{1}{4}})^{\frac{x2}{2}}, x=2,4,6,8,...\\  
+  \frac {1}{2} \times ({\frac {2}{3}})^{\frac{x1}{2}} \times ({\frac{1}{4}})^{\frac {x1}{2}},x=1,3,5,7,...\end{cases} \!</math></div>  
+  
+  
+  For walkers at the sides (Lines 1 or 3),  
+  
+  <div class="center" style="width:auto; marginleft:auto; marginright:auto;"><math>P(x) =\begin{cases} \frac {2} {3} \times ({\frac{1}{4}})^{\frac{x}{2}} \times ({\frac{2}{3}})^{\frac{x2}{2}}, x=2,4,6,8,...\\  
+  \frac {2}{3} \times ({\frac {1}{4}})^{\frac{x1}{2}} \times ({\frac{2}{3}})^{\frac {x1}{2}},x=1,3,5,7,...\end{cases} \!</math></div>  
+  
+  
+  The expected number of branch migrations in each case per step can hence be calculated as  
+  
+  <div class="center" style="width:auto; marginleft:auto; marginright:auto;"><math>\sum_{x=1}^\infty xP(x),x=1,2,3,4,... \!</math></div>  
+  
+  It follows that the expected number of branch migrations per step is <math>\frac{9}{5}\!</math> and <math>\frac{8}{5}</math> for walkers at the center and sides, respectively.  
+  Taking into account that 40% of walkers are in Line 2 while 60% are in Lines 1 and 3, the overall number of branch migrations per step is thus  
+  <div class="center" style="width:auto; marginleft:auto; marginright:auto;"><math>40% \times \frac{9}{5} + 60% \times \frac{8}{5} = 1.68</math></div>  
+  =References=  
+  *Ahmed ElShehawy (1992). On absorption probabilities for a random walk between two different barriers. ''Annals De La Faculte Des Sciences De Toulouse, 1''(1), 95103.  
+  *Feller, W. (1971). ''An introduction to probability theory and its applications''.  
+  *Netus, M. (1963). Absorption probabilities for a random walk between a reflecting and an absorbing barrier. ''Bull. Soc. Math. Belgique, 15'', 253258.  
{{Template:DeoxyriboNucleicAwesomeFooter}}  {{Template:DeoxyriboNucleicAwesomeFooter}} 
Current revision
Sunday, October 23, 2016

Random Walk FormulaModeling IdeaThe random walk on DNA origami can be modeled as one dimensional random walk with a reflecting and an absorbing barrier (Figure 1). Tracks in the same column are grouped into rectangles, and each step is defined as walking from one rectangle to an adjacent one. We are interested in expressing the probability of reaching the walker goal as a function of the number of steps taken.
Consider a random walk on a line segment with N+1 sites denoted by integers (0,1,2, … , N) (Figure 2). The walker starts random walk at site i, 0 < i ≤ N. Let p be the probability for the walker to move one segment to the left, q be the probability for the walker to move one segment to the right. The probability for the walker to stay at a particular site for the next unit time is thus r = 1 – p – q. When the walker reaches site N, the partially reflecting barrier, it has a probability of β to be reflected back to site N – 1, and a probability of α = 1 – β to stay at site N in the next unit time. When reaching site 0, the absorbing barrier, it stays there for 100% probability and the random walk ends. General ApproachTwo assumptions are made in our case. 1) The DNA origami is immune to any free floating walkers in solution, meaning that free floating walkers cannot bind to an origami and starts random walking; 2) walkers are immediately absorbed when reaching the rectangles with WGs, despite the presence of two TR2 in the same rectangle. Let be the probability that the walker reaches 0 for the first time after steps given its starting position being . obeys the following difference equation for and . We define ; . Also, . When we have The generating function for can be expressed as Following Netus (1963), the explicit expression of the generating function is where and The explicit expression of can thus be deduced from using partial fraction expansion (Feller, 1971). Assume that has distinct roots can then be decomposed into partial fractions where It follows that can be similarly deduced from using the same method. Parameter EstimationPossible positions of walkers on the origami can be grouped into two categories, either in the center of rectangles (Figure 1, Line 2) or at the sides (Figure 1, Lines 1&3), and the values of , and are different in each category. Hence there is a need to find out the distribution of walkers in these two categories in order to estimate , and . Assuming no reflecting boundaries, when the walker is in the center (Figure 1, Line 2), the probability of staying in Line 2 after one branch migration is , while the probability of going to Lines 1 or 3 is . Similarly, when it is in Lines 1 or 3, the probability of staying in the same line after one branch migration is while the probability of going to Line 2 is . Here we assume no reflecting barrier in the system. The distribution of relative positions of walkers can be modeled using Markov chain as follows. Let denote the proportion of walkers in Line 2 (a) and Lines 1&3 (1 − a) after branch migrations. Since all the walkers are initially planted in the center, . Define the transition matrix It follows that It was found that Hence at the stable state, 40% of the walkers are in Line 2 while 60% are in Lines 1 or 3. Since , the distribution of relative positions of walkers on the origami stabilizes after approximately 4 branch migrations. The overall probabilities can thus be calculated (Table 1).
Explicit Expression of FormulaeThe explicit expression for the probability of reaching the walker goal (h(t;i)) as a function of the number of steps taken (t) can then be deduced using the general approach and the estimated parameters.
Cumulative FunctionThe cumulative density function of can be computed and used to fit with SPEX experimental data. A plot of verses can be used as a good model to fit the experimental data (Figure 3), where C(t;i) is the proportion of walkers that reach the walker goal and is the number of steps needed. It can be found that the number of steps needed for half of the walkers to reach WG decreases as we move SP nearer to the WG. Number of Branch Migrations per StepTo further study the validity of this model, the number of branch migrations per step was calculated in order to obtain the average rate of branch migration. Let denote the probability of walking to the adjacent column after branch migrations. For walkers at the center line(Line 2),
It follows that the expected number of branch migrations per step is and for walkers at the center and sides, respectively. Taking into account that 40% of walkers are in Line 2 while 60% are in Lines 1 and 3, the overall number of branch migrations per step is thus References
