Previous Chapter: THE ENERGETICS OF A STATE
Suggested Citation: "ANALYSIS OF SUPERHELICAL EQUILIBRIA." National Research Council. 1995. Calculating the Secrets of Life: Contributions of the Mathematical Sciences to Molecular Biology. Washington, DC: The National Academies Press. doi: 10.17226/2121.

Page 187

identity of the base pair. In practice, a >> bGC> bAT.Because the initiation free energy a is large, low-energy states tend to have only a small number of runs of strand separation. Because bGC is larger than bAT, these runs tend to be in A+T-rich regions. The free energy of interstrand twisting within separated regions is quadratic in the local helicity of the deformation, with coefficient denoted by C. The free energy of residual superhelicity has been measured experimentally to be quadratic in that deformation, with coefficient K. Combining these contributions (and allowing the interstrand twisting to equilibrate with the residual superhelicity), the free energy G of a state is found to depend on three parameters: the number n of separated base pairs, the number nATof these that are A·Ts, and the number r of runs of separation:

image                   (7.6)

The energy parameters in this expression, a, bAT, bGC, C, and K, all depend on environmental conditions such as salt concentration and temperature. The values of the b's are known experimentally under a wide variety of conditions (Marmur and Doty, 1962; Schildkraut and Lifson, 1968). However, values for the other parameters are not so well understood. These parameters must be evaluated before the methods can yield quantitatively accurate results. We will do this by fitting these parameters to actual experimental data.

Analysis of Superhelical Equilibria

To calculate the equilibrium strand separation behavior of superhelical DNA molecules, we proceed as follows. First, the DNA sequence is analyzed and key information needed for later stages is stored. This step need be done only once per sequence. Next, the linking difference a and environmental conditions are specified, which sets the energy parameters and determines the free energy associated with each state. The state having minimum free energy under the given conditions

Suggested Citation: "ANALYSIS OF SUPERHELICAL EQUILIBRIA." National Research Council. 1995. Calculating the Secrets of Life: Contributions of the Mathematical Sciences to Molecular Biology. Washington, DC: The National Academies Press. doi: 10.17226/2121.

Page 188

is found from the free energy expression and the sequence data. Then an energy threshold q is specified, and all states i are found that have free energy exceeding the minimum Gmin by no more than this threshold amount. Three inequalities occur, one each for n, nAT, and r. Together the satisfaction of all three inequalities provides necessary and sufficient conditions that a state satisfy the energy threshold condition. For every set of values n, nAT, and r satisfying these inequalities, all states with these values are found from the sequence information. This is a very complex computational task. The number of states involved grows approximately exponentially with the threshold q. In cases where r > 1, care must be taken to verify that a collection of r runs having the requisite total length and A+T-richness neither overlap nor abut, but rather are distinct. An approximate partition function Zcal is computed from this collection of low-energy states to be

image                                         (7.7)

By focusing only on the low-energy states, approximate ensemble average (that is, equilibrium) values are computed for all parameters of interest. These may include the expected torsional deformation of the strand-separated regions, expected numbers of separated base pairs, of separated A·T pairs, and of runs of separation, the ensemble average free energy image, and the residual superhelicity.

The most informative quantities regarding the behavior of the molecule are its destabilization and transition profiles. The transition profile displays the probability of separation of each base pair in the molecule. The separation probability p(x) of the base pair at position x is calculated from equation (7.5) using parameter zx, where zx = 1 in states where base pair x is separated and zx = 0 in all other states. This calculation is performed for every base pair in the sequence. The transition profile displays p(x) as a function of x.

The destabilization profile is the incremental free energy needed to induce separation at each base pair. To calculate this quantity, let i(x) index the states in which the base pair at position x is separated. Then the average free energy of all such states is

Suggested Citation: "ANALYSIS OF SUPERHELICAL EQUILIBRIA." National Research Council. 1995. Calculating the Secrets of Life: Contributions of the Mathematical Sciences to Molecular Biology. Washington, DC: The National Academies Press. doi: 10.17226/2121.

Page 189

image                                (7.8)

To determine the destabilization free energy G(x), we normalize by subtracting the calculated equilibrium free energy:

image                                                     (7.9)

Base pairs that require incremental free energy to separate at equilibrium have G(x) > 0, while base pairs that are energetically favored to separate at equilibrium have G(x) £ 0. This calculation is performed for each base pair in the molecule, and the destabilization profile plots G(x) versus x. Examples of these profiles are given in Figure 7.1.

Although individual high-energy states are exponentially less populated than low-energy states at equilibrium, they are so numerous that their cumulative contribution to the equilibrium still may be significant. The next step in this calculation requires estimating the aggregate influence of the states that were excluded from the above analysis because their free energies exceeded the threshold. This involves estimating the contribution Z(n,nAT,r) to the partition function from all states whose values n,nAT, and r do not satisfy the threshold condition. Here

image                 (7.10)

where M(n,r) is the number of states with n separated base pairs in r runs, which for a circular domain is

image                                          (7.11)

The only part of Z(n,nAT,r) not amenable to exact determination is Pn,r(nAT), the fraction of (n,r)-states that have exactly nATseparated A·T base pairs.

Suggested Citation: "ANALYSIS OF SUPERHELICAL EQUILIBRIA." National Research Council. 1995. Calculating the Secrets of Life: Contributions of the Mathematical Sciences to Molecular Biology. Washington, DC: The National Academies Press. doi: 10.17226/2121.

Page 190

image

Figure 7.1
The transition (top) and the helix destabilization (bottom) profiles are shown for the
circular pBR322 DNA molecule at  a = -26 turns. The promoter (P) and terminator
(T) (the sites that control the start and end, respectively, of gene expression) of the
beta-lactamase gene are indicated. These results were calculated using the
energetics found by the method described in the text. Reprinted (bottom
panel), by permission, from Benham (1993). Copyright © 1993 by the
National Academy of Sciences.

The estimation of the influence of the high-energy states is done in two steps. First, growth conditions are found for Z(n,r), the contribution to the partition function from all states having n total separated base pairs in r runs. Assuming that the distribution of A+T-richness among (n,r)-states is approximately the same as that for (n,r + 1)-states, then the ratio

image                              (7.12)

Suggested Citation: "ANALYSIS OF SUPERHELICAL EQUILIBRIA." National Research Council. 1995. Calculating the Secrets of Life: Contributions of the Mathematical Sciences to Molecular Biology. Washington, DC: The National Academies Press. doi: 10.17226/2121.

Page 191

is monotonically decreasing with r. One can find an image such that Timage=p<1. Then for all r ³ image we have Tr £r, so that

image                           (7.13)

By a similar line of reasoning one also can find a value image, above which the aggregate contribution to the partition function again is bounded above by a convergent geometric series:

image                              (7.14)

In practice, low values ( » 0.1) for the series ratios r and s occur at reasonably small cutoffs (image » 8, image » 150 for a molecule of N = 5,000 base pairs under reasonable environmental conditions).

The contribution of the intermediate states having n £ image and r £ image but not satisfying the threshold requires estimating pn,rnAT, the fraction of n,r-states that have exactly nATseparated A·T base pairs. Although in principle one can compute this quantity from the base sequence, for molecules of kilobase lengths it is feasible to compute only the exact distribution of A+T-richness in r=I run states having n £ nmax » 200. Experience has shown that high accuracy is obtained by calculating pn,2nAT exactly, and using pn,2nAT as an estimate of pn,rnAT for r> 2. Once the sequence information needed in this step has been found (a calculation that need be performed only once per molecule), the performance of the rest of this refinement is computationally very fast.

These results are used to estimate the contribution imageneg to the partition function from the neglected, high-energy states:

image                                       (7.15)

(Here the carat marks denote approximate values.) Any parameter z that depends only on n, nAT, or r also can have its previously calculated

Suggested Citation: "ANALYSIS OF SUPERHELICAL EQUILIBRIA." National Research Council. 1995. Calculating the Secrets of Life: Contributions of the Mathematical Sciences to Molecular Biology. Washington, DC: The National Academies Press. doi: 10.17226/2121.
Page 187
Suggested Citation: "ANALYSIS OF SUPERHELICAL EQUILIBRIA." National Research Council. 1995. Calculating the Secrets of Life: Contributions of the Mathematical Sciences to Molecular Biology. Washington, DC: The National Academies Press. doi: 10.17226/2121.
Page 188
Suggested Citation: "ANALYSIS OF SUPERHELICAL EQUILIBRIA." National Research Council. 1995. Calculating the Secrets of Life: Contributions of the Mathematical Sciences to Molecular Biology. Washington, DC: The National Academies Press. doi: 10.17226/2121.
Page 189
Suggested Citation: "ANALYSIS OF SUPERHELICAL EQUILIBRIA." National Research Council. 1995. Calculating the Secrets of Life: Contributions of the Mathematical Sciences to Molecular Biology. Washington, DC: The National Academies Press. doi: 10.17226/2121.
Page 190
Suggested Citation: "ANALYSIS OF SUPERHELICAL EQUILIBRIA." National Research Council. 1995. Calculating the Secrets of Life: Contributions of the Mathematical Sciences to Molecular Biology. Washington, DC: The National Academies Press. doi: 10.17226/2121.
Page 191
Next Chapter: Evaluation of Free-Energy Parameters
Subscribe to Emails from the National Academies
Stay up to date on activities, publications, and events by subscribing to email updates.