Maximum expected accuracy structural neighbors of an RNA secondary structure
Background Since RNA molecules regulate genes and control alternative splicing by allostery , it is important to develop algorithms to predict RNA conformational switches . Some tools, such as paRNAss, RNAshapes and RNAbor , can be used to predict potential conformational switches; nevertheless, no...
        Saved in:
      
    
          | Published in | BMC bioinformatics Vol. 13; no. Suppl 5; p. S6 | 
|---|---|
| Main Authors | , , | 
| Format | Journal Article | 
| Language | English | 
| Published | 
        London
          BioMed Central
    
        12.04.2012
     Springer Nature B.V  | 
| Subjects | |
| Online Access | Get full text | 
| ISSN | 1471-2105 1471-2105  | 
| DOI | 10.1186/1471-2105-13-S5-S6 | 
Cover
| Summary: | Background
Since RNA molecules regulate genes and control alternative splicing by
allostery
, it is important to develop algorithms to predict RNA
conformational switches
. Some tools, such as
paRNAss, RNAshapes and RNAbor
, can be used to predict potential conformational switches; nevertheless, no existent tool can detect general (i.e., not family specific)
entire
riboswitches (both aptamer and expression platform) with accuracy. Thus, the development of additional algorithms to detect conformational switches seems important, especially since the difference in free energy between the two metastable secondary structures may be as large as 15-20 kcal/mol. It has recently emerged that RNA secondary structure can be more accurately predicted by computing the
maximum expected accuracy
(MEA) structure, rather than the
minimum free energy
(MFE) structure.
Results
Given an arbitrary RNA secondary structure
S
0
for an RNA nucleotide sequence
a
=
a
1
,...,
a
n
, we say that another secondary structure
S
of
a
is a
k
-neighbor of
S
0
, if the base pair distance between
S
0
and
S
is
k
. In this paper, we prove that the Boltzmann probability of all
k
-neighbors of the minimum free energy structure
S
0
can be approximated with accuracy
ε
and confidence 1 -
p
, simultaneously for all 0 ≤
k < K
, by a relative frequency count over
N
sampled structures, provided that
N
>
N
(
ε
,
p
,
K
)
=
Φ
-
1
p
2
K
2
4
ε
2
, where Φ(
z
) is the cumulative distribution function (CDF) for the standard normal distribution. We go on to describe the algorithm
RNAborMEA
, which for an arbitrary initial structure
S
0
and for all values 0 ≤
k < K
, computes the secondary structure
MEA
(
k
), having
maximum expected accuracy
over all
k
-neighbors of
S
0
. Computation time is
O
(
n
3
·
K
2
), and memory requirements are
O
(
n
2
·
K
). We analyze a sample TPP riboswitch, and apply our algorithm to the class of
purine riboswitches
.
Conclusions
The approximation of
RNAbor
by sampling, with rigorous bound on accuracy, together with the computation of maximum expected accuracy
k
-neighbors by
RNAborMEA
, provide additional tools toward conformational switch detection. Results from
RNAborMEA
are quite distinct from other tools, such as
RNAbor, RNAshapes and paRNAss
, hence may provide orthogonal information when looking for suboptimal structures or conformational switches. Source code for
RNAborMEA
can be downloaded from
http://sourceforge.net/projects/rnabormea/
or
http://bioinformatics.bc.edu/clotelab/RNAborMEA/
. | 
|---|---|
| Bibliography: | ObjectType-Article-1 SourceType-Scholarly Journals-1 ObjectType-Feature-2 content type line 14 content type line 23 PMCID: PMC3358666  | 
| ISSN: | 1471-2105 1471-2105  | 
| DOI: | 10.1186/1471-2105-13-S5-S6 |