Semiautomated improvement of RNA alignments
Research output: Contribution to journal › Journal article › Research › peer-review
Standard
Semiautomated improvement of RNA alignments. / Andersen, Ebbe Sloth; Lind-Thomsen, Allan; Knudsen, Bjarne; Kristensen, Susie E.; Havgaard, Jakob Hull; Torarinsson, Elfar; Larsen, Niels; Zwieb, Christian; Sestoft, Peter; Kjems, Jørgen; Gorodkin, Jan.
In: RNA: A publication of the RNA Society, Vol. 13, No. 11, 2007, p. 1850-1859.Research output: Contribution to journal › Journal article › Research › peer-review
Harvard
APA
Vancouver
Author
Bibtex
}
RIS
TY - JOUR
T1 - Semiautomated improvement of RNA alignments
AU - Andersen, Ebbe Sloth
AU - Lind-Thomsen, Allan
AU - Knudsen, Bjarne
AU - Kristensen, Susie E.
AU - Havgaard, Jakob Hull
AU - Torarinsson, Elfar
AU - Larsen, Niels
AU - Zwieb, Christian
AU - Sestoft, Peter
AU - Kjems, Jørgen
AU - Gorodkin, Jan
PY - 2007
Y1 - 2007
N2 - We have developed a semiautomated RNA sequence editor (SARSE) that integrates tools for analyzing RNA alignments. The editor highlights different properties of the alignment by color, and its integrated analysis tools prevent the introduction of errors when doing alignment editing. SARSE readily connects to external tools to provide a flexible semiautomatic editing environment. A new method, Pcluster, is introduced for dividing the sequences of an RNA alignment into subgroups with secondary structure differences. Pcluster was used to evaluate 574 seed alignments obtained from the Rfam database and we identified 71 alignments with significant prediction of inconsistent base pairs and 102 alignments with significant prediction of novel base pairs. Four RNA families were used to illustrate how SARSE can be used to manually or automatically correct the inconsistent base pairs detected by Pcluster: the mir-399 RNA, vertebrate telomase RNA (vert-TR), bacterial transfer-messenger RNA (tmRNA), and the signal recognition particle (SRP) RNA. The general use of the method is illustrated by the ability to accommodate pseudoknots and handle even large and divergent RNA families. The open architecture of the SARSE editor makes it a flexible tool to improve all RNA alignments with relatively little human intervention. Online documentation and software are available at (http://sarse.ku.dk).
AB - We have developed a semiautomated RNA sequence editor (SARSE) that integrates tools for analyzing RNA alignments. The editor highlights different properties of the alignment by color, and its integrated analysis tools prevent the introduction of errors when doing alignment editing. SARSE readily connects to external tools to provide a flexible semiautomatic editing environment. A new method, Pcluster, is introduced for dividing the sequences of an RNA alignment into subgroups with secondary structure differences. Pcluster was used to evaluate 574 seed alignments obtained from the Rfam database and we identified 71 alignments with significant prediction of inconsistent base pairs and 102 alignments with significant prediction of novel base pairs. Four RNA families were used to illustrate how SARSE can be used to manually or automatically correct the inconsistent base pairs detected by Pcluster: the mir-399 RNA, vertebrate telomase RNA (vert-TR), bacterial transfer-messenger RNA (tmRNA), and the signal recognition particle (SRP) RNA. The general use of the method is illustrated by the ability to accommodate pseudoknots and handle even large and divergent RNA families. The open architecture of the SARSE editor makes it a flexible tool to improve all RNA alignments with relatively little human intervention. Online documentation and software are available at (http://sarse.ku.dk).
KW - Former LIFE faculty
KW - RNA structural alignment
KW - RNA secondary structure
KW - SARSE
U2 - 10.1261/rna.215407
DO - 10.1261/rna.215407
M3 - Journal article
C2 - 17804647
VL - 13
SP - 1850
EP - 1859
JO - RNA
JF - RNA
SN - 1355-8382
IS - 11
ER -
ID: 8096836