Computational RNomics
Computational RNomics
Disciplines
Biology (25%); Mathematics (75%)
Keywords
-
RNA Genes,
Bioinformatics,
Secondary Structure,
Non-coding RNA,
Alignments,
Motif search
Structural genomics, the systematic determination of all macro-molecular structures represented in a genome, is at present focused almost exclusively on proteins. Although it is commonplace to speak of ``genes and their encoded protein products``, thousands of human genes produce transcripts that exert their function without ever producing proteins. Furthermore, even though the sequence of the human DNA is known by now, the contents of about half of it remains unknown. It is quite likely that a large class of genes has gone relatively undetected so far because they do not make proteins. The list of functional non-coding RNAs includes key players in the biochemistry of the cell, such as transfer RNAs, ribosomal RNAs, tmRNA, and the RNA components of RNase P and signal recoginition particles. Another level of RNA function is presented by functional motifs within protein-coding genes, located mostly in the non- translated 5` or 3` regions of the immature messenger RNA. It is not hard to argue therefore that "RNomics", i.e., the understanding of functional RNAs and their interactions at a genomic level, is of utmost practical and theoretical importance in modern life sciences: The comprehensive understanding of the biology of a cell obviously requires the knowledge of identity of all encoded RNAs, the molecules with which they interact, and the molecular structures of these complexes. The first step toward this goal is the development of versatile and reliable computational methods that can detect and classify functional RNAs, preferably within a single genome, or in case this proves impossible, from a very small set of related genomes. We propose here to develop a suite of bioinformatics methods that are specifically geared toward detecting, verifying, and classifying functional RNAs. Our comprehensive approach to "Computational RNomics" will provide * improved algorithms for RNA secondary structure prediction * improved alignment algorithms for nucleic acid sequences * novel approaches to compare and align RNA structures * extensions of existing RNA algorithms to deal with genome-size data sets * a database system specifically designed for RNA structures
RNomics stands for the holistic approach to look at all RNAs existing in living organisms. Prediction, detection and classification of these RNAs requires tools using comparative methods for identification of similarities between an unknown and a described RNA species. In the case of DNA, classification is performed simply by sequence comparison. Proteins in turn require additional steps since their function is determined by their three dimensional structure. The function of an RNA is represented by only two constraints: the sequence, which folds into a distinct secondary structure. Folding follows a distinct set of rules, only three kinds of base pairs are allowed: A-U, G-C, G-U. Programs like Mfold or RNAfold calculate such secondary structures starting from a single input sequence. When comparing an RNA from two different organisms, say snoRNA E1 in mouse and man, the sequence will be different while the secondary structure is more or less the same. This phenomenon is a result of compensatory mutations, which are changes in the sequence such that a base pair remains preserved. Only those base pairs taking part in a functional structure will be under such selection pressure. Thus, detection of compensatory mutations allows us to decide wether two ore more similar sequences belong to the same class of functional RNAs. Aim of this project was the development of a variety of tools supporting the quest for RNA genes. We were able to find solutions to all problems formulated in the grant proposal and apply them on selected model organisms of viruses, bacteria, und animals. Thus we were able to proof the general applicability of our RNA tools. We developed two algorithms for aligning RNA sequences taking into account protein coding capacity (codaln) and secondary structure information (pmmulti), respectively. The programs hxmatch and alidot use different methods to predict conserved structures, including computationally expensive pseudoknots, from alignments. Both Alifoldz and RNAz also follow the comparative approach and represent methods for genome wide surveys in eukaryote genomes. Handling of such huge amounts of data requires a specifically designed MySQL database. RNALfold provides a sliding window variant of RNAfold for prediction of locally stable RNA secondary structures and is also applicable on whole genomes. In addition to the proposed goals, we developed methods for designing bistable RNAs, so called RNAswitches and small modifier RNAs, which inhibit formation of functional mRNA structures. The molecular evolution of a group of microRNAs, a natural class of small non- coding RNAs, was reconstructed based on sequences found by means of tools developed during this project.
- Universität Wien - 100%
- Robert Giegerich, Universität Bielefeld - Germany
Research Output
- 2779 Citations
- 16 Publications
-
2006
Title The expansion of the metazoan microRNA repertoire DOI 10.1186/1471-2164-7-25 Type Journal Article Author Hertel J Journal BMC Genomics Pages 25 Link Publication -
2006
Title Algebraic comparison of metabolic networks, phylogenetic inference, and metabolic innovation DOI 10.1186/1471-2105-7-67 Type Journal Article Author Forst C Journal BMC Bioinformatics Pages 67 Link Publication -
2006
Title Memory efficient folding algorithms for circular RNA secondary structures DOI 10.1093/bioinformatics/btl023 Type Journal Article Author Hofacker I Journal Bioinformatics Pages 1172-1176 Link Publication -
2006
Title Thermodynamics of RNA–RNA binding DOI 10.1093/bioinformatics/btl024 Type Journal Article Author Mückstein U Journal Bioinformatics Pages 1177-1182 Link Publication -
2005
Title Fast and reliable prediction of noncoding RNAs DOI 10.1073/pnas.0409169102 Type Journal Article Author Washietl S Journal Proceedings of the National Academy of Sciences Pages 2454-2459 Link Publication -
2005
Title Mapping of conserved RNA secondary structures predicts thousands of functional noncoding RNAs in the human genome DOI 10.1038/nbt1144 Type Journal Article Author Washietl S Journal Nature Biotechnology Pages 1383-1390 -
2005
Title The effect of RNA secondary structures on RNA-ligand binding and the modifier RNA mechanism: a quantitative model DOI 10.1016/j.gene.2004.11.043 Type Journal Article Author Hackermüller J Journal Gene Pages 3-12 Link Publication -
2005
Title Unorthodox mRNA start site to extend the highly structured leader of retrotransposon Tto1 mRNA increases transposition rate DOI 10.1261/rna.2640105 Type Journal Article Author Böhmdorfer G Journal RNA Pages 1181-1191 Link Publication -
2005
Title Multiple sequence alignments of partially coding nucleic acid sequences DOI 10.1186/1471-2105-6-160 Type Journal Article Author Stocsits R Journal BMC Bioinformatics Pages 160 Link Publication -
2004
Title Structural and evolutionary analysis of the transcribed sequence of Boudicca, a Schistosoma mansoni retrotransposon DOI 10.1016/j.gene.2003.12.023 Type Journal Article Author Copeland C Journal Gene Pages 103-114 Link Publication -
2004
Title Bichir HoxA Cluster Sequence Reveals Surprising Trends in Ray-Finned Fish Genomic Evolution DOI 10.1101/gr.1712904 Type Journal Article Author Chiu C Journal Genome Research Pages 11-17 Link Publication -
2004
Title Comparative Genomics, cis-Regulatory Elements, and Gene Duplication DOI 10.1016/s0091-679x(04)77029-6 Type Book Chapter Author Force A Publisher Elsevier Pages 545-561 -
2004
Title Consensus Folding of Aligned Sequences as a New Measure for the Detection of Functional RNAs by Comparative Genomics DOI 10.1016/j.jmb.2004.07.018 Type Journal Article Author Washietl S Journal Journal of Molecular Biology Pages 19-30 Link Publication -
2004
Title Prediction of Consensus RNA Secondary Structures Including Pseudoknots DOI 10.1109/tcbb.2004.22 Type Journal Article Author Witwer C Journal IEEE Transactions on Computational Biology and Bioinformatics Pages 66-77 Link Publication -
2004
Title Efficient computation of RNA folding dynamics DOI 10.1088/0305-4470/37/17/005 Type Journal Article Author Wolfinger M Journal Journal of Physics A: Mathematical and General Pages 4731 -
2004
Title Molecular Evolution of a MicroRNA Cluster DOI 10.1016/j.jmb.2004.03.065 Type Journal Article Author Tanzer A Journal Journal of Molecular Biology Pages 327-335 Link Publication