We are far from understanding all the rules that govern the process of RNA splicing and defining the sequence information that governs intron definition. In the September 25
They chose transcripts from five eukaryote genomes (Saccharomyces cerevisiae, Caernorhabditis elegans, Drosophila melanogaster, Arabidopsis thaliana, and human) whose exon-intron structures were well-defined. They analysed 5' and 3' splice signal motifs in short introns and used mathematical methods (Markov models and Monte Carlo simulations) to determine the amount of information required for intron recognition. While 5' and 3' splice signal sequences were sufficient to predict short introns in the fly and worm genomes (>90% accuracy), human and plant introns required additional transcript features, such as specific pentamer sequences ...