Uploaded image for project: 'Jalview'
  1. Jalview
  2. JAL-2534

visualise infernal's 'full output' WUSS annotations

    XMLWordPrintable

    Details

    • Type: Improvement
    • Status: Open
    • Priority: Critical
    • Resolution: Unresolved
    • Affects Version/s: None
    • Fix Version/s: None
    • Component/s: None
    • Labels:
      None

      Description

      Infernal (used by Rfam, etc) annotates RNA structure to distinguish simple hairpins from other kinds of loops and stems. Although Jalview preserves these symbols in the annotation row, it could do more with them.

      From Page 104 of Infernal's manual (http://eddylab.org/infernal/Userguide.pdf):

      Full (output) WUSS notation

      In detail, symbols used by WUSS notation in output structure annotation strings are as follows:

      Base pairs
      Base pairs are annotated by nested matching pairs of symbols <>, (), [], or {}. The different symbols indicate the “depth” of the helix in the RNA structure as follows: <> are used for simple terminal stems; () are used for “internal” helices enclosing a multifurcation of all terminal stems; [] are used for internal helices enclosing a multifurcation that includes at least one annotated () stem already; and {} are used for all internal helices enclosing deeper multifurcations.

      Hairpin loops
      Hairpin loop residues are indicated by underscores, _. Simple stem loops stand out as, e.g.
      <<<<____>>>>.

      Bulge, interior loops
      Bulge and interior loop residues are indicated by dashes, -.

      Multifurcation loops
      Multifurcation loop residues are indicated by commas, ,. The mnemonic is “stem 1, stem2”, e.g.
      <<<___>>>,,<<<___>>>.

      External residues
      Unstructured single stranded residues completely outside the structure (unenclosed by any base pairs) are annotated by colons, :.

      Insertions
      Insertions relative to a known structure are indicated by periods, .. Regions where local structural alignment was invoked, leaving regions of both target and query sequence unaligned, are indicated by tildes, ˜. These symbols only appear in alignments of a known (query) structure annotation to a target sequence of unknown structure.

      Pseudoknots
      WUSS notation allows pseudoknots to be annotated as pairs of upper case/lower case letters: for example, <<<<_AAAA____>>>>aaaa annotates a simple pseudoknot; additional pseudoknottedstems could be annotated by Bb, Cc, etc. Infernal cannot handle pseudoknots, however; pseudoknot notation never appears in Infernal output; it is accepted in input files, but ignored.

        Attachments

          Activity

            People

            Assignee:
            jprocter James Procter
            Reporter:
            jprocter James Procter
            Votes:
            0 Vote for this issue
            Watchers:
            1 Start watching this issue

              Dates

              Created:
              Updated: