Uploaded image for project: 'Jalview'
  1. Jalview
  2. JAL-4100

ncRNA and genes without product attributes are not imported from Genbank and EMBL format files

    XMLWordPrintable

    Details

    • Type: Bug
    • Status: Open
    • Priority: Critical
    • Resolution: Unresolved
    • Affects Version/s: 2.11.2, 2.11.3.0, 2.11.3.1, 2.11.3.2, 2.11.3.3
    • Fix Version/s: 2.11.3.4
    • Component/s: file format issue
    • Labels:
      None

      Description

      Try to import the EMBL file at https://dag.compbio.dundee.ac.uk/daffodils/data/22TAempr.embl

      Whilst the sequence is imported, and Calculate->Get Cross-References -> EMBLCDS shows a CDS/Protein alignment with 44 sequences, there were errors shown on import like:
        ERROR - Ignoring CDS feature with no protein_id for EMBL:22TAempr

      Opening the overview window shows the sequence has several light grey features corresponding to exons on the DNA.

      However, the same data is also available as a fasta file and adding a GFF3 file:

      https://dag.compbio.dundee.ac.uk/daffodils/data/22TAempr.fasta
      https://dag.compbio.dundee.ac.uk/daffodils/data/22TAempr.gff3

      Loading 22TAempr.fasta and adding 22TAempr.gff3 works without errors. This time, Calculate->Get Cross-References isn't available, but opening the overview shows there are many more CDS annotations (light grey), and also some dark blue rRNA features marked.

      The .embl file is produced from a DNA gene finding/annotation transfer pipeline. It might not be right - but it would be useful if Jalview could extract more data from it if it is. At least show the location of the genes and display the additional metadata included in the .embl file.

        Attachments

          Activity

            People

            Assignee:
            Unassigned Unassigned
            Reporter:
            jprocter James Procter
            Votes:
            0 Vote for this issue
            Watchers:
            1 Start watching this issue

              Dates

              Created:
              Updated: