[JAL-886] fetching EMBL reference for an RNA sequence results fails with a sequence mismatch - Jalview

XML

Word

Printable

Details

Type: Bug
Status: In Progress
Priority: Major
Resolution: Unresolved
Affects Version/s: 2.7_gsoc11, 2.8, 2.8.1, 2.8.2, 2.9
Fix Version/s: None
Component/s: data retrieval services, na
Labels:
- hmmcuration
- knowndefect

Description

Looking up database entries for RNA sequences assumes that the database holds an exact match, rather than the genomic version of the sequence.

E.g.

>ABJT01000033.1/33739-33825
AACACAUCAGAUUUCCUGGUGUAACGAAUUUUUUAAGUGCUUCUUGCUUAAGCAAG-UUUC-AUCC-CGACC
CCCUCA-------GGG-UCGGGAUUU

Fetching [EMBL database references] for the above (by importing the fasta file above to Jalview, then using fetch db refs->EMBL) results in a 'Sequence not 100% match' error, and no identifier added to the sequence. The database ref fetcher needs to be taught how to translate between DNA and RNA nucleotide sequences.

Attachments

Issue Links

blocks

JAL-1597 more efficient resolution of IDs when querying contig databases

Open

depends on

JAL-1000 translate between DNA and RNA versions of nucleotide sequence

Open

related with

JAL-851 sequence database accessions not imported when fetching alignments from Rfam

Resolved

JAL-1708 resolve cross-references between RNA, DNA, RFAM and 3D coordinates via RNACentral

Open

Activity

People

Assignee:: Mungo Carstairs

Reporter:: James Procter

Votes:: 0 Vote for this issue

Watchers:: 1 Start watching this issue

Dates

Created:: 29/Jul/11 3:15 PM

Updated:: 15/Jun/16 2:34 PM