Uploaded image for project: 'Jalview'
  1. Jalview
  2. JAL-791

PID calculation for tree calculation includes gapped columns in the alignment length used for %age calculation

    XMLWordPrintable

    Details

    • Type: Bug
    • Status: In Progress
    • Priority: Major
    • Resolution: Unresolved
    • Affects Version/s: None
    • Fix Version/s: None
    • Component/s: None
    • Labels:
      None

      Description

      The PID calculated for pairs of sequences always takes the length of the longest sequence string as the denominator for calculating the percentage identity, regardless of the presence of affine gaps (ie gaps in all rows at a particular column of the alignment). This means that the dissimilarity values calculated for a particular alignment may be lower than expected if the alignment includes affine gaps, since these will be considered matches under the PID calculation.

        Attachments

          Issue Links

            Activity

              People

              Assignee:
              gmungoc Mungo Carstairs
              Reporter:
              jprocter James Procter
              Votes:
              0 Vote for this issue
              Watchers:
              1 Start watching this issue

                Dates

                Created:
                Updated: