Skip to content

Excessive clipping when mapping to a genomic fragment #65

@limepoutine

Description

@limepoutine

I have four genomic fragments from a tetraploid potato and want to identify splicing junctions computationally using NM_001247643.3 as a query. MSA suggests the sequences match well except for a few indels. However, magicblast seems to be clipping the query sequence excessively.

Image

Running magicblast -query NM_001247643.fasta -subject Fragment1.fasta -out Fragment1.sam -limit_lookup F -score 20 generates CIGAR 510S531M108N51M. The expected result is that the start codon matches.

The sequences are attached here as a text file: sequences.txt.

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions