Skip to content

Ignore gap sequences and reporting of repeats in canonical form #90

@olekto

Description

@olekto

When running ULTRA I sometimes get Ns as a tandem repeat, or included as a larger pattern. Is this intentional?

I would prefer that all gap bases (Ns) are ignored in the output. Is this possible?

Also, it is possible to report the repeats in canonical form? For instance, GA, AG,TC, CT are all represented by the repeat type AG. This is from Chambers, G. K., and E. S. MacAvoy. 2000. (https://www.sciencedirect.com/science/article/pii/S0305049100002339#APPA) which states: "That microsatellite satellite repeat sequence be represented by the simplest and most alphabetical formula possible, e.g. –(CA)15– rather than –(TG)15– or alternatives."

Is this possible?

Thank you.

Sincerely,
Ole

Metadata

Metadata

Assignees

No one assigned

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions