Hello,
We've been encountering an unexpectedly high read loss (approximately 40–60%) when running the detect step in DNAscent 4.0.3, and we’re trying to determine the cause. This dropout rate is significantly higher than anticipated, and it's impacting our ability to assess BrdU incorporation with confidence. A significant number of reads are filtered out, making it difficult to distinguish true from false positive BrdU calls due to limited retained data.
We've ruled out several possible issues:
Read length and mapping quality: We relaxed thresholds for both but observed no improvement.
Basecalling: We've tested both fast and sup models using Dorado with version 0.8.3.
Data quality: Libraries were prepared by pulsing yeast (expressing hENT1 and hsvTK) with 160 µM BrdU for 5 minutes. DNA was extracted post-sodium azide treatment. Basecalling, demultiplexing, and alignment were all done with Dorado, and we then ran index and detect with DNAscent 4.0.3.
BrdU incorporation: Confirmed effective by slot blotting with anti-BrdU antibodies.
Cross-validation: We tested a BrdU-negative dataset from a collaborator (prepared using the same kit and flowcell version) and observed a similarly high dropout rate during detect.
Read ID consistency: Only 184 reads in the BAM file were missing from the index.dnascent file, so this discrepancy doesn't explain the large read loss.
Given this, we’re wondering if there are other internal filtering steps within detect that could be contributing to this issue. Is there any publicly available test dataset we could use to verify that we’re running the pipeline correctly?
Thank you for your time and support,
Sam
Hello,
We've been encountering an unexpectedly high read loss (approximately 40–60%) when running the detect step in DNAscent 4.0.3, and we’re trying to determine the cause. This dropout rate is significantly higher than anticipated, and it's impacting our ability to assess BrdU incorporation with confidence. A significant number of reads are filtered out, making it difficult to distinguish true from false positive BrdU calls due to limited retained data.
We've ruled out several possible issues:
Read length and mapping quality: We relaxed thresholds for both but observed no improvement.
Basecalling: We've tested both fast and sup models using Dorado with version 0.8.3.
Data quality: Libraries were prepared by pulsing yeast (expressing hENT1 and hsvTK) with 160 µM BrdU for 5 minutes. DNA was extracted post-sodium azide treatment. Basecalling, demultiplexing, and alignment were all done with Dorado, and we then ran index and detect with DNAscent 4.0.3.
BrdU incorporation: Confirmed effective by slot blotting with anti-BrdU antibodies.
Cross-validation: We tested a BrdU-negative dataset from a collaborator (prepared using the same kit and flowcell version) and observed a similarly high dropout rate during detect.
Read ID consistency: Only 184 reads in the BAM file were missing from the index.dnascent file, so this discrepancy doesn't explain the large read loss.
Given this, we’re wondering if there are other internal filtering steps within detect that could be contributing to this issue. Is there any publicly available test dataset we could use to verify that we’re running the pipeline correctly?
Thank you for your time and support,
Sam