-
Notifications
You must be signed in to change notification settings - Fork 124
Description
Hi,
I am running cactus-pangenome and encountering a failure during the make_giraffe_indexes step. The job crashes while running vg index to generate the .dist index.
The process exits with signal 15 (SIGTERM) and not SIGKILL. The logged peak memory usage is approximately 1.47 TB.
I would like to understand whether this is expected behavior (e.g. memory underestimation or scheduler-related termination) or if there may be an issue with resource handling in this step.
cactus-pangenome ./jobstorepath ./sequenceFile.tsv \
--binariesMode local \
--workDir /Data/jwshin0727/toilwork \
--outDir ${PREFIX} --outName ${PREFIX} \
--reference ARS_UCD --haplo \
--giraffe clip filter \
--vcf --vcfReference ARS_UCD \
--viz --odgi \
--chrom-vg clip filter --chrom-og \
--gbz clip filter full --gfa clip full \
--maxMemory 1400G \
--logFile 260227_3rd.cactus.log
Error log
Running the command:
vg index -t 143 -j clip.Hanwoo_27ea.dist clip.Hanwoo_27ea.gbz -P ARS_UCD
RuntimeError:
Command /usr/bin/time -f "CACTUS-LOGGED-MEMORY-IN-KB: %M" vg index ...
exited 143: stderr=Command terminated by signal 15
CACTUS-LOGGED-MEMORY-IN-KB: 1469793116
Memory information
MemTotal: 1584984504 kB
MemFree: 144034300 kB
MemAvailable: 1434507540 kB
Buffers: 1634528 kB
Cached: 1289645148 kB
SwapCached: 108096 kB
Active: 236804144 kB
Inactive: 1191347396 kB
Active(anon): 134147196 kB
Inactive(anon): 2736100 kB
Questions
Is it expected that vg index -j (dist index) may require >1.3 TB RAM for large pangenomes?
It consists of 55 assemblies for the same species (3G).
and how can i solve this memory issue?
Any guidance on stabilizing the make_giraffe_indexes step would be greatly appreciated.
Thank you.