The most time-consuming part of DynaCLR prediction is the PHATE computation for very large datasets, like 20X multiwell data. The prediction runs in half the time required for PHATE computation. In case the job is interrupted, no output is received, as with the current implementation, predictions are written after finishing the PHATE computations. Should we start writing the output zar before the PHATE computation so that it doesn't interfere with result generation, and then add the PHATE and other observations later, after the PHATE computation step completes? Also, is there any way to reduce PHATE computation time for large datasets?
@edyoshikun , @srivarra , thoughts?
The most time-consuming part of DynaCLR prediction is the PHATE computation for very large datasets, like 20X multiwell data. The prediction runs in half the time required for PHATE computation. In case the job is interrupted, no output is received, as with the current implementation, predictions are written after finishing the PHATE computations. Should we start writing the output zar before the PHATE computation so that it doesn't interfere with result generation, and then add the PHATE and other observations later, after the PHATE computation step completes? Also, is there any way to reduce PHATE computation time for large datasets?
@edyoshikun , @srivarra , thoughts?