Currently to calculate an HVP we use first-order AD on a supplied gradient. Right now two things are preventing the use of second order AD in this package:
- Batching isn't available in DI
- I haven't been able to get second order AD on the GPU working with any package (haven't tried a few so this is non-exhaustive).
Issue 2 is obviously a bigger block on this integration so, I will make some MWEs and file them upstream where necessary.
Currently to calculate an HVP we use first-order AD on a supplied gradient. Right now two things are preventing the use of second order AD in this package:
Issue 2 is obviously a bigger block on this integration so, I will make some MWEs and file them upstream where necessary.