Optimise Davidson Hv: precompute connected pairs, vmap element kernel, batched MGS by Copilot · Pull Request #8 · jphaupt/CIPSIpy

Copilot · 2026-04-03T10:14:42Z

Profiling showed hamiltonian_vector_product (three Python loops over connected pairs, called ~14×/diagonalisation) dominated runtime at 72% of total wall time. This PR eliminates those loops by precomputing the pair structure once and replacing the inner loop with a single JIT+vmap scatter-add.

Changes

`determinants.py` — JAX-native bitwise helpers + pair precomputation

jax_popcount, excitation_level_jax, first_set_bit_pos_jax, two_set_bit_pos_jax, phase_single_jax, phase_double_jax: pure-JAX implementations of the scalar kernels using only bitwise ops and jnp.where — no Python if/else, fully vmappable.
precompute_connections(dets_alpha, dets_beta, norb) → (row_idx, col_idx): runs the existing sort/generator logic once per diagonalisation and returns all three connection types (β singles/doubles, opposite-spin doubles, α singles/doubles) as flat JAX int32 index arrays with no duplicates.

`hamiltonian.py` — vmappable element kernel + JIT-compiled matvec

_single_excitation_element_spin_jax, _double_same_spin_element_jax, _double_opposite_spin_element_jax: branch-free sub-kernels safe for jax.vmap.
hamiltonian_element_batch: top-level vmappable element function; selects the correct sub-kernel via nested jnp.where (all five excitation types evaluated; wrong branches produce finite values masked out, no NaN propagation).
precompute_h_vals(dets_alpha, dets_beta, row_idx, col_idx, norb, h_core, eri): JIT-compiled, computes all H_ij in one jax.vmap call — called once before Davidson starts.

scatter_add_matvec(coeffs, diag_h, h_vals, row_idx, col_idx): JIT-compiled matvec using scatter-add, zero Python loops:

v_off = jnp.zeros(ndet).at[row_idx].add(h_vals * coeffs[col_idx])
v_off = v_off.at[col_idx].add(h_vals * coeffs[row_idx])
return diag_h * coeffs + v_off

`cipsi.py` — wire up fast path in `_diagonalise_variational_space`

Replaced the hamiltonian_vector_product closure with:

row_idx, col_idx = precompute_connections(dets_alpha, dets_beta, norb)
h_vals = precompute_h_vals(dets_alpha, dets_beta, row_idx, col_idx, norb, h_core, eri)
diag.davidson(lambda v: scatter_add_matvec(v, h_diag, h_vals, row_idx, col_idx))

h_vals is computed once; all ~14 Davidson matvec calls reuse it.

`diagonaliser.py` — batched MGS projection

Replaced the for k in range(m) Gram-Schmidt loop (O(m) Python iterations growing with subspace size) with a single batched projection, valid because V_m maintains orthonormal columns throughout:

# before
for k in range(m):
    vk = Vmat[:, k:k+1]
    new_vecs = new_vecs - vk * (vk.T @ new_vecs)

# after
new_vecs = corrections - V_m @ (V_m.T @ corrections)

Agent-Logs-Url: https://github.com/jphaupt/CIPSIpy/sessions/041213a4-3359-43a6-833d-411eaa4193d8 Co-authored-by: jphaupt <10662835+jphaupt@users.noreply.github.com>

jphaupt · 2026-04-08T12:42:01Z

@copilot If you're going to use acronyms, please always define it at least once (when first using it). For example, I am not sure what MGS is in this context.

Agent-Logs-Url: https://github.com/jphaupt/CIPSIpy/sessions/7362fbc9-cf5d-4c0b-8db7-788edf8585c1 Co-authored-by: jphaupt <10662835+jphaupt@users.noreply.github.com>

Copilot · 2026-04-08T12:43:27Z

@copilot If you're going to use acronyms, please always define it at least once (when first using it). For example, I am not sure what MGS is in this context.

Done — "MGS" is now written out as "modified Gram-Schmidt (MGS)" on its first use in diagonaliser.py (commit cec1750). The PR title keeps the short form since it already appears after the definition.

Copilot AI and others added 2 commits April 3, 2026 10:10

optimise Hv: precompute pairs, vmap element kernel, batched MGS

e55e3e1

Agent-Logs-Url: https://github.com/jphaupt/CIPSIpy/sessions/041213a4-3359-43a6-833d-411eaa4193d8 Co-authored-by: jphaupt <10662835+jphaupt@users.noreply.github.com>

address code review: clarify comments and rename dummy_c -> dummy_coeffs

367f445

Agent-Logs-Url: https://github.com/jphaupt/CIPSIpy/sessions/041213a4-3359-43a6-833d-411eaa4193d8 Co-authored-by: jphaupt <10662835+jphaupt@users.noreply.github.com>

Copilot AI assigned Copilot and jphaupt Apr 3, 2026

Copilot created this pull request from a session on behalf of jphaupt April 3, 2026 10:14 View session

Copilot started work on behalf of jphaupt April 8, 2026 12:42 View session

expand MGS acronym to "modified Gram-Schmidt (MGS)" on first use

cec1750

Agent-Logs-Url: https://github.com/jphaupt/CIPSIpy/sessions/7362fbc9-cf5d-4c0b-8db7-788edf8585c1 Co-authored-by: jphaupt <10662835+jphaupt@users.noreply.github.com>

Copilot finished work on behalf of jphaupt April 8, 2026 12:45

Copilot AI requested a review from jphaupt April 8, 2026 12:45

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Optimise Davidson Hv: precompute connected pairs, vmap element kernel, batched MGS#8

Optimise Davidson Hv: precompute connected pairs, vmap element kernel, batched MGS#8
Copilot wants to merge 3 commits into
mainfrom
copilot/optimize-hamiltonian-vector-product

Copilot AI commented Apr 3, 2026

Uh oh!

jphaupt commented Apr 8, 2026

Uh oh!

Copilot AI commented Apr 8, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

Copilot AI commented Apr 3, 2026

Changes

determinants.py — JAX-native bitwise helpers + pair precomputation

hamiltonian.py — vmappable element kernel + JIT-compiled matvec

cipsi.py — wire up fast path in _diagonalise_variational_space

diagonaliser.py — batched MGS projection

Uh oh!

jphaupt commented Apr 8, 2026

Uh oh!

Copilot AI commented Apr 8, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

`determinants.py` — JAX-native bitwise helpers + pair precomputation

`hamiltonian.py` — vmappable element kernel + JIT-compiled matvec

`cipsi.py` — wire up fast path in `_diagonalise_variational_space`

`diagonaliser.py` — batched MGS projection