Improve PairICL device handling and archive legacy runner by yananlong · Pull Request #8 · yananlong/DDI-FP-Graph

yananlong · 2026-02-07T05:52:33Z

Summary

cast tabular features to float32 during preprocessing so GPU and TPU runs consume identical tensors
move embeddings and support tokens to accelerator memory with CUDA-aware transfers inside the predictor
preserve the original pair_icl_tpu.py implementation under archive/ for future reference

Testing

pytest

Copilot

Pull request overview

This PR modularizes the legacy pair_icl_tpu.py runner into a new PairICL package, adds a CLI and TPU/XLA helpers, and preserves the original monolithic script under archive/ while keeping the old entry point working.

Changes:

Introduce the PairICL package (data/model/predictor/CLI utilities) and include it in packaging.
Replace pair_icl_tpu.py with a legacy forwarder to PairICL.cli, and archive the original implementation.
Add unit tests covering pair embedding invariance, predictor behavior, and GPU-fingerprint-dataset conversion.

Reviewed changes

Copilot reviewed 11 out of 11 changed files in this pull request and generated 8 comments.

Show a summary per file

File	Description
`PairICL/data.py`	Implements dataset/IO utilities, including GPU fingerprint dataset conversion.
`PairICL/models.py`	Adds model components and the order-invariant pair embedding builder.
`PairICL/predictor.py`	Adds a high-level predictor that supports optional support tokens and XLA helpers.
`PairICL/xla_utils.py`	Provides optional torch_xla wrappers (device selection, spawn, broadcast, reduce/gather).
`PairICL/cli.py`	Adds a new CLI for preprocessing and prediction, including GPU dataset ingestion.
`PairICL/__init__.py`	Exposes the package public API for imports/tests.
`PairICL/__main__.py`	Enables `python -m PairICL` execution.
`pair_icl_tpu.py`	Keeps backwards compatibility by forwarding to the new CLI entry point.
`archive/pair_icl_tpu.py`	Preserves the legacy monolithic TPU script for reference.
`tests/test_pairicl.py`	Adds new tests for PairICL core behaviors and dataset conversion.
`pyproject.toml`	Adds `PairICL` to the packaged modules list.

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

Copilot · 2026-02-07T05:55:53Z

+    def _lookup_embeddings(self, indices: torch.Tensor) -> torch.Tensor:
+        vectors = [self.row_embeddings[int(idx)] for idx in indices.tolist()]
+        return torch.stack(vectors)


_lookup_embeddings() uses indices.tolist() to drive Python dict lookups. When indices is on an accelerator (CUDA/XLA), tolist() forces a device sync + host transfer, which can severely degrade throughput (and negates the benefit of ParallelLoader). Consider keeping indices on CPU, or materialising row_embeddings into a single tensor and using tensor indexing instead of Python iteration.

Copilot · 2026-02-07T05:55:53Z

+        self.device = device or xla_utils.get_default_device()
+        self.icl_model = icl_model.to(self.device)
+        non_blocking = self.device.type == "cuda"
+        self.row_embeddings = {
+            int(k): v.to(self.device, non_blocking=non_blocking)
+            for k, v in row_embeddings.items()
+        }


The constructor eagerly moves all row_embeddings tensors to self.device. For realistic datasets this can easily exhaust TPU/GPU memory and also duplicates memory on every replica. Consider making this behaviour optional (e.g., a flag to keep embeddings on CPU and transfer per-batch), or storing embeddings in a sharded/packed representation.

Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>

Copilot

Pull request overview

Copilot reviewed 11 out of 11 changed files in this pull request and generated 3 comments.

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

Copilot · 2026-02-08T11:59:15Z

+    if xla_available():
+        return xm.xla_device()
+    if torch.cuda.is_available():
+        return torch.device("cuda")
+    return torch.device("cpu")


get_default_device() currently prefers xm.xla_device() whenever torch_xla imports successfully, even if the runtime is CPU XLA. That contradicts the documented priority order (TPU > CUDA > CPU) and can unexpectedly route GPU-capable environments to XLA/CPU, hurting performance or breaking expected device selection. Consider detecting actual TPU availability (e.g., checking xm.xla_device_hw() / PJRT_DEVICE) before selecting XLA, otherwise fall back to CUDA when available.

Copilot · 2026-02-08T11:59:15Z

+                idx_a = idx_a.to(self.device)
+                idx_b = idx_b.to(self.device)


In predict_loader(), idx_a/idx_b are moved to the accelerator and then _lookup_embeddings() calls indices.tolist(). On CUDA this forces a device sync; on XLA it can trigger expensive host transfers/compilation barriers. Since the indices are only used for Python dict lookups, keep them on CPU (don’t .to(self.device) before lookup), or switch to a tensor-based embedding table so indexing stays on-device.

Suggested change

idx_a = idx_a.to(self.device)

idx_b = idx_b.to(self.device)

# Keep idx_a and idx_b on CPU for Python dict lookups in _lookup_embeddings

Copilot · 2026-02-08T11:59:16Z

+The original repository shipped a monolithic script that combined data loading,
+model definitions and TPU orchestration.  Those components now live inside the
+:mod:`PairICL` package, but several external references still import this file.
+To keep those references working we expose the same ``main`` function while
+reusing the new implementation.


This legacy entry point now forwards to PairICL.cli.main, but the new CLI uses subcommands (preprocess/predict) instead of the legacy --mode/--row_embeds/--pairs_csv flags. That means existing invocations like python pair_icl_tpu.py --mode predict ... will fail even though this file claims to keep legacy references working. Consider adding an argv-translation shim here (or in PairICL.cli) to accept the old flag-style interface and map it to the new subcommands.

Ensure PairICL runs on TPU and GPU and archive legacy script

d2cd431

Copilot AI review requested due to automatic review settings February 7, 2026 05:52

yananlong added the codex label Feb 7, 2026 — with ChatGPT Codex Connector

Copilot started reviewing on behalf of yananlong February 7, 2026 05:52 View session

Copilot AI reviewed Feb 7, 2026

View reviewed changes

yananlong and others added 6 commits February 8, 2026 18:53

Update PairICL/data.py

78ab762

Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>

Update PairICL/xla_utils.py

0be4a05

Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>

Update PairICL/xla_utils.py

364442a

Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>

Update archive/pair_icl_tpu.py

71af2c4

Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>

Update archive/pair_icl_tpu.py

b7eaa2f

Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>

Update archive/pair_icl_tpu.py

e312945

Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>

yananlong requested a review from Copilot February 8, 2026 11:54

Copilot started reviewing on behalf of yananlong February 8, 2026 11:55 View session

Copilot AI reviewed Feb 8, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Improve PairICL device handling and archive legacy runner#8

Improve PairICL device handling and archive legacy runner#8
yananlong wants to merge 7 commits into
mainfrom
codex/integrate-pair_icl_tpu-functions-into-pairicl

yananlong commented Feb 7, 2026

Uh oh!

Copilot AI left a comment

Uh oh!

Uh oh!

Uh oh!

Copilot AI Feb 7, 2026

Uh oh!

Copilot AI Feb 7, 2026

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Copilot AI left a comment

Uh oh!

Copilot AI Feb 8, 2026

Uh oh!

Copilot AI Feb 8, 2026

Uh oh!

Copilot AI Feb 8, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

	idx_a = idx_a.to(self.device)
	idx_b = idx_b.to(self.device)
	# Keep idx_a and idx_b on CPU for Python dict lookups in _lookup_embeddings

Conversation

yananlong commented Feb 7, 2026

Summary

Testing

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Reviewed changes

Uh oh!

Uh oh!

Uh oh!

Copilot AI Feb 7, 2026

Choose a reason for hiding this comment

Uh oh!

Copilot AI Feb 7, 2026

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Uh oh!

Copilot AI Feb 8, 2026

Choose a reason for hiding this comment

Uh oh!

Copilot AI Feb 8, 2026

Choose a reason for hiding this comment

Uh oh!

Copilot AI Feb 8, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants