Skip to content

perf: use NativePartition wrapper and @transient rddPartitions to red…#19

Open
merrily01 wants to merge 1 commit into
masterfrom
auron#2307_perf-add-transient-reduce-serialization
Open

perf: use NativePartition wrapper and @transient rddPartitions to red…#19
merrily01 wants to merge 1 commit into
masterfrom
auron#2307_perf-add-transient-reduce-serialization

Conversation

@merrily01

Copy link
Copy Markdown
Owner

…uce serialization overhead

Introduce NativePartition[P] wrapper to carry original partition payloads through NativeRDD without re-indexing into the partitions array at execution time, which avoids serializing the full partitions array.

Key changes:

  • Add @transient to rddPartitions in NativeRDD
  • Add NativePartition case class for type-safe partition payload access
  • Wrap input partitions in NativePartition before NativeRDD creation
  • Extract original partition via .asInstanceOf[NativePartition[P]].payload

Which issue does this PR close?

Closes #<issue_number>

Rationale for this change

What changes are included in this PR?

Are there any user-facing changes?

How was this patch tested?

…uce serialization overhead

Introduce NativePartition[P] wrapper to carry original partition payloads
through NativeRDD without re-indexing into the partitions array at
execution time, which avoids serializing the full partitions array.

Key changes:
- Add @transient to rddPartitions in NativeRDD
- Add NativePartition case class for type-safe partition payload access
- Wrap input partitions in NativePartition before NativeRDD creation
- Extract original partition via .asInstanceOf[NativePartition[P]].payload
@github-actions github-actions Bot added the spark label Jun 16, 2026
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant