feat: add LatLngToCellBatch for lower cgo overhead by zoobst · Pull Request #119 · uber/h3-go

zoobst · 2026-05-27T14:39:24Z

Resolves #113

Adds LatLngToCellBatch so a slice of LatLng -> []Cell costs one cgo transaction for the whole batch instead of one per row. The motivating use case is pipelines transforming millions of coordinate rows per cycle, where cgo overhead currently dominates actual H3 work.

Per @jogly's suggestion in the issue, the implementation lives in a new C extension (h3_latLngBatch.{c,h}) that supplements the cloned H3 core. If H3 core later exposes and equivalent latLngToCellBatch, swapping the Go wrapper to call it directly is a one-line change.

Changes

File	Change
`h3_latLngBatch.h` (new)	Declaration of `latLngToCellBatch`
`h3_latLngBatch.c` (new)	C-side loop calling `latLngToCell`
`h3.go`	`#include <h3_latLngBatch.h>` in cgo preamble; new `LatLngToCellBatch` wrapper
`h3_test.go`	Parity tests against per-call across all resolutions, empty input, single element, invalid resolution
`bench_test.go`	`BenchmarkLatLngToCellBatch` plus a `BenchmarkLatLngToCellBaseline` for comparing results

Benchmark

goos: darwin
goarch: arm64
pkg: github.com/uber/h3-go/v4
cpu: Apple M1
BenchmarkLatLngToCellsBatch/1-8                  7131451           319.4 ns/op
BenchmarkLatLngToCellsBatch/64-8                  144639           16473 ns/op
BenchmarkLatLngToCellsBatch/1024-8                  8794           263703 ns/op
BenchmarkLatLngToCellsBatch/16384-8                  564           4244748 ns/op
BenchmarkLatLngToCellsBatch/1000000-8                  8           259712438 ns/op
BenchmarkLatLngToCellsBatch/10000000-8                 1           2623521042 ns/op
BenchmarkLatLngToCellsBaseline/1-8               7072687           309.8 ns/op
BenchmarkLatLngToCellsBaseline/64-8               120739           19834 ns/op
BenchmarkLatLngToCellsBaseline/1024-8               7389           319792 ns/op
BenchmarkLatLngToCellsBaseline/16384-8               465           5118101 ns/op
BenchmarkLatLngToCellsBaseline/1000000-8               7           312783863 ns/op
BenchmarkLatLngToCellsBaseline/10000000-8              1           3151075250 ns/op

At small n (n < 32) the per-call path wins. The crossover is around n=16-32; from there the batched version grows roughly linearly while the loop pays the cgo cost N times. At n=16384 the batched version is ~1.21x faster

Conventions

Returns first error encountered, no partial results.
Conversion of degrees -> radians happens in Go via the existing LatLng.toC(), so the C code sees radians-LatLng identical to the single-call path.
No changes to public types or existing function signatures.

… Adds a batched LatLng -> Cell API in a new C file (h3_latLngBatch.{c,h}) that supplements the cloned H3 core.

CLAassistant · 2026-05-27T14:39:38Z

All committers have signed the CLA.

coveralls · 2026-05-28T18:58:20Z

Coverage Report for CI Build 26683177239

Coverage remained the same at 100.0%

Details

Coverage remained the same as the base build.
Patch coverage: 24 of 24 lines across 1 file are fully covered (100%).
No coverage regressions found.

Uncovered Changes

No uncovered changes found.

Coverage Regressions

No coverage regressions found.

Coverage Stats


Relevant Lines:	815
Covered Lines:	815
Line Coverage:	100.0%
Coverage Strength:	298.19 hits per line

💛 - Coveralls

zachcoleman

lgtm and clean

justinhwang · 2026-06-01T23:38:44Z

Actually had a thought on pure Go implementations (as we can see in #120) - I'm wondering if its a pure Go implementation, then would we also get rid of the Cgo overhead and wouldn't need a batched version?

Before (CGo):

BenchmarkLatLngToCell-16         4155702               270.4 ns/op            24 B/op          2 allocs/op

After (pure Go):

BenchmarkLatLngToCell-16         4177394               288.9 ns/op             0 B/op          0 allocs/op

benchstat (n=10):

                │ /tmp/latlng_old.txt │        /tmp/latlng_new3.txt        │
                │       sec/op        │   sec/op     vs base               │
LatLngToCell-16           273.6n ± 1%   291.2n ± 1%  +6.43% (p=0.000 n=10)

                │ /tmp/latlng_old.txt │        /tmp/latlng_new3.txt        │
                │        B/op         │   B/op     vs base                 │
LatLngToCell-16            24.00 ± 0%   0.00 ± 0%  -100.00% (p=0.000 n=10)

                │ /tmp/latlng_old.txt │        /tmp/latlng_new3.txt         │
                │      allocs/op      │ allocs/op   vs base                 │
LatLngToCell-16            2.000 ± 0%   0.000 ± 0%  -100.00% (p=0.000 n=10)

zoobst · 2026-06-02T13:33:25Z

Actually had a thought on pure Go implementations (as we can see in #120) - I'm wondering if its a pure Go implementation, then would we also get rid of the Cgo overhead and wouldn't need a batched version?

As a user and h3 & Go enthusiast, I'm all for pure Go optimizations.
However, the codebase in this repo seems to avoid becoming a pure Go h3 implementation and tries to stay as aligned as possible to core h3.
Not my call here, but would be happy to contribute to a pure Go h3 lib in the future.

zoobst added 2 commits May 27, 2026 09:35

feat: add LatLngToCellsBatch for lower cgo overhead. Resolves uber#113.…

c4df3c8

… Adds a batched LatLng -> Cell API in a new C file (h3_latLngBatch.{c,h}) that supplements the cloned H3 core.

updated benchmarks to align with benchmark format

c977b0c

jogly previously approved these changes May 28, 2026

View reviewed changes

justinhwang reviewed May 29, 2026

View reviewed changes

Comment thread h3.go Outdated

zachcoleman previously approved these changes May 29, 2026

View reviewed changes

updated naming to singular LatLngToCellBatch

b9716ae

zoobst dismissed stale reviews from zachcoleman and jogly via b9716ae May 30, 2026 11:54

updated naming to singular LatLngToCellBatch

8254f3f

zoobst requested review from justinhwang May 30, 2026 11:59

zoobst changed the title ~~feat: add LatLngToCellsBatch for lower cgo overhead~~ feat: add LatLngToCellBatch for lower cgo overhead May 30, 2026

justinhwang approved these changes Jun 1, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: add LatLngToCellBatch for lower cgo overhead#119

feat: add LatLngToCellBatch for lower cgo overhead#119
zoobst wants to merge 4 commits into
uber:masterfrom
zoobst:feat/latlng-to-cells-batch

zoobst commented May 27, 2026 •

edited

Loading

Uh oh!

CLAassistant commented May 27, 2026 •

edited

Loading

Uh oh!

coveralls commented May 28, 2026 •

edited

Loading

Uh oh!

Uh oh!

zachcoleman left a comment

Uh oh!

justinhwang commented Jun 1, 2026

Uh oh!

zoobst commented Jun 2, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

6 participants

Conversation

zoobst commented May 27, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Resolves #113

Changes

Benchmark

Conventions

Uh oh!

CLAassistant commented May 27, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

coveralls commented May 28, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Coverage Report for CI Build 26683177239

Coverage remained the same at 100.0%

Details

Uncovered Changes

Coverage Regressions

Coverage Stats

💛 - Coveralls

Uh oh!

Uh oh!

zachcoleman left a comment

Choose a reason for hiding this comment

Uh oh!

justinhwang commented Jun 1, 2026

Uh oh!

zoobst commented Jun 2, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

6 participants

zoobst commented May 27, 2026 •

edited

Loading

CLAassistant commented May 27, 2026 •

edited

Loading

coveralls commented May 28, 2026 •

edited

Loading