bench: compare message content parsing by jarvis9443 · Pull Request #41 · api7/lua-qjson

jarvis9443 · 2026-05-18T02:10:06Z

Update the Lua benchmark to match the AI request-body access pattern more closely.

The benchmark now:

uses OpenResty's LuaJIT by default
runs the benchmark through OpenResty resty, so lua-resty-simdjson can run in its normal ngx environment
compares lua-cjson, lua-resty-simdjson, quickdecode.parse, qd.decode + content access, and qd.decode + qd.encode
reads model, temperature, and every messages[*].content
generates multimodal requests where every message has one base64 image; the 10 MB case has roughly 10 messages with ~1 MB image content each
refreshes README/docs benchmark results from the OpenResty run

Validation:

make bench
make test
make lint

Summary by CodeRabbit

Behavior
- Default multimodal access now iterates and reads every message's content instead of sampling only the first message.
Documentation
- Benchmarks, README, and guidance updated to compare quickdecode, cjson, and simdjson across 2 KB–10 MB multimodal workloads with refreshed results, observations, and reproduction steps.
Tests / Benchmarks
- Benchmark scenarios and naming updated to measure content-access and interleaved runs; throughput and memory tables refreshed.
Chores
- Bench runner and environment defaults updated to use an OpenResty-based invocation for local benchmarking.

coderabbitai · 2026-05-18T02:10:18Z

Note

Reviews paused

It looks like this branch is under active development. To avoid overwhelming you with review comments due to an influx of new commits, CodeRabbit has automatically paused this review. You can configure this behavior by changing the reviews.auto_review.auto_pause_after_reviewed_commits setting.

Use the following commands to manage reviews:

@coderabbitai resume to resume automatic reviews.
@coderabbitai review to trigger a single review.

Use the checkboxes below for quick actions:

▶️ Resume reviews
🔍 Trigger review

📝 Walkthrough

Walkthrough

This PR changes the benchmark fixture to a multi-message chat shape, updates accessors to read every message.content, rewires scenario labels/runs to call those accessors, adjusts Makefile to run via OpenResty, and refreshes docs and README with new workload, results, and reproduction instructions.

Changes

Benchmark measurement and documentation update

Layer / File(s)	Summary
Bench: simdjson require and payload comment `benches/lua_bench.lua`	Add `resty.simdjson` require and update payload comment to describe a multi-message history.
Payload generation: message-count and image sizing `benches/lua_bench.lua`	`make_payload` computes `message_count` from `target_bytes`, alternates `user`/`assistant` roles, embeds per-message text and base64 `image_url` parts, and assembles the `messages` array.
Accessor functions and content path helper `benches/lua_bench.lua`	Default accessors iterate all `messages` and read each `msg.content`; adds cached `content_paths(n)` helper for quickdecode path reads; table accessor touches each `msg.content`.
Benchmark wiring, labels, and interleaved scenarios `benches/lua_bench.lua`	Per-scenario benches renamed/relabelled (e.g., `cjson.decode`/`qd.parse` -> “access fields”, `qd.decode` -> “access content”) and routed through the content-based default accessors; interleaved 400-iteration loop rewritten to call defaults.
Makefile: OpenResty env and bench runner `Makefile`	Introduce overridable `OPENRESTY` and derive `LUAJIT`/`RESTY`/`LUA_PATH`/`LUA_CPATH`; export `LUA_PATH` in `LUA_ENV`; run `bench` via `$(RESTY)` under `$(LUA_ENV)`.
Docs: intro, payload model, scenarios/results `docs/benchmarks.md`	Refocus intro/environment on quickdecode parse+read use-case; change multimodal payload model to scale message count with payload size; overhaul workload scenarios, throughput/speed-up/memory tables, and reproduce steps to `make bench`.
Docs: observations, guidance, caveats `docs/benchmarks.md`	Replace Observations/When-to-pick with new takeaways about quickdecode scaling and message content access costs; adjust Caveats to emphasize structural work vs memcpy framing.
README: benchmarks section update `README.md`	Replace Benchmarks subsection with new workload and result framing (including `qd.decode + content`) and update `make bench` comment to include `lua-resty-simdjson`.
CLAUDE.md: bench guidance and layout `CLAUDE.md`	Update `make bench` description to OpenResty LuaJIT benchmark comparing `lua-cjson` and `simdjson`; expand `benches/` layout entry and name fixtures.

🎯 2 (Simple) | ⏱️ ~12 minutes

🚥 Pre-merge checks | ✅ 4

✅ Passed checks (4 passed)

Check name	Status	Explanation
Description Check	✅ Passed	Check skipped - CodeRabbit’s high-level summary is enabled.
Title check	✅ Passed	The pull request title clearly summarizes the main change: updating benchmarks to compare message content parsing across implementations with a focus on the multimodal chat-completion workload.
Linked Issues check	✅ Passed	Check skipped because no linked issues were found for this pull request.
Out of Scope Changes check	✅ Passed	Check skipped because no linked issues were found for this pull request.

_{✏️ Tip: You can configure your own custom pre-merge checks in the settings.}

_{Comment @coderabbitai help to get the list of available commands and usage tips.}

coderabbitai

Actionable comments posted: 1

🧹 Nitpick comments (1)

benches/lua_bench.lua (1)

335-339: ⚡ Quick win

Reduce accessor drift by reusing the default accessor helpers in interleaved benches.

The interleaved block now duplicates content-access logic already defined in default_cjson_access, default_qd_access, and default_table_access. Reusing those helpers keeps workload semantics aligned across benchmark modes and avoids future divergence.

♻️ Proposed refactor

 bench("cjson.decode + access fields", 400, function()
     local p = next_p()
     local obj = cjson.decode(p)
-    local _ = obj.model
-    local _ = obj.temperature
-    if obj.messages then
-        for _, msg in ipairs(obj.messages) do
-            local _ = msg.content
-        end
-    end
+    default_cjson_access(obj)
 end)

 ...

 bench("quickdecode.parse + access fields", 400, function()
     local p = next_p()
     local d = qd.parse(p)
-    local _ = d:get_str("model")
-    local _ = d:get_f64("temperature")
-    local n = d:len("messages") or 0
-    for i = 0, n - 1 do
-        local _ = d:typeof("messages[" .. i .. "].content")
-    end
+    default_qd_access(d)
 end)

 ...

 bench("quickdecode pooled :parse + access fields", 400, function()
     local p = next_p()
     local d = pooled_decoder:parse(p)
-    local _ = d:get_str("model")
-    local _ = d:get_f64("temperature")
-    local n = d:len("messages") or 0
-    for i = 0, n - 1 do
-        local _ = d:typeof("messages[" .. i .. "].content")
-    end
+    default_qd_access(d)
 end)

 ...

 bench("qd.decode + access content", 400, function()
     local p = next_p()
     local t = qd.decode(p)
-    local _ = t.model
-    local _ = t.temperature
-    if t.messages then
-        for i = 1, qd.len(t.messages) do
-            local msg = t.messages[i]
-            local _ = msg.content
-        end
-    end
+    default_table_access(t)
 end)

Also applies to: 348-351, 361-364, 374-379

🤖 Prompt for AI Agents

Verify each finding against current code. Fix only still-valid issues, skip the
rest with a brief reason, keep changes minimal, and validate.

In `@benches/lua_bench.lua` around lines 335 - 339, The interleaved bench block
duplicates message content-access logic that already exists in the helper
functions default_cjson_access, default_qd_access, and default_table_access;
replace the manual loop over obj.messages and the local _ = msg.content access
by invoking the appropriate helper (e.g., call default_cjson_access(obj) or
default_qd_access(obj) or default_table_access(obj) depending on the active
accessor) so the interleaved path reuses the shared accessor logic, and remove
the duplicated per-message accessor code to avoid drift.

🤖 Prompt for all review comments with AI agents

Verify each finding against current code. Fix only still-valid issues, skip the
rest with a brief reason, keep changes minimal, and validate.

Inline comments:
In `@docs/benchmarks.md`:
- Line 53: Update the benchmark table entry for "quickdecode.parse + access
fields" so the workload description removes the stray word "type" and reads
explicitly that it "reads every `messages[i].content`" (the cell that currently
contains "`qd.parse(s)`, read `model` / `temperature`, then read every
`messages[i].content` type" should be changed to drop "type"); edit the
`qd.parse(s)` / `messages[i].content` text in the same table row to match this
corrected phrasing.

---

Nitpick comments:
In `@benches/lua_bench.lua`:
- Around line 335-339: The interleaved bench block duplicates message
content-access logic that already exists in the helper functions
default_cjson_access, default_qd_access, and default_table_access; replace the
manual loop over obj.messages and the local _ = msg.content access by invoking
the appropriate helper (e.g., call default_cjson_access(obj) or
default_qd_access(obj) or default_table_access(obj) depending on the active
accessor) so the interleaved path reuses the shared accessor logic, and remove
the duplicated per-message accessor code to avoid drift.

🪄 Autofix (Beta)

Fix all unresolved CodeRabbit comments on this PR:

Push a commit to this branch (recommended)
Create a new PR with the fixes

ℹ️ Review info

⚙️ Run configuration

Configuration used: Organization UI

Review profile: CHILL

Plan: Pro

Run ID: b43f62f9-cf27-4dd1-98e6-5e10ac8a0487

📥 Commits

Reviewing files that changed from the base of the PR and between 062a1cd and d1f7457.

📒 Files selected for processing (2)

benches/lua_bench.lua
docs/benchmarks.md

Copilot

Pull request overview

Updates the Lua benchmark workload to touch all chat messages’ content fields (instead of only the first message’s role) so benchmark access patterns better reflect how request bodies are inspected in the proxy path.

Changes:

Updated benches/lua_bench.lua accessors to iterate through messages[*].content for both cjson and qd.decode, and to typeof each messages[i].content for qd.parse.
Refreshed docs/benchmarks.md to match the new benchmark workload and updated result tables/observations accordingly.

Reviewed changes

Copilot reviewed 2 out of 2 changed files in this pull request and generated 2 comments.

File	Description
docs/benchmarks.md	Updates benchmark narrative, workload definitions, and published results for the new “touch message content” access pattern.
benches/lua_bench.lua	Changes benchmark access functions to read/inspect each message’s `content` rather than only `messages[1].role`.

Comments suppressed due to low confidence (2)

docs/benchmarks.md:91

The speed-up table header uses qd.decode + access, but the workload row/bench output uses qd.decode + access content. For consistency and easier cross-referencing, align the label here as well.

| Scenario | `qd.parse` / cjson | `qd.decode + access` / cjson |
|---|---:|---:|

docs/benchmarks.md:110

The memory delta table header uses qd.decode + access, but the workload/bench label is qd.decode + access content. Consider renaming this column header to match the actual benchmark row name.

| Scenario | cjson | `qd.parse` | `qd.decode + access` | `qd.decode + qd.encode` |
|---|---:|---:|---:|---:|

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

Copilot

Pull request overview

Copilot reviewed 2 out of 2 changed files in this pull request and generated 3 comments.

Comments suppressed due to low confidence (1)

benches/lua_bench.lua:206

default_qd_access builds a new path string ("messages[" .. i .. "].content") on every iteration, so the benchmark is partly measuring Lua string concatenation/allocation rather than quickdecode field access. Consider switching to cursor-based access (e.g., d:open("messages") + :index(i) + :typeof("content")/get_*) or precomputing paths to keep the workload focused on JSON access costs.

    local n = d:len("messages") or 0
    for i = 0, n - 1 do
        local _ = d:typeof("messages[" .. i .. "].content")
    end

coderabbitai

Caution

Some comments are outside the diff and can’t be posted inline due to platform limitations.

⚠️ Outside diff range comments (1)

benches/lua_bench.lua (1)

247-249: ⚠️ Potential issue | 🟡 Minor | ⚡ Quick win

Fix qd.decode label for scenarios that override table_access.

For github-100k, the benchmark does not access messages[*].content, but the printed label says it does.

Proposed fix

--- a/benches/lua_bench.lua
+++ b/benches/lua_bench.lua
@@
-    bench("qd.decode + access content", s.iters, function()
+    local qd_decode_label = s.table_access and "qd.decode + access fields" or "qd.decode + access content"
+    bench(qd_decode_label, s.iters, function()
         local t = qd.decode(s.payload)
         table_access(t)
     end)

Also applies to: 293-296

🤖 Prompt for AI Agents

Verify each finding against current code. Fix only still-valid issues, skip the
rest with a brief reason, keep changes minimal, and validate.

In `@benches/lua_bench.lua` around lines 247 - 249, The benchmark label for
qd.decode incorrectly claims it accesses messages[*].content for scenarios that
provide a custom table_access override (e.g., the scenario entry with name
"github-100k" which sets table_access = github_table_access and qd_access =
github_qd_access). Update the label-generation logic that prints the qd.decode
description so it checks whether a scenario supplies table_access and, if so,
omits the "messages[*].content" part of the printed qd.decode label (i.e.,
conditionally build the qd.decode label based on the presence of table_access
for entries like github-100k).

🧹 Nitpick comments (1)

benches/lua_bench.lua (1)

14-27: ⚡ Quick win

Update stale size-accuracy commentary to match the current generator.

The comment block still references the removed min(500K, remaining) branch logic, which no longer reflects make_payload.

Proposed comment-only fix

--- a/benches/lua_bench.lua
+++ b/benches/lua_bench.lua
@@
--- Size accuracy: the normal-branch upper is `min(500K, remaining)` so the
--- loop cannot overshoot during steady state. When fewer than 50 KB remain
--- the final image falls through to `math.max(1024, remaining)` — undershoot
--- is at most a few hundred bytes; worst-case overshoot is ~1 KB (only when
--- `remaining < 1024`, which the seed=42 walk does not hit for our ladder).
+-- Size accuracy: payload sizing is approximate. Message separators and
+-- role-string variation introduce only small drift from `target_bytes`.

🤖 Prompt for AI Agents

Verify each finding against current code. Fix only still-valid issues, skip the
rest with a brief reason, keep changes minimal, and validate.

In `@benches/lua_bench.lua` around lines 14 - 27, The comment above make_payload
is stale because it still mentions the removed `min(500K, remaining)` branch;
update that comment to remove the obsolete phrase and accurately describe the
current generator: explain that image sizes are drawn deterministically from the
Park‑Miller LCG, each chosen so it does not exceed the remaining bytes, and that
when fewer than 50 KB remain the final image uses math.max(1024, remaining)
(with undershoot/overshoot behavior as before). Locate the block near the
make_payload function and replace the outdated sentence with this corrected
description.

🤖 Prompt for all review comments with AI agents

Verify each finding against current code. Fix only still-valid issues, skip the
rest with a brief reason, keep changes minimal, and validate.

Outside diff comments:
In `@benches/lua_bench.lua`:
- Around line 247-249: The benchmark label for qd.decode incorrectly claims it
accesses messages[*].content for scenarios that provide a custom table_access
override (e.g., the scenario entry with name "github-100k" which sets
table_access = github_table_access and qd_access = github_qd_access). Update the
label-generation logic that prints the qd.decode description so it checks
whether a scenario supplies table_access and, if so, omits the
"messages[*].content" part of the printed qd.decode label (i.e., conditionally
build the qd.decode label based on the presence of table_access for entries like
github-100k).

---

Nitpick comments:
In `@benches/lua_bench.lua`:
- Around line 14-27: The comment above make_payload is stale because it still
mentions the removed `min(500K, remaining)` branch; update that comment to
remove the obsolete phrase and accurately describe the current generator:
explain that image sizes are drawn deterministically from the Park‑Miller LCG,
each chosen so it does not exceed the remaining bytes, and that when fewer than
50 KB remain the final image uses math.max(1024, remaining) (with
undershoot/overshoot behavior as before). Locate the block near the make_payload
function and replace the outdated sentence with this corrected description.

ℹ️ Review info

⚙️ Run configuration

Configuration used: Organization UI

Review profile: CHILL

Plan: Pro

Run ID: 4ce8e27c-5276-4ee4-a28f-6509255a2816

📥 Commits

Reviewing files that changed from the base of the PR and between 15ae933 and 6f3d93f.

📒 Files selected for processing (2)

benches/lua_bench.lua
docs/benchmarks.md

✅ Files skipped from review due to trivial changes (1)

docs/benchmarks.md

Copilot

Pull request overview

Copilot reviewed 5 out of 5 changed files in this pull request and generated 4 comments.

Comments suppressed due to low confidence (1)

benches/lua_bench.lua:22

The payload generator comments still describe the old Park–Miller LCG-driven image sizing, but make_payload no longer uses any RNG and instead computes a fixed image_size per message. Update/remove these lines so the benchmark description matches the actual payload construction.

-- Image sizes are drawn from a deterministic Park-Miller LCG (not math.random,
-- which delegates to libc rand() and varies across machines) so the same
-- target_bytes produces byte-identical output on any LuaJIT 2.1 host.

coderabbitai

Actionable comments posted: 1

🧹 Nitpick comments (1)

README.md (1)
86-87: ⚡ Quick win

Tighten workload wording for accuracy.

Consider changing “every messages[*].content path” to “every messages[*].content paths” (or “all messages[*].content entries”) to avoid singular/plural ambiguity.
🤖 Prompt for AI Agents
Verify each finding against current code. Fix only still-valid issues, skip the
rest with a brief reason, keep changes minimal, and validate.

In `@README.md` around lines 86 - 87, Update the README wording to remove
singular/plural ambiguity by replacing the phrase "every messages[*].content
path" with a clearer form such as "every `messages[*].content` paths" or "all
`messages[*].content` entries"; locate the text containing the literal snippet
messages[*].content in the README and make the replacement so the sentence reads
unambiguous and grammatically correct.

🤖 Prompt for all review comments with AI agents

Verify each finding against current code. Fix only still-valid issues, skip the
rest with a brief reason, keep changes minimal, and validate.

Inline comments:
In `@README.md`:
- Line 7: The README currently references a nonexistent "Roadmap / Deferred"
section ("— see `Roadmap / Deferred` below."); either add that section header
"Roadmap / Deferred" with the deferred decisions/notes (tracking tuning
decisions such as benchmark tuning) or remove/replace the inline reference
string so it no longer points to a missing section; update the sentence that
contains "— see `Roadmap / Deferred` below." and ensure the new "Roadmap /
Deferred" header contains the deferred items you want tracked.

---

Nitpick comments:
In `@README.md`:
- Around line 86-87: Update the README wording to remove singular/plural
ambiguity by replacing the phrase "every messages[*].content path" with a
clearer form such as "every `messages[*].content` paths" or "all
`messages[*].content` entries"; locate the text containing the literal snippet
messages[*].content in the README and make the replacement so the sentence reads
unambiguous and grammatically correct.

🪄 Autofix (Beta)

Fix all unresolved CodeRabbit comments on this PR:

Push a commit to this branch (recommended)
Create a new PR with the fixes

ℹ️ Review info

⚙️ Run configuration

Configuration used: Organization UI

Review profile: CHILL

Plan: Pro

Run ID: ffd4b74e-0cd0-4ce9-bcc9-89442506aa06

📥 Commits

Reviewing files that changed from the base of the PR and between 4c03716 and b2c414b.

📒 Files selected for processing (3)

README.md
benches/lua_bench.lua
docs/benchmarks.md

✅ Files skipped from review due to trivial changes (1)

docs/benchmarks.md

🚧 Files skipped from review as they are similar to previous changes (1)

benches/lua_bench.lua

Copilot

Pull request overview

Copilot reviewed 5 out of 5 changed files in this pull request and generated 4 comments.

Comments suppressed due to low confidence (1)

docs/benchmarks.md:72

The reproduction instructions imply make bench will “run the full comparison”, but the repo doesn’t install/vendor lua-resty-simdjson and the bench script requires resty.simdjson unconditionally. Please document this dependency explicitly (where to get it / how to add it to LUA_PATH/OpenResty) or adjust the bench to skip the simdjson rows when unavailable so make bench works out-of-the-box.

Run the full comparison with one command:

```sh
make bench

This builds quickdecode, builds the vendored lua-cjson against OpenResty's
LuaJIT, then invokes benches/lua_bench.lua through OpenResty's resty so
lua-resty-simdjson runs in its normal ngx environment.

</details>

bench: read message content in lua benchmark

d1f7457

Copilot AI review requested due to automatic review settings May 18, 2026 02:10

Copilot started reviewing on behalf of jarvis9443 May 18, 2026 02:10 View session

coderabbitai Bot reviewed May 18, 2026

View reviewed changes

Comment thread docs/benchmarks.md Outdated

Copilot AI reviewed May 18, 2026

View reviewed changes

Comment thread docs/benchmarks.md Outdated

Comment thread docs/benchmarks.md Outdated

jarvis9443 added 2 commits May 18, 2026 10:15

bench: reuse content access helpers

99f0875

docs: clarify benchmark access labels

15ae933

Copilot AI review requested due to automatic review settings May 18, 2026 02:16

Copilot started reviewing on behalf of jarvis9443 May 18, 2026 02:16 View session

Copilot AI reviewed May 18, 2026

View reviewed changes

Comment thread benches/lua_bench.lua

Comment thread docs/benchmarks.md Outdated

Comment thread docs/benchmarks.md Outdated

bench: model image content across messages

6f3d93f

coderabbitai Bot reviewed May 18, 2026

View reviewed changes

bench: compare simdjson under openresty

4c03716

Copilot AI review requested due to automatic review settings May 18, 2026 02:41

jarvis9443 changed the title ~~bench: read message content in lua benchmark~~ bench: compare message content parsing May 18, 2026

Copilot started reviewing on behalf of jarvis9443 May 18, 2026 02:41 View session

Copilot AI reviewed May 18, 2026

View reviewed changes

Comment thread benches/lua_bench.lua Outdated

Comment thread docs/benchmarks.md Outdated

Comment thread README.md Outdated

Comment thread README.md Outdated

docs: align benchmark environment and comments

b2c414b

coderabbitai Bot reviewed May 18, 2026

View reviewed changes

Comment thread README.md Outdated

docs: remove stale roadmap reference

1864716

Copilot AI review requested due to automatic review settings May 18, 2026 06:44

Copilot started reviewing on behalf of jarvis9443 May 18, 2026 06:45 View session

docs: clarify content path wording

18bc50b

Copilot AI reviewed May 18, 2026

View reviewed changes

Comment thread Makefile Outdated

Comment thread benches/lua_bench.lua Outdated

Comment thread README.md

Comment thread docs/benchmarks.md Outdated

bench: handle optional simdjson setup

9718a79

nic-6443 merged commit 0c32d51 into main May 18, 2026
4 checks passed

nic-6443 deleted the bench-message-content-20260518 branch May 18, 2026 08:10

coderabbitai Bot mentioned this pull request May 18, 2026

chore: prepare qjson public naming #42

Merged

Conversation

jarvis9443 commented May 18, 2026 • edited by coderabbitai Bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary by CodeRabbit

Uh oh!

coderabbitai Bot commented May 18, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Reviews paused

Walkthrough

Changes

Uh oh!

coderabbitai Bot left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Reviewed changes

Uh oh!

Uh oh!

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Uh oh!

Uh oh!

Uh oh!

Uh oh!

coderabbitai Bot left a comment

Choose a reason for hiding this comment

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

coderabbitai Bot left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

jarvis9443 commented May 18, 2026 •

edited by coderabbitai Bot

Loading

coderabbitai Bot commented May 18, 2026 •

edited

Loading