genMetadata augment PoC (local llama.cpp)#1215
Draft
eschultink wants to merge 53 commits into
Draft
Conversation
…ents/Augment.java Co-authored-by: aperez-worklytics <75276364+aperez-worklytics@users.noreply.github.com>
Add constructor injection so Dagger can provide AugmentProcessor, and suppress SLF4J/JUL warnings during gateway-core tests. Co-authored-by: Cursor <cursoragent@cursor.com>
Ignore computed functionName during Jackson round-trips and align sanitized fixtures with augment sibling output. Co-authored-by: Cursor <cursoragent@cursor.com>
Co-authored-by: Copilot Autofix powered by AI <175728472+Copilot@users.noreply.github.com>
Co-authored-by: Copilot Autofix powered by AI <175728472+Copilot@users.noreply.github.com>
Co-authored-by: Copilot Autofix powered by AI <175728472+Copilot@users.noreply.github.com>
Co-authored-by: Copilot Autofix powered by AI <175728472+Copilot@users.noreply.github.com>
Co-authored-by: Copilot Autofix powered by AI <175728472+Copilot@users.noreply.github.com>
Wire lazy model resolution through instance and shared remote resource paths at runtime, address review feedback on model loading and docs, and add provider tests. Co-authored-by: Cursor <cursoragent@cursor.com>
Move ResourceService to gateway-core, drop redundant NLP wiring helpers, and return SentenceMetadataResult instead of ad-hoc maps. Co-authored-by: Cursor <cursoragent@cursor.com>
…Maven lifecycle. Move OpenNLP download to generate-resources so models are on the test classpath, and replace skipped assumptions with explicit assertions. Co-authored-by: Cursor <cursoragent@cursor.com>
Introduces genMetadata augment processing, Dagger-wired config/backend, non-fatal warning headers, Terraform enable_gen_metadata (4096MB floor + remote resources), and MS Copilot PoC rules/docs. Co-authored-by: Cursor <cursoragent@cursor.com>
Extract ResourceServiceBindingsModule and wire NoOp remote buckets in CmdLineModule so bulk CLI compiles without FunctionRuntimeModule. Co-authored-by: Cursor <cursoragent@cursor.com>
Use HttpEventRequestDto in WebhookSanitizerAugmentsTest, validate zip entry paths on extract, and add explicit canEqual override on GenMetadata. Co-authored-by: Cursor <cursoragent@cursor.com>
* hotfix nondeterministic apply * lambda timeout cannot exceed queue timeout
Jlama treats maxTokens as total KV ceiling, not max output; stop passing PSOXY_GEN_MAX_TOKENS to JlamaChatModel and truncate prompts via Dagger-wired GenMetadataPromptBudget instead. Co-authored-by: Cursor <cursoragent@cursor.com>
An error occurred while trying to automatically change base from
rc-v0.6.3
to
main
June 8, 2026 17:34
Resolve conflicts keeping gen-metadata augments alongside rc-v0.6.6 features (output schema filtering, FOLLOW_REDIRECTS, IAM consolidation). Co-authored-by: Cursor <cursoragent@cursor.com>
…hema failure. Add PSOXY_GEN_META_RETRIES (default 2 attempts) so intermittent schema-echo outputs can recover without failing the augment immediately.
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Add this suggestion to a batch that can be applied as a single commit.This suggestion is invalid because no changes were made to the code.Suggestions cannot be applied while the pull request is closed.Suggestions cannot be applied while viewing a subset of changes.Only one suggestion per line can be applied in a batch.Add this suggestion to a batch that can be applied as a single commit.Applying suggestions on deleted lines is not supported.You must change the existing code in this line in order to create a valid suggestion.Outdated suggestions cannot be applied.This suggestion has been applied or marked resolved.Suggestions cannot be applied from pending reviews.Suggestions cannot be applied on multi-line comments.Suggestions cannot be applied while the pull request is queued to merge.Suggestion cannot be applied right now. Please check back later.
PoC for LLM-generated metadata alongside sanitized fields:
genMetadataaugment with local GGUF via llama.cpp, building on augments framework andsentenceMetadata(OpenNLP). Latest commit adds genMetadata processing, warning headers, Dagger wiring, and Terraformenable_gen_metadata.Features
+{field}:genMetadatafrom deployment-configured backend (BETA:local/ llama.cpp)enable_gen_metadataon AWS/GCP host modules —ENABLE_GEN_METADATA, 4096 MB memory floor, remote resources for GGUF weightsMS_COPILOT_GEN_METADATA_AUGMENT) and commented YAML example (11 categories, no confidence)X-Psoxy-Warning(augment-gen-unavailable,augment-gen-inference-failed,augment-output-schema-mismatch)Logistics
Change implications
de.kherud:llama(java-llama.cpp JNI) inpsoxy-coreonly; see gen-metadata-augment.mdenable_gen_metadata(host + per-connector); floors memory and enables remote resource paths when setPSOXY_GEN_*,ENABLE_GEN_METADATA{SHARED_RESOURCE_PATH}/llm/{model}.ggufwhen using local backendTest plan
mvn test -pl gateway-core,core -am(local)terraform validateininfra/examples-dev/awsandinfra/examples-dev/gcpenable_gen_metadata = true, upload GGUF, exercise MS Copilot connector with genMetadata rules enabledX-Psoxy-Warningwhen model missing (no 5xx)Made with Cursor