fix: cap embedding provider input length by TurboTheTurtle · Pull Request #899 · CortexReach/memory-lancedb-pro

TurboTheTurtle · 2026-06-17T05:27:20Z

Refs #834.

Adds an explicit embedding.maxInputChars cap for embedding provider inputs, with a conservative default for nomic-embed-text local servers. The cap is applied before provider requests and cache lookup so cached embeddings match the actual submitted text.

Also exposes the config in the plugin schema/UI hints and adds focused regression coverage.

rwmjhb

Requesting changes. The input cap direction is useful, but this head has two issues that should be fixed before merge.

Must fix

The input cap truncates long documents before the existing chunking path can preserve later content. In embedSingle, the code computes inputText = this.prepareInput(text) before the provider request and then passes that already-capped inputText into smartChunk() on context errors. embedMany similarly stores prepareInput(text) in validTexts before batch fallback chunking. For nomic-embed-text, the default cap is 1400 chars, so a long memory can be embedded from only the first 1400 characters and never reach the chunk-and-average path. That prevents provider overload, but it regresses recall for facts after the capped prefix. Please apply maxInputChars per provider request/chunk rather than truncating the whole document before chunking, or make this explicit truncation mode with clear tests/docs for the recall tradeoff.
The checked-in generated output is stale. I verified locally on 9c900f5:

npm run build --if-present passes
node --test test/embedder-max-input-chars.test.mjs passes
node test/plugin-manifest-regression.mjs passes

After build, the worktree is dirty:

M dist/src/retriever.js

The generated change is the FLEET-PATCH(#884) retriever comment expansion. Please run the build and commit the updated generated dist output, or remove unrelated generated/comment drift from this PR.

Also worth considering: truncation is silent when content is dropped before embedding, so users may not realize recall quality changed.

rwmjhb

Requesting changes. The latest head fixes the earlier whole-document pre-truncation problem and the generated output is now clean, but there is still a blocking fallback bug in the batch path.

Independent verification on dcf1bfa passed:

npm run build --if-present
node --test test/embedder-max-input-chars.test.mjs
node test/plugin-manifest-regression.mjs
build left source/dist/package/manifest/docs/tests clean

Must fix

embedMany() handles any batch context error by sending every valid text through embedChunkedText(). That helper is appropriate for a single input that is itself too large, but it also applies the single-chunk strict-reduction guard and retries with inputText.slice(0, safeLimit).

A local embedding server can reject a batch because the aggregate request is too large while each item would succeed individually. In that case, the fallback can truncate otherwise valid memories and silently drop trailing content. This recreates the recall-quality regression the PR is meant to avoid, just in the batch aggregate-limit path.

Please separate these two cases: for a batch context-limit failure, retry items individually first, or chunk per item by maxInputChars, but only apply the strict single-input truncation fallback after that individual request also fails with a context/input-size error. Add a regression test where the initial batch fails but each individual item fits and verify the provider receives the full individual inputs, not truncated prefixes.

Also worth addressing

The full npm test run timed out in orchestrator (npm test exceeded 180s); targeted tests are useful, but broad regression coverage did not complete in that run.
The new normal long-input chunking paths emit unconditional console output, which can be noisy during routine ingestion.
If one item exceeds maxInputChars, the whole batch currently falls back to per-item calls; that changes throughput and timeout behavior for mixed batches.

TurboTheTurtle · 2026-06-19T18:04:40Z

Fixed the batch fallback case: if a provider rejects the aggregate batch for context length, the fallback now retries each item individually first, so valid per-item inputs are not chunked or reduced just because the batch was too large.\n\nI added a regression for that path and reran:\n- npm run build --if-present\n- node --test test/embedder-max-input-chars.test.mjs\n- node test/plugin-manifest-regression.mjs

rwmjhb

Approved. Orchestrator verdict: approve, and I also ran an independent verification on head c66f8fd.

Verification run:

npm ci --silent
npm run build --if-present
node --test test/embedder-max-input-chars.test.mjs
node test/plugin-manifest-regression.mjs
git diff --exit-code -- dist src index.ts package.json package-lock.json openclaw.plugin.json README.md scripts test

All passed. Remaining items are non-blocking: consider routing the new routine chunking/fallback output through debug/diagnostics instead of direct console output, and improving per-item retry error attribution in the batch fallback.

fix: cap embedding provider input length

9c900f5

TurboTheTurtle marked this pull request as ready for review June 17, 2026 07:34

rwmjhb requested changes Jun 18, 2026

View reviewed changes

fix: chunk embedding inputs before request caps

dcf1bfa

TurboTheTurtle requested a review from rwmjhb June 19, 2026 03:02

rwmjhb requested changes Jun 19, 2026

View reviewed changes

fix: retry batch context fallback individually

c66f8fd

TurboTheTurtle requested a review from rwmjhb June 20, 2026 20:52

rwmjhb approved these changes Jun 21, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix: cap embedding provider input length#899

fix: cap embedding provider input length#899
TurboTheTurtle wants to merge 3 commits into
CortexReach:masterfrom
TurboTheTurtle:codex/embedding-max-input-chars

TurboTheTurtle commented Jun 17, 2026

Uh oh!

rwmjhb left a comment

Uh oh!

rwmjhb left a comment

Uh oh!

TurboTheTurtle commented Jun 19, 2026

Uh oh!

rwmjhb left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

TurboTheTurtle commented Jun 17, 2026

Uh oh!

rwmjhb left a comment

Choose a reason for hiding this comment

Must fix

Uh oh!

rwmjhb left a comment

Choose a reason for hiding this comment

Must fix

Also worth addressing

Uh oh!

TurboTheTurtle commented Jun 19, 2026

Uh oh!

rwmjhb left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants