fix: align and persist remote embedding result metadata by andinux · Pull Request #6 · sqliteai/sqlite-memory

andinux · 2026-05-07T17:24:30Z

Summary

Embedding result metadata, unified and persisted. Replace
n_tokens_truncated with a shared truncated boolean across local,
remote, and custom embedding engines. Add n_tokens and truncated
columns to dbmem_vault and dbmem_cache, with schema versioning and
migration so existing databases pick them up cleanly.
Remote parser fixed. Read the documented vectors.space envelope
(output_dimension, data[0].embedding, data[0].truncated, nested
usage token fields) instead of relying on a flat key scan. Prefer the
exact / estimated prompt-token counts when present so n_tokens is
meaningful for cache and reporting use.
End-to-end coverage for the long-text path. New e2e tests cover
multi-chunk ingestion + per-section retrieval, single-chunk ingestion
under and over the provider's per-request token ceiling, and a single
chunk near the model context window. Each search test prints the
per-chunk n_tokens / truncated values it just round-tripped through
the API, so regressions in the metadata path show up in the test log.
Build/CI hygiene. Run the SQLite extension unit tests in CI, fix
them under local-only builds, use portable temp paths in the sync
tests, hash the sync fixture from disk, and isolate curl's ./configure
from inherited shell LDFLAGS/CPPFLAGS/CFLAGS/LIBS so curl
rebuilds work in environments with Homebrew-style env exports.
Docs aligned. Update the public C API reference for the new
truncated flag on custom provider results.

Test plan

make unittest passes — covers schema versioning, migration of
pre-existing vault/cache rows, and persistence of n_tokens /
truncated end-to-end.
make e2e APIKEY=<vectors.space key> passes — exercises the
remote parser and the persisted metadata against the live API.
CI green on the SQLite extension unit-test job, both full and
local-only build variants.
Manual: open an existing pre-migration database, run any
memory_add_text call, and confirm the new columns are populated
without errors.
Manual: with vectors.space's recently-fixed batch ceiling, the
three boundary tests (memory_search_under_token_limit,
_truncation_signature, _truncation_near_model_context) print
trunc=0 for the under-limit case and trunc=1 for the two
oversized cases — i.e. truncation is now observable as data
rather than as a 500.

Replace n_tokens_truncated with a shared truncated flag across local, remote, and custom embedding results. Keep remote token counts useful by preferring exact and estimated prompt token metadata when present. This avoids carrying truncation semantics that were exact for the local engine but misleading for remote vectors.space responses.

Read output_dimension, data[0].embedding, data[0].truncated, and nested usage token fields from the documented vectors.space response shape. This makes the remote embedding parser match the current API envelope instead of relying on flat key scans across the whole JSON payload.

Align the public C API reference with the current dbmem_embedding_result_t shape by documenting the truncated boolean flag instead of n_tokens_truncated. This avoids publishing stale custom-provider docs after the shared embedding result metadata was simplified across local, remote, and custom engines.

- add schema versioning and migrate vault/cache metadata columns - store n_tokens and truncated on vault rows and cache entries - assert metadata persistence in unittest coverage - document the remote parser refactor plan for later implementation

…gure Add four e2e tests covering long-text behaviour and provider boundaries: - memory_add_long_text_chunking + memory_search_long_text_sections: structural and retrieval coverage for multi-chunk inputs. - memory_search_under_token_limit (~5KB, single chunk; passes) and memory_search_truncation_signature / _near_model_context (~10KB / ~19.5KB, rejected by provider): document vectors.space's 1024-token batch ceiling. Also: RUN_TEST no longer prints PASSED after a test fails, and Makefile strips LDFLAGS/CPPFLAGS/CFLAGS/LIBS from curl's ./configure so rebuilds work in shells with Homebrew-style env exports. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>

Surface n_tokens and truncated from dbmem_vault in the test output for the four search tests: - memory_search_long_text_sections: aggregate (chunk count, min n_tokens, any-truncated) for the chunked corpus. - memory_search_under_token_limit, _truncation_signature, _truncation_near_model_context: per-chunk values for both the short reference and the long chunk alongside ranking/score. Also append a per-test trailing sentence to the SHORT_REF in the two truncation tests so memory_add_text's content-hash idempotency doesn't collapse them into no-ops of the under-limit test's identical SHORT_REF. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>

remote-embedding-parser-refactor-plan.md was committed by mistake — it was a local-only planning note. Remove it from tracking via git rm --cached so the working copy is preserved for the author while the file is dropped from the published tree. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>

Add example queries for the schema-version-2 columns on dbmem_vault so readers can discover token-usage aggregation and truncation diagnostics without having to read the source. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>

andinux and others added 15 commits April 27, 2026 17:19

Update dbmem-rembed.c

a79fd96

Run SQLite extension unit tests in CI

4571a61

Fix extension tests for local-only builds

ca54200

Use portable temp paths in sync tests

b3c97f3

Hash sync test fixture from disk

1b45e33

Merge branch 'main' into fix-remote-embedding-result-metadata

735dd09

feat: persist embedding metadata

1e519a2

- add schema versioning and migrate vault/cache metadata columns - store n_tokens and truncated on vault rows and cache entries - assert metadata persistence in unittest coverage - document the remote parser refactor plan for later implementation

Merge branch 'main' into fix-remote-embedding-result-metadata

24c7454

andinux requested a review from marcobambini May 7, 2026 17:24

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix: align and persist remote embedding result metadata#6

fix: align and persist remote embedding result metadata#6
andinux wants to merge 15 commits intomainfrom
fix-remote-embedding-result-metadata

andinux commented May 7, 2026 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

andinux commented May 7, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Test plan

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

andinux commented May 7, 2026 •

edited

Loading