#337 - feat(api): LLM provider adapters — Anthropic + OpenAI-compatible - james/carol

james commented

2026-06-29 00:00:47 +00:00

Owner

Adds the thin non-streaming inference layer (ADR-0029 §1 + §4, #336): an LlmClient interface with an Anthropic adapter (official @anthropic-ai/sdk) and an OpenAI-compatible adapter (plain fetch — covers OpenAI, OpenRouter, and local Ollama). It consumes #333's per-user config/key and the #51 tool registry.

Rebased onto main after #334 merged, so the diff is just the adapters.

What's in it (`apps/api/lib/llm/`)

client.ts — LlmClient interface (generate(req): Promise<LlmResult>), normalized types (LlmMessage/LlmToolDef/LlmGenerateRequest/LlmResult/LlmStopReason), a typed LlmError (not_configured/key_required/auth/rate_limited/overloaded/provider_error — carries only kind/message/status, never the key or raw error), and getLlmClientForUser(userId, db).
tools.ts — zod → JSON Schema (z.toJSONSchema, allOf-flattened, mirroring the MCP endpoint) + llmToolDefsFromRegistry() over #51's allTools().
anthropic.ts — translates normalized ↔ Anthropic Messages API (system, tool_use/tool_result blocks, tools); injectable client/fetch for tests; model-agnostic (no thinking/sampling params in v1). 529/overload detected via status === 529 || type === "overloaded_error" (the brief's OverloadedError class doesn't exist in SDK 0.106).
openai-compatible.ts — fetch to ${baseUrl}/chat/completions, Authorization omitted when keyless; maps function-tools + tool_calls/tool role; injectable fetch.
getLlmClientForUser selects the adapter from the #333 config + decrypted key (anthropic → key required; openai_compatible → key optional).

Bounded to inference + provider selection — streaming, the agent loop, conversations/messages, the SSE endpoint, and the chat UI are later tickets that consume this.

Dependency

@anthropic-ai/sdk@^0.106.0 (latest stable; the requested ^0.69.0 is superseded). Established package (won't trip package-age); no install lifecycle script, so no onlyBuiltDependencies entry. OpenAI-compatible uses fetch — no openai dep. Lockfile updated; pnpm install --frozen-lockfile clean.

Verification

typecheck + lint clean; semgrep (full CI pack set) — 0 findings (incl. the user-baseUrl fetch); openapi:check up to date (lib-only, no contract change).
Full suite on BOTH engines: 1150 passed / 0 skipped (ephemeral Postgres — the provider-selection test touches #333's table via describePerEngine). Tests inject transport (fake Anthropic client / fake fetch) — no live API calls: request translation, response normalization (text + tool_calls + stop reason), tool-use round, error mapping, keyless Ollama, selection + not_configured/key_required, and an assertion that no tool def carries user_id.

Closes #336

🤖 Generated with Claude Code

Adds the thin **non-streaming inference layer** (ADR-0029 §1 + §4, #336): an `LlmClient` interface with an **Anthropic** adapter (official `@anthropic-ai/sdk`) and an **OpenAI-compatible** adapter (plain `fetch` — covers OpenAI, OpenRouter, and **local Ollama**). It consumes #333's per-user config/key and the #51 tool registry. Rebased onto `main` after #334 merged, so the diff is just the adapters. ## What's in it (`apps/api/lib/llm/`) - `client.ts` — `LlmClient` interface (`generate(req): Promise<LlmResult>`), normalized types (`LlmMessage`/`LlmToolDef`/`LlmGenerateRequest`/`LlmResult`/`LlmStopReason`), a typed `LlmError` (`not_configured`/`key_required`/`auth`/`rate_limited`/`overloaded`/`provider_error` — carries only kind/message/status, never the key or raw error), and `getLlmClientForUser(userId, db)`. - `tools.ts` — zod → JSON Schema (`z.toJSONSchema`, `allOf`-flattened, mirroring the MCP endpoint) + `llmToolDefsFromRegistry()` over #51's `allTools()`. - `anthropic.ts` — translates normalized ↔ Anthropic Messages API (system, `tool_use`/`tool_result` blocks, `tools`); injectable client/fetch for tests; **model-agnostic** (no `thinking`/sampling params in v1). 529/overload detected via `status === 529 || type === "overloaded_error"` (the brief's `OverloadedError` class doesn't exist in SDK 0.106). - `openai-compatible.ts` — `fetch` to `${baseUrl}/chat/completions`, `Authorization` omitted when keyless; maps function-tools + `tool_calls`/`tool` role; injectable `fetch`. - `getLlmClientForUser` selects the adapter from the #333 config + decrypted key (anthropic → key required; openai_compatible → key optional). Bounded to **inference + provider selection** — streaming, the agent loop, `conversations`/`messages`, the SSE endpoint, and the chat UI are later tickets that consume this. ## Dependency `@anthropic-ai/sdk@^0.106.0` (latest stable; the requested `^0.69.0` is superseded). Established package (won't trip package-age); **no install lifecycle script**, so no `onlyBuiltDependencies` entry. OpenAI-compatible uses `fetch` — no `openai` dep. Lockfile updated; `pnpm install --frozen-lockfile` clean. ## Verification - typecheck + lint clean; **semgrep (full CI pack set) — 0 findings** (incl. the user-`baseUrl` fetch); `openapi:check` up to date (lib-only, no contract change). - **Full suite on BOTH engines: 1150 passed / 0 skipped** (ephemeral Postgres — the provider-selection test touches #333's table via `describePerEngine`). Tests inject transport (fake Anthropic client / fake `fetch`) — **no live API calls**: request translation, response normalization (text + tool_calls + stop reason), tool-use round, error mapping, keyless Ollama, selection + `not_configured`/`key_required`, and an assertion that no tool def carries `user_id`. Closes #336 🤖 Generated with [Claude Code](https://claude.com/claude-code)

james added 1 commit

2026-06-29 00:00:47 +00:00

feat(api): LLM provider adapters — Anthropic + OpenAI-compatible

Commits / Conventional Commits (pull_request) Successful in 11s

Details

PR / Static analysis (pull_request) Successful in 1m50s

Details

PR / OpenAPI (pull_request) Successful in 2m22s

Details

PR / OSV-Scanner (pull_request) Successful in 30s

Details

PR / Test (sqlite) (pull_request) Successful in 3m41s

Details

PR / Build (pull_request) Successful in 3m51s

Details

PR / Client (web export smoke) (pull_request) Successful in 4m0s

Details

PR / Test (postgres) (pull_request) Failing after 4m8s

Details

PR / Lint (pull_request) Successful in 4m16s

Details

PR / Typecheck (pull_request) Successful in 4m24s

Details

PR / pnpm audit (pull_request) Successful in 4m17s

Details

PR / Package age policy (soft) (pull_request) Successful in 47s

Details

Secrets / gitleaks (pull_request) Successful in 40s

Details

PR / Coverage (soft) (pull_request) Successful in 4m11s

Details

PR / Trivy (image) (pull_request) Successful in 4m22s

Details

PR / E2E (Playwright) (pull_request) Successful in 7m53s

Details

e95bdbac11

Adds the thin non-streaming inference layer (ADR-0029 §1 + §4, #336): an
LlmClient interface with an Anthropic adapter (official @anthropic-ai/sdk)
and an OpenAI-compatible adapter (plain fetch — covers OpenAI, OpenRouter,
and local Ollama). Normalized messages + the #51 tool registry (zod →
JSON Schema) translate to each provider and back to
{ text, toolCalls, stopReason }.

getLlmClientForUser selects the adapter from the #333 per-user config +
decrypted key (anthropic → key required; openai_compatible → key optional
for keyless local). Typed LlmError never carries the key or the raw
provider error. Model-agnostic v1: no thinking/sampling params.

Bounded to inference + provider selection — streaming, the agent loop,
conversations/messages, the SSE endpoint, and the chat UI are later
tickets that consume this. New dep @anthropic-ai/sdk@^0.106.0 (no install
lifecycle script). Tests use injected transport (no live API calls).

Closes #336

Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com>

forgejo-actions commented

2026-06-29 00:07:20 +00:00

📊 Test coverage

Patch coverage: no testable lines changed.

Overall (app/, lib/, db/, excluding UI per ADR-0019):

Metric	Value	Soft target
Lines	78.9% ✅	≥ 50%
Branches	70.8% ⚠️	≥ 75%
Functions	79.3%	informational

Soft thresholds per ADR-0019. Coverage is informational and does not block merge.

## 📊 Test coverage **Patch coverage:** no testable lines changed. **Overall** (`app/`, `lib/`, `db/`, excluding UI per ADR-0019): | Metric | Value | Soft target | |---|---|---| | Lines | 78.9% ✅ | ≥ 50% | | Branches | 70.8% ⚠️ | ≥ 75% | | Functions | 79.3% | informational | Soft thresholds per [ADR-0019](docs/adr/0019-coverage-soft-targets.md). Coverage is informational and does not block merge.

james force-pushed 336-llm-provider-adapters from e95bdbac11

Commits / Conventional Commits (pull_request) Successful in 11s

Details

PR / Static analysis (pull_request) Successful in 1m50s

Details

PR / OpenAPI (pull_request) Successful in 2m22s

Details

PR / OSV-Scanner (pull_request) Successful in 30s

Details

PR / Test (sqlite) (pull_request) Successful in 3m41s

Details

PR / Build (pull_request) Successful in 3m51s