gnoma

Owlibou/gnoma

Fork 0

Commit Graph

Author	SHA1	Message	Date
vikingowl	1606d19366	feat(subprocess/codex): account for cached and reasoning tokens codex 0.133.0 emits two token-accounting fields at top level that we previously dropped: cached_input_tokens — subset of input_tokens that hit the prompt cache (cheaper, but still counted in input_tokens per OpenAI Responses API semantics) reasoning_output_tokens — separately reported billable thinking tokens on reasoning-capable models Map cached_input_tokens to message.Usage.CacheReadTokens and subtract it from InputTokens. message.Usage.Add() sums InputTokens and CacheReadTokens as peers, so the uncached residual goes in InputTokens — matches the anthropic provider's convention and keeps cumulative usage tracking arithmetically correct. Fold reasoning_output_tokens into OutputTokens for accurate cost tracking. The top-level peer positioning (vs nested in output_tokens_details) implies a separately counted billable quantity, not a subset of output_tokens. Defensive clamp at zero in case a future codex build reports cached > input due to schema drift. Includes a verbatim regression guard against the live 2026-05-22 codex 0.133.0 output to catch schema changes early.	2026-05-22 13:35:57 +02:00
vikingowl	1717f9f567	fix(subprocess/codex): tolerate non-JSON stdout, max-of-token-paths Codex emits banner / debug / "starting turn" lines to stdout interleaved with the JSON event stream. The parser previously returned an error on any line that wasn't a JSON object, which subprocessStream.Next treats as terminal — one stray banner aborted the whole turn. Skip lines that don't start with `{` after whitespace trim, and downgrade unparseable JSON-looking lines to a slog.Debug so they don't kill the stream either. Token accounting: usage payloads from newer codex builds occasionally carry both input_tokens and prompt_tokens (and likewise output / completion) with slightly different values. Always use the larger of the two so we can't silently undercount. Tests cover non-JSON banner skipping, malformed-JSON non-fatal-skip, and the max() behavior with both token fields populated.	2026-05-22 12:08:32 +02:00
vikingowl	12a6b83cc9	feat: implement Google auth precedence and Codex integration	2026-05-22 00:21:32 +02:00

Author

SHA1

Message

Date

vikingowl

1606d19366

feat(subprocess/codex): account for cached and reasoning tokens

codex 0.133.0 emits two token-accounting fields at top level that
we previously dropped:

  cached_input_tokens   — subset of input_tokens that hit the prompt
                          cache (cheaper, but still counted in
                          input_tokens per OpenAI Responses API
                          semantics)
  reasoning_output_tokens — separately reported billable thinking
                          tokens on reasoning-capable models

Map cached_input_tokens to message.Usage.CacheReadTokens and subtract
it from InputTokens. message.Usage.Add() sums InputTokens and
CacheReadTokens as peers, so the uncached residual goes in
InputTokens — matches the anthropic provider's convention and keeps
cumulative usage tracking arithmetically correct.

Fold reasoning_output_tokens into OutputTokens for accurate cost
tracking. The top-level peer positioning (vs nested in
output_tokens_details) implies a separately counted billable
quantity, not a subset of output_tokens.

Defensive clamp at zero in case a future codex build reports
cached > input due to schema drift. Includes a verbatim regression
guard against the live 2026-05-22 codex 0.133.0 output to catch
schema changes early.

2026-05-22 13:35:57 +02:00

vikingowl

1717f9f567

fix(subprocess/codex): tolerate non-JSON stdout, max-of-token-paths

Codex emits banner / debug / "starting turn" lines to stdout
interleaved with the JSON event stream. The parser previously
returned an error on any line that wasn't a JSON object, which
subprocessStream.Next treats as terminal — one stray banner
aborted the whole turn. Skip lines that don't start with `{`
after whitespace trim, and downgrade unparseable JSON-looking
lines to a slog.Debug so they don't kill the stream either.

Token accounting: usage payloads from newer codex builds
occasionally carry both input_tokens and prompt_tokens (and
likewise output / completion) with slightly different values.
Always use the larger of the two so we can't silently undercount.

Tests cover non-JSON banner skipping, malformed-JSON
non-fatal-skip, and the max() behavior with both token
fields populated.

2026-05-22 12:08:32 +02:00

vikingowl

12a6b83cc9

feat: implement Google auth precedence and Codex integration

2026-05-22 00:21:32 +02:00

3 Commits