Files

vikingowl efcb5a2901 docs: add project essentials (12/12 complete)

Vision, domain model, architecture, patterns, process flows,
UML diagrams, API contracts, tech stack, constraints, milestones
(M1-M11), decision log (6 ADRs), and risk register.

Key decisions: single binary, pull-based streaming, Mistral as M1
reference provider, discriminated unions, multi-provider collaboration
as core identity.

2026-04-02 18:09:07 +02:00

6.3 KiB

Raw Blame History

ADR-001: Single Binary with Goroutines

Status: Accepted Date: 2026-04-02

Context

gnoma needs to decouple the UI from the engine to support multiple frontends (TUI, CLI, future IDE extensions). Options were: (a) single binary with goroutines + channels, (b) client-server with gRPC IPC, (c) embedded library.

Decision

Single Go binary. Engine runs as goroutines within the same process. UI communicates with engine via the Session interface over channels. Future serve mode adds a Unix socket listener for external clients — still the same process.

Alternatives Considered

Alternative A: gRPC IPC (gnoma + gnomad)

Pros: Process isolation, true sandboxing, multiple clients to one daemon
Cons: Protobuf codegen dependency, daemon lifecycle management, two binaries to distribute

Alternative B: Embedded library

Pros: Maximum flexibility for embedders
Cons: No standalone binary, API stability burden, harder to ship

Consequences

Positive: Simple deployment, no daemon, no codegen, Go's goroutine model provides sufficient isolation. Negative: No process-level sandboxing for tools. Multi-client scenarios require serve mode (future).

ADR-002: Pull-Based Stream Interface

Status: Accepted Date: 2026-04-02

Context

Need a unified streaming abstraction across 4 SDKs with different patterns: Anthropic/OpenAI/Mistral use pull-based Next()/Current(), Google uses range-based for chunk, err := range iter.

Decision

Pull-based Stream interface: Next() bool, Current() Event, Err() error, Close() error. Google adapter bridges via goroutine + buffered channel.

Alternatives Considered

Alternative A: Channel-based

Pros: Go-idiomatic, works with select
Cons: Requires goroutine per stream, less control over backpressure, no Close() for cleanup

Alternative B: iter.Seq (range-over-func)

Pros: Modern Go pattern, clean for event := range stream
Cons: No Close() for resource cleanup, no separate error retrieval, doesn't match SDK patterns

Consequences

Positive: Zero-overhead adapter for 3 of 4 SDKs. Explicit resource cleanup. Consumer controls pace. Negative: Google needs a goroutine bridge. Slightly more verbose than range-based.

ADR-003: Mistral as M1 Reference Provider

Status: Accepted Date: 2026-04-02

Context

Need to pick one provider to implement first as the reference adapter. Candidates: Anthropic (richest model), OpenAI (most popular), Mistral (user maintains SDK).

Decision

Mistral first. The user maintains somegit.dev/vikingowl/mistral-go-sdk and knows its internals. The API shape is similar to OpenAI, making it a good baseline. Anthropic's unique features (thinking blocks, cache tokens) are better tested as M2 extensions.

Alternatives Considered

Alternative A: Anthropic first

Pros: Richest content model, most features to test
Cons: Anthropic-specific features (thinking, caching) could bias the abstraction

Alternative B: OpenAI first

Pros: Most widely used, well-documented
Cons: No special insight into SDK internals

Consequences

Positive: Fast iteration on reference adapter. SDK bugs fixed directly. Negative: Thinking block support tested later (M2).

ADR-004: Discriminated Union Structs

Status: Accepted Date: 2026-04-02

Context

Go lacks sum types. Need to represent Content variants (text, tool call, tool result, thinking) and Event variants.

Decision

Struct with Type discriminant field. Exactly one payload field is set per type value. Consumer switches on Type.

Alternatives Considered

Alternative A: Interface hierarchy

Pros: Extensible, familiar OOP pattern
Cons: Heap allocation per variant, type assertion overhead, no exhaustive switch checking

Alternative B: Generics-based enum

Pros: Type-safe, compile-time checked
Cons: Complex, unfamiliar, Go's generics don't support sum types well

Consequences

Positive: Zero allocation, cache-friendly, fast switch dispatch, simple. Negative: New variants require updating all switch statements. Acceptable for small, stable sets.

ADR-005: json.RawMessage for Tool Schemas

Status: Accepted Date: 2026-04-02

Context

Tool parameters (JSON Schema) and tool call arguments need to flow between providers and tools. Options: typed schema library, code generation, or raw JSON passthrough.

Decision

json.RawMessage for both tool parameter schemas and tool call arguments. Zero-cost passthrough between provider and tool. Tools parse their own arguments.

Alternatives Considered

Alternative A: JSON Schema library

Pros: Centralized validation, type-safe schema construction
Cons: Core dependency, serialization overhead, schema library selection lock-in

Alternative B: Code generation from schemas

Pros: Full type safety, compile-time checks
Cons: Build complexity, generated code maintenance, rigid

Consequences

Positive: No JSON Schema dependency. Providers and tools speak JSON natively. Minimal overhead. Negative: Validation at tool boundary only, not centralized.

ADR-006: Multi-Provider Collaboration as Core Identity

Status: Accepted Date: 2026-04-02

Context

Most AI coding assistants are single-provider. gnoma already supports multiple providers, but the question is whether multi-provider collaboration (elfs on different providers working together) is a nice-to-have or a core architectural feature.

Decision

Multi-provider collaboration is a core feature and part of gnoma's identity. The architecture must support elfs running on different providers simultaneously, with routing rules directing tasks by capability, cost, or latency. This is not an afterthought — it shapes how we design the elf system, provider registry, and session management.

Alternatives Considered

Alternative A: Multi-provider as optional extension

Pros: Simpler MVP, routing added later
Cons: Architectural decisions made without routing in mind may need rework

Consequences

Positive: Clear differentiator from all existing tools. Shapes architecture from day one. Negative: Elf system design must account for per-elf provider config from the start.

Changelog

2026-04-02: Initial decisions from architecture planning session

6.3 KiB Raw Blame History

ADR-001: Single Binary with Goroutines

Context

Decision

Alternatives Considered

Alternative A: gRPC IPC (gnoma + gnomad)

Alternative B: Embedded library

Consequences

ADR-002: Pull-Based Stream Interface

Context

Decision

Alternatives Considered

Alternative A: Channel-based

Alternative B: iter.Seq (range-over-func)

Consequences

ADR-003: Mistral as M1 Reference Provider

Context

Decision

Alternatives Considered

Alternative A: Anthropic first

Alternative B: OpenAI first

Consequences

ADR-004: Discriminated Union Structs

Context

Decision

Alternatives Considered

Alternative A: Interface hierarchy

Alternative B: Generics-based enum

Consequences

ADR-005: json.RawMessage for Tool Schemas

Context

Decision

Alternatives Considered

Alternative A: JSON Schema library

Alternative B: Code generation from schemas

Consequences

ADR-006: Multi-Provider Collaboration as Core Identity

Context

Decision

Alternatives Considered

Alternative A: Multi-provider as optional extension

Consequences

Changelog

6.3 KiB

Raw Blame History