Skip to content
This repository was archived by the owner on Jul 4, 2025. It is now read-only.
This repository was archived by the owner on Jul 4, 2025. It is now read-only.

planning: Cortex Hardware API #1165

@dan-menlo

Description

@dan-menlo

Goal

  • We should have a very clear Eng Spec for Cortex Hardware API for Sprint 23

Key Functionality

  • Hardware Detection
    • Cortex can list all available hardware
  • Hardware Activation
    • Cortex has a clear CLI and API to select active hardware
    • Cortex can activate specific hardware (e.g. CPU-only, or specific GPU)
  • Hardware -> Engines
    • Engines initialize using activated Hardware
    • List of active hardware is passed down to the engine (e.g. llama.cpp or TensorRT-LLM)
    • How does this interact with ngl settings?
  • Hardware Usage Detection
    • Cortex can detect free RAM or VRAM
  • Hardware Fallback

Tasklist

  • Design API (e.g. GET /hardware)
  • Design CLI (e.g. cortex hardware list?)

Functionality

  • Hardware Detection
  • Hardware Activation
  • Hardware Usage Detection (e.g. RAM, VRAM)

Cortex & Jan Integration

Previous Issues

Appendix

UX Goal

Cortex.cpp's Hardware API should enable us to do this in Jan
Image

Metadata

Metadata

Assignees

Labels

Type

No type

Projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions