Linux: cbm_system_info / cbm_default_worker_count don't respect cgroup CPU/memory limits

### Summary

When CBM runs inside a Linux container with cgroup CPU and/or memory
limits (the normal Kubernetes / Docker / Nomad / systemd-slice case),
`cbm_system_info()` and `cbm_default_worker_count()` report
*node-level* CPU count and RAM, not the cgroup's effective limits.
This produces three concrete operator-visible problems:

1. `cbm_default_worker_count(initial=true)` returns
   `sysconf(_SC_NPROCESSORS_ONLN)`, which on Linux is the number of
   *online host CPUs*. A 1-vCPU container scheduled onto a 16-core
   node spawns ~16 indexing workers, each with its own per-worker
   buffers (AST stacks, tree-sitter parsers, slab allocators).
   In a memory-constrained container this is the dominant OOMKill
   driver — far more so than `g_budget` (which is observability-only;
   `cbm_mem_over_budget()` and `cbm_vmem_over_budget()` are declared
   but never called).
2. `mem.init budget_mb=…` log line is computed from the host's
   `sysinfo(2).totalram`, so a 2 GiB-limited pod on a 62 GB node
   logs `budget_mb=31000`, which is alarming and confuses incident
   response. (Cosmetic, but compounds 1.)
3. The over-budget worker count also magnifies the blast radius of
   #317 (dump-phase OOM crash on large TS monorepo) and #334 (silent
   index corruption after rapid kill/restart) — both of which we have
   reproduced in the wild on Kubernetes pods.

### Where in the source

Verified against `main` at HEAD `22153563cd1072b4f79e0e27113f6e0dea3abc1a`:

- [`src/foundation/system_info.c` `detect_system_linux()`](https://github.com/DeusData/codebase-memory-mcp/blob/22153563cd1072b4f79e0e27113f6e0dea3abc1a/src/foundation/system_info.c#L108-L122) uses `sysconf(_SC_NPROCESSORS_ONLN)` for cores and `sysinfo(2)` for RAM, both host-scoped.
- [`src/foundation/system_info.c` `cbm_default_worker_count()`](https://github.com/DeusData/codebase-memory-mcp/blob/22153563cd1072b4f79e0e27113f6e0dea3abc1a/src/foundation/system_info.c#L162-L171) consumes the host-scoped value with no cgroup adjustment.
- `src/foundation/mem.c` and `src/foundation/vmem.c` — `g_budget` is set from `info.total_ram * 0.5`, so the misleading log line in (2) above propagates from the same source.

### Suggested fix shape

Two reads in `detect_system_linux()`, both with safe fallbacks to
the existing host-scoped values:

1. **CPU count** — read cgroup v2 `cpu.max` (preferred) or v1
   `cpu.cfs_quota_us` / `cpu.cfs_period_us`. If the result is
   `max` / unlimited / parse error, fall back to
   `sysconf(_SC_NPROCESSORS_ONLN)`.
2. **Memory** — read cgroup v2 `memory.max` (preferred) or v1
   `memory.limit_in_bytes`. If the result is `max` / unlimited or
   exceeds host total, fall back to `sysinfo(2).totalram *
   mem_unit`.

Both files live at well-known paths (`/sys/fs/cgroup/...` for v2,
`/sys/fs/cgroup/<controller>/...` for v1) and are simple text
reads. No new dependencies.

A reasonable cap for safety: `min(cgroup_cpu, host_cpu)` and
`min(cgroup_mem, host_mem)`. This is robust against
mis-mounted-cgroups edge cases.

### Compatible with existing env knobs

Once cgroup awareness lands, an explicit env-override remains useful
for ops who want to tune below cgroup limits (e.g. leave headroom
for sibling processes in the same container). A companion PR (filed
alongside this issue) adds `CBM_WORKERS` for that case. The two
together give: env > cgroup > host as the precedence chain,
matching the `CBM_SQLITE_MMAP_SIZE` precedent from commit `093707c`.

### Out of scope

- macOS, BSD, Windows containerization stories (none of them have
  the same sysconf/sysinfo problem in practice).
- mimalloc tuning. mimalloc respects its own arena settings; the
  request here is just to compute *the inputs* CBM passes to its
  worker scheduler and budget logger correctly.

### Related

- Related crash issues that are made worse by overspawn in
  containers: #317, #334, #336, #340.
- Existing precedent for env-knob tuning: commit `093707c`
  (`CBM_SQLITE_MMAP_SIZE`).
- Downstream context: this is being filed while integrating CBM as
  a long-lived MCP-server process inside a 2 GiB Kubernetes pod.
  Happy to iterate on or contribute the patch — let us know which
  form you prefer.


Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Linux: cbm_system_info / cbm_default_worker_count don't respect cgroup CPU/memory limits #363

Summary

Where in the source

Suggested fix shape

Compatible with existing env knobs

Out of scope

Related

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Linux: cbm_system_info / cbm_default_worker_count don't respect cgroup CPU/memory limits #363

Description

Summary

Where in the source

Suggested fix shape

Compatible with existing env knobs

Out of scope

Related

Metadata

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Issue actions