commits

ken's old walker buffered the entire sync.getRepo response body in heap via
zat.HttpTransport.fetch (which always dupes into std.Io.Writer.Allocating),
then handed it to zat.car.readWithOptions which eagerly materialized a
StringHashMap of every CID → block content. that combination capped ken at
repos with <200k blocks and kept the whole CAR resident for the duration of
the walk. pfrazee.com (196k records, 72 MB CAR, 248k blocks counting MST
internals) sat just past the cliff.

this path now:
1. talks to std.http.Client directly for the one call that needs it,
streaming the response body straight into /tmp/ken-car-{seq}-{did}.car
via std.Io.File.Writer.initStreaming — no heap staging
2. mmaps the temp file read-only via std.Io.File.MemoryMap (kernel pages
in what we touch, evicts what we don't)
3. feeds the mmap slice to zat.car.streamBlocks (v0.3.0-alpha.24) and
builds a CID → {offset, len} index into the buffer via pointer
arithmetic — no block content duplication, ~16 bytes of value per
entry instead of 48
4. walks the MST through that index, delete-on-destroy cleans the
temp file whether the walk succeeds or errors out

every other ken call still uses zat.HttpTransport — only this one endpoint
needs streaming. bumps zat to v0.3.0-alpha.24 for car.streamBlocks.

smoke tested against two real repos via a standalone /tmp/ken_smoke.zig that
imports repo_walk.zig directly:

zzstoatzz.io: 17,348 records, 200 collections, 8.2 MB CAR
pfrazee.com: 195,904 records, 39 collections, 72.0 MB CAR

pfrazee walks end-to-end in ~11.5s on my laptop. 0 lingering /tmp/ken-car-*
files after either run. verified fly's /tmp is on the rootfs overlay (7.4G
free on the current machine), not tmpfs, so streaming to disk does not
compete with the 4 GB memory budget.

not yet addressed: indexer.zig still holds the full records[] + extracted
text + 384-dim vectors for every record simultaneously during embedding,
which for pfrazee would be ~300 MB of vectors on top of the mmap. walking
pfrazee is unblocked; embedding pfrazee needs a separate, record-at-a-time
pipeline that's planned as a follow-up.

Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

2w ago

zzstoatzz +1

69a8a2af

drop stale phase-F labeling from health + main.zig

3w ago

zzstoatzz +1

3e4f605a

lock search input in share view, dashed-border readonly style

3w ago

zzstoatzz +1

8fd4f405

bring back the flavor carousel, fix iOS input zoom

3w ago

zzstoatzz +1

638b6e79

add LICENSE

3w ago

zzstoatzz +1

decbc109

ken — fuzzy find any record in your atproto repo

3w ago

add FIFO job queue for indexing — sequential processing with position feedback main

0d752d79

zzstoatzz +1

untrack macos llama.cpp binaries and release tarball

ea478eca

zzstoatzz +1

add profile record: lexicon, putRecord/getRecord, auto-write on sign-in

0259433a

zzstoatzz +1

add design notes + TODO for multi-account, settings, profile record

b072d6fe

zzstoatzz +1

slim README to match sibling project style, dedupe about/disclosure

daacbf58

zzstoatzz +1

truncate huge repos to 50k instead of rejecting them

1b6482a6

zzstoatzz +1

hard cap at 50k records with a friendly error

758f50f9

zzstoatzz +1

replace DID-as-cookie with random session token

921fefe6

zzstoatzz +1

about: fix misleading copy, emphasize experimental + opt-in

e8339e4e

zzstoatzz +1

restore full CSS, apply only pack-menu changes

924dc434

zzstoatzz +1

kill the pill: inline pack state, move share back out

2d33ed4e

zzstoatzz +1

tighten README: fix propagation claims, document filtering

a371edf3

zzstoatzz +1

about-modal: give pack-link a real fallback href, not "#"

71e998ae

zzstoatzz +1

widen pack-menu popover so description reads in 3-4 lines, not 7

25e90533

zzstoatzz +1

pack-meta: visible pill trigger with explicit subject, kill orphan bullet

39daa31c

zzstoatzz +1

collapse pack actions into single disclosure menu

340fb77a

zzstoatzz +1

add minimal justfile + zig fmt baseline

ba1592d4

zzstoatzz +1

apply filter + cutoff on listRecords fallback; fix skip copy

9a69eeb9

zzstoatzz +1

filter noise collections and auto time-cutoff large repos

49e1b8a1

zzstoatzz +1

stream getRepo body to /tmp and mmap it for the CAR walk

9dbde477

zzstoatzz +1

drop stale phase-F labeling from health + main.zig

69a8a2af

zzstoatzz +1

lock search input in share view, dashed-border readonly style

3e4f605a

zzstoatzz +1

bring back the flavor carousel, fix iOS input zoom

8fd4f405

zzstoatzz +1

add LICENSE

638b6e79

zzstoatzz +1

ken — fuzzy find any record in your atproto repo

decbc109

zzstoatzz +1

Configure Feed

Configure Feed

commits