search for standard sites pub-search.waow.tech
search zig blog atproto
11
fork

Configure Feed

Select the types of activity you want to include in your feed.

docs: update for offprint support and local sqlite replica

- README.md: add offprint to platform list, mention "other" category
- CLAUDE.md: add local SQLite read replica, document platforms
- standard-search-planning.md: mark offprint as now indexed

🤖 Generated with [Claude Code](https://claude.com/claude-code)

Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>

zzstoatzz 3ed08c73 d69f0682

+13 -9
+5 -1
CLAUDE.md
··· 14 14 - **backend** (Zig): HTTP API, FTS5 search, vector similarity 15 15 - **tap**: firehose sync via bluesky-social/indigo tap 16 16 - **site**: static frontend on Cloudflare Pages 17 - - **db**: Turso (SQLite) - FTS5 + embeddings 17 + - **db**: Turso (source of truth) + local SQLite read replica (FTS queries) 18 + 19 + ## platforms 20 + - leaflet, pckt, offprint: known platforms (detected via basePath) 21 + - other: site.standard.* documents not from a known platform 18 22 19 23 ## search ranking 20 24 - hybrid BM25 + recency: `ORDER BY rank + (days_old / 30)`
+2 -2
README.md
··· 2 2 3 3 by [@zzstoatzz.io](https://bsky.app/profile/zzstoatzz.io) 4 4 5 - search ATProto publishing platforms ([leaflet](https://leaflet.pub), [pckt](https://pckt.blog), and others using [standard.site](https://standard.site)). 5 + search ATProto publishing platforms ([leaflet](https://leaflet.pub), [pckt](https://pckt.blog), [offprint](https://offprint.app), and others using [standard.site](https://standard.site)). 6 6 7 7 **live:** [pub-search.waow.tech](https://pub-search.waow.tech) 8 8 ··· 35 35 GET /health # health check 36 36 ``` 37 37 38 - search returns three entity types: `article` (document in a publication), `looseleaf` (standalone document), `publication` (newsletter itself). each result includes a `platform` field (leaflet, pckt, etc). tag and platform filtering apply to documents only. 38 + search returns three entity types: `article` (document in a publication), `looseleaf` (standalone document), `publication` (newsletter itself). each result includes a `platform` field (leaflet, pckt, offprint, or other). tag and platform filtering apply to documents only. 39 39 40 40 **ranking**: results use hybrid BM25 + recency scoring. text relevance is primary, but recent documents get a boost (~1 point per 30 days). the `since` parameter filters to documents created after the given ISO date (e.g., `since=2025-01-01`). 41 41
+6 -6
docs/standard-search-planning.md
··· 7 7 - [standard.site](https://standard.site/) - shared lexicons for long-form publishing on ATProto 8 8 - [leaflet.pub](https://leaflet.pub/) - implements `pub.leaflet.*` lexicons 9 9 - [pckt.blog](https://pckt.blog/) - implements `blog.pckt.*` lexicons 10 - - [offprint.app](https://offprint.app/) - implements `app.offprint.*` lexicons (early beta) 10 + - [offprint.app](https://offprint.app/) - implements `app.offprint.*` lexicons 11 11 - [ATProto docs](https://atproto.com/docs) - protocol documentation 12 12 13 13 ## context ··· 173 173 - no `textContent` field - content is in nested `pages[].blocks[]` 174 174 - will need to continue parsing blocks OR wait for migration 175 175 176 - ### offprint.app - LIKELY EARLY BETA 177 - - no `site.standard.document` records found on offprint.app account 178 - - no `app.offprint.document` collection visible 179 - - website shows no example users/content 180 - - probably in early/private beta - no public records yet 176 + ### offprint.app - NOW INDEXED (2026-01-22) 177 + - writes `site.standard.document` records with `app.offprint.content` blocks 178 + - has `textContent` field (pre-flattened) 179 + - platform detected via basePath (`*.offprint.app`, `*.offprint.test`) 180 + - now fully supported alongside leaflet and pckt 181 181 182 182 ### implication for architecture 183 183