commits

zzstoatzz.io / plyr.fm

fork

audio streaming app plyr.fm

fork

Author

Commit

Message

Date

nate nowack +1

c213426b

fix(copyright): suppress self-match flags before they reach UI/DM (#1341) 2026.0426.042541 main

4d ago

nate nowack +1

f7946389

fix(embed): wire MediaSession metadata + action handlers (#1340)

Empirical finding from iOS lock-screen testing: embed surfaces
(CollectionEmbed.svelte for albums/playlists, embed/track/[id]/+page.svelte
for single tracks) set NOTHING on navigator.mediaSession. Result on iOS
Safari and Android Chrome lock-screen controls: generic placeholder title,
no cover art, next/previous buttons either greyed out or routing to nothing.
The main app's Player.svelte has the right behavior; the embeds were
just missing it.

Adds `lib/media-session.ts` — small helper module that wraps the four
MediaSession APIs we use (metadata, playbackState, positionState,
action handlers) with no-op fallbacks on platforms without the API and
a try/catch around setPositionState (which throws on stale
duration/position during track transitions).

Wires the helpers into both embed surfaces:

- Metadata effect: re-runs on track change. Pulls title/artist from
the track and falls back through track image → collection image
for artwork (single-track embed uses trackCoverUrl directly).
- PlaybackState effect: re-runs on paused change.
- PositionState effect: re-runs on time/duration change.
- Action handlers: registered ONCE on mount with cleanup on unmount.
Single-track embed explicitly nulls previoustrack/nexttrack so the
OS greys them out instead of inheriting stale handlers.
- Cleanup on unmount: clears metadata, sets playbackState to 'none',
nulls all handlers. Prevents stale lock-screen entries when the
user navigates away from an embed mid-playback.

Does NOT touch Player.svelte — it has its own (older, inline)
MediaSession setup that works. Refactoring it to use these helpers
is a separate dedup concern.

Validated via svelte:svelte-file-editor agent: zero autofixer issues,
reactivity correct (each effect reads only its deps), unmount cleanup
fires correctly, and `$state` closures inside the action handlers
read the current value at handler-call time (not a mount-time
snapshot).

🤖 Generated with [Claude Code](https://claude.com/claude-code)

Co-authored-by: Claude Opus 4 (1M context) <noreply@anthropic.com>

4d ago

nate nowack +1

fac6be73

fix(player): synchronous fast path for auto-advance to survive locked-screen autoplay (#1339)

* fix(player): synchronous fast path for auto-advance to survive locked-screen autoplay

Reported in zzstoatzz.io/plyr.fm#1: on Android with the screen locked,
album / playlist playback stops at the end of each track instead of
advancing to the next. The reporter notes it worked in early February.

## Root cause

The chain from `<audio onended>` to `audio.play()` on the next track
goes through ~5 microtask boundaries plus an `await getAudioSource(...)`:

ended → handleTrackEnded → queue.next()
→ $effect: queue → player.currentTrack
→ $effect: load new src (await getCachedAudioUrl, fetch HEAD if gated)
→ audio.src = src; audio.load(); wait for loadeddata
→ $effect: shouldAutoPlay && !isLoadingTrack → player.paused = false
→ $effect: paused-sync → audio.play()

On a foregrounded tab this is milliseconds and works fine. On Android
with the screen locked, Chrome aggressively throttles non-foreground JS
and treats the page as "no longer audible" the moment the previous
track ends. By the time `audio.play()` finally runs, the implicit-
playback grace is gone and the call rejects with NotAllowedError. The
only way to resume is via a Media Session action handler (an explicit
lock-screen button press), which is exactly the workaround the
reporter was using.

This is not a regression from any one commit — the chain has had this
shape since before February. Most likely Chrome on Android tightened
locked-screen autoplay/freeze behavior between then and now, exposing
a long-standing fragility.

## Fix

Three coordinated changes:

1. **`queue.autoAdvanceTrack` getter** — single seam for "what should
natural end-of-track continuation play next". Today returns
`tracks[currentIndex + 1]`. Future continuation strategies (album
tail, feed continuation, recommendations) plug in here.

2. **Next-track prefetcher** — `resolveAudioSource` (extracted to
`lib/audio-source.ts`) returns a structured `ResolvedSource`
discriminator (ready / gated-denied / failed). A `$effect`
opportunistically resolves `queue.autoAdvanceTrack` while the
current track plays and stores the result in `preloadedNext`.
IndexedDB cache lookup and gated HEAD check move out of the
critical path.

3. **Synchronous fast path in `handleTrackEnded`** — when the
prefetcher has a ready source for the next track and we're not in
jam mode, swap `audio.src` and call `audio.play()` in the same tick
as the `ended` event. Reactivity (queue.next, player.currentTrack)
updates AFTER, so the autoplay grace is preserved. Pre-bumping
previousTrackId/previousFileId/previousQueueIndex before
`player.currentTrack = next; queue.next()` keeps downstream
effects no-ops; without it the queue→player sync effect's
`indexChanged` branch would seek the just-started audio back to 0.

When the preload isn't ready (race, jam active, gated denial), we
fall back to the existing reactive chain — same behavior as today.

Plus structured telemetry (`recordPlaybackRejection`) logging
errorName, visibilityState, audio.readyState, fast-path flag, and
preload state so we can confirm in production whether the fast path
actually dodges the autoplay block per browser bucket.

## What this PR does NOT do

- Does not change collection needle-drop semantics. Album/playlist
row clicks still call `queue.playNow(track)` and discard collection
context — separate problem. The new `autoAdvanceTrack` getter is
the seam where a future "soft context" continuation strategy plugs in.
- Does not refactor `TrackItem.svelte`'s `$effect.pre` reset block
or other pre-existing patterns. Scoped to the auto-advance chain.

## Validation

- `just frontend check`: 0 errors / 0 warnings.
- Reviewed via `svelte:svelte-file-editor` agent — confirmed prefetch
effect's reactivity (correct), fast-path state-write ordering
(correct, with comment-strengthening applied), and blob-URL
accounting (correct across both paths).
- `lib/audio-source.ts` extracted out so Player.svelte's growth is
justified by the actual fast-path/prefetch substance, not pure
helpers that could live elsewhere.

## Test plan

- [x] svelte-check clean.
- [ ] After deploy: reproduce on Android (screen locked) with an album
that has 3+ tracks; confirm auto-advance works end-to-end.
- [ ] Confirm desktop foreground playback unchanged.
- [ ] Confirm gated-track skipping still works (denial via prefetch
consumes the cached entry; active gated denial still triggers
the toast).
- [ ] After 24h on prod: query logfire for `audio play() rejected`
events; analyze fast-path vs slow-path rejection rates per
`error.name` and `document.visibility_state` bucket.

🤖 Generated with [Claude Code](https://claude.com/claude-code)

Co-Authored-By: Claude Opus 4 (1M context) <noreply@anthropic.com>

* fix(player): preserve auto-advance through gated tracks; fix telemetry pollution

review feedback on #1339:

1. **gated auto-advance no longer kept playing.** my parameterized
`handleGatedDenial(err, fromAutoAdvance)` was ALWAYS called with
`false`, including from the loader effect when consuming a cached
`gated-denied` preload. so after `handleTrackEnded` set
`shouldAutoPlay = true` and `queue.next()` advanced into the
gated-denied track, `handleGatedDenial` clobbered shouldAutoPlay
back to false before `queue.goTo(nextPlayable)` — playback
stopped instead of skipping the gated track and continuing.
pre-fast-path code unconditionally set `shouldAutoPlay = true` in
this branch.

fix: drop the `fromAutoAdvance` parameter; always intend to
auto-play after a gated skip. matches pre-PR behavior. whether
the user clicked a gated track or auto-advance landed on one,
the user wants the next playable track to start.

2. **fallback telemetry was polluting the rejection metric.**
`recordAutoAdvanceFallback` emitted via `recordPlaybackRejection`,
whose event name is `audio play() rejected`, even though no
`play()` had been attempted on the slow path at that point. any
dashboard query filtering on that event name would have counted
slow-path-fallback markers as play rejections.

fix: drop `recordAutoAdvanceFallback` entirely. instead, instrument
the existing slow-path `play().catch(...)` site (which previously
only `console.error`'d) with `recordPlaybackRejection({fastPath:
false, ...})`. now BOTH paths emit the same event, and the
`playback.fast_path` field is the genuine discriminator for
comparing rejection rates between fast and slow paths. that's the
actual question the telemetry was trying to answer.

svelte-check: 0 errors / 0 warnings.

Co-Authored-By: Claude Opus 4 (1M context) <noreply@anthropic.com>

* chore(player): drop dead frontend telemetry plumbing

review feedback: I was writing comments and commit copy that referenced
"dashboards" for fast-vs-slow path comparison. There are no dashboards.
Frontend logfire is config-flagged off (`config.browser_observability`)
because it was destabilizing the backend; nobody is querying frontend
spans. So `recordPlaybackRejection` was emitting `logfire.info` against
an unconfigured client — net effect: dead code with imaginary purpose.

Removed:
- `recordPlaybackRejection` + `PlaybackRejectionContext` from
`lib/observability.ts`. `initObservability` itself stays — fetch /
XHR auto-instrumentation is the part that DOES propagate trace
headers to the backend, and that's still useful when the flag is on.
- Both call sites in Player.svelte (slow-path and fast-path
`play().catch(...)`) now `console.error` the same way the rest of
the file already did. If a user reports lock-screen playback
trouble, the actual debug pathway is "ask them to repro in
devtools and capture the console."

Co-Authored-By: Claude Opus 4 (1M context) <noreply@anthropic.com>

---------

Co-authored-by: Claude Opus 4 (1M context) <noreply@anthropic.com>

4d ago

nate nowack +1

5980d3cc

fix(likes): tombstone cancelled URIs to prevent like resurrection race (#1338) 2026.0425.220114

`test_cross_user_like` flakes intermittently in the staging integration
suite because of a real race in the like → unlike sequence:

1. user clicks LIKE → DB INSERT row R (atproto_like_uri=NULL),
`pds_create_like(R.id)` enqueued via docket.
2. user clicks UNLIKE before pds_create_like runs. atproto_like_uri
is still NULL so we just DELETE R; no PDS-delete is scheduled
because there's no URI yet.
3. `pds_create_like(R.id)` finally runs:
a. PDS create returns URI X.
b. SELECT R.id → row gone → orphan-cleanup branch fires.
c. `delete_record_by_uri(X)` is scheduled.
4. Jetstream emits the `app.bsky.feed.like` create event for X
BEFORE the matching delete event from (3c) propagates.
5. `ingest_like_create` finds no existing row for (track, user)
→ INSERTS a fresh row with URI X. **the like just resurrected
itself after the user explicitly unliked.**
6. eventually the delete event arrives and `ingest_like_delete`
by URI X clears the resurrected row — but in the gap the user
sees their unlike undone.

Fix: in (3c), tombstone the URI in Redis with a 5-minute TTL BEFORE
issuing the orphan PDS delete. `ingest_like_create` checks the
tombstone and drops the matching create event in (5). The TTL only
needs to cover Jetstream propagation; expiry is harmless because the
matching delete event still arrives shortly after.

Why Redis tombstone over a `cancelled_at` schema column: no migration,
no read-path filtering across ~15 query sites, scoped fix to the two
files actually involved in the race. Local Redis blip falls back to
the existing Jetstream-delete cleanup; user briefly sees the ghost
like but it's cleared seconds later.

Mirrors the existing track-tombstone pattern in `ingest.py` (which
prevents ghost tracks from cursor rewind) — same Redis primitive,
different prefix (`like_cancelled:` vs `plyr:tombstone:`) reflecting
the different concern (write race vs replay race).

Tests:
- tests/test_pds_create_like_tombstone.py — pds_create_like writes
the tombstone in the orphan branch and NOT on the happy path
(which would otherwise stall the user's own like indefinitely).
- tests/test_jetstream.py::TestIngestLikeCreate::test_skips_create_for_cancelled_uri
— ingest_like_create drops the create event when the URI is
tombstoned.

447/447 backend tests pass; ruff + ty clean.

Co-authored-by: Claude Opus 4 (1M context) <noreply@anthropic.com>

4d ago

nate nowack +1

1495c643

fix(frontend): inherit album cover when a track has no per-track image (#1337)

4d ago

nate nowack +1

275e2dce

fix(uploads): stage audio + image to shared storage before enqueueing docket (#1336) 2026.0425.200206

* fix(uploads): stage audio + image to shared storage before enqueueing docket

PR #1331 moved POST /tracks/ + PUT /tracks/{id}/audio onto docket
to fix a connection-pool problem, but mechanically forwarded the same
request-handler `/tmp/...` paths over Redis. on production fly.io,
`relay-api` runs multiple machines per process group; the docket worker
frequently lands on a different machine than the request handler. that
machine has its own /tmp, so the upload silently fails:
`FileNotFoundError: [Errno 2] No such file or directory: '/tmp/tmpXXXX.wav'`.

evidence (prod, 2026-04-25 darkhart.bsky.social, 7 jobs):
4 failed at varied phases (`upload`, `pds_upload`, `atproto`) — all with
the same FileNotFoundError. the 3 that succeeded all hit the same
`atproto` phase. pure luck of which worker grabbed the job. the
successful tracks also had `image_id IS NULL` in `tracks` because
`_save_image_to_storage` reads `image_path` and silently swallows the
exception (returns `(None, None, None)` on failure). that's the
"cover art shows in the player bar but not on the track page" symptom.

shape of the fix:

HTTP handler:
1. stream client upload to a request-local temp file (size enforce)
2. extract duration once, while bytes are still local
3. `storage.save(file, filename)` -> audio_file_id
4. stream image to memory, `storage.save` -> image_id, image_url, thumb_url
5. delete request-local temp file
6. enqueue docket task with file_id / image_id / URLs ONLY

worker (`run_track_upload`, `run_track_audio_replace`):
- signatures take `audio_file_id`, never a `*_path`
- `_validate_audio` reads duration from the context (no I/O)
- `_store_audio` reuses the staged id directly for web-playable
formats; for lossless, downloads from storage, transcodes via a
worker-local /tmp (single-task, never crosses machine boundary),
saves transcoded result back to storage
- `_upload_to_pds` downloads bytes from storage when not transcoded
- `_store_image` is a no-op forward (URLs already resolved in handler)

this preserves PR #1331's connection-pool win (handler returns once
storage is durable + docket task is enqueued) and removes the
multi-machine fragility entirely.

- drops aiofiles use on this path; uses `storage.get_file_data`
- removes the temp-file cleanup in `_process_upload_background` —
there's nothing local to clean
- audio_replace handler also captures `support_gate` up front so the
staged bytes land in the right bucket (private vs public) before
the worker sees them

regression coverage:
the structural change (`UploadContext` no longer has `file_path`,
docket task signatures no longer have `*_path` args) is the contract.
existing tests (`test_upload_session_reload`, `test_upload_phases`,
`track_audio_replace/test_pipeline.py`) exercise the orchestrator
end-to-end through the new context shape and pass green (46 tests).

Co-Authored-By: Claude Opus 4 (1M context) <noreply@anthropic.com>

* fix(uploads): clean up staged storage on handler-side + pre-DB worker aborts

addresses three orphan-cleanup gaps reviewer flagged on the staging refactor:

1. **handler-side**: any abort between `stage_audio_to_storage` and a
successful schedule call left staged storage objects orphaned and
the job stuck in PROCESSING. wrap staging+enqueue in try/except;
on failure delete staged audio (private if gated, public otherwise)
and image, mark the job FAILED.

2. **replace orchestrator**: `new_file_id_for_rollback` was None until
`_store_audio` returned. the gated-FLAC path (handler stages new
bytes to private bucket → `_store_audio` raises "supporter-gated
tracks cannot use lossless formats yet") left those bytes stranded.
initialize from `ctx.audio_file_id` upfront, thread the playable-
file extension through `_rollback_new_files`. add `is_gated: bool`
to ReplaceContext (handler-time decision) so rollback selects the
bucket the bytes ACTUALLY live in even under a concurrent PATCH
that flips support_gate between request and worker.

3. **upload orchestrator**: phases 1-5 raise UploadPhaseError without
releasing staged bytes. add `_cleanup_staged_media_pre_db` and a
`db_row_owns_media` boundary flag — orchestrator cleans up only
before `_create_records`, deferring to its existing reserve-then-
publish cleanup past that. covers the transcoded-sibling case.

session-expired path on both workers also deletes the staged bytes
(no recovery without a fresh sign-in; orphans serve nothing).

regression tests:
- `tests/api/test_upload_storage_cleanup.py` (4 tests)
- `track_audio_replace/test_pipeline.py` (1 test):
early-abort rolls back staged file from the right bucket per
`ctx.is_gated`

370/370 tests pass locally; ruff + ty clean.

Co-Authored-By: Claude Opus 4 (1M context) <noreply@anthropic.com>

* chore: drop stray backend/loq.toml

* chore(uploads): consolidate cleanup helper, drop redundant deferred import

once-over after CI green:

- removed redundant `from backend._internal import get_session` deferred
re-import inside `_process_upload_background` — the symbol is already
imported at module scope. updated `test_upload_session_reload` to
patch where the symbol is used (`backend.api.tracks.uploads.get_session`)
rather than where it's defined, which is the right pattern anyway.
- audio_replace's handler + session-expired path were inlining the
same `delete_gated if gated else delete` pattern that uploads exposes
as `_delete_staged_audio`. import + reuse instead of duplicating.

no behavior change; 370/370 tests pass.

Co-Authored-By: Claude Opus 4 (1M context) <noreply@anthropic.com>

---------

Co-authored-by: Claude Opus 4 (1M context) <noreply@anthropic.com>

4d ago

nate nowack +1

1a1d8bb9

fix(upload): avoid session-state race in concurrent album creation (#1334) 2026.0424.174126

5d ago

nate nowack

9ed734c3

fix(uploads): per-DID concurrency cap + exponential backoff on PDS calls (#1333) 2026.0424.164219

5d ago

nate nowack

9ecb6a93

fix(atproto): always refresh on 401 + widen transient-error retry net (#1332) 2026.0424.161059

5d ago

nate nowack +1

608cb733

fix(uploads): migrate track upload + audio replace to docket (#1331)

the POST /tracks/ and PUT /tracks/{id}/audio handlers used
`fastapi.BackgroundTasks.add_task`, which runs the task within the
same ASGI request lifecycle after the response is sent. consequence:
any request-scoped DB session stays checked out of the pool until the
task finishes (20-100s per upload), and nothing bounds concurrency.

today flo.by uploaded 6 tracks in a single album-create fan-out. six
concurrent uploads held six of the 10 pool slots for over a minute
and starved every other request (/auth/me p95 hit 9.7s, /health 3s).
root cause: this pattern was in place from the very first streaming-
uploads commit (26a48c75, Nov 2025). docket landed a month later and
all post-upload tasks were migrated piecemeal (copyright, embedding,
genre, image moderation, atproto sync, teal, export, pds backfill)
but the upload orchestration itself never was. audio replace (#1311,
Apr 2026) copied the same pattern.

changes:
- uploads.py: add run_track_upload (docket task, primitives only,
rehydrates session, delegates to existing _process_upload_background)
+ schedule_track_upload helper
- audio_replace.py: same trio for replace
- handlers: drop `background_tasks: BackgroundTasks` param, call
await schedule_* instead
- _internal/tasks/__init__.py: register both tasks in the docket list
- test_endpoint.py: patch the scheduler helper, not the orchestrator
- tests/integration/test_album_upload.py: add
test_album_upload_10_tracks_concurrently as regression coverage —
fires 10 concurrent uploads through an album and asserts all complete
- loq.toml: relax limits on uploads.py + audio_replace.py to cover the
new wrapper functions

the existing orchestrators (_process_upload_background,
_process_replace_background) keep the same signature so every pipeline
test that drives them directly continues to pass unchanged.

buys us:
- HTTP handler returns in <1s; request-scoped DB session released on
response instead of 100s later
- per-op DB sessions via db_session() inside the task, not held across
the whole upload
- bounded concurrency via settings.docket.worker_concurrency (default
10/worker x 2 prod machines = 20 concurrent uploads, rest queue in
Redis rather than saturating the pool)
- fresh session rehydration if OAuth refreshed between queue and task

Co-authored-by: Claude Opus 4 (1M context) <noreply@anthropic.com>

5d ago

nate nowack +1

0b2504e7

fix(confirm-dialog): use native <dialog> + showModal for top-layer stacking (#1330)

6d ago

nate nowack +1

8f20e8e4

fix(portal): audio-clear-btn inherits global font for consistency (#1327)

7d ago

nate nowack +1

eac426d6

fix(portal): audio buttons inherit global font (#1326)

7d ago

nate nowack +1

d38aa796

fix(restore): re-upload blob to PDS when it's been GC'd (#1325) 2026.0422.181250

7d ago

nate nowack +1

5c6c6b77

docs(deployment): correct frontend Pages build config (#1324) 2026.0422.155233

7d ago

nate nowack +1

af6558f0

revert(test): remove the \"flakiness\" retry-poll on test_cross_user_like (#1322)

9d ago

nate nowack

e8463db2

fix(restore): fall back gracefully when PDS has GC'd the old blob (#1320)

10d ago

nate nowack +1

f282e3ad

fix(restore): preserve PDS blob ref on restored track record (#1319)

10d ago

nate nowack +1

703341e2

feat: audio revisions with confirm-before-replace and restore (#1318)

* feat: audio revisions with confirm-before-replace and restore

closes the UX loop on the audio-replace feature shipped in #1311-1313.
two changes shipped together:

1. **confirmation gate** before audio replace fires. picking a file no
longer kicks off the irreversible upload — clicking "replace audio"
now opens a confirm dialog. addresses Alex's report that hitting
"cancel" after picking a file did not roll back the replace (because
nothing actually fired until "replace audio" was clicked, but the
coupling between picker and that button was confusing).

2. **track_revisions table** + restore endpoint + version-history sheet.
every audio replace snapshots the displaced audio into a TrackRevision
row in the same DB transaction as the swap. column names are
provider-neutral (audio_url, not r2_url) so swapping blob providers
later doesn't leave cruft behind. retention cap is 10 per track —
pruning deletes the backing blob if no other row still references
it. PDS-only audio is never deleted (user owns those blobs).

restore is an instant pointer-swap: the chosen revision becomes the
live audio, the displaced current is snapshotted into a new revision
row, and the chosen revision row is deleted (its content is now
current). PDS record is republished as part of the same flow — non-
negotiable so the user's PDS stays in sync with plyr.fm state.

restore is rejected with 409 if it would cross the public ↔ gated
boundary — moving blobs between buckets isn't built yet, and serving
gated audio from the public bucket would defeat the gate.

the version-history surface is a bottom-sheet on mobile / centered
modal on desktop, modeled on LikersSheet. trigger lives in the audio
file section of the track edit form. each row shows format,
relative time, duration, storage location, and a restore button.

new endpoints:
- GET /tracks/{id}/revisions
- POST /tracks/{id}/revisions/{revision_id}/restore

new components:
- ConfirmDialog.svelte — generic alertdialog (used for replace + restore)
- AudioRevisionsSheet.svelte — mobile-first version-history surface

related: #1314 (orphan R2 files) — revisions give R2 files an owner,
which removes the orphan path. #1315 (in-flight tasks writing stale
results) is orthogonal and not addressed here.

Co-Authored-By: Claude Opus 4 (1M context) <noreply@anthropic.com>

* test: integration coverage for audio revisions + restore

three end-to-end tests against staging (skip when PLYR_TEST_TOKEN_* unset):

- replace_audio_creates_revision — upload, replace, verify history holds
exactly one row capturing the displaced original
- restore_swaps_audio_and_rotates_revision — upload, replace, restore;
live audio is back to the original, chosen revision row is gone, the
displaced post-replace audio is now in history
- non_owner_cannot_list_or_restore — user2 gets 403 on both list and
restore against user1's track

each test cleans up via the SDK's delete(). new endpoints aren't in the
SDK yet, so raw httpx is used for replace + revisions/restore.

these will run automatically after the PR merges and staging deploys
(the integration-tests workflow fires on deploy staging completion).

Co-Authored-By: Claude Opus 4 (1M context) <noreply@anthropic.com>

---------

Co-authored-by: Claude Opus 4 (1M context) <noreply@anthropic.com>

10d ago

nate nowack +1

57c4d10e

fix(audio-replace): show useful context in current label (#1313) 2026.0419.071310

11d ago

nate nowack +1

3b0352e2

feat(frontend): replace audio file from track edit form (#1312)

11d ago

nate nowack +1

f069e543

feat(backend): replace audio on existing track (#1311)

* feat(backend): replace audio on existing track via PUT /tracks/{id}/audio

artists currently delete + re-upload to fix bad audio (logfire shows
darkhart.bsky.social did this 3× in ~65min). that loses likes, comments,
plays, and the track URI. add an endpoint that swaps the audio bytes while
keeping the track's stable identity intact.

orchestration is atomic with rollback:
1. validate + store new audio (R2; transcode if lossless)
2. upload to PDS (best-effort, falls back to r2-only on size limit)
3. PUT updated ATProto record (URI stable, new CID)
4. DB row swap in single tx — file_id, r2_url, atproto_record_cid, duration,
pds_blob_*, audio_storage; clears stale genre_predictions provenance
5. delete old R2 object only on success
6. fire post-replace hooks: invalidate old CopyrightScan rows, re-fire
copyright/embedding/genre tasks; never re-notify followers
7. resync album list record so its strongRef carries the new track CID

if step 3 fails, rollback deletes the just-written R2 file and leaves the
track row untouched.

reuses upload phase helpers (_validate_audio, _store_audio, _upload_to_pds)
so the transcode/PDS-blob/gating logic stays in one place.

intentional non-changes:
- labeler labels on the (URI-stable) track are NOT auto-dismissed — that's
a moderation call left for manual review
- likes/playlists/comments retain stale strongRef CIDs; this is the same
CID-churn behavior that PATCH /tracks/{id} produces today

also fixes two pre-existing test failures uncovered while building this:
- conftest pg_trgm extension only created in xdist template path
- moderation report tests leaked rate-limit budget across the session

Co-Authored-By: Claude Opus 4 (1M context) <noreply@anthropic.com>

* fix(audio-replace): tighten rollback scope + handle gated bucket

addresses two review findings on #1311:

1. **post-commit failures triggered rollback (P1)**. the previous orchestrator
wrapped both pre- and post-commit work in one try block. if any side
effect after `_commit_db_swap` raised (post-replace hooks, album resync,
cache invalidation), the except path would delete the new R2 file even
though the track row + ATProto record were already pointing at it —
leaving production with a 404 for the freshly-replaced audio.

split into two phases: pre-commit may rollback; post-commit failures are
logged and swallowed (the swap stands). each post-commit side effect
gets its own try/log so one failure doesn't skip the others.

2. **gated tracks leaked private-bucket objects (P2)**. `R2Storage.delete()`
only probes the public audio + image buckets, so cleanup and rollback
silently no-op'd on supporter-gated tracks (which live in
`private_audio_bucket_name`).

added `delete_gated()` to `R2Storage` + `StorageProtocol` (mirrors
`delete()`'s refcount guard and key probing, against the private bucket).
`_cleanup_old_files` and `_rollback_new_files` now route based on the
track's `support_gate`. also fixes the same pre-existing leak for
gated tracks deleted via the API today (separate latent bug, but the
primitive is now there).

3. **defensive metadata refresh before publish**. a concurrent PATCH that
landed between `_load_and_authorize` and `_publish_record_update` would
have its title / album / features clobbered by the stale snapshot. now
re-loads the row right before building the new ATProto record.

4. **hoist deferred imports** in audio_replace.py + storage/r2.py per the
project's "no unnecessary deferred imports" rule (CLAUDE.md). the
`backend.api.albums` import doesn't have a real circular dep — i'd
copied the pattern from mutations.py without checking.

new tests:
- post-replace hook failure does NOT roll back the new file
- album list sync failure does NOT roll back the new file
- gated track success path uses `delete_gated` for old file
- gated track rollback uses `delete_gated` for new file
- concurrent PATCH title is reflected in the published ATProto record

full xdist suite: 815 passed (was 810 + 5 new tests).

Co-Authored-By: Claude Opus 4 (1M context) <noreply@anthropic.com>

---------

Co-authored-by: Claude Opus 4 (1M context) <noreply@anthropic.com>

11d ago

nate nowack +1

a4977bc2

fix: split search into explicit keyword/mood modes (#1310)

12d ago

nate nowack +1

1dd1fa60

revert: roll back liker avatar strip redesign (#1309) 2026.0417.135425

13d ago

nate nowack +1

47bed2de

fix: don't render dead-end "+1" / "+2" tiles in avatar stacks (#1308) 2026.0417.080305

13d ago

nate nowack +1

99e145fd

fix: don't eat anchor clicks in liker strip — it was nuking playback (#1307) 2026.0417.075226

13d ago

nate nowack +1

ed7a9487

fix: prefix liker stack with "liked by" label (#1306)

13d ago

nate nowack +1

46d71fd4

fix: stop liker strip clicks from triggering track playback (#1305)

13d ago

nate nowack +1

d94da7f3

fix: show "liked N ago" in liker stack avatar tooltips (#1304)

13d ago

nate nowack +1

072f9061

fix: likers strip expands inline, no separate panel (#1303)

13d ago

nate nowack +1

8d4335a6

feat: inline liker avatar stack + shared AvatarStack primitive (#1302)

Replaces the plain "N likes" text next to tracks with an overlapping
strip of the 3 most recent liker avatars (+N if more) — matching the
existing supporter-row pattern on artist pages. Both sites now render
the same `AvatarStack` presentational component; only the data flow
differs (liker avatars are maintained in our artists DB via jetstream;
supporter avatars come from atprotofans via the /artists/batch
enrichment already in place).

Backend

- new `get_top_likers(db, track_ids, limit=3)` aggregation utility
using `ROW_NUMBER() OVER (PARTITION BY track_id ORDER BY created_at
DESC)`, filtered to `rn <= limit`. Postgres 15+ pushes the limit
into the window aggregate (Run Condition), so work short-circuits
per partition. EXPLAIN ANALYZE on production (308 likes, 20-track
page): ~1ms execution, all in shared buffer cache.
- `TrackResponse.top_likers: list[LikerPreview]` added; threaded
through every list endpoint that already batches aggregations
(for_you, tracks listing, tracks /top, tracks /me, tracks /me/broken,
albums listing, users/{handle}/likes, tracks/tags, tracks/shares,
lists/hydration, liked tracks list) plus single-track endpoints
(playback /by-uri, mutations update, mutations restore-record).
- queue and jams serializers continue to skip aggregations per their
existing comments — they pass no `top_likers`, and the field
defaults to `[]`, which the frontend renders as the plain count
(pre-existing behavior).
- `LikerPreview` lives in `utilities/aggregations.py` rather than
`schemas.py` to avoid a circular import (schemas.py imports from
aggregations.py for `CopyrightInfo`).
- tests in `test_aggregations.py`: default limit, custom limit,
ordering by most-recent-first, empty track list, and the
JOIN-on-Artist filter behavior (likers without an artist row are
omitted, matching the existing `GET /tracks/{id}/likes` semantics).

Frontend

- `AvatarStack.svelte` — new purely-presentational component. Props:
`users`, `total`, `maxVisible`, `size`, `borderColor`, `moreHref`,
`onMoreClick`, `avatarHref`, `onAvatarClick`, `ariaLabel`, `class`.
Handles 0-N users, renders +overflow tile as link OR button
depending on the surface, supports fallback initials when
`avatar_url` is null.
- `UserPreview` type added to `types.ts`; matches backend
`LikerPreview` and the atprotofans-derived `Supporter` shape.
- `Track.top_likers?: UserPreview[]` added.
- wired into `TrackItem`, `TrackCard`, and `track/[id]/+page.svelte` —
the existing wrapper keeps the hover-tooltip (desktop) and
bottom-sheet (mobile) behavior on the whole strip; clicking an
individual avatar is intentionally a no-op so the detail sheet is
the canonical "see all likers" path.
- wired into `u/[handle]/+page.svelte` supporter row, replacing the
hand-rolled `.supporter-circle` markup and CSS (~65 lines deleted).
Avatars here DO link to `/u/{handle}` per existing UX; +N links
out to the atprotofans supporter page in a new tab.
- sizing is mobile-first: 20px avatars on mobile tracks, 22px on
desktop tracks, 18px in track cards, 28px/32px on the supporter
row.

Co-authored-by: Claude Opus 4 (1M context) <noreply@anthropic.com>