bluejay-infra

Author	SHA1	Message	Date
Andrew Stoltz	437f346aee	fc-ttsreader: register ttsreader-modern Deployment + Service Adds the Deployment + Service for the fc-modern-tts container that landed in the previous commit. Same shape as ttsreader-biblical: runAsNonRoot uid 1654, dnsPolicy: None to bypass the iamworkin.lan hijack on Microsoft endpoint lookups, /health probes, modest CPU/mem since edge-tts is network-bound. Service surfaces ttsreader-modern.fc-ttsreader.svc:10403 for the web pod to call when the operator picks a he-IL-* or el-GR-* voice. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-04-25 18:39:58 -05:00
Andrew Stoltz	263d06acb9	fc-ttsreader: bump web to v202604251750 (Lane H Slice 1: inline Study view + chapter notes)	2026-04-25 17:54:09 -05:00
Andrew Stoltz	25dbb2967f	fc-ttsreader: bump web to v202604251714 (BuildRenderPlan splits each mood block via SpeechSentenceSegmenter)	2026-04-25 17:18:09 -05:00
Andrew Stoltz	a89a774eaf	fc-ttsreader: deploy eSpeak-NG biblical-tts (Ancient Greek + Hebrew) Adds a third TTS engine alongside Piper (modern English/multi-lang) and Kokoro (high-quality English): a small FastAPI wrapper around eSpeak-NG with built-in support for Ancient Greek (grc), Hebrew (he), and Modern Greek (el). Same shape as fc-speech-align so AiStation talks to all the TTS/alignment services with one HTTP client pattern. biblical-tts/Dockerfile + app.py: - Python 3.12 base + apt-get espeak-ng + libsndfile1 + ffmpeg-free deps. - POST /tts -> WAV audio bytes (audio/wav). - POST /timings -> word-level timings derived from espeak's --pho phoneme duration stream, distributed across whitespace-split words proportional to character count. Accuracy is good enough for chip-level read-along highlighting (~30-80ms per-word jitter). - GET /voices for catalog discovery, GET /health for probes. - Body shape mirrors AlignmentRequest from FlowerCore.Shared.Speech so the .NET BiblicalTtsClient round-trips it cleanly. K8s deployment in fc-ttsreader namespace: - ttsreader-biblical Deployment + Service on port 10402. - localhost/fc-biblical-tts:v1, imagePullPolicy: Never (built on noc1, imported to all 3 RKE2 nodes via ctr). - runAsNonRoot uid 1654 to match the namespace's standard security ctx. - Modest resources (100m/128Mi req, 1000m/512Mi limit) — eSpeak is CPU-cheap. - Probes hit /health which returns the supported language list. Verified live: container started, /health returns ok with grc/el/he, POST /timings on Ἐν ἀρχῇ ἦν ὁ λόγος returned 5 words / 1714ms. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-04-25 17:17:38 -05:00
Andrew Stoltz	dc39747f3f	fc-ttsreader: piper memory 1Gi -> 3Gi to stop OOMKill mid-render	2026-04-25 17:10:20 -05:00
Andrew Stoltz	87050e72a9	fc-ttsreader: deploy Kokoro to the cluster (replaces BLUEJAY-WS host pointer) The cluster ttsreader-web was reaching across to BLUEJAY-WS:10401 for Kokoro synthesis, which meant a workstation-down event broke render- pipeline TTS. Add a cluster-native ttsreader-kokoro Deployment and Service inside fc-ttsreader so the cluster owns the engine. - Image: ghcr.io/remsky/kokoro-fastapi-cpu:latest. Model + 67 voices ship inside the image, so no PVC is required. - Port 8880 (the kokoro-fastapi default; the entrypoint hardcodes it). - Resources: 250m/1Gi request, 2000m/3Gi limit. CPU-only inference matches what AiStation runs locally on BLUEJAY-WS. - dnsPolicy: None to bypass CoreDNS's *.iamworkin.lan template hijack on huggingface.co lookups, same shape as ttsreader-align. - Probes hit /v1/audio/voices since the kokoro server doesn't expose /health; that endpoint is cheap (lists configured voice files). ttsreader-web env var TtsReader__Kokoro__BaseUrl flips from the workstation pointer to the cluster service: http://ttsreader-kokoro.fc-ttsreader.svc.cluster.local.:8880. AiStation keeps its local http://localhost:8880 since the workstation operator still wants the audio to render on the local sound device without a network hop. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-04-25 16:56:39 -05:00
Andrew Stoltz	e8c5d2afd2	fc-ttsreader: bump web to v202604251544 (Unicode sanitize + continue-on-segment-fail)	2026-04-25 15:50:16 -05:00
Andrew Stoltz	eef492125f	fc-ttsreader: bump web to v202604251534 (SignalR 8MB + DOM-peek + poll loop refresh fix)	2026-04-25 15:39:18 -05:00
Andrew Stoltz	b51ee35bfa	fc-speech-align: v3 — emit FlowerCore.Shared.Speech word contract The /align endpoint was returning Whisper-native word fields (word/startSeconds/endSeconds/confidence), but FlowerCore.Shared.Speech's FasterWhisperAlignmentClient on master deserializes FasterWhisperWord against [JsonPropertyName("text")/("startMs")/("endMs")]. Result: ttsreader-web reported alignment.source="whisper" with words[] present but every entry had Text="" and StartMs=EndMs=0 — visible in the 2026-04-25 hello-world smoke against ttsreader.iamworkin.lan. Match the published Common contract instead of the Python model's native shape: emit text/startMs/endMs (millisecond ints, not float seconds). Confidence stays on the wire as informational; the deployed C# client ignores it but a future fc-align operator UI can surface low-confidence words. Bump tag to v3 and bump the Deployment image accordingly. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-04-25 11:52:14 -05:00
Andrew Stoltz	4abc2fa95d	fc-speech-align: add dnsPolicy: None to bypass CoreDNS *.iamworkin.lan template hijack on huggingface.co	2026-04-25 11:12:21 -05:00
Andrew Stoltz	d7628a6945	fc-speech-align: bump to v2 with explicit requests dep (faster-whisper 1.0.3 missing transitive)	2026-04-25 10:55:51 -05:00
Andrew Stoltz	df115e4d1e	fc-ttsreader: ship cluster-native fc-speech-align (faster-whisper) + bump web - New ttsreader-align Deployment + Service + 5Gi PVC under apps/fc-ttsreader/. Wraps SYSTRAN/faster-whisper in a small FastAPI app exposing POST /align (fc-align contract used by Shared.Speech) AND POST /transcribe (audio-in feature consumed by ttsreader-web Lane G). Source: apps/fc-ttsreader/speech-align/ (Dockerfile + app.py + requirements.txt). Built locally (apt-get RUN steps need BLUEJAY-WS, not noc1) and ctr-imported to all 3 RKE2 nodes. - ttsreader-web env: flip Speech__Alignment__Enabled=true and point BaseUrl at http://ttsreader-align.fc-ttsreader.svc.cluster.local.:9200. Add new TtsReader__Transcription__* env triplet pointing at the same service (same /transcribe endpoint). - Bump ttsreader-web image to v202604251046 (carries the TranscriptionController + MCP tool + Quick.razor InputFile UI).	2026-04-25 10:50:45 -05:00
Andrew Stoltz	9df26620b8	fc-ttsreader: disable Whisper, fall back to estimator until backend is reachable The cluster-wide pod cannot reach BLUEJAY-WS speaches on 10.0.56.20:9200 because the rootless+host-net podman setup binds 127.0.0.1 only on the WSL machine; nothing on the LAN-facing interface. The openai-compatible Backend value also relied on a Common change still on feat/shared-indexing rather than master, so the deployed image's Shared.Speech only knows the FC-native /align shape. Disable Speech:Alignment for now. EstimatedAlignmentClient kicks in and keeps /api/v1/voices/preview-with-timings returning word-aligned JSON, just with uniform-distribution timings instead of real Whisper output. Re-enable once: (a) Common's openai-compatible Backend lands on master and a new TtsReader image ships, or (b) we point at a LAN-routable backend (e.g. an aiohttp /align shim, or speaches running on a node that's actually reachable from cluster pods). Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-04-25 10:28:21 -05:00
Andrew Stoltz	08aa7a5bff	fc-ttsreader: route Whisper alignment at openai-compatible backend The fc-speech-align container on BLUEJAY-WS (port 9200) is the speaches build of faster-whisper-server, which exposes the OpenAI-compatible /v1/audio/transcriptions contract — not the FlowerCore /align contract. FasterWhisperAlignmentClient (FlowerCore.Common a1b3bfc) supports both shapes; tell it explicitly to talk OpenAI-compatible here so requests land on the right endpoint and verbose_json gets adapted into the FC alignment response. Also pin the Model id to one speaches recognizes. Switch back to fc-align once a native /align backend is deployed (or wire a tiny FastAPI shim in front of speaches if we want a stable contract). Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-04-25 10:25:24 -05:00
Andrew Stoltz	38e20a8b64	fc-ttsreader: bump to v202604251018 (resume banner + scope-ID rebuild fix)	2026-04-25 10:24:39 -05:00
Andrew Stoltz	c945d44b9e	fc-ttsreader: bump to v202604250729 (playback bookmark + breaker fix)	2026-04-25 07:33:38 -05:00
Andrew Stoltz	76ece92cfd	fc-ttsreader: enable real Whisper alignment via fc-speech-align Flips Speech__Alignment__Enabled=true and points BaseUrl at the new BLUEJAY-WS podman quadlet running fc-speech-align (faster- whisper, /align contract). When Lane 1δ's /api/v1/voices/preview-with-timings runs after this lands, the alignment.source field flips from 'estimated' to 'whisper' and the per-word timings come from real audio analysis instead of uniform-spacing estimates. No image rebuild — the Lane 1α DI registration already routes IWhisperAlignmentClient to FasterWhisperAlignmentClient when Speech:Alignment:Enabled is true. Companion firewall rule from FlowerCore.Puppet@bbc02ea + @05504ed (whisper_align_enabled flag on bluejay-ws-linux Hiera) opens port 9200 to RKE2 pod CIDR durably.	2026-04-24 23:37:30 -05:00
Andrew Stoltz	9fb526c7c5	fc-ttsreader: bump to v202604242301readwithtimings Picks up Lane 1δ + 3β: - Lane 3β: GET /reader-embed iframe-friendly host route + public- read CORS for /api/v1/voices and /_content/.../embed/ - Lane 1δ: GET /api/v1/voices/preview-with-timings — pairs synthesized audio with per-word alignment timings (faster- whisper or estimated fallback) so embed bundle / FcReaderOverlay / in-app /voices preview can word-highlight in one round-trip - Latest FlowerCore.UI.Components from Common master: FcReaderOverlay annotation popover (Lane 2γ) + <fc-reader> standalone embed bundle (Phase 3) Built from FlowerCore.TtsReader@06ef815 (master) against FlowerCore.Common@d23d4c3 (master). Image imported on rke2-server / rke2-agent1 / rke2-agent2.	2026-04-24 23:05:41 -05:00
Andrew Stoltz	1d4ad64226	chore(ttsreader): bump to v202604241555backlog (rebuild with correct publish/) Previous v202604241543backlog image accidentally used a stale publish/ directory at the TtsReader repo root (Dockerfile.deploy says COPY publish/ but my ad-hoc publish wrote to artifacts/publish/). Rebuilt with a clean copy from artifacts/publish/ to publish/ first. Confirmed new image has appsettings.json Preview section + the quick-swipe-gestures.js asset baked in. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-04-24 15:53:55 -05:00
Andrew Stoltz	774f82c431	chore(ttsreader): bump image to v202604241543backlog Merges three parallel backlog lanes onto longsegfix: - B (hotfix/preview-timeout-2026-04-24): 25s preview-path timeout, 504 on expiry, config-tunable via TtsReader:Preview:TimeoutSeconds. - C (feature/swipe-gestures-2026-04-24): PointerEvent swipe gestures on /quick (prev/next sentence) + CSS fade-hint. - D (feature/bible-defaults-button-2026-04-24): Apply Bible speech defaults button on project detail page with confirm + toast. TtsReader master octopus merge: 730e7fa. Tests 155/155 .NET + 13/13 JS. 28 MCP tools. Built + imported on all 3 RKE2 nodes. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-04-24 15:47:13 -05:00
Andrew Stoltz	d2cc36ea0e	chore(ttsreader): bump image to v202604241345longsegfix Hotfix for two live render errors: - Kokoro chapter render failed with "count ('-1') must be non-negative" — streaming-WAV chunk-size sentinel (0xFFFFFFFF) read as -1. - Piper render timed out on book-chapter paragraphs with no sentence punctuation — one giant segment exceeded the 2-min timeout. Source fix: FlowerCore.TtsReader@826589b. 153/153 tests green. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-04-24 13:58:45 -05:00
Andrew Stoltz	299070e4bf	fc-ttsreader: bump to v202604241332 (XL sprint — sidebar shell + CHAP + offline ZIP + breaker) Merges four XL-sprint lanes to TtsReader master: - FcSidebarLayout swap (MainLayout + NavMenu, deletes 88-line shell CSS) - ID3 CHAP chapter markers in rendered MP3s (Apple Podcasts chapter list) - POST /api/v1/projects/{id}/export/zip offline bundle + MCP tool - Per-engine TtsCircuitBreaker with Kokoro→Piper fallback + /voices engine-health panel + GET /api/v1/voices/engines/status 153/153 tests green on Linux dotnet build. Image imported to rke2-server / rke2-agent1 / rke2-agent2. No app env changes beyond the image tag.	2026-04-24 13:37:33 -05:00
Andrew Stoltz	a37fc83584	ttsreader: bump to v202604240117 (fix blazor-error-ui scope drift)	2026-04-24 01:21:38 -05:00
Andrew Stoltz	5c0c21790e	ttsreader: bump image to v202604240101 (Kokoro voices merge fix)	2026-04-24 01:05:55 -05:00
Andrew Stoltz	bb39a0c1fd	ttsreader: bump image to v202604240053 + Kokoro env vars Adds TtsReader__Kokoro__Enabled=true + BaseUrl=http://10.0.56.20:10401 + TimeoutSeconds=120 so the pod routes kokoro-tagged voices to the Kokoro-FastAPI backend running on BLUEJAY-WS. Multi-engine router falls through to Piper for piper-tagged and untagged voices. Requires nftables on BLUEJAY-WS to permit tcp/10401 from 10.0.56/23 and 10.42.0.0/16. Applied to the live ruleset — Puppet Hiera path is the durable fix (kokoro_server_enabled under profile::security::firewall). Tests 107 → 114 (+7 MultiEngineSpeechSynthesizerTests).	2026-04-24 00:57:41 -05:00
Andrew Stoltz	297a2a9bbc	ttsreader: bump image to v202604240023 for P3 one-click book render New surfaces: POST /api/v1/bible/projects (one-click whole-book render), GET /api/v1/bible/books, GET /api/v1/bible/books/{book}/preview, MCP tools render_tts_reader_bible_book + list_tts_reader_bible_books, Dashboard "Render a Bible book" card. 107/107 tests, +7 from previous.	2026-04-24 00:25:48 -05:00
Andrew Stoltz	fc0b67f670	ttsreader: bump image to v202604232334 for iTunes RSS + ID3 tags Pulls in FlowerCore.TtsReader@9e2497f: P2.3 iTunes-namespace podcast feed (author, summary, category, cover art, episode numbering, duration, atom:self link, serial channel type for Bible projects) and P2.4 ID3v2 tags on MP3 export + Vorbis comments on OGG (title, artist with Piper voice humanized, album, track N/M, genre defaulting to Religion & Spirituality for Bible or Audiobook for text sources, date). Phones and podcast apps now show proper track info instead of "Unknown - Unknown". Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-04-23 23:37:53 -05:00
Andrew Stoltz	6c1375b21a	ttsreader: bump image to v202604232310 for Media Session API + Bible lexicon Pulls in FlowerCore.TtsReader@63e6b62: P1.1 Media Session API wiring in fc-media-session.js + quick-player.js + rendered-chapter-player.js, and P1.2 biblical-name pronunciation lexicon auto-seed on Bible-source project creation plus apply-bible-defaults endpoint + MCP tool for existing projects. Tests 81 -> 97 all green. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-04-23 23:15:53 -05:00
Andrew Stoltz	9272abc225	Use absolute cluster DNS for ttsreader piper	2026-04-23 20:49:40 -05:00
Andrew Stoltz	3999634b06	Seed ttsreader piper voices before startup	2026-04-23 17:18:57 -05:00
Andrew Stoltz	f9593e494a	Allow ttsreader piper voice downloads	2026-04-23 16:50:21 -05:00
Andrew Stoltz	a76eeb5c39	Add dedicated selectable piper for ttsreader	2026-04-23 16:37:03 -05:00
Andrew Stoltz	686dbacc66	Bump TTS Reader image for render follow-along	2026-04-23 15:54:07 -05:00
Andrew Stoltz	4da60820c6	Deploy TTS Reader queue presentation fix	2026-04-23 15:13:21 -05:00
Andrew Stoltz	1cc4324cfb	Deploy TTS Reader import and preview fixes	2026-04-23 14:28:08 -05:00
Andrew Stoltz	ffef5c9126	Deploy TTS Reader annotation timeout fix	2026-04-23 13:06:17 -05:00
Andrew Stoltz	634e90a9ee	Deploy TTS Reader quick hardening release	2026-04-23 12:47:45 -05:00
Andrew Stoltz	1c5caf3f40	Deploy TTS Reader v20260423114016	2026-04-23 11:57:39 -05:00
Andrew Stoltz	a3aa84bdae	fc-ttsreader: bump image to v20260422201135 (Quick Read highlight no-reflow fix) Quick Read's active-sentence highlight was changing font-weight from regular to semibold, which shifted glyph widths and reflowed the whole paragraph mid-playback. New image drops the weight change and uses a 1px box-shadow ring instead for a stable layout. Built from FlowerCore.TtsReader@e77d69d.	2026-04-22 20:20:30 -05:00
Andrew Stoltz	01cb9a557f	fc-ttsreader: deploy fixed reader image	2026-04-22 16:13:15 -05:00
Andrew Stoltz	0fa46ad53b	fc-ttsreader: deploy reader UI split image	2026-04-22 15:57:58 -05:00
Andrew Stoltz	3cf675b8c3	ttsreader: wire operator secrets through 1password	2026-04-17 10:05:24 -05:00
Andrew Stoltz	2a9f2e4540	Improve TTS Reader workspace card layout	2026-04-17 03:57:23 -05:00
Andrew Stoltz	b15a35a258	Fix TTS Reader character layout	2026-04-17 03:48:03 -05:00
Andrew Stoltz	3f4985ee13	Deploy TTS Reader queue feedback fix	2026-04-17 03:34:28 -05:00
Andrew Stoltz	e535a8d34b	Deploy TTS Reader voice preview update	2026-04-17 02:13:09 -05:00
Andrew Stoltz	6ddbd2cae5	Point TTS Reader at Pi Ollama defaults	2026-04-17 00:53:45 -05:00
Andrew Stoltz	e9608651f7	Bump TTS Reader image to v20260417001119	2026-04-17 00:33:29 -05:00
Andrew Stoltz	abdb7a806e	Bump TTS Reader image to v20260416234817	2026-04-16 23:53:42 -05:00
Andrew Stoltz	7afb5043c4	Fix ttsreader forwarded header handling	2026-04-16 21:55:46 -05:00

1 2

54 Commits