Incidents | TWF Incidents reported on status page for TWF https://status-page.thewilsons.space/ en memories.thewilsons.space recovered https://status-page.thewilsons.space/ Mon, 23 Mar 2026 10:29:25 +0000 https://status-page.thewilsons.space/#7a9581c9cf21cfa215cb8a0cc53fa930b1ca0d2efc3c734e1f54937b0c627188 memories.thewilsons.space recovered memories.thewilsons.space went down https://status-page.thewilsons.space/ Mon, 23 Mar 2026 10:11:41 +0000 https://status-page.thewilsons.space/#7a9581c9cf21cfa215cb8a0cc53fa930b1ca0d2efc3c734e1f54937b0c627188 memories.thewilsons.space went down Maintenance - Service disruption https://status-page.thewilsons.space/incident/830709 Mon, 23 Feb 2026 05:59:28 -0000 https://status-page.thewilsons.space/incident/830709#c3b68f55e7cc69eb28bb0ed62d1aabecab713093147ae20fa4a49d982232555e Maintenance completed Maintenance - Service disruption https://status-page.thewilsons.space/incident/830709 Sun, 22 Feb 2026 06:01:28 -0000 https://status-page.thewilsons.space/incident/830709#c4afdbde9e46da24187e6e572ccb5c15fc4b38a6c44c81191a72230b1d5289b0 This is a major overhaul of the R730xd server hardware and a complete platform migration for all services: Hardware Upgrades: Installing 8x 480GB SSD RAID 10 array in an additional physical RAID card for VM storage Adding second GPU (enables GPU workload high availability) Installing 10GbE SFP+ networking (10x faster uplink) Fixing CPU thermal issues and automatic fan control Migrating all equipment into an enclosed rack (physical relocation) Platform Migration: Moving all services from Docker VMs to a 3-node Kubernetes (k3s) cluster - This part may take months, not weeks to come to fruition Deploying Longhorn distributed block storage with high availability GPU nodes will use dedicated NVMe storage for faster caching Consolidating from 5 separate Docker host VMs to a unified k3s platform Why the disruptions: Initial downtime: Hardware installation (CPUs, RAID card, GPUs, thermal paste, cleaning) Rolling disruptions: Migrating VMs to new storage (zero-downtime Storage vMotion, but services may be slower) Service redeployment: Moving each service from Docker Compose to Kubernetes (brief outage per service) Physical relocation: Moving server and network gear into the new rack End result: Faster performance, better reliability, GPU failover for transcoding, and a much cleaner infrastructure setup.