All notable changes to the Madagascar cluster will be documented in this file.
The format is based on Keep a Changelog, and this project adheres to Semantic Versioning.
ExecStop syntax in tb-enlist@.service caused failed unit teardown on Thunderbolt device removal [ISSUE-2026-001]tb-enlist@.service now stays active until network.target stops, so NFS storages routed over thunderbridge can unmount cleanly before Thunderbolt ports are detached; this is the Thunderbolt-side fix for the cluster-wide maintenance shutdown incident [ISSUE-2026-002]tb-recover.service) and periodic timer (tb-recover.timer) for flap resilience [ISSUE-2026-001]tb-recover.sh now escalates recovery by restarting bolt.service when rescan alone does not recreate thunderbolt net devices [ISSUE-2026-001]tb-recover.sh now includes cooldowned Thunderbolt NHI PCI remove+rescan fallback (soft replug path) for reboot cases where netdev is missing [ISSUE-2026-001]tb-recover.sh now retries the Thunderbolt NHI reset within the same recovery run when a peer xdomain host reappears without its *.0 network service [ISSUE-2026-001]tb-recover.sh now probes the expected peer behind each Thunderbolt port and cycles the affected interface with ifdown/ifup when a port stays attached but logically detached [ISSUE-2026-001]