|
Bogdan Timofte
authored
3 months ago
|
1
|
# Changelog
|
|
|
2
|
|
|
|
3
|
All notable changes to the Madagascar cluster will be documented in this file.
|
|
|
4
|
|
|
|
5
|
The format is based on [Keep a Changelog](https://keepachangelog.com/en/1.0.0/),
|
|
|
6
|
and this project adheres to [Semantic Versioning](https://semver.org/spec/v2.0.0.html).
|
|
|
7
|
|
|
|
8
|
## [Unreleased]
|
|
|
9
|
|
|
|
10
|
### Fixed
|
|
|
11
|
- Invalid `ExecStop` syntax in `tb-enlist@.service` caused failed unit teardown on Thunderbolt device removal [ISSUE-2026-001]
|
|
|
12
|
- Tapia-Baobab Thunderbolt recovery path hardened after reboot-time disconnect/reconnect events [ISSUE-2026-001]
|
|
Bogdan Timofte
authored
3 months ago
|
13
|
- `tb-enlist@.service` now stays active until `network.target` stops, so NFS storages routed over `thunderbridge` can unmount cleanly before Thunderbolt ports are detached; this is the Thunderbolt-side fix for the cluster-wide maintenance shutdown incident [ISSUE-2026-002]
|
|
Bogdan Timofte
authored
3 months ago
|
14
|
|
|
|
15
|
### Added
|
|
|
16
|
- Automatic Thunderbolt recovery worker (`tb-recover.service`) and periodic timer (`tb-recover.timer`) for flap resilience [ISSUE-2026-001]
|
|
|
17
|
|
|
|
18
|
### Changed
|
|
|
19
|
- `tb-recover.sh` now escalates recovery by restarting `bolt.service` when rescan alone does not recreate thunderbolt net devices [ISSUE-2026-001]
|
|
|
20
|
- `tb-recover.sh` now includes cooldowned Thunderbolt NHI PCI `remove+rescan` fallback (soft replug path) for reboot cases where netdev is missing [ISSUE-2026-001]
|
|
|
21
|
- `tb-recover.sh` now retries the Thunderbolt NHI reset within the same recovery run when a peer xdomain host reappears without its `*.0` network service [ISSUE-2026-001]
|
|
|
22
|
- `tb-recover.sh` now probes the expected peer behind each Thunderbolt port and cycles the affected interface with `ifdown/ifup` when a port stays attached but logically detached [ISSUE-2026-001]
|
|
|
23
|
- Added standardized shared-runtime install/uninstall flow that manages scripts, unit files, and udev rules without rewriting host network configuration
|
|
|
24
|
|
|
|
25
|
## [2025-10-30]
|
|
|
26
|
|
|
|
27
|
### Fixed
|
|
|
28
|
- Thunderbolt interfaces not in bridge after MTU fix deployment [ISSUE-2025-002]
|
|
|
29
|
- MTU reset to 1500 after systemctl restart networking [ISSUE-2025-001]
|
|
|
30
|
|
|
|
31
|
### Added
|
|
|
32
|
- Issue tracking system with structured templates
|
|
|
33
|
- Defense-in-depth for thunderbolt network configuration (udev + ifupdown2 hooks)
|
|
|
34
|
|
|
|
35
|
### Changed
|
|
|
36
|
- Enhanced udev rules for thunderbolt device handling
|
|
|
37
|
- Updated network interfaces.d with post-up hooks for MTU and bridge membership
|
|
|
38
|
|
|
|
39
|
## [2025-10-29]
|
|
|
40
|
|
|
|
41
|
### Added
|
|
|
42
|
- Initial issue tracking setup
|
|
|
43
|
- COPILOT_BACKUPS_INSTRUCTIONS.md for backup procedures
|
|
|
44
|
- CHANGELOG.md for change documentation</content>
|