So I found some logs that the network PCIe device was being dropped in Ubuntu 22.04.
igc (...) eno1: PCIe link lost, device now detached
After looking it up, I reached the conclusion that that particular chip overheats, which causes the kernel to drop the device.
Other than adding a heatsink, the solution is to change the OS configuration, so you make it slower, so it doesn’t overhead:
- Adding these two kernel parameters: pcie_port_pm=off pcie_aspm.policy=performance
- Disabling a bunch of TCP features:
sudo ethtool --offload eno1 rx off tx off
. I personally find this option a bit scary, so I ended up reversing it.