I have spent many hours today investigating this issue. I have a ClearLinux server on a mini PC. This PC has a Realtek rtl8168h Ethernet Interface.
Ever since what I assume was the last automatic system update I cannot run anything which puts a medium to high load on the network interface. CPU and local processing is all fine. No other changes have been made apart from automatic ones. I can do very minor network tasks, but the second you put load on it such as downloading a bundle or running a speed test it goes.
- Testing with Ubuntu live USB does not have the issue.
- Using a USB C Ethernet adapter (same cable and port) does not have the issue.
- Using the live ClearLinux server USB bootable has the exact same issue.
There are no related logs that I could find in the journal.
firmware-version: rtl8168h-2_0.0.2 02/26/15
lsmod | grep r8169
r8169 135168 0
mdio_devres 12288 1 r8169
libphy 225280 3 r8169,mdio_devres,realtek
01:00.0 Ethernet controller: Realtek Semiconductor Co., Ltd. RTL8111/8168/8411 PCI Express Gigabit Ethernet Controller (rev 15)
Is the info on it.
Running a speedtest, or running any container that does moderate to high network traffic causes the vast majority of pings to drop (sometimes it’s 4-5 seconds per a ping) to and from the device for up to 5 minutes (depending on how long the attempt is). 100% repeatable every time. I have confirmed everything else on the network is fine. Pings to the gateway are just fine at the same time. SSH goes down of course so I am having to console.
Perhaps an issue with the latest Stable 6.6.9-1394 Linux Kernel? Doing research on this issue finds nothing at all.
For reference the Ubuntu live USB is running:
Firmware version rtl8168h-2_0.0.2 02/26/15 (same)
lsmod | grep r8169
R8169 114688 0
I have followed the instructions here and various other repairs but no luck. clear-linux-documentation/source/guides/maintenance/fix-broken-install.rst at 160e230e7acfee7cd08e26b86611ab0d7c66c0f0 · clearlinux/clear-linux-documentation · GitHub
I was looking into rolling back the kernel version, this is the furthest it seems I can go back, is this safe to do?
- org.clearlinux.native.6.6.9-1394 (current version running)
Then using clr-boot-manager set-kernel
Any help would be much appreciated.
turns out this was an issue introduced by a change, fix is here:
sudo sh -c ‘echo 0 > /proc/sys/net/core/busy_poll’
sudo sh -c ‘echo 0 > /proc/sys/net/core/busy_read’
it doesn’t stick on a reboot so I’m just trying to get that sorted now.