↩ Accueil

Vue lecture

Adjusting One Line Of Linux Code Yields 5x Wakeup Latency Reduction For Modern Xeon CPUs

A new patch posted to the Linux kernel mailing list aims to address the high wake-up latency experienced on modern Intel Xeon server platforms. With Sapphire Rapids and newer, "excessive" wakeup latencies with the Linux menu governor and NOHZ_FULL configuration can negatively impair Xeon CPUs for latency-sensitive workloads but a 16 line patch aims to better improve the situation. That is, changing one line of actual code and the rest being code comments...
  •  

New Linux Patch Improved NVMe Performance +15% With CPU Cluster-Aware Handling

Intel Linux engineers have been working on enhancing the NVMe storage performance with today's high core count processors. Due to situations where multiple CPUs could end up sharing the same NVMe IRQ(s), performance penalties can arise if the IRQ affinity and the CPU's cluster do not align. There is a pending patch to address this situation. A 15% performance improvement was reported with the pending patch...
  •  

Linux 6.19 ATA Fixes Address Power Management Regression For The Past Year

It's typically rare these days for the ATA subsystem updates in the Linux kernel to contain anything really noteworthy. But today some important fixes were merged for the ATA code to deal with a reported power management regression affecting the past number of Linux kernel releases over the last year. ATAPI devices with dummy ports weren't hitting their low-power state and in turn preventing the CPU from reaching low-power C-states but thankfully that is now resolved with this code...
  •  

AMD Making It Easier To Install vLLM For ROCm

Deploying vLLM for LLM inference and serving on NVIDIA hardware can be as easy as pip3 install vllm. Beautifully simple just as many of the AI/LLM Python libraries can deploy straight-away and typically "just work" on NVIDIA. Running vLLM atop AMD Radeon/Instinct hardware though has traditionally meant either compiling vLLM from source yourself or AMD's recommended approach of using Docker containers that contain pre-built versions of vLLM. Finally there is now a blessed Python wheel for making it easier to install vLLM without Docker and leveraging ROCm...
  •  

LLVM Adopts "Human In The Loop" Policy For AI/Tool-Assisted Contributions

Following recent discussions over AI contributions to the LLVM open-source compiler project, they have come to an agreement on allowing AI/tool-assisted contributions but that there must be a human involved that is first looking over the code before opening any pull request and similar. Strictly AI-driven contributions without any human vetting will not be permitted...
  •  

DragonFlyBSD Now Allows Optional AMD GCN 1.1 Support In AMDGPU Driver

DragonFlyBSD's AMDGPU kernel graphics driver continues to be a port of the AMDGPU Linux kernel driver. Their latest porting effort for AMD graphics on DragonFlyBSD is now enabling optional support for the GCN 1.1 "Sea Islands (CIK) graphics processors on this modern alternative to the prior Radeon kernel driver...
  •  

New Patches From Valve Bring AMDGPU Power Management Improvements For Old GCN 1.0 GPUs

Last year Valve contractor Timur Kristóf managed to improve the AMDGPU driver enough for old GCN 1.0 Southern Islands and GCN 1.1 Sea Islands GPUs that with Linux 6.19 AMDGPU is now the default for those GPUs with better performance, RADV Vulkan out-of-the-box, and other benefits. He isn't done though improving the old GCN 1.0/1.1 era GPU support on this modern AMDGPU kernel driver - a new patch series posted today brings some power management fixes...
  •  

Mozilla Now Providing RPM Packages For Firefox Nightly Builds

In late 2023 Mozilla began providing Debian packages of Firefox Nightly builds complete with an APT repository. Those on Debian/Ubuntu distributions have a much easier path for enjoying Firefox Nightly since then and now Mozilla engineers are providing similar RPM builds of Firefox nightly too...
  •  

How NVIDIA GB10 Performance With the Dell Pro Max GB10 Compares To The GH200

Earlier this month we looked at the Dell Pro Max GB10 performance up against AMD's Ryzen AI Max+ "Strix Halo" with the superior performance for the green team for performance and power efficiency. For those wondering how the Dell Pro Max GB10 performance comes up for the much talked about NVIDIA GH200, here are some comparison benchmarks.
  •  

New Patches Provide HDMI VRR & Auto Low Latency Mode Gaming Features For AMD Linux GPU Driver

Support for newer HDMI features in the open-source AMD Linux graphics driver have been limited due to being blocked by the HDMI Forum. There are though some new HDMI gaming features being enabled via new AMDGPU kernel driver patches that are coming outside of AMD and based on public knowledge and/or "trying things out until they work/break" for functionality like HDMI Variable Refresh Rate (VRR) and Auto Low Latency Mode...
  •  

RADV Vulkan Driver Now Implements HPLOC For Even Faster Ray-Tracing Performance

There have been a number of nice RADV driver Vulkan ray-tracing performance optimizations for Mesa in recent times... Here is yet another merge request now merged for Mesa 26.0 and helping deliver some nice performance uplift for ray-traced games on Linux. And, yes, this is yet another Valve contribution to this open-source AMD Radeon Linux graphics driver...
  •  
❌