refs/heads/sched/test - pub/scm/linux/kernel/git/peterz/queue

commit: bbeb9f7451783fa0cc68aba85fb5eaaa4c34adb0
[log]
author: Peter Zijlstra <peterz@infradead.org>
Fri May 01 12:40:06 2026 +0200
committer: Peter Zijlstra <peterz@infradead.org>
Fri May 01 16:52:41 2026 +0200
tree: 2147660bb58256a084201b47b15d0db356858277
parent: 8088b65b8e90e89601a942285332da08c34451f2 [diff]

sched/fair: Fix overflow in vruntime_eligible()

On Wed, Apr 29, 2026 at 10:33:48AM +0530, K Prateek Nayak wrote:

Construct a worst case:

So if you have this cgroup crap on, then you can have an entity of
weight 2, and vlag should then be bounded by: (slice+TICK_NSEC) *
NICE_0_LOAD, which is around 44 bits as per the comment on entity_key().

The other end is 100*NICE_0_LOAD, so lets wake that, then you get:

{key, weight}[] := {
  puny: { (slice + TICK_NSEC) * NICE_0_LOAD, 2               },
  max:  { 0,                                 100*NICE_0_LOAD },
}

The avg_vruntime() would end up being very close to 0 (which is
zero_vruntime), so no real help making that more accurate.

vruntime_eligible(puny) ends up with:

 avg    = 2 * puny.key (+ 0)
 weight = 2 + 100 * NICE_0_LOAD

 avg >= puny.key * weight

And that is: (slice + TICK_NSEC) * NICE_0_LOAD * NICE_0_LOAD * 100

--

Anyway, I had a poke around with godbolt, and the below seems to
generate the best code for things like x86_64 and arm64.

Specifically, the __builtin_mul_overflow() already has to compute the
128 bit product anyway for most architectures, so using that directly
then leads to saner asm and easier to understand code.

AFAICT HPPA64 is the only 64bit architecture that doesn't implement
__int128 and will thus be demoted to doing what we do on 32bit.

Now all I need to do is write a coherent Changelog to go with it ...
*sigh*

Signed-off-by: Peter Zijlstra (Intel) <peterz@infradead.org>
Link: https://patch.msgid.link/20260501104006.GA3102624@noisy.programming.kicks-ass.net

2 files changed

tree: 2147660bb58256a084201b47b15d0db356858277