x86/pti: don't waste 512-64 words of stack

This is only important because the slower stack trampoline is done in
C as opposed to rep movsb which is faster as also guarantees it can
fit in 64 words unlike the C version.
1 file changed