cve/published/2024/CVE-2024-26687.mbox - pub/scm/linux/security/vulns - Git at Google

 From bippy-5f407fcff5a0 Mon Sep 17 00:00:00 2001
 From: Greg Kroah-Hartman <gregkh@linuxfoundation.org>
 To: <linux-cve-announce@vger.kernel.org>
 Reply-to: <cve@kernel.org>, <linux-kernel@vger.kernel.org>
 Subject: CVE-2024-26687: xen/events: close evtchn after mapping cleanup

 Description
 ===========

 In the Linux kernel, the following vulnerability has been resolved:

 xen/events: close evtchn after mapping cleanup

 shutdown_pirq and startup_pirq are not taking the
 irq_mapping_update_lock because they can't due to lock inversion. Both
 are called with the irq_desc->lock being taking. The lock order,
 however, is first irq_mapping_update_lock and then irq_desc->lock.

 This opens multiple races:
 - shutdown_pirq can be interrupted by a function that allocates an event
   channel:

   CPU0                        CPU1
   shutdown_pirq {
     xen_evtchn_close(e)
                               __startup_pirq {
                                 EVTCHNOP_bind_pirq
                                   -> returns just freed evtchn e
                                 set_evtchn_to_irq(e, irq)
                               }
     xen_irq_info_cleanup() {
       set_evtchn_to_irq(e, -1)
     }
   }

   Assume here event channel e refers here to the same event channel
   number.
   After this race the evtchn_to_irq mapping for e is invalid (-1).

 - __startup_pirq races with __unbind_from_irq in a similar way. Because
   __startup_pirq doesn't take irq_mapping_update_lock it can grab the
   evtchn that __unbind_from_irq is currently freeing and cleaning up. In
   this case even though the event channel is allocated, its mapping can
   be unset in evtchn_to_irq.

 The fix is to first cleanup the mappings and then close the event
 channel. In this way, when an event channel gets allocated it's
 potential previous evtchn_to_irq mappings are guaranteed to be unset already.
 This is also the reverse order of the allocation where first the event
 channel is allocated and then the mappings are setup.

 On a 5.10 kernel prior to commit 3fcdaf3d7634 ("xen/events: modify internal
 [un]bind interfaces"), we hit a BUG like the following during probing of NVMe
 devices. The issue is that during nvme_setup_io_queues, pci_free_irq
 is called for every device which results in a call to shutdown_pirq.
 With many nvme devices it's therefore likely to hit this race during
 boot because there will be multiple calls to shutdown_pirq and
 startup_pirq are running potentially in parallel.

   ------------[ cut here ]------------
   blkfront: xvda: barrier or flush: disabled; persistent grants: enabled; indirect descriptors: enabled; bounce buffer: enabled
   kernel BUG at drivers/xen/events/events_base.c:499!
   invalid opcode: 0000 [#1] SMP PTI
   CPU: 44 PID: 375 Comm: kworker/u257:23 Not tainted 5.10.201-191.748.amzn2.x86_64 #1
   Hardware name: Xen HVM domU, BIOS 4.11.amazon 08/24/2006
   Workqueue: nvme-reset-wq nvme_reset_work
   RIP: 0010:bind_evtchn_to_cpu+0xdf/0xf0
   Code: 5d 41 5e c3 cc cc cc cc 44 89 f7 e8 2b 55 ad ff 49 89 c5 48 85 c0 0f 84 64 ff ff ff 4c 8b 68 30 41 83 fe ff 0f 85 60 ff ff ff <0f> 0b 66 66 2e 0f 1f 84 00 00 00 00 00 0f 1f 40 00 0f 1f 44 00 00
   RSP: 0000:ffffc9000d533b08 EFLAGS: 00010046
   RAX: 0000000000000000 RBX: 0000000000000000 RCX: 0000000000000006
   RDX: 0000000000000028 RSI: 00000000ffffffff RDI: 00000000ffffffff
   RBP: ffff888107419680 R08: 0000000000000000 R09: ffffffff82d72b00
   R10: 0000000000000000 R11: 0000000000000000 R12: 00000000000001ed
   R13: 0000000000000000 R14: 00000000ffffffff R15: 0000000000000002
   FS:  0000000000000000(0000) GS:ffff88bc8b500000(0000) knlGS:0000000000000000
   CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
   CR2: 0000000000000000 CR3: 0000000002610001 CR4: 00000000001706e0
   DR0: 0000000000000000 DR1: 0000000000000000 DR2: 0000000000000000
   DR3: 0000000000000000 DR6: 00000000fffe0ff0 DR7: 0000000000000400
   Call Trace:
    ? show_trace_log_lvl+0x1c1/0x2d9
    ? show_trace_log_lvl+0x1c1/0x2d9
    ? set_affinity_irq+0xdc/0x1c0
    ? __die_body.cold+0x8/0xd
    ? die+0x2b/0x50
    ? do_trap+0x90/0x110
    ? bind_evtchn_to_cpu+0xdf/0xf0
    ? do_error_trap+0x65/0x80
    ? bind_evtchn_to_cpu+0xdf/0xf0
    ? exc_invalid_op+0x4e/0x70
    ? bind_evtchn_to_cpu+0xdf/0xf0
    ? asm_exc_invalid_op+0x12/0x20
    ? bind_evtchn_to_cpu+0xdf/0xf0
    ? bind_evtchn_to_cpu+0xc5/0xf0
    set_affinity_irq+0xdc/0x1c0
    irq_do_set_affinity+0x1d7/0x1f0
    irq_setup_affinity+0xd6/0x1a0
    irq_startup+0x8a/0xf0
    __setup_irq+0x639/0x6d0
    ? nvme_suspend+0x150/0x150
    request_threaded_irq+0x10c/0x180
    ? nvme_suspend+0x150/0x150
    pci_request_irq+0xa8/0xf0
    ? __blk_mq_free_request+0x74/0xa0
    queue_request_irq+0x6f/0x80
    nvme_create_queue+0x1af/0x200
    nvme_create_io_queues+0xbd/0xf0
    nvme_setup_io_queues+0x246/0x320
    ? nvme_irq_check+0x30/0x30
    nvme_reset_work+0x1c8/0x400
    process_one_work+0x1b0/0x350
    worker_thread+0x49/0x310
    ? process_one_work+0x350/0x350
    kthread+0x11b/0x140
    ? __kthread_bind_mask+0x60/0x60
    ret_from_fork+0x22/0x30
   Modules linked in:
   ---[ end trace a11715de1eee1873 ]---

 The Linux kernel CVE team has assigned CVE-2024-26687 to this issue.


 Affected and fixed versions
 ===========================

 	Issue introduced in 2.6.37 with commit d46a78b05c0e37f76ddf4a7a67bf0b6c68bada55 and fixed in 5.4.274 with commit 9470f5b2503cae994098dea9682aee15b313fa44
 	Issue introduced in 2.6.37 with commit d46a78b05c0e37f76ddf4a7a67bf0b6c68bada55 and fixed in 5.10.215 with commit 0fc88aeb2e32b76db3fe6a624b8333dbe621b8fd
 	Issue introduced in 2.6.37 with commit d46a78b05c0e37f76ddf4a7a67bf0b6c68bada55 and fixed in 5.15.154 with commit ea592baf9e41779fe9a0424c03dd2f324feca3b3
 	Issue introduced in 2.6.37 with commit d46a78b05c0e37f76ddf4a7a67bf0b6c68bada55 and fixed in 6.1.81 with commit 585a344af6bcac222608a158fc2830ff02712af5
 	Issue introduced in 2.6.37 with commit d46a78b05c0e37f76ddf4a7a67bf0b6c68bada55 and fixed in 6.6.19 with commit 20980195ec8d2e41653800c45c8c367fa1b1f2b4
 	Issue introduced in 2.6.37 with commit d46a78b05c0e37f76ddf4a7a67bf0b6c68bada55 and fixed in 6.7.6 with commit 9be71aa12afa91dfe457b3fb4a444c42b1ee036b
 	Issue introduced in 2.6.37 with commit d46a78b05c0e37f76ddf4a7a67bf0b6c68bada55 and fixed in 6.8 with commit fa765c4b4aed2d64266b694520ecb025c862c5a9

 Please see https://www.kernel.org for a full list of currently supported
 kernel versions by the kernel community.

 Unaffected versions might change over time as fixes are backported to
 older supported kernel versions.  The official CVE entry at
 	https://cve.org/CVERecord/?id=CVE-2024-26687
 will be updated if fixes are backported, please check that for the most
 up to date information about this issue.


 Affected files
 ==============

 The file(s) affected by this issue are:
 	drivers/xen/events/events_base.c


 Mitigation
 ==========

 The Linux kernel CVE team recommends that you update to the latest
 stable kernel version for this, and many other bugfixes.  Individual
 changes are never tested alone, but rather are part of a larger kernel
 release.  Cherry-picking individual commits is not recommended or
 supported by the Linux kernel community at all.  If however, updating to
 the latest release is impossible, the individual changes to resolve this
 issue can be found at these commits:
 	https://git.kernel.org/stable/c/9470f5b2503cae994098dea9682aee15b313fa44
 	https://git.kernel.org/stable/c/0fc88aeb2e32b76db3fe6a624b8333dbe621b8fd
 	https://git.kernel.org/stable/c/ea592baf9e41779fe9a0424c03dd2f324feca3b3
 	https://git.kernel.org/stable/c/585a344af6bcac222608a158fc2830ff02712af5
 	https://git.kernel.org/stable/c/20980195ec8d2e41653800c45c8c367fa1b1f2b4
 	https://git.kernel.org/stable/c/9be71aa12afa91dfe457b3fb4a444c42b1ee036b
 	https://git.kernel.org/stable/c/fa765c4b4aed2d64266b694520ecb025c862c5a9
	From bippy-5f407fcff5a0 Mon Sep 17 00:00:00 2001
	From: Greg Kroah-Hartman <gregkh@linuxfoundation.org>
	To: <linux-cve-announce@vger.kernel.org>
	Reply-to: <cve@kernel.org>, <linux-kernel@vger.kernel.org>
	Subject: CVE-2024-26687: xen/events: close evtchn after mapping cleanup

	Description
	===========

	In the Linux kernel, the following vulnerability has been resolved:

	xen/events: close evtchn after mapping cleanup

	shutdown_pirq and startup_pirq are not taking the
	irq_mapping_update_lock because they can't due to lock inversion. Both
	are called with the irq_desc->lock being taking. The lock order,
	however, is first irq_mapping_update_lock and then irq_desc->lock.

	This opens multiple races:
	- shutdown_pirq can be interrupted by a function that allocates an event
	channel:

	CPU0 CPU1
	shutdown_pirq {
	xen_evtchn_close(e)
	__startup_pirq {
	EVTCHNOP_bind_pirq
	-> returns just freed evtchn e
	set_evtchn_to_irq(e, irq)
	}
	xen_irq_info_cleanup() {
	set_evtchn_to_irq(e, -1)
	}
	}

	Assume here event channel e refers here to the same event channel
	number.
	After this race the evtchn_to_irq mapping for e is invalid (-1).

	- __startup_pirq races with __unbind_from_irq in a similar way. Because
	__startup_pirq doesn't take irq_mapping_update_lock it can grab the
	evtchn that __unbind_from_irq is currently freeing and cleaning up. In
	this case even though the event channel is allocated, its mapping can
	be unset in evtchn_to_irq.

	The fix is to first cleanup the mappings and then close the event
	channel. In this way, when an event channel gets allocated it's
	potential previous evtchn_to_irq mappings are guaranteed to be unset already.
	This is also the reverse order of the allocation where first the event
	channel is allocated and then the mappings are setup.

	On a 5.10 kernel prior to commit 3fcdaf3d7634 ("xen/events: modify internal
	[un]bind interfaces"), we hit a BUG like the following during probing of NVMe
	devices. The issue is that during nvme_setup_io_queues, pci_free_irq
	is called for every device which results in a call to shutdown_pirq.
	With many nvme devices it's therefore likely to hit this race during
	boot because there will be multiple calls to shutdown_pirq and
	startup_pirq are running potentially in parallel.

	------------[ cut here ]------------
	blkfront: xvda: barrier or flush: disabled; persistent grants: enabled; indirect descriptors: enabled; bounce buffer: enabled
	kernel BUG at drivers/xen/events/events_base.c:499!
	invalid opcode: 0000 [#1] SMP PTI
	CPU: 44 PID: 375 Comm: kworker/u257:23 Not tainted 5.10.201-191.748.amzn2.x86_64 #1
	Hardware name: Xen HVM domU, BIOS 4.11.amazon 08/24/2006
	Workqueue: nvme-reset-wq nvme_reset_work
	RIP: 0010:bind_evtchn_to_cpu+0xdf/0xf0
	Code: 5d 41 5e c3 cc cc cc cc 44 89 f7 e8 2b 55 ad ff 49 89 c5 48 85 c0 0f 84 64 ff ff ff 4c 8b 68 30 41 83 fe ff 0f 85 60 ff ff ff <0f> 0b 66 66 2e 0f 1f 84 00 00 00 00 00 0f 1f 40 00 0f 1f 44 00 00
	RSP: 0000:ffffc9000d533b08 EFLAGS: 00010046
	RAX: 0000000000000000 RBX: 0000000000000000 RCX: 0000000000000006
	RDX: 0000000000000028 RSI: 00000000ffffffff RDI: 00000000ffffffff
	RBP: ffff888107419680 R08: 0000000000000000 R09: ffffffff82d72b00
	R10: 0000000000000000 R11: 0000000000000000 R12: 00000000000001ed
	R13: 0000000000000000 R14: 00000000ffffffff R15: 0000000000000002
	FS: 0000000000000000(0000) GS:ffff88bc8b500000(0000) knlGS:0000000000000000
	CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033
	CR2: 0000000000000000 CR3: 0000000002610001 CR4: 00000000001706e0
	DR0: 0000000000000000 DR1: 0000000000000000 DR2: 0000000000000000
	DR3: 0000000000000000 DR6: 00000000fffe0ff0 DR7: 0000000000000400
	Call Trace:
	? show_trace_log_lvl+0x1c1/0x2d9
	? show_trace_log_lvl+0x1c1/0x2d9
	? set_affinity_irq+0xdc/0x1c0
	? __die_body.cold+0x8/0xd
	? die+0x2b/0x50
	? do_trap+0x90/0x110
	? bind_evtchn_to_cpu+0xdf/0xf0
	? do_error_trap+0x65/0x80
	? bind_evtchn_to_cpu+0xdf/0xf0
	? exc_invalid_op+0x4e/0x70
	? bind_evtchn_to_cpu+0xdf/0xf0
	? asm_exc_invalid_op+0x12/0x20
	? bind_evtchn_to_cpu+0xdf/0xf0
	? bind_evtchn_to_cpu+0xc5/0xf0
	set_affinity_irq+0xdc/0x1c0
	irq_do_set_affinity+0x1d7/0x1f0
	irq_setup_affinity+0xd6/0x1a0
	irq_startup+0x8a/0xf0
	__setup_irq+0x639/0x6d0
	? nvme_suspend+0x150/0x150
	request_threaded_irq+0x10c/0x180
	? nvme_suspend+0x150/0x150
	pci_request_irq+0xa8/0xf0
	? __blk_mq_free_request+0x74/0xa0
	queue_request_irq+0x6f/0x80
	nvme_create_queue+0x1af/0x200
	nvme_create_io_queues+0xbd/0xf0
	nvme_setup_io_queues+0x246/0x320
	? nvme_irq_check+0x30/0x30
	nvme_reset_work+0x1c8/0x400
	process_one_work+0x1b0/0x350
	worker_thread+0x49/0x310
	? process_one_work+0x350/0x350
	kthread+0x11b/0x140
	? __kthread_bind_mask+0x60/0x60
	ret_from_fork+0x22/0x30
	Modules linked in:
	---[ end trace a11715de1eee1873 ]---

	The Linux kernel CVE team has assigned CVE-2024-26687 to this issue.


	Affected and fixed versions
	===========================

	Issue introduced in 2.6.37 with commit d46a78b05c0e37f76ddf4a7a67bf0b6c68bada55 and fixed in 5.4.274 with commit 9470f5b2503cae994098dea9682aee15b313fa44
	Issue introduced in 2.6.37 with commit d46a78b05c0e37f76ddf4a7a67bf0b6c68bada55 and fixed in 5.10.215 with commit 0fc88aeb2e32b76db3fe6a624b8333dbe621b8fd
	Issue introduced in 2.6.37 with commit d46a78b05c0e37f76ddf4a7a67bf0b6c68bada55 and fixed in 5.15.154 with commit ea592baf9e41779fe9a0424c03dd2f324feca3b3
	Issue introduced in 2.6.37 with commit d46a78b05c0e37f76ddf4a7a67bf0b6c68bada55 and fixed in 6.1.81 with commit 585a344af6bcac222608a158fc2830ff02712af5
	Issue introduced in 2.6.37 with commit d46a78b05c0e37f76ddf4a7a67bf0b6c68bada55 and fixed in 6.6.19 with commit 20980195ec8d2e41653800c45c8c367fa1b1f2b4
	Issue introduced in 2.6.37 with commit d46a78b05c0e37f76ddf4a7a67bf0b6c68bada55 and fixed in 6.7.6 with commit 9be71aa12afa91dfe457b3fb4a444c42b1ee036b
	Issue introduced in 2.6.37 with commit d46a78b05c0e37f76ddf4a7a67bf0b6c68bada55 and fixed in 6.8 with commit fa765c4b4aed2d64266b694520ecb025c862c5a9

	Please see https://www.kernel.org for a full list of currently supported
	kernel versions by the kernel community.

	Unaffected versions might change over time as fixes are backported to
	older supported kernel versions. The official CVE entry at
	https://cve.org/CVERecord/?id=CVE-2024-26687
	will be updated if fixes are backported, please check that for the most
	up to date information about this issue.


	Affected files
	==============

	The file(s) affected by this issue are:
	drivers/xen/events/events_base.c


	Mitigation
	==========

	The Linux kernel CVE team recommends that you update to the latest
	stable kernel version for this, and many other bugfixes. Individual
	changes are never tested alone, but rather are part of a larger kernel
	release. Cherry-picking individual commits is not recommended or
	supported by the Linux kernel community at all. If however, updating to
	the latest release is impossible, the individual changes to resolve this
	issue can be found at these commits:
	https://git.kernel.org/stable/c/9470f5b2503cae994098dea9682aee15b313fa44
	https://git.kernel.org/stable/c/0fc88aeb2e32b76db3fe6a624b8333dbe621b8fd
	https://git.kernel.org/stable/c/ea592baf9e41779fe9a0424c03dd2f324feca3b3
	https://git.kernel.org/stable/c/585a344af6bcac222608a158fc2830ff02712af5
	https://git.kernel.org/stable/c/20980195ec8d2e41653800c45c8c367fa1b1f2b4
	https://git.kernel.org/stable/c/9be71aa12afa91dfe457b3fb4a444c42b1ee036b
	https://git.kernel.org/stable/c/fa765c4b4aed2d64266b694520ecb025c862c5a9