| From bippy-1.0.0 Mon Sep 17 00:00:00 2001 |
| From: Greg Kroah-Hartman <gregkh@kernel.org> |
| To: <linux-cve-announce@vger.kernel.org> |
| Reply-to: <cve@kernel.org>, <linux-kernel@vger.kernel.org> |
| Subject: CVE-2022-48802: fs/proc: task_mmu.c: don't read mapcount for migration entry |
| |
| Description |
| =========== |
| |
| In the Linux kernel, the following vulnerability has been resolved: |
| |
| fs/proc: task_mmu.c: don't read mapcount for migration entry |
| |
| The syzbot reported the below BUG: |
| |
| kernel BUG at include/linux/page-flags.h:785! |
| invalid opcode: 0000 [#1] PREEMPT SMP KASAN |
| CPU: 1 PID: 4392 Comm: syz-executor560 Not tainted 5.16.0-rc6-syzkaller #0 |
| Hardware name: Google Google Compute Engine/Google Compute Engine, BIOS Google 01/01/2011 |
| RIP: 0010:PageDoubleMap include/linux/page-flags.h:785 [inline] |
| RIP: 0010:__page_mapcount+0x2d2/0x350 mm/util.c:744 |
| Call Trace: |
| page_mapcount include/linux/mm.h:837 [inline] |
| smaps_account+0x470/0xb10 fs/proc/task_mmu.c:466 |
| smaps_pte_entry fs/proc/task_mmu.c:538 [inline] |
| smaps_pte_range+0x611/0x1250 fs/proc/task_mmu.c:601 |
| walk_pmd_range mm/pagewalk.c:128 [inline] |
| walk_pud_range mm/pagewalk.c:205 [inline] |
| walk_p4d_range mm/pagewalk.c:240 [inline] |
| walk_pgd_range mm/pagewalk.c:277 [inline] |
| __walk_page_range+0xe23/0x1ea0 mm/pagewalk.c:379 |
| walk_page_vma+0x277/0x350 mm/pagewalk.c:530 |
| smap_gather_stats.part.0+0x148/0x260 fs/proc/task_mmu.c:768 |
| smap_gather_stats fs/proc/task_mmu.c:741 [inline] |
| show_smap+0xc6/0x440 fs/proc/task_mmu.c:822 |
| seq_read_iter+0xbb0/0x1240 fs/seq_file.c:272 |
| seq_read+0x3e0/0x5b0 fs/seq_file.c:162 |
| vfs_read+0x1b5/0x600 fs/read_write.c:479 |
| ksys_read+0x12d/0x250 fs/read_write.c:619 |
| do_syscall_x64 arch/x86/entry/common.c:50 [inline] |
| do_syscall_64+0x35/0xb0 arch/x86/entry/common.c:80 |
| entry_SYSCALL_64_after_hwframe+0x44/0xae |
| |
| The reproducer was trying to read /proc/$PID/smaps when calling |
| MADV_FREE at the mean time. MADV_FREE may split THPs if it is called |
| for partial THP. It may trigger the below race: |
| |
| CPU A CPU B |
| ----- ----- |
| smaps walk: MADV_FREE: |
| page_mapcount() |
| PageCompound() |
| split_huge_page() |
| page = compound_head(page) |
| PageDoubleMap(page) |
| |
| When calling PageDoubleMap() this page is not a tail page of THP anymore |
| so the BUG is triggered. |
| |
| This could be fixed by elevated refcount of the page before calling |
| mapcount, but that would prevent it from counting migration entries, and |
| it seems overkilling because the race just could happen when PMD is |
| split so all PTE entries of tail pages are actually migration entries, |
| and smaps_account() does treat migration entries as mapcount == 1 as |
| Kirill pointed out. |
| |
| Add a new parameter for smaps_account() to tell this entry is migration |
| entry then skip calling page_mapcount(). Don't skip getting mapcount |
| for device private entries since they do track references with mapcount. |
| |
| Pagemap also has the similar issue although it was not reported. Fixed |
| it as well. |
| |
| [shy828301@gmail.com: v4] |
| [nathan@kernel.org: avoid unused variable warning in pagemap_pmd_range()] |
| |
| The Linux kernel CVE team has assigned CVE-2022-48802 to this issue. |
| |
| |
| Affected and fixed versions |
| =========================== |
| |
| Issue introduced in 4.5 with commit e9b61f19858a5d6c42ce2298cf138279375d0d9b and fixed in 5.10.102 with commit db3f3636e4aed2cba3e4e7897a053323f7a62249 |
| Issue introduced in 4.5 with commit e9b61f19858a5d6c42ce2298cf138279375d0d9b and fixed in 5.15.25 with commit a8dd0cfa37792863b6c4bf9542975212a6715d49 |
| Issue introduced in 4.5 with commit e9b61f19858a5d6c42ce2298cf138279375d0d9b and fixed in 5.16.10 with commit 05d3f8045efa59457b323caf00bdb9273b7962fa |
| Issue introduced in 4.5 with commit e9b61f19858a5d6c42ce2298cf138279375d0d9b and fixed in 5.17 with commit 24d7275ce2791829953ed4e72f68277ceb2571c6 |
| |
| Please see https://www.kernel.org for a full list of currently supported |
| kernel versions by the kernel community. |
| |
| Unaffected versions might change over time as fixes are backported to |
| older supported kernel versions. The official CVE entry at |
| https://cve.org/CVERecord/?id=CVE-2022-48802 |
| will be updated if fixes are backported, please check that for the most |
| up to date information about this issue. |
| |
| |
| Affected files |
| ============== |
| |
| The file(s) affected by this issue are: |
| fs/proc/task_mmu.c |
| |
| |
| Mitigation |
| ========== |
| |
| The Linux kernel CVE team recommends that you update to the latest |
| stable kernel version for this, and many other bugfixes. Individual |
| changes are never tested alone, but rather are part of a larger kernel |
| release. Cherry-picking individual commits is not recommended or |
| supported by the Linux kernel community at all. If however, updating to |
| the latest release is impossible, the individual changes to resolve this |
| issue can be found at these commits: |
| https://git.kernel.org/stable/c/db3f3636e4aed2cba3e4e7897a053323f7a62249 |
| https://git.kernel.org/stable/c/a8dd0cfa37792863b6c4bf9542975212a6715d49 |
| https://git.kernel.org/stable/c/05d3f8045efa59457b323caf00bdb9273b7962fa |
| https://git.kernel.org/stable/c/24d7275ce2791829953ed4e72f68277ceb2571c6 |