powerpc/mm/thp: Make page table walk safe against thp split/collapse
We can disable a THP split or a hugepage collapse by disabling irq. We do send IPI to all the cpus in the early part of split/collapse, and disabling local irq ensure we don't make progress with split/collapse. If the THP is getting split we return NULL from find_linux_pte_or_hugepte(). For all the current callers it should be ok. We need to be careful if we want to use returned pte_t pointer outside the irq disabled region. W.r.t to THP split, the pfn remains the same, but then a hugepage collapse will result in a pfn change. There are few steps we can take to avoid a hugepage collapse.One way is to take page reference inside the irq disable region. Other option is to take mmap_sem so that a parallel collapse will not happen. We can also disable collapse by taking pmd_lock. Another method used by kvm subsystem is to check whether we had a mmu_notifer update in between using mmu_notifier_retry(). Signed-off-by:Aneesh Kumar K.V <aneesh.kumar@linux.vnet.ibm.com> Signed-off-by:
Michael Ellerman <mpe@ellerman.id.au>
Showing
- arch/powerpc/include/asm/pgtable.h 10 additions, 1 deletionarch/powerpc/include/asm/pgtable.h
- arch/powerpc/kernel/eeh.c 4 additions, 2 deletionsarch/powerpc/kernel/eeh.c
- arch/powerpc/kernel/io-workarounds.c 5 additions, 5 deletionsarch/powerpc/kernel/io-workarounds.c
- arch/powerpc/kvm/book3s_64_mmu_hv.c 3 additions, 2 deletionsarch/powerpc/kvm/book3s_64_mmu_hv.c
- arch/powerpc/kvm/book3s_hv_rm_mmu.c 23 additions, 7 deletionsarch/powerpc/kvm/book3s_hv_rm_mmu.c
- arch/powerpc/kvm/e500_mmu_host.c 12 additions, 2 deletionsarch/powerpc/kvm/e500_mmu_host.c
- arch/powerpc/mm/hash_utils_64.c 1 addition, 1 deletionarch/powerpc/mm/hash_utils_64.c
- arch/powerpc/mm/hugetlbpage.c 16 additions, 6 deletionsarch/powerpc/mm/hugetlbpage.c
- arch/powerpc/perf/callchain.c 14 additions, 10 deletionsarch/powerpc/perf/callchain.c
Loading
Please register or sign in to comment