KVM: x86: Optimize mmio spte zapping when creating/moving memslot When we create or move a memory slot, we need to zap mmio sptes. Currently, zap_all() is used for this and this is causing two problems: - extra page faults after zapping mmu pages - long mmu_lock hold time during zapping mmu pages For the latter, Marcelo reported a disastrous mmu_lock hold time during hot-plug, which made the guest unresponsive for a long time. This patch takes a simple way to fix these problems: do not zap mmu pages unless they are marked mmio cached. On our test box, this took only 50us for the 4GB guest and we did not see ms of mmu_lock hold time any more. Note that we still need to do zap_all() for other cases. So another work is also needed: Xiao's work may be the one. Reviewed-by: Marcelo Tosatti <mtosatti@redhat.com> Signed-off-by: Takuya Yoshikawa <yoshikawa_takuya_b1@lab.ntt.co.jp> Signed-off-by: Gleb Natapov <gleb@redhat.com>

commit: 982b3394dd23eec6e5a2f7871238435a167b63cc [log] [tgz]
author: Takuya Yoshikawa <yoshikawa_takuya_b1@lab.ntt.co.jp> Tue Mar 12 17:45:30 2013 +0900
committer: Gleb Natapov <gleb@redhat.com> Thu Mar 14 10:21:21 2013 +0200
tree: 24e7cbbfdfa7500aa1e685b80aee205bf2ff17af
parent: 95b0430d1a53541076ffbaf453f8b49a547cceba [diff] [blame]
diff --git a/arch/x86/kvm/mmu.c b/arch/x86/kvm/mmu.c
index de45ec1..c1a9b7b 100644
--- a/arch/x86/kvm/mmu.c
+++ b/arch/x86/kvm/mmu.c

@@ -4189,6 +4189,24 @@
 	spin_unlock(&kvm->mmu_lock);
 }
 
+void kvm_mmu_zap_mmio_sptes(struct kvm *kvm)
+{
+	struct kvm_mmu_page *sp, *node;
+	LIST_HEAD(invalid_list);
+
+	spin_lock(&kvm->mmu_lock);
+restart:
+	list_for_each_entry_safe(sp, node, &kvm->arch.active_mmu_pages, link) {
+		if (!sp->mmio_cached)
+			continue;
+		if (kvm_mmu_prepare_zap_page(kvm, sp, &invalid_list))
+			goto restart;
+	}
+
+	kvm_mmu_commit_zap_page(kvm, &invalid_list);
+	spin_unlock(&kvm->mmu_lock);
+}
+
 static int mmu_shrink(struct shrinker *shrink, struct shrink_control *sc)
 {
 	struct kvm *kvm;
commit	982b3394dd23eec6e5a2f7871238435a167b63cc	[log] [tgz]
author	Takuya Yoshikawa <yoshikawa_takuya_b1@lab.ntt.co.jp>	Tue Mar 12 17:45:30 2013 +0900
committer	Gleb Natapov <gleb@redhat.com>	Thu Mar 14 10:21:21 2013 +0200
tree	24e7cbbfdfa7500aa1e685b80aee205bf2ff17af
parent	95b0430d1a53541076ffbaf453f8b49a547cceba [diff] [blame]