Prev: kernel.h Fix #warning message web address.
Next: [PATCH V2] slab: fix caller tracking on !CONFIG_DEBUG_SLAB && CONFIG_TRACING
From: Jeremy Fitzhardinge on 30 Jun 2010 06:00 On 06/30/2010 11:11 AM, Peter Zijlstra wrote: > On Wed, 2010-06-30 at 10:00 +0100, Jan Beulich wrote: > >>>>> On 30.06.10 at 10:05, Peter Zijlstra <peterz(a)infradead.org> wrote: >>>>> >>> On Tue, 2010-06-29 at 15:31 +0100, Jan Beulich wrote: >>> >>>> Add optional (alternative instructions based) callout hooks to the >>>> contended ticket lock and the ticket unlock paths, to allow hypervisor >>>> specific code to be used for reducing/eliminating the bad effects >>>> ticket locks have on performance when running virtualized. >>>> >>> Uhm, I'd much rather see a single alternative implementation, not a >>> per-hypervisor lock implementation. >>> >> How would you imaging this to work? I can't see how the mechanism >> could be hypervisor agnostic. Just look at the Xen implementation >> (patch 2) - do you really see room for meaningful abstraction there? >> > I tried not to, it made my eyes bleed.. > > But from what I hear all virt people are suffering from spinlocks (and > fair spinlocks in particular), so I was thinking it'd be a good idea to > get all interested parties to collaborate on one. Fragmentation like > this hardly ever works out well. > The fastpath of the spinlocks can be common, but if it ends up spinning too long (however that might be defined), then it needs to call out to a hypervisor-specific piece of code which is effectively "yield this vcpu until its worth trying again". In Xen we can set up an event channel that the waiting CPU can block on, and the current lock holder can tickle it when it releases the lock (ideally it would just tickle the CPU with the next ticket, but that's a further refinement). I'm not sure what the corresponding implementation for KVM or HyperV would look like. Modern Intel chips have a "do a VMEXIT if you've run pause in a tight loop for too long" feature, which deals with the "spinning too long" part, but I'm not sure about the blocking mechanism (something based on monitor/mwait perhaps). J -- To unsubscribe from this list: send the line "unsubscribe linux-kernel" in the body of a message to majordomo(a)vger.kernel.org More majordomo info at http://vger.kernel.org/majordomo-info.html Please read the FAQ at http://www.tux.org/lkml/
From: Jeremy Fitzhardinge on 30 Jun 2010 07:00
On 06/30/2010 11:11 AM, Peter Zijlstra wrote: >>> Uhm, I'd much rather see a single alternative implementation, not a >>> per-hypervisor lock implementation. >>> >> How would you imaging this to work? I can't see how the mechanism >> could be hypervisor agnostic. Just look at the Xen implementation >> (patch 2) - do you really see room for meaningful abstraction there? >> > I tried not to, it made my eyes bleed.. > > But from what I hear all virt people are suffering from spinlocks (and > fair spinlocks in particular), so I was thinking it'd be a good idea to > get all interested parties to collaborate on one. Fragmentation like > this hardly ever works out well. > Yes. Now that I've looked at it a bit more closely I think these patches put way too much logic into the per-hypervisor part of the code. > Ah, right, after looking a bit more at patch 2 I see you indeed > implement a ticket like lock. Although why you need both a ticket and a > FIFO list is beyond me. > That appears to be a mechanism to allow it to take interrupts while spinning on the lock, which is something that stock ticket locks don't allow. If that's a useful thing to do, it should happen in the generic ticketlock code rather than in the per-hypervisor backend (otherwise we end up with all kinds of subtle differences in lock behaviour depending on the exact environment, which is just going to be messy). Even if interrupts-while-spinning isn't useful on native hardware, it is going to be equally applicable to all virtual environments. J -- To unsubscribe from this list: send the line "unsubscribe linux-kernel" in the body of a message to majordomo(a)vger.kernel.org More majordomo info at http://vger.kernel.org/majordomo-info.html Please read the FAQ at http://www.tux.org/lkml/ |