Prev: kexec: set prstatus.pr_pid to cpu id when current->pid is 0
Next: [PATCH] ucb1400_core: avoid calling platform_device_put() twice in ucb1400_core_probe error path
From: Peter Zijlstra on 3 Aug 2010 04:40 On Fri, 2010-07-16 at 18:03 -0700, Jeremy Fitzhardinge wrote: > @@ -26,6 +26,9 @@ typedef struct arch_spinlock { > __ticket_t head, tail; > } tickets; > }; > +#ifdef CONFIG_PARAVIRT_SPINLOCKS > + __ticket_t waiting; > +#endif > } arch_spinlock_t; This bloats spinlock_t from u32 to u64 on most distro configs I think, since they'll have NR_CPUS=4096 or something large like that and probably also want to have this PARAVIRT_SPINLOCKS thing. -- To unsubscribe from this list: send the line "unsubscribe linux-kernel" in the body of a message to majordomo(a)vger.kernel.org More majordomo info at http://vger.kernel.org/majordomo-info.html Please read the FAQ at http://www.tux.org/lkml/
From: Nick Piggin on 3 Aug 2010 05:50 On Tue, Aug 03, 2010 at 10:32:40AM +0200, Peter Zijlstra wrote: > On Fri, 2010-07-16 at 18:03 -0700, Jeremy Fitzhardinge wrote: > > @@ -26,6 +26,9 @@ typedef struct arch_spinlock { > > __ticket_t head, tail; > > } tickets; > > }; > > +#ifdef CONFIG_PARAVIRT_SPINLOCKS > > + __ticket_t waiting; > > +#endif > > } arch_spinlock_t; > > This bloats spinlock_t from u32 to u64 on most distro configs I think, > since they'll have NR_CPUS=4096 or something large like that and > probably also want to have this PARAVIRT_SPINLOCKS thing. Which sucks for carefully packed data structures like dentry. They'll tend to grow by 8 bytes per spinlock. -- To unsubscribe from this list: send the line "unsubscribe linux-kernel" in the body of a message to majordomo(a)vger.kernel.org More majordomo info at http://vger.kernel.org/majordomo-info.html Please read the FAQ at http://www.tux.org/lkml/
From: Jeremy Fitzhardinge on 3 Aug 2010 11:50
On 08/03/2010 01:32 AM, Peter Zijlstra wrote: > On Fri, 2010-07-16 at 18:03 -0700, Jeremy Fitzhardinge wrote: >> @@ -26,6 +26,9 @@ typedef struct arch_spinlock { >> __ticket_t head, tail; >> } tickets; >> }; >> +#ifdef CONFIG_PARAVIRT_SPINLOCKS >> + __ticket_t waiting; >> +#endif >> } arch_spinlock_t; > This bloats spinlock_t from u32 to u64 on most distro configs I think, > since they'll have NR_CPUS=4096 or something large like that and > probably also want to have this PARAVIRT_SPINLOCKS thing. Yes, it is very unfortunate. In principle I could make it work with a single bit: set it when a cpu blocks and will need kicking; clear it when the lock becomes uncontended again (since everything is FIFO, so if something blocks itself there's a good chance that everything afterwards will also decide to block). But a bit takes as much space as a word (and it isn't obvious to me how to implement the "clearing when it becomes uncontended" part in a race-free way). (But see below for some handwaving) I could store this out in a secondary structure, but it really needs to be efficient to access from the unlock fast-path (to determine whether it needs to do the slow-path kick), so something out of line isn't going to work. Without the separate "waiting" counter, the previous code just checked to see if there was anyone else has a ticket on the lock. This is too pessimistic, since it isn't all that uncommon to have someone else who has been waiting on the lock for a little while, but has not yet decided to block themselves, and it results in many spurious calls into the unlock slow path. I was thinking of getting a bit in the ticket lock structure by stealing a counter bit: halve the supported number of cpus (128 for byte, 32k for word), add/sub 2, and use the LSB for the "needs kicking" flag. And that actually gives us two bits to play with, which may be useful for dealing with the clearing race (perhaps can be done cleverly in the unlock itself, or something). But I haven't thought this through in detail. J -- To unsubscribe from this list: send the line "unsubscribe linux-kernel" in the body of a message to majordomo(a)vger.kernel.org More majordomo info at http://vger.kernel.org/majordomo-info.html Please read the FAQ at http://www.tux.org/lkml/ |