Prev: [PATCH -next] staging/cxt1e1: use netdev_priv to fix build
Next: [GIT PULL] XFS update for 2.6.34-rc6 or -rc7
From: Andrew Morton on 30 Apr 2010 13:20 On Thu, 29 Apr 2010 14:43:37 +0200 Steffen Klassert <steffen.klassert(a)secunet.com> wrote: > padata_get_next had a bogus check that returned always true, > so the try_again loop in padata_reorder was never taken. A better changelog would have told us what this "bogus check" _is_. > This can lead to object leaks in some rare cases. And a better changelog would describe those leaks! > This patch > implements a timer that processes the reorder queues if noone > else does it in appropriate time. Under what circumstances would "noone else do it in appropriate time"? Would that be a bug, or what? > @@ -273,13 +274,22 @@ try_again: > > spin_unlock_bh(&pd->lock); > > - if (atomic_read(&pd->reorder_objects)) > - goto try_again; > + if (atomic_read(&pd->reorder_objects) > + && !(pinst->flags & PADATA_RESET)) > + mod_timer(&pd->timer, jiffies + HZ); > + else > + del_timer(&pd->timer); > > -out: > return; > } I'd feel more comfortable if the above was in the locked region. Is there a race whereby another CPU can set pd->reorder_objects, but we forgot to arm the timer? -- To unsubscribe from this list: send the line "unsubscribe linux-kernel" in the body of a message to majordomo(a)vger.kernel.org More majordomo info at http://vger.kernel.org/majordomo-info.html Please read the FAQ at http://www.tux.org/lkml/
From: Steffen Klassert on 30 Apr 2010 14:20 On Thu, Apr 29, 2010 at 04:06:44PM -0700, Andrew Morton wrote: > On Thu, 29 Apr 2010 14:43:37 +0200 > Steffen Klassert <steffen.klassert(a)secunet.com> wrote: > > > padata_get_next had a bogus check that returned always true, > > so the try_again loop in padata_reorder was never taken. > > A better changelog would have told us what this "bogus check" _is_. > > > This can lead to object leaks in some rare cases. > > And a better changelog would describe those leaks! I'll try to write a better one and resent. > > > This patch > > implements a timer that processes the reorder queues if noone > > else does it in appropriate time. > > Under what circumstances would "noone else do it in appropriate time"? > Would that be a bug, or what? > We need to ensure that only one cpu can work on dequeueing of the reorder queue the time. Calculating in which percpu reorder queue the next object will arrive takes some time. A spinlock would be highly contended. Also it is not clear in which order the objects arrive to the reorder queues. So a cpu could wait to get the lock just to notice that there is nothing to do at the moment. Therefore we use a trylock and let the holder of the lock care for all the objects enqueued during the holdtime of the lock. The timer is to handle a race that appears with the trylock. If cpu1 queues an object to the reorder queue while cpu2 holds the pd->lock but left the while loop in padata_reorder already, cpu2 can't care for this object but cpu1 exits because it can't get the lock. Usually the next cpu that takes the lock cares for this object too. We need the timer just if this object was the last one that arrives to the reorder queues. The timer function sends it out in this case. > > @@ -273,13 +274,22 @@ try_again: > > > > spin_unlock_bh(&pd->lock); > > > > - if (atomic_read(&pd->reorder_objects)) > > - goto try_again; > > + if (atomic_read(&pd->reorder_objects) > > + && !(pinst->flags & PADATA_RESET)) > > + mod_timer(&pd->timer, jiffies + HZ); > > + else > > + del_timer(&pd->timer); > > > > -out: > > return; > > } > > I'd feel more comfortable if the above was in the locked region. Is > there a race whereby another CPU can set pd->reorder_objects, but we > forgot to arm the timer? > We could hit the race that the timer handles, if we move this into the lock. cpu1 cpu2 spin_trylock_bh() | | | test pd->reorder_objects == 0 delete timer | hardinterrupt | set pd->reorder_objects == 1 | enqueue object | spin_trylock_bh() busy | exit | spin_unlock_bh() -- To unsubscribe from this list: send the line "unsubscribe linux-kernel" in the body of a message to majordomo(a)vger.kernel.org More majordomo info at http://vger.kernel.org/majordomo-info.html Please read the FAQ at http://www.tux.org/lkml/
From: Herbert Xu on 2 May 2010 23:40
On Fri, Apr 30, 2010 at 01:20:00PM +0200, Steffen Klassert wrote: > On Thu, Apr 29, 2010 at 04:06:44PM -0700, Andrew Morton wrote: > > On Thu, 29 Apr 2010 14:43:37 +0200 > > Steffen Klassert <steffen.klassert(a)secunet.com> wrote: > > > > > padata_get_next had a bogus check that returned always true, > > > so the try_again loop in padata_reorder was never taken. > > > > A better changelog would have told us what this "bogus check" _is_. > > > > > This can lead to object leaks in some rare cases. > > > > And a better changelog would describe those leaks! > > I'll try to write a better one and resent. I've picked up patches 1-5. Thanks! -- Visit Openswan at http://www.openswan.org/ Email: Herbert Xu ~{PmV>HI~} <herbert(a)gondor.apana.org.au> Home Page: http://gondor.apana.org.au/~herbert/ PGP Key: http://gondor.apana.org.au/~herbert/pubkey.txt -- To unsubscribe from this list: send the line "unsubscribe linux-kernel" in the body of a message to majordomo(a)vger.kernel.org More majordomo info at http://vger.kernel.org/majordomo-info.html Please read the FAQ at http://www.tux.org/lkml/ |