Prev: [-next July 9 - s390 ] Badness at fs/sysfs/symlink.c:82 during qeth initalization
Next: Badness at fs/sysfs/symlink.c:82 during qeth initalization
From: Vivek Goyal on 3 Aug 2010 16:20 On Tue, Aug 03, 2010 at 10:31:33AM -0400, Munehiro Ikeda wrote: [..] > >Muuh, > > > >You will require one more piece and that is support for per cgroup request > >descriptors on request queue. With writes, it is so easy to consume those > >128 request descriptors. > > Hi Vivek, > > Yes. Thank you for the comment. > I have two concerns to do that. > > (1) technical concern > If there is fixed device-wide limitation and there are so many groups, > the number of request descriptors distributed to each group can be too > few. My only idea for this is to make device-wide limitation flexible, > but I'm not sure if it is the best or even can be allowed. > > (2) implementation concern > Now the limitation is done by generic block layer which doesn't know > about grouping. The idea in my head to solve this is to add a new > interface on elevator_ops to ask IO scheduler if a new request can > be allocated. > Acutally it is good point. We already call into CFQ (cfq_may_queue()) for doing some kind of determination regarding what is the urgency of request allocation. May be we can just keep track of how many outstanding requests are there per group in CFQ. And inside CFQ always allow request allocation for the active group. We can probably not allow this if a group has already got many requests backlogged (say more than 16). We might overshoot number of request descriptors on device wide limitation but we do any way (allow upto 50% more requests descriptors etc). So not introducing per group limit through sysfs and just doing some rough internal calculations in CFQ and being little flexible with over allocation of request descriptors, it might reduce complexity. But it probably will not solve the problem of higher layer asking if queue is congested or not. It might happen that request queue is overall congested but a high priority group should not be affected by that and still be able to submit requests. I think this primarily is used only in WRITE paths. So READ path should still be fine. Once WRITE support is in, we need to probably introduce additional mechanism where we can queury per bdi per group congestion instead of per bdi congestion. One group might be congested and but not the other one. I had done that in my previous postings. Thanks Vivek -- To unsubscribe from this list: send the line "unsubscribe linux-kernel" in the body of a message to majordomo(a)vger.kernel.org More majordomo info at http://vger.kernel.org/majordomo-info.html Please read the FAQ at http://www.tux.org/lkml/
From: Munehiro Ikeda on 4 Aug 2010 10:40
Nauman Rafique wrote, on 08/03/2010 03:24 PM: > On Tue, Aug 3, 2010 at 7:31 AM, Munehiro Ikeda<m-ikeda(a)ds.jp.nec.com> wrote: >> Vivek Goyal wrote, on 08/02/2010 04:58 PM: >>> You will require one more piece and that is support for per cgroup request >>> descriptors on request queue. With writes, it is so easy to consume those >>> 128 request descriptors. >> >> Hi Vivek, >> >> Yes. Thank you for the comment. >> I have two concerns to do that. >> >> (1) technical concern >> If there is fixed device-wide limitation and there are so many groups, >> the number of request descriptors distributed to each group can be too >> few. My only idea for this is to make device-wide limitation flexible, >> but I'm not sure if it is the best or even can be allowed. >> >> (2) implementation concern >> Now the limitation is done by generic block layer which doesn't know >> about grouping. The idea in my head to solve this is to add a new >> interface on elevator_ops to ask IO scheduler if a new request can >> be allocated. > > Muuhh, > We have already done the work of forward porting the request > descriptor patch that Vivek had in his earlier patch sets. We also > taken care of the two concerns you have mentioned above. We have been > testing it, and getting good numbers. So if you want, I can send the > patch your way so it can be included in this same patch series. > > Thanks. Hi Nauman, It is the patch that I'm thinking we should be based on. You have already done the forward porting, great! Please post it to LKML, container-list etc. independently if you don't mind. I appreciate your suggestion to include it into my patch series, but I'm worrying about that the patch set becomes larger beyond my poor antique brain processor. The issue of request limitation may be significant when async write is supported, but I don't think it is limited to it. It should be beneficial for current blkio controller. And we can combine them after independent posts if needed. Thanks a lot, Muuhh -- IKEDA, Munehiro NEC Corporation of America m-ikeda(a)ds.jp.nec.com -- To unsubscribe from this list: send the line "unsubscribe linux-kernel" in the body of a message to majordomo(a)vger.kernel.org More majordomo info at http://vger.kernel.org/majordomo-info.html Please read the FAQ at http://www.tux.org/lkml/ |