Linux/Guest cooperative unmapped page cache control [Kernel]

Prev: Dear Account Owner,
Next: [PATCH] rt3070: Fixed a line over 80 character warning reported by checkpatch.pl tool

From: Balbir Singh on 14 Jun 2010 13:50

* Avi Kivity <avi(a)redhat.com> [2010-06-14 19:34:00]:

> On 06/14/2010 06:55 PM, Dave Hansen wrote:
> >On Mon, 2010-06-14 at 18:44 +0300, Avi Kivity wrote:
> >>On 06/14/2010 06:33 PM, Dave Hansen wrote:
> >>>At the same time, I see what you're trying to do with this. It really
> >>>can be an alternative to ballooning if we do it right, since ballooning
> >>>would probably evict similar pages. Although it would only work in idle
> >>>guests, what about a knob that the host can turn to just get the guest
> >>>to start running reclaim?
> >>Isn't the knob in this proposal the balloon? AFAICT, the idea here is
> >>to change how the guest reacts to being ballooned, but the trigger
> >>itself would not change.
> >I think the patch was made on the following assumptions:
> >1. Guests will keep filling their memory with relatively worthless page
> > cache that they don't really need.
> >2. When they do this, it hurts the overall system with no real gain for
> > anyone.
> >
> >In the case of a ballooned guest, they _won't_ keep filling memory. The
> >balloon will prevent them. So, I guess I was just going down the path
> >of considering if this would be useful without ballooning in place. To
> >me, it's really hard to justify _with_ ballooning in place.
>
> There are two decisions that need to be made:
>
> - how much memory a guest should be given
> - given some guest memory, what's the best use for it
>
> The first question can perhaps be answered by looking at guest I/O
> rates and giving more memory to more active guests. The second
> question is hard, but not any different than running non-virtualized
> - except if we can detect sharing or duplication. In this case,
> dropping a duplicated page is worthwhile, while dropping a shared
> page provides no benefit.

I think there is another way of looking at it, give some free memory

1. Can the guest run more applications or run faster
2. Can the host potentially get this memory via ballooning or some
other means to start newer guest instances

I think the answer to 1 and 2 is yes.

>
> How the patch helps answer either question, I'm not sure. I don't
> think preferential dropping of unmapped page cache is the answer.
>

Preferential dropping as selected by the host, that knows about the
setup and if there is duplication involved. While we use the term
preferential dropping, remember it is still via LRU and we don't
always succeed. It is a best effort (if you can and the unmapped pages
are not highly referenced) scenario.

> >>My issue is that changing the type of object being preferentially
> >>reclaimed just changes the type of workload that would prematurely
> >>suffer from reclaim. In this case, workloads that use a lot of unmapped
> >>pagecache would suffer.
> >>
> >>btw, aren't /proc/sys/vm/swapiness and vfs_cache_pressure similar knobs?
> >Those tell you how to balance going after the different classes of
> >things that we can reclaim.
> >
> >Again, this is useless when ballooning is being used. But, I'm thinking
> >of a more general mechanism to force the system to both have MemFree
> >_and_ be acting as if it is under memory pressure.
>
> If there is no memory pressure on the host, there is no reason for
> the guest to pretend it is under pressure. If there is memory
> pressure on the host, it should share the pain among its guests by
> applying the balloon. So I don't think voluntarily dropping cache
> is a good direction.
>

There are two situations

1. Voluntarily drop cache, if it was setup to do so (the host knows
that it caches that information anyway)
2. Drop the cache on either a special balloon option, again the host
knows it caches that very same information, so it prefers to free that
up first.

--
Three Cheers,
Balbir
--
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majordomo(a)vger.kernel.org
More majordomo info at http://vger.kernel.org/majordomo-info.html
Please read the FAQ at http://www.tux.org/lkml/

From: Balbir Singh on 14 Jun 2010 13:50

* Avi Kivity <avi(a)redhat.com> [2010-06-14 18:34:58]:

> On 06/14/2010 06:12 PM, Dave Hansen wrote:
> >On Mon, 2010-06-14 at 14:18 +0530, Balbir Singh wrote:
> >>1. A slab page will not be freed until the entire page is free (all
> >>slabs have been kfree'd so to speak). Normal reclaim will definitely
> >>free this page, but a lot of it depends on how frequently we are
> >>scanning the LRU list and when this page got added.
> >You don't have to be freeing entire slab pages for the reclaim to have
> >been useful. You could just be making space so that _future_
> >allocations fill in the slab holes you just created. You may not be
> >freeing pages, but you're reducing future system pressure.
>
> Depends. If you've evicted something that will be referenced soon,
> you're increasing system pressure.
>

I don't think slab pages care about being referenced soon, they are
either allocated or freed. A page is just a storage unit for the data
structure. A new one can be allocated on demand.

--
Three Cheers,
Balbir
--
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majordomo(a)vger.kernel.org
More majordomo info at http://vger.kernel.org/majordomo-info.html
Please read the FAQ at http://www.tux.org/lkml/

From: Avi Kivity on 15 Jun 2010 03:00

On 06/14/2010 08:45 PM, Balbir Singh wrote:
>
>> There are two decisions that need to be made:
>>
>> - how much memory a guest should be given
>> - given some guest memory, what's the best use for it
>>
>> The first question can perhaps be answered by looking at guest I/O
>> rates and giving more memory to more active guests. The second
>> question is hard, but not any different than running non-virtualized
>> - except if we can detect sharing or duplication. In this case,
>> dropping a duplicated page is worthwhile, while dropping a shared
>> page provides no benefit.
>>
> I think there is another way of looking at it, give some free memory
>
> 1. Can the guest run more applications or run faster
>

That's my second question. How to best use this memory. More
applications == drop the page from cache, faster == keep page in cache.

All we need is to select the right page to drop.

> 2. Can the host potentially get this memory via ballooning or some
> other means to start newer guest instances
>

Well, we already have ballooning. The question is can we improve the
eviction algorithm.

> I think the answer to 1 and 2 is yes.
>
>
>> How the patch helps answer either question, I'm not sure. I don't
>> think preferential dropping of unmapped page cache is the answer.
>>
>>
> Preferential dropping as selected by the host, that knows about the
> setup and if there is duplication involved. While we use the term
> preferential dropping, remember it is still via LRU and we don't
> always succeed. It is a best effort (if you can and the unmapped pages
> are not highly referenced) scenario.
>

How can the host tell if there is duplication? It may know it has some
pagecache, but it has no idea whether or to what extent guest pagecache
duplicates host pagecache.

>>> Those tell you how to balance going after the different classes of
>>> things that we can reclaim.
>>>
>>> Again, this is useless when ballooning is being used. But, I'm thinking
>>> of a more general mechanism to force the system to both have MemFree
>>> _and_ be acting as if it is under memory pressure.
>>>
>> If there is no memory pressure on the host, there is no reason for
>> the guest to pretend it is under pressure. If there is memory
>> pressure on the host, it should share the pain among its guests by
>> applying the balloon. So I don't think voluntarily dropping cache
>> is a good direction.
>>
>>
> There are two situations
>
> 1. Voluntarily drop cache, if it was setup to do so (the host knows
> that it caches that information anyway)
>

It doesn't, really. The host only has aggregate information about
itself, and no information about the guest.

Dropping duplicate pages would be good if we could identify them. Even
then, it's better to drop the page from the host, not the guest, unless
we know the same page is cached by multiple guests.

But why would the guest voluntarily drop the cache? If there is no
memory pressure, dropping caches increases cpu overhead and latency even
if the data is still cached on the host.

> 2. Drop the cache on either a special balloon option, again the host
> knows it caches that very same information, so it prefers to free that
> up first.
>

Dropping in response to pressure is good. I'm just not convinced the
patch helps in selecting the correct page to drop.

--
error compiling committee.c: too many arguments to function

--
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majordomo(a)vger.kernel.org
More majordomo info at http://vger.kernel.org/majordomo-info.html
Please read the FAQ at http://www.tux.org/lkml/

From: Avi Kivity on 15 Jun 2010 03:20

On 06/14/2010 08:40 PM, Balbir Singh wrote:
> * Avi Kivity<avi(a)redhat.com> [2010-06-14 18:34:58]:
>
>
>> On 06/14/2010 06:12 PM, Dave Hansen wrote:
>>
>>> On Mon, 2010-06-14 at 14:18 +0530, Balbir Singh wrote:
>>>
>>>> 1. A slab page will not be freed until the entire page is free (all
>>>> slabs have been kfree'd so to speak). Normal reclaim will definitely
>>>> free this page, but a lot of it depends on how frequently we are
>>>> scanning the LRU list and when this page got added.
>>>>
>>> You don't have to be freeing entire slab pages for the reclaim to have
>>> been useful. You could just be making space so that _future_
>>> allocations fill in the slab holes you just created. You may not be
>>> freeing pages, but you're reducing future system pressure.
>>>
>> Depends. If you've evicted something that will be referenced soon,
>> you're increasing system pressure.
>>
>>
> I don't think slab pages care about being referenced soon, they are
> either allocated or freed. A page is just a storage unit for the data
> structure. A new one can be allocated on demand.
>

If we're talking just about slab pages, I agree. If we're applying
pressure on the shrinkers, then you are removing live objects which can
be costly to reinstantiate.

--
error compiling committee.c: too many arguments to function

--
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majordomo(a)vger.kernel.org
More majordomo info at http://vger.kernel.org/majordomo-info.html
Please read the FAQ at http://www.tux.org/lkml/

From: Balbir Singh on 15 Jun 2010 04:00

* Avi Kivity <avi(a)redhat.com> [2010-06-15 09:58:33]:

> On 06/14/2010 08:45 PM, Balbir Singh wrote:
> >
> >>There are two decisions that need to be made:
> >>
> >>- how much memory a guest should be given
> >>- given some guest memory, what's the best use for it
> >>
> >>The first question can perhaps be answered by looking at guest I/O
> >>rates and giving more memory to more active guests. The second
> >>question is hard, but not any different than running non-virtualized
> >>- except if we can detect sharing or duplication. In this case,
> >>dropping a duplicated page is worthwhile, while dropping a shared
> >>page provides no benefit.
> >I think there is another way of looking at it, give some free memory
> >
> >1. Can the guest run more applications or run faster
>
> That's my second question. How to best use this memory. More
> applications == drop the page from cache, faster == keep page in
> cache.
>
> All we need is to select the right page to drop.
>

Do we need to drop to the granularity of the page to drop? I think
figuring out the class of pages and making sure that we don't write
our own reclaim logic, but work with what we have to identify the
class of pages is a good start.

> >2. Can the host potentially get this memory via ballooning or some
> >other means to start newer guest instances
>
> Well, we already have ballooning. The question is can we improve
> the eviction algorithm.
>
> >I think the answer to 1 and 2 is yes.
> >
> >>How the patch helps answer either question, I'm not sure. I don't
> >>think preferential dropping of unmapped page cache is the answer.
> >>
> >Preferential dropping as selected by the host, that knows about the
> >setup and if there is duplication involved. While we use the term
> >preferential dropping, remember it is still via LRU and we don't
> >always succeed. It is a best effort (if you can and the unmapped pages
> >are not highly referenced) scenario.
>
> How can the host tell if there is duplication? It may know it has
> some pagecache, but it has no idea whether or to what extent guest
> pagecache duplicates host pagecache.
>

Well it is possible in host user space, I for example use memory
cgroup and through the stats I have a good idea of how much is duplicated.
I am ofcourse making an assumption with my setup of the cached mode,
that the data in the guest page cache and page cache in the cgroup
will be duplicated to a large extent. I did some trivial experiments
like drop the data from the guest and look at the cost of bringing it
in and dropping the data from both guest and host and look at the
cost. I could see a difference.

Unfortunately, I did not save the data, so I'll need to redo the
experiment.

> >>>Those tell you how to balance going after the different classes of
> >>>things that we can reclaim.
> >>>
> >>>Again, this is useless when ballooning is being used. But, I'm thinking
> >>>of a more general mechanism to force the system to both have MemFree
> >>>_and_ be acting as if it is under memory pressure.
> >>If there is no memory pressure on the host, there is no reason for
> >>the guest to pretend it is under pressure. If there is memory
> >>pressure on the host, it should share the pain among its guests by
> >>applying the balloon. So I don't think voluntarily dropping cache
> >>is a good direction.
> >>
> >There are two situations
> >
> >1. Voluntarily drop cache, if it was setup to do so (the host knows
> >that it caches that information anyway)
>
> It doesn't, really. The host only has aggregate information about
> itself, and no information about the guest.
>
> Dropping duplicate pages would be good if we could identify them.
> Even then, it's better to drop the page from the host, not the
> guest, unless we know the same page is cached by multiple guests.
>

On the exact pages to drop, please see my comments above on the class
of pages to drop.
There are reasons for wanting to get the host to cache the data

Unless the guest is using cache = none, the data will still hit the
host page cache
The host can do a better job of optimizing the writeouts

> But why would the guest voluntarily drop the cache? If there is no
> memory pressure, dropping caches increases cpu overhead and latency
> even if the data is still cached on the host.
>

So, there are basically two approaches

1. First patch, proactive - enabled by a boot option
2. When ballooned, we try to (please NOTE try to) reclaim cached pages
first. Failing which, we go after regular pages in the alloc_page()
call in the balloon driver.

> >2. Drop the cache on either a special balloon option, again the host
> >knows it caches that very same information, so it prefers to free that
> >up first.
>
> Dropping in response to pressure is good. I'm just not convinced
> the patch helps in selecting the correct page to drop.
>

That is why I've presented data on the experiments I've run and
provided more arguments to backup the approach.

--
Three Cheers,
Balbir
--
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majordomo(a)vger.kernel.org
More majordomo info at http://vger.kernel.org/majordomo-info.html
Please read the FAQ at http://www.tux.org/lkml/

First | Prev | Next | Last
Pages: 1 2 3 4 5 6
Prev: Dear Account Owner,
Next: [PATCH] rt3070: Fixed a line over 80 character warning reported by checkpatch.pl tool