SpinLock/Mutex : Difference ? [Win32 Kernel]

Prev: Worm in msvcr80.dll?
Next: object manager name space \?? \\. \?

From: Chris M. Thomasson on 22 Dec 2009 01:50

"m" <m(a)b.c> wrote in message news:elNVmCqgKHA.2160(a)TK2MSFTNGP02.phx.gbl...
>I have no experience with a lock of that type. I does assume that either
>the set of all threads that will access the lock is known beforehand, or
>that a new thread must register itself

Yes, exactly.

> and wait for a sequence point to inject its new lock, before accessing the
> object for the first time.

I might be totally misunderstanding you but a thread can actually use it's
own lock right after registration occurs. Pseudo-code for dynamic
registration might look like:
_____________________________________________________________
struct thread
{
mutex m_mutex;

void read_lock()
{
m_mutex.lock();
}

void read_unlock()
{
m_mutex.unlock();
}
};

struct rwmutex
{
collection<thread*> m_threads;
mutex m_mutex;

void register(thread* t)
{
mutex::scoped_lock lock(m_mutex);
m_threads.push(t);
}

void unregister(thread* t)
{
mutex::scoped_lock lock(m_mutex);
m_threads.pop(t);
}

void write_lock()
{
m_mutex.lock();

for each thread in `m_threads' as i
{
i->read_lock();
}
}

void write_unlock()
{
for each thread in `m_threads' as i
{
i->read_unlock();
}

m_mutex.unlock();
}
};
_____________________________________________________________

This has some fairly decent scalability characteristics and, registration
aside for a moment, moves the heavy lifting to writers which should be few
and far between. IIRC, this type of lock is called a `brlock' in Linux.

> At a high level, the algorithm relies on signals (writer pending, reader
> pending) + a status code to maintain the lock. The only concern in my
> mind about this approach is the poor performance on NUMA. But as I said,
> it is a minimal implementation! The best attribute is that reader
> acquisition when uncontended by writers (99%+ of actual cases) requires
> only a single atomic increment.

Indeed. A simple atomic increment is pretty good for a fast-path. Humm. Have
you experimented with so-called asymmetric synchronization?

http://groups.google.com/group/comp.programming.threads/msg/6d6c97d11e432db8

The cool thing about this is that Windows Vista and onward provides
everything one needs to get this done! The reader acquisition and release
are made up plain loads and stores without any memory barriers. The writer
invokes membar on behalf of the readers by executing something like
`FlushProcess WriteBuffers()':

http://msdn.microsoft.com/en-us/library/ms683148(VS.85).aspx

From: Chris M. Thomasson on 22 Dec 2009 01:51

"Chris M. Thomasson" <no(a)spam.invalid> wrote in message
news:ZYZXm.28822$_b5.17297(a)newsfe22.iad...
[...]
> This has some fairly decent scalability characteristics and, registration
> aside for a moment, moves the heavy lifting to writers which should be few
> and far between. IIRC, this type of lock is called a `brlock' in Linux.

FWIW, in Linux, the locks are kept on a per-CPU basis.

From: m on 22 Dec 2009 19:38

While maintaining locks on a per CPU basis is obviously inappropriate for UM
code, your other point about registration is valid. In your case, you use a
standard lock to guard the list of threads, whereas I was thinking of a
lock-free quiescence method. Neither handle thread ABEND without leaks, but
that is usually not a problem since you are likely about to die anyway.

The implementation that you referenced for asymmetric locks is similar to
one I have used before.

What is chiefly bothering me at the moment is how to deal with NUMA effects
(access speed) on single volatiles in a general way. I know that if I mess
with thread affinity, and use a NUMA aware allocator, I can allocate node
local sub-structures and guard them with node local locks, but it would be
better if a solution that did not depend on known preconditions could be
found.

"Chris M. Thomasson" <no(a)spam.invalid> wrote in message
news:RZZXm.28825$_b5.9225(a)newsfe22.iad...
> "Chris M. Thomasson" <no(a)spam.invalid> wrote in message
> news:ZYZXm.28822$_b5.17297(a)newsfe22.iad...
> [...]
>> This has some fairly decent scalability characteristics and,
>> registration aside for a moment, moves the heavy lifting to writers which
>> should be few and far between. IIRC, this type of lock is called a
>> `brlock' in Linux.
>
> FWIW, in Linux, the locks are kept on a per-CPU basis.

From: Hugo gleaves on 24 Dec 2009 04:24

First: The order in which my text appears is simply the default way the edit
box presents itself to me in IE when I click "Reply", so gripe at MS for this.

Second: A spinlock is NOT a "kernel mode mechanism" period. Yes the NT
kernel uses spinlocks and has a small API to support them, but a spinlock is
not something that one can only use or have in kernel mode, this the first of
many errors you make.

Read and learn somthing:

http://en.wikipedia.org/wiki/Spinlock

Note the sentence "In software engineering, a spinlock is a lock where the
thread simply waits in a loop ("spins") repeatedly checking until the lock
becomes available."

Where does that say "kernel mode"?

In fact it says "For this reason, spinlocks are often used inside operating
system kernels" note the term "often" clearly stating that they are also used
elsewhere.

If you are going to post rude and disparaging remarks simply when someone
dares to disagree with you, then you fully deserve any humiliating responses
such as this.

H

"tanix" wrote:

> In article <59F509C6-D5A5-4049-A075-53DF0E2656DE(a)microsoft.com>, =?Utf-8?B?SHVnbw==?= gleaves(a)hotmail.com> <hugh<underbar> wrote:
> >Since you have drifted off into insults
>
> Not really, if you comprehend what was being said.
>
> > and did not bother to read my orignal
> >post
>
> Not sure how original was the post I was following up on.
>
> > (I have said three times here that I do not in an way suggest user mode
> >code should use kernel mode mechanisms,
>
> Spinlocks ARE a kernel mode mechanism.
> There is a reason Microsoft suggests to hold them for extremely
> short periods of time. These guys are not exactly dumb.
>
> Anyway, I really have not much time or interest to deal with this.
>
> Just one point: realize that by top posting (writing your response
> BEFORE the stuff you are following up, is one of the most despised
> ways to post. You are presenting your position out of context
> and readers do not know what in your response relates to what
> in the article you are following up.
>
> That single thing by itself indicates to me you have a problem
> with slopiness, and I would not be surprised it will eventually
> translate into how you design and code things.
>
> It is best to reply to some specific statement in place.
>
> Beyond that, do as thou wilt.
>
> Cya.
>
> > but you simply didn't bother to read
> >that) I will not bother commenting on what you wrote, except for this:
>
> >You seem to think that a spinlock is is something that only the kernel can
> >do? is that your understanding of sofware engineering? Yes, the kernel has a
> >spinlock API designed for use within the kernel.
> >
> >But anyone can write an equivalent API for user mode, all you need to do is
> >use the Win32 InterlockXXX functions and you can easily create a real,
> >working spinlock API.
> >
> >This is what I have been discussing.
> >
> >What if my shared memory contains thousands of distinct data items? What if
> >we have a need to read/update these in real-time from multiple processes each
> >running many threads? perhaps tens of thousands of times per second?
> >
> >Do you naively think a Mutex or two or a hundred is going to help?
> >
> >Is it not better to simply better to define a tny struct typedef "SpinLock"
> >and have a tiny API that uses InterlockXXX to "lock" and the struct? spinning
> >if the item is already locked?
> >
> >Is this not a very good solution especially if we KNOW that the updates are
> >very short indeed?
> >
> >Once again I am not in way suggesting access too the kernel, you have failed
> >to read my posts.
> >
> >Hugh
> >
> >
> >
> >
> >Hugh
> >
> >
> >"tanix" wrote:
> >
> >> In article <6B8D1539-D8C5-44AF-9646-261E8DDBEEBB(a)microsoft.com>,
> > =?Utf-8?B?SHVnbw==?= gleaves(a)hotmail.com> <hugh<underbar> wrote:
> >> >Well let me be perfectly clear I am not suggesting or advocating any
> >> >interaction with any kernel mode mechanisms.
> >> >
> >> >If you work with shared mapped memory a great deal as we do, then you need
> >> >very fast means of locking shared data structures, especially of this is
> >> >happening very often.
> >>
> >> Fine. I had to deal with that kind of thing.
> >> And then?
> >>
> >> >For example we have a shared heap library that we designed, it obvioulsy
> >> >must handle safe updates to heap control metadata by many threads and many
> >> >processes at the same time, perhaps tens of thousands of times a second.
> >>
> >> Understood. I had to deal with a similar situation on a global
> >> voice messaging system, just like many people do with their products.
> >>
> >> >A mutex simply can't deliver the speed and efficiency.
> >>
> >> How do you know where is your REAL bottlenecks?
> >>
> >> Are you sure your system architecture is up to snuff?
> >>
> >> May be the whole thing is not correctly archtected?
> >>
> >> You can not just go for any critical point without seeing the whole
> >> picture. ESPECIALLY, once you start suggesting things that FUNDAMENTALLY
> >> violate the generations old principles of separation of kernel and
> >> user mode?
> >>
> >> Do you think people that created these concepts are stoopid?
> >> Infantile?
> >>
> >> You know better?
> >>
> >> Well, can YOU propose the solution then?
> >> Do you think the idea of separating the user mode and kernel mode
> >> is flawed? How so?
> >>
> >> >Getting/releaseing the Mutex costs far far more CPU cycles then updating
> >> >four fields in a little structure and a pointer or two.
> >>
> >> Yep, that is what ALL of them say.
> >> It is called looking for the trees and not seeing the forest.
> >> OBVIOUSLY.
> >>
> >> Do you have an estimate on what kinds of problems your approach may
> >> cause the the whole LIVELIHOOD of your system?
> >>
> >> Do you have an estimate on how many light years is it going to take
> >> to deal with problems are you going to create as a result of your
> >> great "improvement"?
> >>
> >> How many man years others would have to spend, forever fixing the
> >> problems are you going to create, pretty much inevitably?
> >>
> >> Do you have a mathematical proof that your concept is going to be
> >> stable?
> >>
> >> What DO you have beyond these small greedy things, tring to
> >> squeeze some CPU cycles even from the places that clearly have
> >> a label:
> >>
> >> DO NOT ENTER. KERNEL LEVEL. NO MORONS ALLOWED.
> >>
> >> Enough.
> >>
> >> >Most people perhaps use shared memory for the odd neat trick or some useful
> >> >little mechanism, but if you rely on it as a vehicle for sharing/storing
> > many
> >> >gigs of data then you really do need speedy locks.
> >> >
> >> >Hugo
> >> >
> >> >
> >> >"tanix" wrote:
> >> >
> >> >> In article <OFiPAJ$fKHA.1112(a)TK2MSFTNGP04.phx.gbl>, "Alexander Grigoriev"
> >> > <alegr(a)earthlink.net> wrote:
> >> >> >By definition, spinlock is a busy loop. It only makes sense to do if the
> >> >> >spinlock owner is guaranteed to never get preempted by another thread. In
> >
> >> >> >kernel mode, this is done by setting IRQL to DISPATCH_LEVEL. In user
> > mode,
> >> >> >this is not possible.
> >> >>
> >> >> Good point.
> >> >>
> >> >> Actually, when I was doing a contract with Intel on a multi-gigabit
> >> >> network card, there was one guy that was suggesting to do something
> >> >> totally out of the wall in my view. Pretty much the same idea as in
> >> >> this thread.
> >> >>
> >> >> They were trying to squeze out as much performance as possible.
> >> >> So, this "smart", fat blob, came up with the idea to manipulate
> >> >> the kernel level synchronization objects from the app level, such
> >> >> as resetting the count of kernel level semaphores under some contitions.
> >> >>
> >> >> On a weekly project meeting I told him: you will never prove this
> >> >> thing is going to work. That is the whole point with semaphores.
> >> >> They have been mathematically proven to work. Once you start resetting
> >> >> the counts at will, outside of the normal semaphore mechanics,
> >> >> you'll be fixing this thing for years to come.
> >> >>
> >> >> The result?
> >> >>
> >> >> They fired me. Not him.
> >> >> I wonder if they were ever able to make that network card to work.
> >> >>
> >> >> When they gave me this project to create a hack to manipulate the
> >> >> kernel level structures from the user mode, and gave me a couple
> >> >> of weeks to do that, I could not believe these smart donkeys.
> >> >>
> >> >> It takes YEARS to work on things like these. These are some of
> >> >> the central concepts in O/S design and the very idea of separation
> >> >> of kernel and user mode. I nearly flipped out when manager told
> >> >> me my "project".
> >> >>
> >> >> Some people really need their brains examined.
> >> >> Where did they get their estimates on "improving performance"
> >> >> by slaughtering some of the most fundamental concepts in o/s
> >> >> design and architecture just escapes me.
> >> >>
> >> >> >CRITICAL_SECTION provide an option of limited spinning in case of
> >> >> >contention, if you expect that the CS will mostly be held for very short
> >> >> >time.
> >> >> >
> >> >> >
> >> >> >"Hugo gleaves(a)hotmail.com>" <hugh<underbar> wrote in message
> >> >> >news:574400EE-AFAF-4620-BDDD-7E49608392E0(a)microsoft.com...
> >> >> >> The idea of a spinlock is to ensure secure access/updates to some
> > datum,
> >> >> >> usually for operations that are very very brief. So although some other
> >> >> >> thread/process may have to "spin" for a time, the time is deemed to
> > brief
> >> >> >> as
> >> >> >> to be not a serious performance issue.
> >> >> >>
> >> >> >> The kind of operations that might be suited to a spinlock protection
> > are:
> >> >> >>
> >> >> >> 1) Updating several fields atomically in some shared structure.
> >> >> >> 2) Adding/removing an entry from a shared list.
> >> >> >>
> >> >> >> As Don says a Mutex is much more sophisticated and expensive and
> > although
> >> >> >> very uself in Win32 programming is no match for a spinlock when it
> > comes
> >> >> >> to
> >> >> >> raw speed (this is why device drivers use them).
> >> >> >>
> >> >> >> The real question here is why has MS not provided developers with a
> >> >> >> spinlock
> >> >> >> API for user mode code?
> >> >> >>
> >> >> >> I had to hand craft such a beast in C, it has to support:
> >> >> >>
> >> >> >> 1) Nested locking (a spinlock can be acquired 'n' times and must then
> > be
> >> >> >> released 'n' times).
> >> >> >> 2) Read/Write locking modes.
> >> >> >> 3) Not end up dragging its heels because of these needs.
> >> >> >>
> >> >> >> If you do a lot of work with shared/mapped memory as we do, then such
> > an
> >> >> >> API
> >> >> >> is very important, if you rely on Mutexes it will cost dearly, also the
> >> >> >> interlock functions alone wont let you easily do the above (but they do
> >> >> >> provide the foundation).
> >> >> >>
> >> >> >> I've been interested for some time in seeing open source or public
> > domain
> >> >> >> implementations of a user-mode spinlock API (one that is truly tested
> > on
> >> >> >> multicore H/W) but there seems to be absolutely nothing, what we have
> >> >> >> seems
> >> >> >> to be pretty good but it is pretty complex and I worry sometimes...
> >> >> >>
> >> >> >> ;)
> >> >> >>
> >> >> >>
> >> >> >> "o0Zz" wrote:
> >> >> >>
> >> >> >>> Thank you very much !
> >> >> >>>
> >> >> >>> "Daniel Terhell" <daniel(a)resplendence.com> a ecrit dans le message de
> >> >> >>> groupe
> >> >> >>> de discussion : emRp7$LdKHA.2188(a)TK2MSFTNGP04.phx.gbl...
> >> >> >>> > "m" <m(a)b.c> wrote in message
> >> >> >>> > news:eP9kilIdKHA.6096(a)TK2MSFTNGP02.phx.gbl...
> >> >> >>> >> The loop will consume all 100% of a single CPU until the lock is
> >> >> >>> >> acquired, the thread is interrupted by a higher priority thread or
> > the
> >> >> >>> >> thread quantum ends (and the OS schedules another thread).
> >> >> >>> >>
> >> >> >>> >
> >> >> >>> > That information is incorrect, on Windows a spinlock raises to
> >> >> >>> > DISPATCH_LEVEL (or higher) so it is not possible to get interrupted
> > by
> >> >> >>> > any
> >> >> >>> > thread. It's not even possible for the thread to finish its quantum
> >> >> >>> > while
> >> >> >>> > it spins or for the dispatcher to schedule another thread on the
> > same
> >> >> >>> > CPU.
> >> >> >>> > It can get interrupted by an ISR but it will return (to the spinning
> >> >> >>> > state) as soon as the ISR has executed and returns control.
> >> >> >>> >
> >> >> >>> > //Daniel
> >> >> >>> >
> >> >> >>> >
> >> >> >>> .
> >> >>
> >> >> --
> >> >> Programmer's Goldmine collections:
> >> >>
> >> >> http://preciseinfo.org
> >> >>
> >> >> Tens of thousands of code examples and expert discussions on
> >> >> C++, MFC, VC, ATL, STL, templates, Java, Python, Javascript,
> >> >> organized by major topics of language, tools, methods, techniques.
> >> >>
> >> >> .
> >> >>

From: tanix on 24 Dec 2009 15:30

In article <7854FD96-2DDB-4A81-A2D0-184F37600308(a)microsoft.com>, =?Utf-8?B?SHVnbw==?= gleaves(a)hotmail.com> <hugh<underbar> wrote:
>First: The order in which my text appears is simply the default way the edit
>box presents itself to me in IE when I click "Reply", so gripe at MS for this.
>
>Second: A spinlock is NOT a "kernel mode mechanism" period. Yes the NT
>kernel uses spinlocks and has a small API to support them, but a spinlock is
>not something that one can only use or have in kernel mode, this the first of
>many errors you make.
>
>Read and learn somthing:
>
>http://en.wikipedia.org/wiki/Spinlock
>
>Note the sentence "In software engineering, a spinlock is a lock where the
>thread simply waits in a loop ("spins") repeatedly checking until the lock
>becomes available."

Correct. And so it spends 100% of your processing time
and nothing else may happen mean while.

That is NOT the solution for user mode.

That is all I am willing to say on this.
Cya.

>Where does that say "kernel mode"?
>
>In fact it says "For this reason, spinlocks are often used inside operating
>system kernels" note the term "often" clearly stating that they are also used
>elsewhere.
>
>If you are going to post rude and disparaging remarks simply when someone
>dares to disagree with you, then you fully deserve any humiliating responses
>such as this.
>
>H
>
>
>"tanix" wrote:
>
>> In article <59F509C6-D5A5-4049-A075-53DF0E2656DE(a)microsoft.com>,
> =?Utf-8?B?SHVnbw==?= gleaves(a)hotmail.com> <hugh<underbar> wrote:
>> >Since you have drifted off into insults
>>
>> Not really, if you comprehend what was being said.
>>
>> > and did not bother to read my orignal
>> >post
>>
>> Not sure how original was the post I was following up on.
>>
>> > (I have said three times here that I do not in an way suggest user mode
>> >code should use kernel mode mechanisms,
>>
>> Spinlocks ARE a kernel mode mechanism.
>> There is a reason Microsoft suggests to hold them for extremely
>> short periods of time. These guys are not exactly dumb.
>>
>> Anyway, I really have not much time or interest to deal with this.
>>
>> Just one point: realize that by top posting (writing your response
>> BEFORE the stuff you are following up, is one of the most despised
>> ways to post. You are presenting your position out of context
>> and readers do not know what in your response relates to what
>> in the article you are following up.
>>
>> That single thing by itself indicates to me you have a problem
>> with slopiness, and I would not be surprised it will eventually
>> translate into how you design and code things.
>>
>> It is best to reply to some specific statement in place.
>>
>> Beyond that, do as thou wilt.
>>
>> Cya.
>>
>> > but you simply didn't bother to read
>> >that) I will not bother commenting on what you wrote, except for this:
>>
>> >You seem to think that a spinlock is is something that only the kernel can
>> >do? is that your understanding of sofware engineering? Yes, the kernel has a
>
>> >spinlock API designed for use within the kernel.
>> >
>> >But anyone can write an equivalent API for user mode, all you need to do is
>> >use the Win32 InterlockXXX functions and you can easily create a real,
>> >working spinlock API.
>> >
>> >This is what I have been discussing.
>> >
>> >What if my shared memory contains thousands of distinct data items? What if
>> >we have a need to read/update these in real-time from multiple processes
> each
>> >running many threads? perhaps tens of thousands of times per second?
>> >
>> >Do you naively think a Mutex or two or a hundred is going to help?
>> >
>> >Is it not better to simply better to define a tny struct typedef "SpinLock"
>> >and have a tiny API that uses InterlockXXX to "lock" and the struct?
> spinning
>> >if the item is already locked?
>> >
>> >Is this not a very good solution especially if we KNOW that the updates are
>> >very short indeed?
>> >
>> >Once again I am not in way suggesting access too the kernel, you have failed
>
>> >to read my posts.
>> >
>> >Hugh
>> >
>> >
>> >
>> >
>> >Hugh
>> >
>> >
>> >"tanix" wrote:
>> >
>> >> In article <6B8D1539-D8C5-44AF-9646-261E8DDBEEBB(a)microsoft.com>,
>> > =?Utf-8?B?SHVnbw==?= gleaves(a)hotmail.com> <hugh<underbar> wrote:
>> >> >Well let me be perfectly clear I am not suggesting or advocating any
>> >> >interaction with any kernel mode mechanisms.
>> >> >
>> >> >If you work with shared mapped memory a great deal as we do, then you
> need
>> >> >very fast means of locking shared data structures, especially of this is
>> >> >happening very often.
>> >>
>> >> Fine. I had to deal with that kind of thing.
>> >> And then?
>> >>
>> >> >For example we have a shared heap library that we designed, it obvioulsy
>> >> >must handle safe updates to heap control metadata by many threads and
> many
>> >> >processes at the same time, perhaps tens of thousands of times a second.
>> >>
>> >> Understood. I had to deal with a similar situation on a global
>> >> voice messaging system, just like many people do with their products.
>> >>
>> >> >A mutex simply can't deliver the speed and efficiency.
>> >>
>> >> How do you know where is your REAL bottlenecks?
>> >>
>> >> Are you sure your system architecture is up to snuff?
>> >>
>> >> May be the whole thing is not correctly archtected?
>> >>
>> >> You can not just go for any critical point without seeing the whole
>> >> picture. ESPECIALLY, once you start suggesting things that FUNDAMENTALLY
>> >> violate the generations old principles of separation of kernel and
>> >> user mode?
>> >>
>> >> Do you think people that created these concepts are stoopid?
>> >> Infantile?
>> >>
>> >> You know better?
>> >>
>> >> Well, can YOU propose the solution then?
>> >> Do you think the idea of separating the user mode and kernel mode
>> >> is flawed? How so?
>> >>
>> >> >Getting/releaseing the Mutex costs far far more CPU cycles then updating
>> >> >four fields in a little structure and a pointer or two.
>> >>
>> >> Yep, that is what ALL of them say.
>> >> It is called looking for the trees and not seeing the forest.
>> >> OBVIOUSLY.
>> >>
>> >> Do you have an estimate on what kinds of problems your approach may
>> >> cause the the whole LIVELIHOOD of your system?
>> >>
>> >> Do you have an estimate on how many light years is it going to take
>> >> to deal with problems are you going to create as a result of your
>> >> great "improvement"?
>> >>
>> >> How many man years others would have to spend, forever fixing the
>> >> problems are you going to create, pretty much inevitably?
>> >>
>> >> Do you have a mathematical proof that your concept is going to be
>> >> stable?
>> >>
>> >> What DO you have beyond these small greedy things, tring to
>> >> squeeze some CPU cycles even from the places that clearly have
>> >> a label:
>> >>
>> >> DO NOT ENTER. KERNEL LEVEL. NO MORONS ALLOWED.
>> >>
>> >> Enough.
>> >>
>> >> >Most people perhaps use shared memory for the odd neat trick or some
> useful
>> >> >little mechanism, but if you rely on it as a vehicle for sharing/storing
>> > many
>> >> >gigs of data then you really do need speedy locks.
>> >> >
>> >> >Hugo
>> >> >
>> >> >
>> >> >"tanix" wrote:
>> >> >
>> >> >> In article <OFiPAJ$fKHA.1112(a)TK2MSFTNGP04.phx.gbl>, "Alexander
> Grigoriev"
>> >> > <alegr(a)earthlink.net> wrote:
>> >> >> >By definition, spinlock is a busy loop. It only makes sense to do if
> the
>> >> >> >spinlock owner is guaranteed to never get preempted by another thread.
> In
>> >
>> >> >> >kernel mode, this is done by setting IRQL to DISPATCH_LEVEL. In user
>> > mode,
>> >> >> >this is not possible.
>> >> >>
>> >> >> Good point.
>> >> >>
>> >> >> Actually, when I was doing a contract with Intel on a multi-gigabit
>> >> >> network card, there was one guy that was suggesting to do something
>> >> >> totally out of the wall in my view. Pretty much the same idea as in
>> >> >> this thread.
>> >> >>
>> >> >> They were trying to squeze out as much performance as possible.
>> >> >> So, this "smart", fat blob, came up with the idea to manipulate
>> >> >> the kernel level synchronization objects from the app level, such
>> >> >> as resetting the count of kernel level semaphores under some
> contitions.
>> >> >>
>> >> >> On a weekly project meeting I told him: you will never prove this
>> >> >> thing is going to work. That is the whole point with semaphores.
>> >> >> They have been mathematically proven to work. Once you start resetting
>> >> >> the counts at will, outside of the normal semaphore mechanics,
>> >> >> you'll be fixing this thing for years to come.
>> >> >>
>> >> >> The result?
>> >> >>
>> >> >> They fired me. Not him.
>> >> >> I wonder if they were ever able to make that network card to work.
>> >> >>
>> >> >> When they gave me this project to create a hack to manipulate the
>> >> >> kernel level structures from the user mode, and gave me a couple
>> >> >> of weeks to do that, I could not believe these smart donkeys.
>> >> >>
>> >> >> It takes YEARS to work on things like these. These are some of
>> >> >> the central concepts in O/S design and the very idea of separation
>> >> >> of kernel and user mode. I nearly flipped out when manager told
>> >> >> me my "project".
>> >> >>
>> >> >> Some people really need their brains examined.
>> >> >> Where did they get their estimates on "improving performance"
>> >> >> by slaughtering some of the most fundamental concepts in o/s
>> >> >> design and architecture just escapes me.
>> >> >>
>> >> >> >CRITICAL_SECTION provide an option of limited spinning in case of
>> >> >> >contention, if you expect that the CS will mostly be held for very
> short
>> >> >> >time.
>> >> >> >
>> >> >> >
>> >> >> >"Hugo gleaves(a)hotmail.com>" <hugh<underbar> wrote in message
>> >> >> >news:574400EE-AFAF-4620-BDDD-7E49608392E0(a)microsoft.com...
>> >> >> >> The idea of a spinlock is to ensure secure access/updates to some
>> > datum,
>> >> >> >> usually for operations that are very very brief. So although some
> other
>> >> >> >> thread/process may have to "spin" for a time, the time is deemed to
>> > brief
>> >> >> >> as
>> >> >> >> to be not a serious performance issue.
>> >> >> >>
>> >> >> >> The kind of operations that might be suited to a spinlock protection
>> > are:
>> >> >> >>
>> >> >> >> 1) Updating several fields atomically in some shared structure.
>> >> >> >> 2) Adding/removing an entry from a shared list.
>> >> >> >>
>> >> >> >> As Don says a Mutex is much more sophisticated and expensive and
>> > although
>> >> >> >> very uself in Win32 programming is no match for a spinlock when it
>> > comes
>> >> >> >> to
>> >> >> >> raw speed (this is why device drivers use them).
>> >> >> >>
>> >> >> >> The real question here is why has MS not provided developers with a
>> >> >> >> spinlock
>> >> >> >> API for user mode code?
>> >> >> >>
>> >> >> >> I had to hand craft such a beast in C, it has to support:
>> >> >> >>
>> >> >> >> 1) Nested locking (a spinlock can be acquired 'n' times and must
> then
>> > be
>> >> >> >> released 'n' times).
>> >> >> >> 2) Read/Write locking modes.
>> >> >> >> 3) Not end up dragging its heels because of these needs.
>> >> >> >>
>> >> >> >> If you do a lot of work with shared/mapped memory as we do, then
> such
>> > an
>> >> >> >> API
>> >> >> >> is very important, if you rely on Mutexes it will cost dearly, also
> the
>> >> >> >> interlock functions alone wont let you easily do the above (but they
> do
>> >> >> >> provide the foundation).
>> >> >> >>
>> >> >> >> I've been interested for some time in seeing open source or public
>> > domain
>> >> >> >> implementations of a user-mode spinlock API (one that is truly
> tested
>> > on
>> >> >> >> multicore H/W) but there seems to be absolutely nothing, what we
> have
>> >> >> >> seems
>> >> >> >> to be pretty good but it is pretty complex and I worry sometimes...
>> >> >> >>
>> >> >> >> ;)
>> >> >> >>
>> >> >> >>
>> >> >> >> "o0Zz" wrote:
>> >> >> >>
>> >> >> >>> Thank you very much !
>> >> >> >>>
>> >> >> >>> "Daniel Terhell" <daniel(a)resplendence.com> a ecrit dans le message
> de
>> >> >> >>> groupe
>> >> >> >>> de discussion : emRp7$LdKHA.2188(a)TK2MSFTNGP04.phx.gbl...
>> >> >> >>> > "m" <m(a)b.c> wrote in message
>> >> >> >>> > news:eP9kilIdKHA.6096(a)TK2MSFTNGP02.phx.gbl...
>> >> >> >>> >> The loop will consume all 100% of a single CPU until the lock is
>> >> >> >>> >> acquired, the thread is interrupted by a higher priority thread
> or
>> > the
>> >> >> >>> >> thread quantum ends (and the OS schedules another thread).
>> >> >> >>> >>
>> >> >> >>> >
>> >> >> >>> > That information is incorrect, on Windows a spinlock raises to
>> >> >> >>> > DISPATCH_LEVEL (or higher) so it is not possible to get
> interrupted
>> > by
>> >> >> >>> > any
>> >> >> >>> > thread. It's not even possible for the thread to finish its
> quantum
>> >> >> >>> > while
>> >> >> >>> > it spins or for the dispatcher to schedule another thread on the
>> > same
>> >> >> >>> > CPU.
>> >> >> >>> > It can get interrupted by an ISR but it will return (to the
> spinning
>> >> >> >>> > state) as soon as the ISR has executed and returns control.
>> >> >> >>> >
>> >> >> >>> > //Daniel
>> >> >> >>> >
>> >> >> >>> >
>> >> >> >>> .
>> >> >>
>> >> >> --
>> >> >> Programmer's Goldmine collections:
>> >> >>
>> >> >> http://preciseinfo.org
>> >> >>
>> >> >> Tens of thousands of code examples and expert discussions on
>> >> >> C++, MFC, VC, ATL, STL, templates, Java, Python, Javascript,
>> >> >> organized by major topics of language, tools, methods, techniques.
>> >> >>
>> >> >> .
>> >> >>

--
Programmer's Goldmine collections:

http://preciseinfo.org

Tens of thousands of code examples and expert discussions on
C++, MFC, VC, ATL, STL, templates, Java, Python, Javascript,
organized by major topics of language, tools, methods, techniques.

First | Prev | Next | Last
Pages: 1 2 3 4 5 6 7 8 9
Prev: Worm in msvcr80.dll?
Next: object manager name space \?? \\. \?