Random number generators [Fortran]

Prev: pointer and allocatable arrays
Next: f95 to windows dll

From: nmm1 on 19 Nov 2009 15:38

In article <%ShNm.55991$ze1.49888(a)news-server.bigpond.net.au>,
robin <robin_v(a)bigpond.com> wrote:
>"David Flower" <DavJFlower(a)AOL.COM> wrote in message
>news:8b69b09f-fd08-44a2-a48b-03afab751b88(a)z41g2000yqz.googlegroups.com...
>
>>Posters may be interested in the following reference:
>
>>A.C.M. Trans. Math, Software, 5, #2, 132 (1979) by Linus Schrage
>
>It's a bit ancient.
>Those by George Marsaglia are not only portable,
>they also have extremely long periods.
>
>His RNGs include 32-bit generators and 64-bit generators.

32-bit generators should never be used in any simulation which
uses more than a million numbers in total. To a first approximation,
ALWAYS use 64-bit ones.

Not all of Marsaglia's generators are free from 32-bit defects, even
when they are used in 64-bit forms, though most of them are good or
very good. Their actual code isn't always very portable, but that's
easy to clean up.

Regards,
Nick Maclaren.

From: glen herrmannsfeldt on 19 Nov 2009 16:25

nmm1(a)cam.ac.uk wrote:
(snip)

> 32-bit generators should never be used in any simulation which
> uses more than a million numbers in total. To a first approximation,
> ALWAYS use 64-bit ones.

All the ones I knew from 40 years ago were 32 bit (or 31 bit).
For many problems, that is probably enough. I have noticed that
the games that come with Windows have bad RNGs. Of course closed
source so we really don't know how they do it.

> Not all of Marsaglia's generators are free from 32-bit defects, even
> when they are used in 64-bit forms, though most of them are good or
> very good. Their actual code isn't always very portable, but that's
> easy to clean up.

-- glen

From: frank on 19 Nov 2009 19:40

On Wed, 18 Nov 2009 12:03:34 +0000, glen herrmannsfeldt wrote:

> David Jones <dajxxxx(a)ceh.ac.uk> wrote:
>> steve wrote:
>>> On Nov 16, 6:04 pm, frank <fr...(a)example.invalid> wrote:
> (snip)
>
>>>> Isn't there an asymmetry in the unit interval though as to which
>>>> endpoint is included? So if there's N outcomes on one side of .5
>>>> there would be N +-1 on the other.
>
>>> Can you restate your question without a little more detail?
>> <snip>
>
>> The point centres on the explicit statement of the range in the
>> standard as 0<= x < 1. Thus an exact value of zero is allowed as an
>> output. while an exact value of one is not. So there is asymmetry on
>> this point. For some uses, the question of whether or not exact zeroes
>> and ones are ever returned is important and is sometimes addressed by
>> testing for such values and re-generating another as necessary.
>> Obviously such testing should be avoided if the zero or one case is
>> guaranteed not to occur. Random number generators are such as to
>> naturally include zero as a possible output, and to avoid this would
>> involve slowing the code down.
>
> Consider a true random number generator (not a pseudo-random number
> generator, as is usually used) generating an IEEE double with a 53 bit
> significand generating uniformly 2**53 possible values, 0<=x<1. The
> expected standard deviation of the mean after averaging N values is
> 1/sqrt(N). To notice a deviation of 1 in 2**53, the offset due to the
> asymmetry, requires N of about 2**104, or about 1e31. At 1GHz, that
> takes 1e22 seconds, or, if I did it right, about 6e14 years. Good PRNG
> generate enough additional bits internally that it will be about as hard
> to notice.

That's on one end of the spectrum. On the other is me, PC Joe in single
precision. I've worked a lot with this material, for example when I was
working up different sorts.

Things look pretty glompy down here:

program rand

implicit integer(a-t)

integer, parameter :: r = 10

real, dimension(1:r) :: myarray

real u
integer m

call init_seed

do i =1,r

call random_number(u)

myarray(i) = u

end do

m = 2**30 * 1.9

print *, "myarray is ", myarray

print *, "m is ", m

contains

subroutine init_seed()

integer :: n, ival(8), v(3), i

integer, allocatable :: seed(:)

call date_and_time(values=ival)

v(1) = ival(8) + 2048*ival(7)

v(2) = ival(6) + 64*ival(5) ! value(4) isn't really 'random'

v(3) = ival(3) + 32*ival(2) + 32*8*ival(1)

call random_seed(size=n)

allocate(seed(n))

call random_seed() ! Give the seed an implementation-dependent kick

call random_seed(get=seed)

do i=1, n

seed(i) = seed(i) + v(mod(i-1, 3) + 1)

enddo

call random_seed(put=seed)

deallocate(seed)

end subroutine

end program rand

! gfortran ran1.f90 -Wall -Wextra -o out

dan(a)dan-desktop:~/source$ gfortran ran1.f90 -Wall -Wextra -o out
dan(a)dan-desktop:~/source$ ./out
myarray is 0.92712229 0.85859424 0.79266590
0.57997841 0.31713003 0.38777798 0.23500705
0.40011257 0.90164834 0.72985947
m is 2040109440
dan(a)dan-desktop:~/source$

Since 2**31 overflowed, I think m is close to maybe 2.1 billion outcomes
to expect on the unit interval. If m =~ 2n +-1, then n is on the order
of one billion, which was only a large number while the great William
Proxmire was alive.
>
>> A description of a RNG should explicity state the range and
>> discretisation of the values that might be reurned, so that it would be
>> possible to see whether just rejecting zero values would leave the set
>> of allowed values symmetric about zero.
>
> If processor speeds double every year for the next 100 years then you
> might have a chance to notice. If by then the generators switch to a
> longer return value then you won't.

Glen, I would almost be scared if our processor speeds trended along the
same curve for the next forty years.
--
frank

"Guns: yes, they are harmful."

From: frank on 19 Nov 2009 19:45

On Mon, 16 Nov 2009 18:48:57 -0800, steve wrote:

>> Isn't there an asymmetry in the unit interval though as to which
>> endpoint is included? So if there's N outcomes on one side of .5 there
>> would be N +-1 on the other.
>
> Can you restate your question without a little more detail? For
> example,
> for N=4, random_number in gfortran returns 0.998, 0.567, 0.966, 0.748.
> If I draw 10000 numbers and count the number in [0,0.5) and [0.5,1), the
> outcome is 4926 and 5074. If I draw 10000000 numbers, I get 4999679 and
> 50000321. If I reseed the rng with a different seed, these numbers
> change.

At least conceptually I think it's important to get the [ ) [ )
correct on this on the test, as you have it. So the test would be if
(x.gt. (.5)).
--
frank

"Guns: yes, they are harmful."

From: steve on 19 Nov 2009 20:22

On Nov 19, 4:45 pm, frank <fr...(a)example.invalid> wrote:
> On Mon, 16 Nov 2009 18:48:57 -0800, steve wrote:
> >> Isn't there an asymmetry in the unit interval though as to which
> >> endpoint is included? So if there's N outcomes on one side of .5 there
> >> would be N +-1 on the other.
>
> > Can you restate your question without a little more detail? For
> > example,
> > for N=4, random_number in gfortran returns 0.998, 0.567, 0.966, 0.748..
> > If I draw 10000 numbers and count the number in [0,0.5) and [0.5,1), the
> > outcome is 4926 and 5074. If I draw 10000000 numbers, I get 4999679 and
> > 50000321. If I reseed the rng with a different seed, these numbers
> > change.
>
> At least conceptually I think it's important to get the [ ) [ )
> correct on this on the test, as you have it. So the test would be if
> (x.gt. (.5)).

With your suggested test, the intervals are [0,0.5] and (0.5,1).
See Ron Shepard's post with respect to finite-precision floating point
numbers.

--
steve

First | Prev | Next | Last
Pages: 1 2 3 4 5 6 7 8 9 10 11 12 13
Prev: pointer and allocatable arrays
Next: f95 to windows dll