Pseudorandom number generator: Difference between revisions
en>Theopolisme m Reverted edit(s) by 117.254.148.26 identified as unconstructive or controvertial (and unsourced) using STiki |
en>Monkbot |
||
Line 1: | Line 1: | ||
The '''Mersenne twister''' is a [[pseudorandom number generator]] (PRNG). It is, by far, the most widely used PRNG.<ref>E.g. Marsland S. (2011) ''Machine Learning'' ([[CRC Press]]), §4.1.1. Also see the section "Adoption in software systems".</ref> Its name derives from the fact that its period length is chosen to be a [[Mersenne prime]]. | |||
The Mersenne Twister was developed in 1997 by {{nihongo|[[Makoto Matsumoto]]|松本 眞}} and {{nihongo|[[Takuji Nishimura]]|西村 拓士}}.<ref>{{cite doi|10.1145/272991.272995}}</ref> It was designed specifically to rectify most of the flaws found in older PRNGs. It was the first PRNG to provide fast generation of high-quality pseudorandom integers. | |||
The most commonly-used version of the Mersenne Twister algorithm is based on the Mersenne prime 2<sup>19937</sup>−1. The standard implementation of that, MT19937, uses a 32-bit word length. There is another implementation that uses a 64-bit word length, MT19937-64; it generates a different sequence. | |||
== Adoption in software systems == | |||
The Mersenne Twister is the default PRNG for | |||
[[R (programming language)|R]],<ref>{{cite web|title=Random Number Generators|work=CRAN Task View: Probability Distributions | |||
|url=http://cran.r-project.org/web/views/Distributions.html|accessdate=2012-05-29}}</ref> | |||
[[Python (programming language)|Python]],<ref>{{cite web|title=9.6 random — Generate pseudo-random numbers|work=Python v2.6.8 documentation | |||
|url=http://docs.python.org/release/2.6.8/library/random.html|accessdate=2012-05-29}}</ref><ref>{{cite web|title=8.6 random — Generate pseudo-random numbers | |||
|work=Python v3.2 documentation|url=http://docs.python.org/release/3.2/library/random.html | |||
|accessdate=2012-05-29}}</ref> | |||
[[Ruby (programming language)|Ruby]],<ref>{{cite web|title="Random" class documentation|work=Ruby 1.9.3 documentation | |||
|url=http://www.ruby-doc.org/core-1.9.3/Random.html|accessdate=2012-05-29}}</ref> | |||
[[IDL (programming language)|IDL]],<ref>{{cite web|title=RANDOMU (IDL Reference)|work=Exelis VIS Docs Center | |||
|url=http://www.exelisvis.com/docs/RANDOMU.html|accessdate=2013-08-23}}</ref> | |||
[[Free Pascal]],<ref>{{cite web|title=random|work=free pascal documentation | |||
|url=http://www.freepascal.org/docs-html/rtl/system/random.html|accessdate=2013-11-28}}</ref> | |||
[[PHP]],<ref>{{cite web|title=mt_srand|work=php documentation | |||
|url=http://php.net/manual/en/function.mt-srand.php|accessdate=2012-05-29}}</ref> | |||
[[Maple (software)|Maple]],<ref>{{cite web|title=random number generator|work=Maple Online Help | url=http://www.maplesoft.com/support/help/Maple/view.aspx?path=rand|accessdate=2013-11-21}}</ref> | |||
[[MATLAB]], | |||
[[GAUSS (software)|GAUSS]],<ref>[http://www.aptech.com/wp-content/uploads/2014/01/GAUSS14_LR.pdf GAUSS 14 Language Reference]</ref> | |||
[[CMU Common Lisp]],<ref>{{cite web|title=Design choices and extensions|work=CMUCL User's Manual | |||
|url=http://common-lisp.net/project/cmucl/doc/cmu-user/extensions.html |accessdate=2014-02-03}}</ref> | |||
the [[GNU Multiple Precision Arithmetic Library]],<ref>{{cite web|title=Randum Number Algorithms | work= GNU MP| url=http://gmplib.org/manual/Random-Number-Algorithms.html |accessdate=2013-11-21}}</ref> | |||
and the [[GNU Scientific Library]].<ref>{{cite web|title=Random number environment variables| work=GNU Scientific Library| url=http://www.gnu.org/software/gsl/manual/html_node/Random-number-environment-variables.html | accessdate=2013-11-24}}</ref> | |||
It is also available in [[C++]]<ref>{{cite web|title=std::mersenne_twister_engine|work=Pseudo Random Number Generation| | |||
url=http://en.cppreference.com/w/cpp/numeric/random/mersenne_twister_engine|accessdate=2012-09-25}}</ref> since [[C++11]]. Add-on implementations are provided by the [[Boost (C++ libraries)|Boost C++ Libraries]],<ref> | |||
{{cite web | |||
|title=boost/random/mersenne_twister.hpp | |||
|work=Boost C++ Libraries | |||
|url=http://www.boost.org/doc/libs/1_49_0/boost/random/mersenne_twister.hpp | |||
|accessdate=2012-05-29}}</ref> | |||
[[Glib]],<ref> | |||
{{cite web | |||
|title=Changes to GLib | |||
|work=GLib Reference Manual | |||
|url=http://developer.gnome.org/glib/stable/glib-changes.html | |||
|accessdate=2012-05-29}}</ref> | |||
and the [[NAG Numerical Library]].<ref> | |||
{{cite web | |||
|title=G05 – Random Number Generators | |||
|work=NAG Library Chapter Introduction | |||
|url=http://www.nag.co.uk/numeric/fl/nagdoc_fl23/xhtml/G05/g05intro.xml | |||
|accessdate=2012-05-29}}</ref> | |||
The Mersenne Twister is one of two PRNGs in [[SPSS]]: the other generator is kept only for compatibility with older programs, and the Mersenne Twister is stated to be "more reliable".<ref>{{cite web| title=Random Number Generators | work=IBM SPSS Statistics| url= http://pic.dhe.ibm.com/infocenter/spssstat/v20r0m0/index.jsp?topic=%2Fcom.ibm.spss.statistics.help%2Fidh_seed.htm | accessdate=2013-11-21}}</ref> | |||
The Mersenne Twister is similarly one of the PRNGs in [[SAS (software)|SAS]]: the other generators are older and deprecated.<ref>{{cite web| title=Using Random-Number Functions| work=SAS Language Reference|url=http://support.sas.com/documentation/cdl/en/lrdict/64316/HTML/default/viewer.htm#a001281561.htm | accessdate=2013-11-21}}</ref> | |||
==Advantages== | |||
The commonly-used version of Mersenne Twister, MT19937, which produces a sequence of 32-bit integers, has the following desirable properties: | |||
# It has a very long period of 2<sup>19937</sup> − 1. While a long period is not a guarantee of quality in a random number generator, short periods (such as the 2<sup>32</sup> common in many older software packages) can be problematic.<ref>Note: 2<sup>19937</sup> is approximately 4.3 × 10<sup>6001</sup>; this is many orders of magnitude larger than the estimated number of particles in the [[observable universe#Matter content|observable universe]], which is 10<sup>87</sup>.</ref> | |||
# It is ''k''-distributed to 32-bit accuracy for every 1 ≤ ''k'' ≤ 623 (see definition below). | |||
# It passes numerous tests for statistical randomness, including the [[Diehard tests]]. | |||
==Disadvantages== | |||
It passes most, but not all, of the stringent [http://www.iro.umontreal.ca/~simardr/testu01/tu01.html TestU01] [[TestU01|Crush]] randomness tests.<ref>P. L'Ecuyer and R. Simard, [http://www.iro.umontreal.ca/~lecuyer/myftp/papers/testu01.pdf TestU01: "A C Library for Empirical Testing of Random Number Generators"], ACM Transactions on Mathematical Software, 33, 4, Article 22, August 2007.</ref> | |||
It can take a long time to turn a non-random initial state—particularly an initial state with many zeros—into output that passes [[randomness tests]]. A consequence of this is that two instances of the generator, started with an almost the same initial state will output nearly the same sequence for a long time before eventually diverging. | |||
== ''k''-distribution == | |||
A pseudorandom sequence ''x<sub>i</sub>'' of ''w''-bit integers of period ''P'' is said to be ''k''-distributed to ''v''-bit accuracy if the following holds. | |||
: Let ''trunc<sub>v</sub>(x)'' denote the number formed by the leading ''v'' bits of ''x'', and consider ''P'' of the ''kv''-bit vectors | |||
:: <math> (\text{trunc}_v(x_i), \, \text{trunc}_v(x_{i+1}), \, ..., \, \text{trunc}_v(x_{i+k-1})) \quad (0\leq i< P) </math>. | |||
: Then each of the 2<sup>''kv''</sup> possible combinations of bits occurs the same number of times in a period, except for the all-zero combination that occurs once less often. | |||
== Alternatives == | |||
The algorithm in its native form is not suitable for [[cryptography]] (unlike [[Blum Blum Shub]]). Observing a sufficient number of iterations (624 in the case of MT19937, since this is the size of the state vector from which future iterations are produced) allows one to predict all future iterations. A pair of cryptographic stream ciphers based on output from Mersenne Twister has been proposed by Makoto Matsumoto et al. The authors claim speeds 1.5 to 2 times faster than [[Advanced Encryption Standard]] in [[counter mode]].<ref>{{Cite web|title=Cryptographic Mersenne Twister and Fubuki Stream/Block Cipher|last1=Matsumoto|first1=Makoto|last2=Nishimura|first2=Takuji|last3=Hagita|first3=Mariko|last4=Saito|first4=Mutsuo|year=2005|url=http://eprint.iacr.org/2005/165.pdf|postscript=<!--None-->}}</ref> | |||
The Mersenne Twister is sensitive to poor initialization and can take a long time to recover from a zero-excess initial state. An alternative, [[Well Equidistributed Long-period Linear|WELL ("Well Equidistributed Long-period Linear")]], has quicker recovery, the same or better performance and equal randomness.<ref>P. L'Ecuyer, "Uniform Random Number Generators", ''International Encyclopedia of Statistical Science'', Lovric, Miodrag (Ed.), Springer-Verlag, 2010.</ref> | |||
== Algorithmic detail == | |||
For a ''k''-bit word length, the Mersenne Twister generates integers in the range [0, 2<sup>''k''</sup>−1]. | |||
The Mersenne Twister algorithm is based on a [[Recurrence relation|matrix linear recurrence]] over a finite [[binary numeral system|binary]] [[field (mathematics)|field]] ''F''<sub>2</sub>. The algorithm is a twisted [[generalised feedback shift register]]<ref>{{cite doi|10.1145/146382.146383}}</ref> (twisted GFSR, or TGFSR) of [[rational normal form]] (TGFSR(R)), with state bit reflection and tempering. It is characterized by the following quantities: | |||
* ''w'': word size (in number of bits) | |||
* ''n'': degree of recurrence | |||
* ''m'': middle word, or the number of parallel sequences, 1 ≤ ''m'' ≤ ''n'' | |||
* ''r'': separation point of one word, or the number of bits of the lower bitmask, 0 ≤ ''r'' ≤ ''w'' - 1 | |||
* '''''a''''': coefficients of the rational normal form twist matrix | |||
* '''''b''''', '''''c''''': TGFSR(R) tempering bitmasks | |||
* ''s'', ''t'': TGFSR(R) tempering bit shifts | |||
* ''u'', ''l'': additional Mersenne Twister tempering bit shifts | |||
with the restriction that 2<sup>''nw'' − ''r''</sup> − 1 is a Mersenne prime. This choice simplifies the primitivity test and ''k''-distribution test that are needed in the parameter search. | |||
For a word '''''x''''' with ''w'' bit width, it is expressed as the recurrence relation | |||
:<math>x_{k+n} := x_{k+m} \oplus ({x_k}^u \mid {x_{k+1}}^l) A \qquad \qquad k=0,1,\ldots</math> | |||
with | as the bitwise [[Logical disjunction|or]] and <math>\oplus</math> as the bitwise [[exclusive or]] (XOR), '''''x'''''<sup>''u''</sup>, '''''x'''''<sup>''l''</sup> being '''''x''''' with upper and lower bitmasks applied. The twist transformation ''A'' is defined in rational normal form | |||
<math> | |||
A = R = \begin{pmatrix} 0 & I_{w - 1} \\ a_{w-1} & (a_{w - 2}, \ldots , a_0) \end{pmatrix} | |||
</math> | |||
with ''I''<sub>''n'' − 1</sub> as the (''n'' − 1) × (''n'' − 1) identity matrix (and in contrast to normal matrix multiplication, bitwise XOR replaces addition). The rational normal form has the benefit that it can be efficiently expressed as | |||
<math> | |||
\boldsymbol{x}A = \begin{cases}\boldsymbol{x} \gg 1 & x_0 = 0\\(\boldsymbol{x} \gg 1) \oplus \boldsymbol{a} & x_0 = 1\end{cases} | |||
</math> | |||
where | |||
:<math>\boldsymbol{x} := ({x_k}^u \mid {x_{k+1}}^l) \qquad \qquad k=0,1,\ldots</math> | |||
In order to achieve the 2<sup>''nw'' − ''r''</sup> − 1 theoretical upper limit of the period in a TGFSR, ''φ''<sub>''B''</sub>(''t'') must be a [[primitive polynomial (field theory)|primitive polynomial]], ''φ''<sub>''B''</sub>(''t'') being the [[characteristic polynomial]] of | |||
<math> | |||
B = \begin{pmatrix} | |||
0 & I_{w} & \cdots & 0 & 0 \\ | |||
\vdots & & & & \\ | |||
I_{w} & \vdots & \ddots & \vdots & \vdots \\ | |||
\vdots & & & & \\ | |||
0 & 0 & \cdots & I_{w} & 0 \\ | |||
0 & 0 & \cdots & 0 & I_{w - r} \\ | |||
S & 0 & \cdots & 0 & 0 | |||
\end{pmatrix} | |||
\begin{matrix} | |||
\\ \\ \leftarrow m\hbox{-th row} \\ \\ \\ \\ | |||
\end{matrix} | |||
</math> | |||
<math> | |||
S = \begin{pmatrix} 0 & I_{r} \\ I_{w - r} & 0 \end{pmatrix} A | |||
</math> | |||
The twist transformation improves the classical GFSR with the following key properties: | |||
* Period reaches the theoretical upper limit 2<sup>''nw'' − ''r''</sup> − 1 (except if initialized with 0) | |||
* Equidistribution in ''n'' dimensions (e.g. [[linear congruential generator]]s can at best manage reasonable distribution in 5 dimensions) | |||
As like TGFSR(R), the Mersenne Twister is cascaded with a [[Tempered representation|tempering transform]] to compensate for the reduced dimensionality of equidistribution (because of the choice of ''A'' being in the rational normal form), which is equivalent to the transformation ''A'' = ''R'' → ''A'' = ''T''<sup>−1</sup>''RT'', ''T'' invertible. The tempering is defined in the case of Mersenne Twister as | |||
:'''''y''''' := '''''x''''' ⊕ ('''''x''''' >> ''u'') | |||
:'''''y''''' := :'''''y''''' ⊕ (('''''y''''' << ''s'') & '''''b''''') | |||
:'''''y''''' := :'''''y''''' ⊕ (('''''y''''' << ''t'') & '''''c''''') | |||
:'''''z''''' := '''''y''''' ⊕ ('''''y''''' >> ''l'') | |||
with <<, >> as the bitwise left and right shifts, and & as the bitwise [[Logical conjunction|and]]. The first and last transforms are added in order to improve lower bit equidistribution. From the property of TGFSR, <math>s + t \ge \lfloor w/2 \rfloor - 1</math> is required to reach the upper bound of equidistribution for the upper bits. | |||
The coefficients for MT19937 are: | |||
* (''w'', ''n'', ''m'', ''r'') = (32, 624, 397, 31) | |||
* '''''a''''' = 9908B0DF<sub>16</sub> | |||
* ''u'' = 11 | |||
* (''s'', '''''b''''') = (7, 9D2C5680<sub>16</sub>) | |||
* (''t'', '''''c''''') = (15, EFC60000<sub>16</sub>) | |||
* ''l'' = 18 | |||
A small [[lagged Fibonacci generator]] or [[linear congruential generator]] usually is used to seed the Mersenne Twister with random initial values.{{Citation needed|date=February 2013}} | |||
==Pseudocode== | |||
The following piece of [[pseudocode]] generates uniformly distributed 32-bit integers in the range [0, 2<sup>32</sup> − 1] with the MT19937 algorithm: | |||
''// Create a length 624 array to store the state of the generator'' | |||
'''int'''[0..623] MT | |||
'''int''' index = 0 | |||
''// Initialize the generator from a seed'' | |||
'''function''' initialize_generator('''int''' seed) { | |||
index := 0 | |||
MT[0] := seed | |||
'''for''' i '''from''' 1 '''to''' 623 { ''// loop over each other element'' | |||
MT[i] := '''last''' 32 bits '''of'''(1812433253 * (MT[i-1] '''[[Bitwise operation#XOR|xor]]''' ('''right shift by''' 30 bits(MT[i-1]))) + i) ''// 0x6c078965'' | |||
} | |||
} | |||
''// Extract a tempered pseudorandom number based on the index-th value,'' | |||
''// calling generate_numbers() every 624 numbers'' | |||
'''function''' extract_number() { | |||
'''if''' index == 0 { | |||
generate_numbers() | |||
} | |||
'''int''' y := MT[index] | |||
y := y '''[[Bitwise operation#XOR|xor]]''' ('''right shift by''' 11 bits(y)) | |||
y := y '''[[Bitwise operation#XOR|xor]]''' ('''left shift by''' 7 bits(y) '''[[Bitwise operation#AND|and]]''' (2636928640)) ''// 0x9d2c5680'' | |||
y := y '''[[Bitwise operation#XOR|xor]]''' ('''left shift by''' 15 bits(y) '''[[Bitwise operation#AND|and]]''' (4022730752)) ''// 0xefc60000'' | |||
y := y '''[[Bitwise operation#XOR|xor]]''' ('''right shift by''' 18 bits(y)) | |||
index := (index + 1) '''[[modulo operation|mod]]''' 624 | |||
'''return''' y | |||
} | |||
''// Generate an array of 624 untempered numbers'' | |||
'''function''' generate_numbers() { | |||
'''for''' i '''from''' 0 '''to''' 623 { | |||
'''int''' y := (MT[i] '''[[Bitwise operation#AND|and]]''' 0x80000000) ''// bit 31 (32nd bit) of MT[i]'' | |||
+ (MT[(i+1) '''[[modulo operation|mod]]''' 624] '''[[Bitwise operation#AND|and]] '''0x7fffffff) ''// bits 0-30 (first 31 bits) of MT[...]'' | |||
MT[i] := MT[(i + 397) '''[[modulo operation|mod]]''' 624] '''[[Bitwise operation#XOR|xor]]''' ('''right shift by''' 1 bit(y)) | |||
'''if''' (y '''[[modulo operation|mod]]''' 2) != 0 { ''// y is odd'' | |||
MT[i] := MT[i] '''[[Bitwise operation#XOR|xor]]''' (2567483615) ''// 0x9908b0df'' | |||
} | |||
} | |||
} | |||
== SFMT == | |||
{{Expand section|date=June 2007}} | |||
SFMT, the [[SIMD]]-oriented Fast Mersenne Twister, is a variant of Mersenne Twister, introduced in 2006,<ref>[http://www.math.sci.hiroshima-u.ac.jp/~m-mat/MT/SFMT/index.html SIMD-oriented Fast Mersenne Twister (SFMT)<!-- Bot generated title -->]</ref> designed to be fast when it runs on 128-bit SIMD. | |||
*It is roughly twice as fast as Mersenne Twister.<ref>[http://www.math.sci.hiroshima-u.ac.jp/~m-mat/MT/SFMT/speed.html SFMT:Comparison of speed<!-- Bot generated title -->]</ref> | |||
*It has a better [[equidistribution]] property of v-bit accuracy than MT but worse than [[Well Equidistributed Long-period Linear|WELL ("Well Equidistributed Long-period Linear")]]. | |||
*It has quicker recovery from zero-excess initial state than MT, but slower than WELL. | |||
*It supports various periods from 2<sup>607</sup>−1 to 2<sup>216091</sup>−1. | |||
Intel [[SSE2]] and [[PowerPC]] AltiVec are supported by SFMT. It is also used for games with the [[Cell (microprocessor)|Cell BE]] in the [[PlayStation 3]].<ref>[http://www.scei.co.jp/ps3-license/index.html PLAYSTATION 3 License<!-- Bot generated title -->]</ref> | |||
==MTGP== | |||
MTGP is a variant of Mersenne Twister optimised for [[Graphics processing unit|GPUs]] published by Mutsuo Saito and Makoto Matsumoto.<ref>{{cite arXiv |eprint=1005.4973|version= v3 |title= Variants of Mersenne Twister Suitable for Graphic Processors|class= cs.MS|year= 2010|accessdate= |author1= Mutsuo Saito |author2= Makoto Matsumoto }}</ref> The basic linear recurrence operations are extended from MT and parameters are chosen to allow many threads to compute the recursion in parallel, while sharing their state space to reduce memory load. Sample code [http://www.math.sci.hiroshima-u.ac.jp/~m-mat/MT/MTGP/index.html] for [[CUDA]] includes parameter sets suitable for 256, 512 and 1024 parallel threads per block, and up to 200 blocks generating independent random streams. The paper claims improved [[equidistribution]] over MT and performance on a high specification GPU ([[Nvidia]] GTX260 with 192 cores) of 4.7ms for 5x10<sup>7</sup> random 32-bit integers. | |||
==Implementations in various languages== | |||
<div style="-moz-column-count:2; column-count:2;"> | |||
* [http://help.sap.com/abapdocu_70/en/ABENCL_ABAP_MATH.htm ABAP] | |||
* [http://flashexperiments.insh-allah.com/#Mersenne_Twister_ported_to_ActionScript ActionScript 1] | |||
* [https://github.com/skyboy/AS3-Utilities/blob/master/skyboy/utils/Random.as ActionScript 3.0] | |||
* [http://adrianhoe.com/adrianhoe/projects/adamt19937/ Ada] | |||
* [http://www.mrpt.org/Example:Random_number_generation C++] | |||
* [http://randomlib.sourceforge.net C++] | |||
* [http://my.opera.com/metrallik/blog/2013/04/19/c-class-for-random-generation-with-mersenne-twister-method C++] | |||
* [http://www.hackinghat.com/index.php/lisp/mersenne-twister-in-clojure Clojure] | |||
* [http://clean.cs.ru.nl/Download/Download_Libraries/mt/body_mt.html Clean] | |||
* [http://adam.ierymenko.name/files/MersenneTwister32_spe.cpp C++ Sony Cell Broadband Engine] | |||
* [http://code.msdn.microsoft.com/MersenneTwister C# ] | |||
* [https://github.com/D-Programming-Language/phobos/blob/master/std/random.d D] | |||
* [https://github.com/analogic/mersenne_twister Dart] | |||
* [http://github.com/jj1bdx/sfmt-erlang/ Erlang] | |||
* [http://www.rapideuphoria.com/mt.zip Euphoria] | |||
* [http://www.ntrand.com/ Excel addin] | |||
* [http://www.falvotech.com/blog2/blog.fs Forth] | |||
* [http://theo.phys.sci.hiroshima-u.ac.jp/~ishikawa/PRNG/mt_stream_en.html Fortran 95:] | |||
* [http://pastebin.com/jgFzXs1H F#] | |||
* [https://github.com/seehuhn/mt19937 Go] | |||
* [http://www.gnu.org/software/gsl/ The GNU Scientific Library (GSL)] | |||
* [http://www.augustsson.net/Darcs/MT/ Haskell] | |||
* [http://hackage.haskell.org/package/mersenne-random-pure64 Haskell] | |||
* [http://www.cs.gmu.edu/~sean/research/ Java] | |||
* [http://www.math.sci.hiroshima-u.ac.jp/~m-mat/MT/VERSIONS/JAVASCRIPT/java-script.html JavaScript] | |||
* [https://gist.github.com/300494 JavaScript] | |||
* [http://cybertiggyr.com/gene/jmt/ Lisp] | |||
* [http://www.tecgraf.puc-rio.br/~lhf/ftp/lua/ Lua] | |||
* [http://www.mitrionics.com/?page=page49058d1d215e8 Mitrion-C] | |||
* [http://www.wolfgang-ehrhardt.de/misc_en.html#prng Pascal/FreePascal/Delphi] | |||
* [http://search.cpan.org/search?module=Math%3A%3ARandom%3A%3AMT%3A%3AAuto Perl] | |||
* [http://kingfisher.nfshost.com/sw/twister/ PHP 5.3.0] | |||
* [http://my.opera.com/metrallik/blog/2013/04/19/python-class-for-random-generation-with-mersenne-twister Python] | |||
* [http://stat.ethz.ch/R-manual/R-patched/library/base/html/Random.html R] | |||
* [http://web.archive.org/web/20080218152740/http://www.aaronballman.com/programming/REALbasic/Rand.php REALbasic] | |||
* [http://mlton.org/cgi-bin/viewsvn.cgi/mltonlib/trunk/org/mlton/ville/mersenne-twister/unstable/ Standard ML] | |||
* [http://www.simul8.com/prng.htm SIMUL8] | |||
* [https://gist.github.com/887771 Scala] | |||
* [http://www.quantcode.com/modules/mydownloads/singlefile.php?cid=9&lid=610/ VBA] | |||
* [http://www.math.sci.hiroshima-u.ac.jp/~m-mat/MT/VERSIONS/BASIC/basic.html Visual Basic] | |||
</div> | |||
==References== | |||
{{Reflist|30em}} | |||
== External links == | |||
* [http://www.math.sci.hiroshima-u.ac.jp/~m-mat/MT/ARTICLES/earticles.html The academic paper for MT, and related articles by Makoto Matsumoto] | |||
* [http://www.math.sci.hiroshima-u.ac.jp/~m-mat/MT/emt.html Mersenne Twister home page, with codes in C, Fortran, Java, Lisp and some other languages] | |||
* [http://www.math.sci.hiroshima-u.ac.jp/~m-mat/MT/SFMT/index.html SIMD-oriented Fast Mersenne Twister (SFMT)] | |||
{{DEFAULTSORT:Mersenne Twister}} | |||
[[Category:Pseudorandom number generators]] | |||
[[Category:Articles with example pseudocode]] |
Revision as of 00:00, 29 January 2014
The Mersenne twister is a pseudorandom number generator (PRNG). It is, by far, the most widely used PRNG.[1] Its name derives from the fact that its period length is chosen to be a Mersenne prime.
The Mersenne Twister was developed in 1997 by Template:Nihongo and Template:Nihongo.[2] It was designed specifically to rectify most of the flaws found in older PRNGs. It was the first PRNG to provide fast generation of high-quality pseudorandom integers.
The most commonly-used version of the Mersenne Twister algorithm is based on the Mersenne prime 219937−1. The standard implementation of that, MT19937, uses a 32-bit word length. There is another implementation that uses a 64-bit word length, MT19937-64; it generates a different sequence.
Adoption in software systems
The Mersenne Twister is the default PRNG for R,[3] Python,[4][5] Ruby,[6] IDL,[7] Free Pascal,[8] PHP,[9] Maple,[10] MATLAB, GAUSS,[11] CMU Common Lisp,[12] the GNU Multiple Precision Arithmetic Library,[13] and the GNU Scientific Library.[14] It is also available in C++[15] since C++11. Add-on implementations are provided by the Boost C++ Libraries,[16] Glib,[17] and the NAG Numerical Library.[18]
The Mersenne Twister is one of two PRNGs in SPSS: the other generator is kept only for compatibility with older programs, and the Mersenne Twister is stated to be "more reliable".[19] The Mersenne Twister is similarly one of the PRNGs in SAS: the other generators are older and deprecated.[20]
Advantages
The commonly-used version of Mersenne Twister, MT19937, which produces a sequence of 32-bit integers, has the following desirable properties:
- It has a very long period of 219937 − 1. While a long period is not a guarantee of quality in a random number generator, short periods (such as the 232 common in many older software packages) can be problematic.[21]
- It is k-distributed to 32-bit accuracy for every 1 ≤ k ≤ 623 (see definition below).
- It passes numerous tests for statistical randomness, including the Diehard tests.
Disadvantages
It passes most, but not all, of the stringent TestU01 Crush randomness tests.[22]
It can take a long time to turn a non-random initial state—particularly an initial state with many zeros—into output that passes randomness tests. A consequence of this is that two instances of the generator, started with an almost the same initial state will output nearly the same sequence for a long time before eventually diverging.
k-distribution
A pseudorandom sequence xi of w-bit integers of period P is said to be k-distributed to v-bit accuracy if the following holds.
- Let truncv(x) denote the number formed by the leading v bits of x, and consider P of the kv-bit vectors
- Then each of the 2kv possible combinations of bits occurs the same number of times in a period, except for the all-zero combination that occurs once less often.
Alternatives
The algorithm in its native form is not suitable for cryptography (unlike Blum Blum Shub). Observing a sufficient number of iterations (624 in the case of MT19937, since this is the size of the state vector from which future iterations are produced) allows one to predict all future iterations. A pair of cryptographic stream ciphers based on output from Mersenne Twister has been proposed by Makoto Matsumoto et al. The authors claim speeds 1.5 to 2 times faster than Advanced Encryption Standard in counter mode.[23]
The Mersenne Twister is sensitive to poor initialization and can take a long time to recover from a zero-excess initial state. An alternative, WELL ("Well Equidistributed Long-period Linear"), has quicker recovery, the same or better performance and equal randomness.[24]
Algorithmic detail
For a k-bit word length, the Mersenne Twister generates integers in the range [0, 2k−1].
The Mersenne Twister algorithm is based on a matrix linear recurrence over a finite binary field F2. The algorithm is a twisted generalised feedback shift register[25] (twisted GFSR, or TGFSR) of rational normal form (TGFSR(R)), with state bit reflection and tempering. It is characterized by the following quantities:
- w: word size (in number of bits)
- n: degree of recurrence
- m: middle word, or the number of parallel sequences, 1 ≤ m ≤ n
- r: separation point of one word, or the number of bits of the lower bitmask, 0 ≤ r ≤ w - 1
- a: coefficients of the rational normal form twist matrix
- b, c: TGFSR(R) tempering bitmasks
- s, t: TGFSR(R) tempering bit shifts
- u, l: additional Mersenne Twister tempering bit shifts
with the restriction that 2nw − r − 1 is a Mersenne prime. This choice simplifies the primitivity test and k-distribution test that are needed in the parameter search.
For a word x with w bit width, it is expressed as the recurrence relation
with | as the bitwise or and as the bitwise exclusive or (XOR), xu, xl being x with upper and lower bitmasks applied. The twist transformation A is defined in rational normal form
with In − 1 as the (n − 1) × (n − 1) identity matrix (and in contrast to normal matrix multiplication, bitwise XOR replaces addition). The rational normal form has the benefit that it can be efficiently expressed as
where
In order to achieve the 2nw − r − 1 theoretical upper limit of the period in a TGFSR, φB(t) must be a primitive polynomial, φB(t) being the characteristic polynomial of
The twist transformation improves the classical GFSR with the following key properties:
- Period reaches the theoretical upper limit 2nw − r − 1 (except if initialized with 0)
- Equidistribution in n dimensions (e.g. linear congruential generators can at best manage reasonable distribution in 5 dimensions)
As like TGFSR(R), the Mersenne Twister is cascaded with a tempering transform to compensate for the reduced dimensionality of equidistribution (because of the choice of A being in the rational normal form), which is equivalent to the transformation A = R → A = T−1RT, T invertible. The tempering is defined in the case of Mersenne Twister as
- y := x ⊕ (x >> u)
- y := :y ⊕ ((y << s) & b)
- y := :y ⊕ ((y << t) & c)
- z := y ⊕ (y >> l)
with <<, >> as the bitwise left and right shifts, and & as the bitwise and. The first and last transforms are added in order to improve lower bit equidistribution. From the property of TGFSR, is required to reach the upper bound of equidistribution for the upper bits.
The coefficients for MT19937 are:
- (w, n, m, r) = (32, 624, 397, 31)
- a = 9908B0DF16
- u = 11
- (s, b) = (7, 9D2C568016)
- (t, c) = (15, EFC6000016)
- l = 18
A small lagged Fibonacci generator or linear congruential generator usually is used to seed the Mersenne Twister with random initial values.Potter or Ceramic Artist Truman Bedell from Rexton, has interests which include ceramics, best property developers in singapore developers in singapore and scrabble. Was especially enthused after visiting Alejandro de Humboldt National Park.
Pseudocode
The following piece of pseudocode generates uniformly distributed 32-bit integers in the range [0, 232 − 1] with the MT19937 algorithm:
// Create a length 624 array to store the state of the generator int[0..623] MT int index = 0 // Initialize the generator from a seed function initialize_generator(int seed) { index := 0 MT[0] := seed for i from 1 to 623 { // loop over each other element MT[i] := last 32 bits of(1812433253 * (MT[i-1] xor (right shift by 30 bits(MT[i-1]))) + i) // 0x6c078965 } } // Extract a tempered pseudorandom number based on the index-th value, // calling generate_numbers() every 624 numbers function extract_number() { if index == 0 { generate_numbers() } int y := MT[index] y := y xor (right shift by 11 bits(y)) y := y xor (left shift by 7 bits(y) and (2636928640)) // 0x9d2c5680 y := y xor (left shift by 15 bits(y) and (4022730752)) // 0xefc60000 y := y xor (right shift by 18 bits(y)) index := (index + 1) mod 624 return y } // Generate an array of 624 untempered numbers function generate_numbers() { for i from 0 to 623 { int y := (MT[i] and 0x80000000) // bit 31 (32nd bit) of MT[i] + (MT[(i+1) mod 624] and 0x7fffffff) // bits 0-30 (first 31 bits) of MT[...] MT[i] := MT[(i + 397) mod 624] xor (right shift by 1 bit(y)) if (y mod 2) != 0 { // y is odd MT[i] := MT[i] xor (2567483615) // 0x9908b0df } } }
SFMT
SFMT, the SIMD-oriented Fast Mersenne Twister, is a variant of Mersenne Twister, introduced in 2006,[26] designed to be fast when it runs on 128-bit SIMD.
- It is roughly twice as fast as Mersenne Twister.[27]
- It has a better equidistribution property of v-bit accuracy than MT but worse than WELL ("Well Equidistributed Long-period Linear").
- It has quicker recovery from zero-excess initial state than MT, but slower than WELL.
- It supports various periods from 2607−1 to 2216091−1.
Intel SSE2 and PowerPC AltiVec are supported by SFMT. It is also used for games with the Cell BE in the PlayStation 3.[28]
MTGP
MTGP is a variant of Mersenne Twister optimised for GPUs published by Mutsuo Saito and Makoto Matsumoto.[29] The basic linear recurrence operations are extended from MT and parameters are chosen to allow many threads to compute the recursion in parallel, while sharing their state space to reduce memory load. Sample code [1] for CUDA includes parameter sets suitable for 256, 512 and 1024 parallel threads per block, and up to 200 blocks generating independent random streams. The paper claims improved equidistribution over MT and performance on a high specification GPU (Nvidia GTX260 with 192 cores) of 4.7ms for 5x107 random 32-bit integers.
Implementations in various languages
- ABAP
- ActionScript 1
- ActionScript 3.0
- Ada
- C++
- C++
- C++
- Clojure
- Clean
- C++ Sony Cell Broadband Engine
- C#
- D
- Dart
- Erlang
- Euphoria
- Excel addin
- Forth
- Fortran 95:
- F#
- Go
- The GNU Scientific Library (GSL)
- Haskell
- Haskell
- Java
- JavaScript
- JavaScript
- Lisp
- Lua
- Mitrion-C
- Pascal/FreePascal/Delphi
- Perl
- PHP 5.3.0
- Python
- R
- REALbasic
- Standard ML
- SIMUL8
- Scala
- VBA
- Visual Basic
References
43 year old Petroleum Engineer Harry from Deep River, usually spends time with hobbies and interests like renting movies, property developers in singapore new condominium and vehicle racing. Constantly enjoys going to destinations like Camino Real de Tierra Adentro.
External links
- The academic paper for MT, and related articles by Makoto Matsumoto
- Mersenne Twister home page, with codes in C, Fortran, Java, Lisp and some other languages
- SIMD-oriented Fast Mersenne Twister (SFMT)
- ↑ E.g. Marsland S. (2011) Machine Learning (CRC Press), §4.1.1. Also see the section "Adoption in software systems".
- ↑ Template:Cite doi
- ↑ Template:Cite web
- ↑ Template:Cite web
- ↑ Template:Cite web
- ↑ Template:Cite web
- ↑ Template:Cite web
- ↑ Template:Cite web
- ↑ Template:Cite web
- ↑ Template:Cite web
- ↑ GAUSS 14 Language Reference
- ↑ Template:Cite web
- ↑ Template:Cite web
- ↑ Template:Cite web
- ↑ Template:Cite web
- ↑ Template:Cite web
- ↑ Template:Cite web
- ↑ Template:Cite web
- ↑ Template:Cite web
- ↑ Template:Cite web
- ↑ Note: 219937 is approximately 4.3 × 106001; this is many orders of magnitude larger than the estimated number of particles in the observable universe, which is 1087.
- ↑ P. L'Ecuyer and R. Simard, TestU01: "A C Library for Empirical Testing of Random Number Generators", ACM Transactions on Mathematical Software, 33, 4, Article 22, August 2007.
- ↑ Template:Cite web
- ↑ P. L'Ecuyer, "Uniform Random Number Generators", International Encyclopedia of Statistical Science, Lovric, Miodrag (Ed.), Springer-Verlag, 2010.
- ↑ Template:Cite doi
- ↑ SIMD-oriented Fast Mersenne Twister (SFMT)
- ↑ SFMT:Comparison of speed
- ↑ PLAYSTATION 3 License
- ↑ Template:Cite arXiv