gumbel

hana_ml.algorithms.pal.random.gumbel(conn_context, location=0, scale=1, num_random=100, seed=None, thread_ratio=None)

Draw samples from a Gumbel distribution, which is one of a class of Generalized Extreme Value (GEV) distributions used in modeling extreme value problems.

Parameters
conn_contextConnectionContext

Database connection object.

locationfloat, optional

Defaults to 0.

scalefloat, optional

Defaults to 1.

num_randomint, optional

Specifies the number of random data to be generated.

Defaults to 100.

seedint, optional

Indicates the seed used to initialize the random number generator:

  • 0: Uses the system time.

  • Not 0: Uses the specified seed.

Note

When multithreading is enabled, the random number sequences of different runs might be different even if the SEED value remains the same.

Defaults to 0.

thread_ratiofloat, optional

Controls the proportion of available threads to use.

The value range is from 0 to 1, where 0 indicates a single thread, and 1 indicates up to all available threads.

Values between 0 and 1 will use that percentage of available threads.

Values outside the range [0, 1] tell PAL to heuristically determine the number of threads to use.

Defaults to 0.

Returns
DataFrame

Dataframe containing the generated random samples, structured as follows:

  • ID, type INTEGER, ID column.

  • GENERATED_NUMBER, type DOUBLE, sample value.

Examples

Draw samples from a gumbel distribution.

>>> res = gumbel(conn_context=cc, location=0, scale=1, num_random=10)
>>> res.collect()
   ID  GENERATED_NUMBER
0   0          1.544054
1   1          0.339531
2   2          0.394224
3   3          3.161123
4   4          1.208050
5   5         -0.276447
6   6          1.694589
7   7          1.406419
8   8         -0.443717
9   9          0.156404