13 Downloads

According to the Gemma team, the optimal config for inference is temperature = 1.0, top_k = 64, top_p = 0.95, min_p = 0.0 (optional 0.01)

Updated by
yazon
on July 11

PRESET

Parameters
Limit Response Length
Disabled
Min P Sampling
0.01
Repeat Penalty
1
Temperature
1
Top K Sampling
64