Min P Sampler Should be Available

heralax · February 5, 2024, 11:31pm

Sampling parameters have advanced a ton in the past few months. OpenAI’s combination of temp, top k, and top p is beyond antiquated. OpenAI should offer the min p sampler and ideally other modern samplers too (ie, quadratic sampling, mirostat). I know it’s more options but I’d bet that most people advanced enough to use the API can handle a few more sliders, and it’d probably improve generation quality a bunch.

This is actively hurting my projects at this point.

Diet · February 5, 2024, 11:46pm

yeah but the sampler actually needs to be implemented.

If it really is that good, it will probably eventually come with newer models, maybe they’ll treat temperature as a mirostat bias or something

How is it hurting your projects?

edit: that said, just hypothetically theoretically, the MoE thing could maybe make mirostat obsolete anyways

heralax · February 6, 2024, 12:59am

it’s hurting my projects because basically every API, even the open source ones, use OpenAI’s library for LLM API calls. OAI has managed to dominate the LLM API space and their library’s lack of sampler options is reducing the output quality I can get for my clients. So its relative antiquation is hurting my work.

Also I mentioned mirostat since it’s known, but since it’s so hard to use I’m really more interested in things that Kalomaze has made, ie min_p and quadratic. We wouldn’t have to wait for them to “come with” new models, samplers are how you pick the token that the model generates, they aren’t actually tied to any specific model.

min_p works with MoE (I’ve used Mixtral with min_p before and the outputs are good) so I don’t quite understand your edit. Clarify?

Diet · February 6, 2024, 1:54am

I meant that using dynamic samplers could introduce some weirdness when used on a model with an expert router, because they’re two completely cooks working on sorta the same soup

it’s hurting my projects because basically every API, even the open source ones, use OpenAI’s library for LLM API calls. OAI has managed to dominate the LLM API space

i wholeheartedly agree with this

However, i kinda disagree with this - while yes, you theoretically should be able to play with all this, APIs are products - and the sampler is part of that product. they have their temp, top_p, logit bias and presence penalty, although some of them are kinda wonky with some models if I recall correctly. If they add all these things they’d have to support all the combinations. Maybe not the best idea if they’re already struggling with what they have.

Ideally they’d do their own evals and release models with the optimal default parameters. But considering 0314 is still the best model all around, they’re also struggling with that.

stefan.hg · February 7, 2025, 3:46pm

Wondering why isn’t OpenAI implementing this.

From the Min-P paper accepted at ICLR’25 now:

Community Adoption: Min-p sampling has been rapidly adopted by the open-source community, with over 54,000 GitHub repositories using it, amassing a cumulative 1.1 million stars across these projects.

_j · February 7, 2025, 10:13pm

Maybe the sign and symptom will be that they no longer allow temperature or top_p to be sent… just as with reasoning models, now a year later.

Topic		Replies	Views
Temperature in GPT-5 models API gpt-5	33	70196	September 8, 2025
How to set temperature and other sampling parameters of model in Open AI Assistant api? API assistants-api	41	21051	May 31, 2024
The models should be fine-tuned for the latest API features Feedback model-tuning	2	158	October 11, 2024
Is there a single sampling method used during inference, or there's a logic to use different sampling methods based on a given input? Community llm , llm-output	1	1132	April 28, 2024
Seeking clarity on limited availability of "o1-mini" and "o1-preview" models: Technical constraints or strategic decision? API api , o1 , o1-mini , o1-preview	3	560	October 9, 2024

Min P Sampler Should be Available

Related topics