wait so just to confirm, with a low n
, the input tokens are only counted once in terms of the input token limit and not n * tokenizer(input prompt string)?
wait so just to confirm, with a low n
, the input tokens are only counted once in terms of the input token limit and not n * tokenizer(input prompt string)?