Need help with prompt: "Can you generate 1000 random tokens? "

EricGT · May 11, 2023, 8:34pm

Did you see this?

Language models can explain neurons in language models

among other things there is a tool with published code on GitHub.

If you use the tool you will see that it has access to every neuron in GPT-2 (not GPT-4) and also the tokens.

If you look at the code you will find

openai/automated-interpretability/blob/main/neuron-viewer/src/interpAPI.ts#L44-L62


      
          // # (derived from az://oaialignment/datasets/interp/gpt2_xl/v1/webtext1/len_nomax/n_50000/mlp_post_act/ranked_by_max_activation)
          // const NEURON_RECORDS_PATH = "az://oaisbills/rcall/oss/migrated_make_crow_datasets/gpt2_xl_n_50000_64_token/neurons"
          const NEURON_RECORDS_PATH = "https://openaipublic.blob.core.windows.net/neuron-explainer/data/collated-activations"
          
          
// # (derived from az://oaialignment/datasets/interp/gpt2_xl/v1/webtext1/len_nomax/n_50000/mlp_post_act/ranked_by_max_activation/neurons/explanations/canonical-run-v1)
          // const EXPLANATIONS_PATH = "az://oaisbills/rcall/oss/migrated_explanation_datasets/canonical_gpt2_xl_all_neurons"
          const EXPLANATIONS_PATH = "https://openaipublic.blob.core.windows.net/neuron-explainer/data/explanations"
          
          
// weight-based
          // const WHOLE_LAYER_WEIGHT_TOKENS_PATH = "az://oaidan/rcall/data/interpretability/connections/gpt2-xl/mlp/unnorm_token_representations_uncommon_vanilla"
          // const WEIGHT_TOKENS_PATH = "az://oaijeffwu/jeffwu-data/interpretability/neuron-connections/gpt2-xl/weight-based"
          const WEIGHT_TOKENS_PATH = "https://openaipublic.blob.core.windows.net/neuron-explainer/data/related-tokens/weight-based"
          // lookup table
          // const WHOLE_LAYER_ACTIVATION_TOKENS_PATH = "az://oaidan/rcall/data/interpretability/connections/gpt2_xl/mlp/unnorm_token_representations_vanilla_and_common_in_colangv2_unigram"
          // const ACTIVATION_TOKENS_PATH = "az://oaijeffwu/jeffwu-data/interpretability/neuron-connections/gpt2-xl/lookup-table"
          const ACTIVATION_TOKENS_PATH = "https://openaipublic.blob.core.windows.net/neuron-explainer/data/related-tokens/activation-based"
          
          
// const CONNECTIONS_PATH = "az://oaialignment/datasets/interp/connections/gpt2/neuron_space/incl_attn_False"
          const CONNECTIONS_PATH = "https://openaipublic.blob.core.windows.net/neuron-explainer/data/related-neurons/weight-based"

So while it may not be the tokens you are looking for, it might be part of what you seek in the long run but for a simpler model.

Topic		Replies	Views
Hypothetical Token-increase Strategy . Community gpt-4 , chatgpt	21	347	March 17, 2025
Do 'MAX tokens' include the follow up prompts and completion in a single chat session API token	22	5432	August 25, 2023
"Do this occasionally" - A potential (but strange) method to implement randomness Prompting gpt-4	12	2396	August 18, 2023
Suppose I want to write a story which is longer than 4000 tokens Prompting	27	11887	December 17, 2023
Seen anything novel by o1-preview? Community o1-preview	15	1980	September 16, 2024

Need help with prompt: "Can you generate 1000 random tokens? "

Related topics