Generating similarities for code generation

mmark · June 24, 2023, 11:10am

Got it, so maybe a ‘hybrid’ approach? I.e. encode the code snippets as class/function/interface name + parameter_names + docstrings as a ‘syntactic’ embedding, and then use a code2seq or the like to generate embeddings based on their AST paths (and get the ‘semantic’ meaning as well). Then whatever the user prompts, I can generate an embedding based off of his prompt (whether a textual description or code) and see if I get some good similarity results for relevant coding snippets. Does this make any sense?

Topic		Replies	Views
Prompting with the chat/completions API against a large transcript file API	5	3824	October 4, 2023
How does the generative aspect of GPT impacts my models? Documentation	12	1086	February 14, 2023
Teaching GPT the information it will be working on API gpt-4 , assistants	8	2385	November 19, 2023
How to fine tune so GPT knows a new API and then how to prompt to use that API Prompting	4	1544	March 29, 2023
[GitHub] Embeddings for Entire GitHub Code Repository API	4	5213	September 30, 2025

Generating similarities for code generation

Related topics