Trained a Small Language Model. Have Some Questions. Report

mittalh944 · April 24, 2024, 3:20am

To develop an understanding of the science and engineering behind Large Language Models (LLMs), I’ve trained a small transformer-based language model from scratch and then fine-tuned it

As I’m self-learning and experimenting, it’s possible that I might develop a flawed understanding of the science and engineering practices behind LLMs.

I’m sharing the detailed process I followed, and I have some questions along the way.

I would appreciate the thoughts from the community

Here’s a link to the report: mittalh.notion.site/Small-Language-Model-Report-Questions-43a86edbdd954cd0a7f4df2c9dcd8408

Looking forward to it.

Topic		Replies	Views
Building Own Knowledge Base LLM Community embeddings , chatgpt , api , assistants-api	3	5027	April 8, 2024
Could Someone Give me Advice on Best Practices for Training Large Language Models? Community large-language-model , training	0	370	April 29, 2024
Best practice to retrain the already trained model using fine tuning API fine-tuning	0	800	May 14, 2024
Knowledge through fine tuning or RAG embedding Community fine-tuning , rag	2	74	September 28, 2024
Pretraining a Model From Scratch. Help a dude Community chatgpt	21	704	May 25, 2024

Trained a Small Language Model. Have Some Questions. Report

Related Topics