Technique
Supervised fine-tuning (SFT)
Fine-tuning a base LLM on a dataset of (input, ideal-output) pairs so it learns to produce that style of response — the first step of post-training.
Technique
Fine-tuning a base LLM on a dataset of (input, ideal-output) pairs so it learns to produce that style of response — the first step of post-training.
We use cookies
Anonymous analytics help us improve the site. You can opt out anytime. Learn more