Everything you need to know falls into one of three categories:
model.gguf
model.safetensors
model.bin
Examples of models:
LLaMA-3 8B
LLaMA-3 70B
Mistral 7B
Mixtral 8x7B
Qwen 2 7B
Phi-3
A model file contains:
Neurons
Synapses
Every learned pattern
All the intelligence
A model does NOT:
It only takes text in → text out.
These are the programs that LOAD and RUN the model's brain.
Think of a “runtime” as the machine that runs a model file.
===== Runtimes include =====
:
✔ Ollama
✔ LM Studio
✔ GPT4All
✔ koboldcpp
✔ Faraday / Unsloth / Axolotl (training tools, not chatbots)
These are training engines, not chat apps.
You don’t talk to them; you use them to train a model.
Models = The brains
Runtimes = The engines that run the brains
Training tools = The machines that modify the brains
This is where real learning happens. Not prompts. Not memory.
Not RAG. (library model), Actual neural adjustment.
\code\
Tools include: ✔ Axolotl
✔ Unsloth
✔ Faraday
* These tools:
* Read your dataset
This is real learning.