LET'S DEFINE THE PIECES

Everything you need to know falls into one of three categories:

1. MODELS (LLMs)

These are the brains.
They are NOT software.
They are NOT programs.
They are NOT plugins.
They are neural networks stored in a single file, usually:

model.gguf
model.safetensors
model.bin

Examples of models:
LLaMA-3 8B
LLaMA-3 70B
Mistral 7B
Mixtral 8x7B
Qwen 2 7B
Phi-3

A model file contains:
Neurons
Synapses
Every learned pattern
All the intelligence

A model does NOT:

Have a UI
Open PDFs
Connect to NASA
Run indexing
Provide a chat window

It only takes text in → text out.

2. MODEL RUNTIMES (ENGINES)

These are the programs that LOAD and RUN the model's brain.
Think of a “runtime” as the machine that runs a model file.

===== Runtimes include ===== :
✔ Ollama

Terminal-based
Local API
Can fine-tune
Good for automation
Good for pipelines
Very flexible
Acts like a backend server

✔ LM Studio

GUI desktop app
Easy model downloading
Drag-and-drop PDFs
File chat
Rudimentary RAG
Easy to test many models
Great for tinkering

✔ GPT4All

GUI
Also a runtime
Similar to LM Studio
Not as modern

✔ koboldcpp

Runtime specialized for story-writing/roleplay
GUI
Some fine-tuning tools

✔ Faraday / Unsloth / Axolotl (training tools, not chatbots)

These are training engines, not chat apps.

You don’t talk to them; you use them to train a model.

Summary

Models = The brains
Runtimes = The engines that run the brains
Training tools = The machines that modify the brains

3. TRAINING TOOLS (NEURAL LEARNING, “NEW SYNAPSES”)

This is where real learning happens. Not prompts. Not memory.
Not RAG. (library model), Actual neural adjustment.

\code\

Tools include: ✔ Axolotl

THE standard for fine-tuning LLaMA, Mistral, etc.
Used by researchers, labs, and giants.

✔ Unsloth

A faster, more GPU-efficient training toolkit.
Great for consumer GPUs like yours.

✔ Faraday

A more experimental training tool, but advanced.

* These tools: * Read your dataset

Modify the model’s weights
Produce a new personalized model

This is real learning.

LET'S DEFINE THE PIECES

1. MODELS (LLMs)

2. MODEL RUNTIMES (ENGINES)

Summary

3. TRAINING TOOLS (NEURAL LEARNING, “NEW SYNAPSES”)

On this page

Page tools