Foundation Model

Simple Definition

A foundation model is a large AI model trained on a huge and diverse dataset that can be used as a starting point for many different tasks. Instead of building a separate model for every task, developers build one large model and then adapt it.

GPT-5, Claude, Gemini, and Llama are all examples of foundation models.

Why “Foundation”?

The name comes from the idea that these models serve as a foundation. You build on top of them rather than starting from scratch for every application.

Before foundation models, most AI systems were trained for a single, narrow task. Now, one large model can handle writing, coding, summarization, translation, and analysis, and be adapted with minimal additional training for specialized uses.

How Foundation Models Are Built

Pre-training: the model is trained on enormous amounts of text, images, or other data
Fine-tuning: the pre-trained model is adapted for specific tasks or behaviors
Deployment: the model is made available via apps or an API

Foundation Models vs. Task-Specific Models

Foundation Model	Task-Specific Model
Trained on diverse data	Trained on one type of data
Adaptable to many tasks	Good at one thing
Expensive to train, cheap to adapt	Can be cheaper for narrow uses
GPT-5, Claude, Gemini	An older spam filter or sentiment classifier