AI Glossary
Inference
The process of using a trained AI model to generate predictions, classifications, or outputs.
Inference
Overview
When people hear about AI, they often focus on training.
Training is where a model learns patterns from data.
But training is only part of the story.
Once a model has been trained, it must actually be used.
This is where inference comes in.
Inference is the process of applying a trained model to new information in order to generate a prediction, recommendation, classification, or response.
Every time you interact with a chatbot, receive a recommendation from a streaming service, or use an AI-powered search tool, inference is taking place.
A helpful way to think about inference is as the model putting its knowledge into action.
Training teaches the model.
Inference allows the model to use what it has learned.
For most users, inference is the part of AI they experience directly because it powers the outputs they see every day.
Understanding inference helps explain how AI systems deliver value after the training process has been completed.
Why It Matters
Inference is the stage where AI models generate real-world outputs and provide practical value to users.
Real-World Example
When a chatbot generates a response to a user’s question, it is performing inference using a trained language model.
Related Concepts
- Model Training
- Model Deployment
- Machine Learning
- Prediction
- Monitoring