AI Glossary: A to Z
An A-to-Z glossary of AI terms, created with help from AI itself. Because in 2026, the best way to study AI is apparently to ask AI itself. π€£
Written for beginners and practitioners alike. Each term includes a plain English definition and a real-world example.
A Β· B Β· C Β· D Β· E Β· F Β· G Β· H Β· I Β· J Β· K Β· L Β· M Β· N Β· O Β· P Β· Q Β· R Β· S Β· T Β· U Β· V Β· W Β· X Β· Y Β· Z
β Back to top
| Term | Definition | Example |
|---|---|---|
| Agent (AI Agent) | An AI system that perceives its environment, makes decisions, and takes autonomous actions to achieve a goal | A coding agent that writes, runs, and debugs its own code without human intervention |
| AGI (Artificial General Intelligence) | A hypothetical AI that can match or exceed human-level intelligence across any task β does not yet exist | Often cited as a long-term goal by companies like OpenAI and DeepMind |
| AI (Artificial Intelligence) | The field of computer science focused on building machines that can perform tasks normally requiring human intelligence | ChatGPT writing an essay, an algorithm detecting cancer in X-rays |
| AI Ethics | The principles and practices for developing and deploying AI in ways that are fair, transparent, and safe | Auditing a hiring algorithm to ensure it doesn't discriminate by gender or race |
| AI Safety | The field dedicated to ensuring AI systems remain reliable, controllable, and beneficial as they grow more capable | Research into preventing AI from pursuing goals that harm people |
| Alignment | The challenge of ensuring an AI system's goals and behaviour match what its designers and users actually intend | Preventing a powerful AI from optimising for a metric in a way that causes unintended harm |
| Annotation | The process of labelling raw data so it can be used to train supervised learning models | Humans drawing bounding boxes around cars in images to train a self-driving model |
| API (Application Programming Interface) | A defined interface that lets software systems communicate with each other | Calling the OpenAI API to add GPT-powered responses to your own application |
| API Key | A private authentication token that identifies you when making API requests | Pasting your secret key into code so it has permission to use Claude or OpenAI's API |
| Attention Mechanism | The component of a transformer that lets a model focus on the most relevant parts of the input when producing each output | A model knowing that "it" in "The cat sat because it was tired" refers to the cat |
| Augmented Intelligence | Using AI to enhance human decision-making rather than replace it entirely | A radiologist using AI to flag suspicious areas in a scan, then making the final call |
| AutoML | Automated Machine Learning β tools that automatically select models, tune hyperparameters, and build pipelines | Google AutoML letting non-experts build a custom image classifier without coding |
β Back to top
| Term | Definition | Example |
|---|---|---|
| Backpropagation | The algorithm used to train neural networks by calculating how much each parameter contributed to the error and adjusting accordingly | How a neural network "learns" by working backwards from its mistakes to fix its weights |
| Batch Size | The number of training examples processed together before the model's weights are updated | A batch size of 64 means the model updates after every 64 training samples |
| Benchmark | A standardised test used to measure and compare AI model performance | MMLU (Massive Multitask Language Understanding) and HumanEval for coding ability |
| Bias (Data Bias) | Systematic unfairness in AI outputs caused by skewed or unrepresentative training data | A facial recognition system that performs poorly on darker skin tones because training data was mostly light-skinned faces |
| BLEU Score | A metric used to evaluate the quality of AI-generated text by comparing it to human reference text | Measuring how close a machine translation is to a professional human translation |
| Bot | A software program that performs automated tasks, often simulating human interaction | A customer service chatbot that answers FAQs on a website 24/7 |
β Back to top
| Term | Definition | Example |
|---|---|---|
| Chain-of-Thought Prompting | A technique that encourages an AI to reason step by step before giving a final answer, improving accuracy on complex tasks | Adding "Think step by step" to a maths problem prompt dramatically improves the model's answer |
| Chatbot | A software application that simulates conversation with users, typically powered by an LLM or rule-based system | ChatGPT, customer support bots, virtual assistants on bank websites |
| Classification | A machine learning task where a model predicts which category an input belongs to | Labelling emails as spam or not spam; detecting whether a tumour is malignant or benign |
| Clustering | Grouping similar data points together without predefined labels, used in unsupervised learning | Segmenting customers into groups based on purchasing behaviour |
| CNN (Convolutional Neural Network) | A type of neural network designed specifically for processing grid-like data such as images | Used in face recognition, medical imaging, and object detection |
| Computer Vision | The field of AI focused on enabling machines to interpret and understand visual information | A self-driving car detecting pedestrians; a quality control camera spotting defects |
| Context Window | The maximum amount of text an AI model can process and retain in a single interaction | A model with a 200,000-token context window can read roughly 150,000 words at once |
| Copilot | An AI assistant integrated into a tool or workflow to help users complete tasks more efficiently | GitHub Copilot suggesting code completions as a developer types |
| Cross-Validation | A technique for evaluating how well a model generalises by training and testing it on different subsets of the data | Splitting data into 5 "folds" and rotating which one is the test set each time |
| CUDA | A parallel computing platform by NVIDIA that enables GPUs to be used for AI training and inference | Virtually every large AI model is trained using CUDA on NVIDIA hardware |
β Back to top
| Term | Definition | Example |
|---|---|---|
| Data Augmentation | Artificially expanding a training dataset by creating modified versions of existing data | Flipping, rotating, and cropping images to give a computer vision model more variety |
| Data Pipeline | An automated workflow that collects, processes, and delivers data for AI training or inference | A system that ingests raw sensor data, cleans it, and feeds it to a fraud detection model |
| Dataset | A structured collection of data used to train or evaluate an AI model | ImageNet β a dataset of 14 million labelled images used to train and benchmark vision models |
| Deep Learning | An advanced form of machine learning that uses multi-layered neural networks to learn complex patterns | Powering speech recognition, image generation, and language understanding |
| Deepfake | AI-generated media (video, audio, or images) that realistically depicts someone saying or doing something they never did | Synthetic video of a public figure making a false statement |
| Deployment | The process of making a trained AI model available for use in a real-world product or system | Releasing a trained customer churn model into a company's CRM platform |
| Diffusion Model | A type of generative AI that learns to create data by learning to reverse a process of adding noise | Stable Diffusion and DALLΒ·E use diffusion models to generate images from text prompts |
| Distillation | A technique where a smaller "student" model is trained to mimic the behaviour of a larger "teacher" model, reducing size and cost | Creating a lightweight model for mobile devices by distilling a large cloud-based model |
β Back to top
| Term | Definition | Example |
|---|---|---|
| Edge AI | Running AI models directly on a local device rather than sending data to the cloud | A smart security camera that detects intruders locally without needing an internet connection |
| Embeddings | Numerical vector representations of text (or other data) that capture semantic meaning and relationships | Words with similar meanings have embeddings that are close together in vector space |
| Epoch | One complete pass through the entire training dataset during model training | Training for 10 epochs means the model has seen every training example 10 times |
| Ensemble Learning | Combining multiple models and averaging their outputs to get better predictions than any single model | Random Forests, which combine hundreds of decision trees to make more accurate predictions |
| Evaluation Metrics | Measurements used to assess how well an AI model is performing | Accuracy, precision, recall, F1 score, and BLEU score |
| Explainable AI (XAI) | AI systems designed so their reasoning and decisions can be understood and audited by humans | A loan-rejection system that shows which factors (income, debt ratio) drove the decision |
β Back to top
| Term | Definition | Example |
|---|---|---|
| Feature | An individual measurable property used as input to a machine learning model | In a house-price model: square footage, number of bedrooms, and location are features |
| Feature Engineering | The process of selecting, transforming, or creating input variables to improve model performance | Combining "day of week" and "time of day" into a single "rush hour" feature for a traffic model |
| Few-Shot Prompting | Giving an AI a small number of examples in the prompt before asking it to complete a task | Showing 3 example customer reviews before asking the model to classify a new one |
| Fine-Tuning | Further training a pre-trained model on a specific, smaller dataset to specialise its behaviour | Training a general LLM on legal documents to create a legal research assistant |
| Foundation Model | A large AI model trained on broad, general data that can be adapted to many downstream tasks | GPT-4, Claude, and Gemini are all foundation models |
| Function Calling | A feature that allows an LLM to trigger external tools or APIs as part of generating a response | An AI assistant calling a weather API to answer "Should I bring an umbrella tomorrow?" |
β Back to top
| Term | Definition | Example |
|---|---|---|
| GAN (Generative Adversarial Network) | A model architecture where two networks β a generator and a discriminator β compete to produce increasingly realistic outputs | Used to generate photorealistic synthetic faces or artistic images |
| Generative AI | AI that can create new content β text, images, audio, video, or code β rather than just analysing existing data | ChatGPT writing an article; Midjourney generating artwork |
| GPU (Graphics Processing Unit) | Specialised hardware with thousands of cores that dramatically accelerates AI training and inference | NVIDIA A100 and H100 GPUs are the standard for training large AI models |
| Gradient Descent | The core optimisation algorithm that iteratively adjusts a model's weights to minimise prediction error during training | The mathematical engine behind how every neural network learns |
| Guardrails | Constraints or filters applied to an AI system to prevent it from producing harmful, offensive, or off-topic outputs | A customer service bot that refuses to discuss competitors or give legal advice |
β Back to top
| Term | Definition | Example |
|---|---|---|
| Hallucination | When an AI model confidently generates information that is factually incorrect or entirely fabricated | An AI citing a scientific paper that doesn't exist, with a realistic-looking author and journal |
| Hugging Face | A popular open-source platform for sharing, discovering, and running AI models and datasets | Often called "the GitHub of AI" β thousands of models are freely available there |
| Human-in-the-Loop (HITL) | A system design where a human reviews or approves AI decisions before they take effect | A doctor reviewing an AI-flagged medical scan before acting on the recommendation |
| Hyperparameter | A configuration value set before training begins that controls how the model learns, not what it learns | Learning rate, batch size, and number of layers are all hyperparameters |
β Back to top
| Term | Definition | Example |
|---|---|---|
| Image Recognition | AI's ability to identify and classify objects, people, or scenes within images | Google Photos automatically tagging people and places in your photo library |
| Inference | The process of using a trained AI model to generate predictions or outputs on new, unseen inputs | Every time you send a message to ChatGPT, it runs inference |
| Interpretability | The degree to which humans can understand why an AI model made a specific decision | Being able to explain why a credit scoring model rejected an application |
β Back to top
| Term | Definition | Example |
|---|---|---|
| Jailbreak | A technique used to trick an AI model into bypassing its safety rules or guidelines | A roleplaying prompt designed to make an AI ignore its ethical restrictions |
| JSON Mode | A setting in some LLM APIs that forces the model to return responses in valid JSON format | Useful when building apps that need to parse AI output programmatically |
β Back to top
| Term | Definition | Example |
|---|---|---|
| Knowledge Base | A structured repository of information that an AI can query to answer questions or complete tasks | A company's internal FAQ documents connected to a RAG-powered support chatbot |
| Knowledge Graph | A network of entities and the relationships between them, used to represent and query structured knowledge | Google's Knowledge Graph connecting "Albert Einstein" to "physicist", "Germany", and "Theory of Relativity" |
| Knowledge Distillation | Training a smaller model to replicate the performance of a larger one by learning from its outputs | Creating a fast, lightweight model for edge deployment by mimicking a large cloud model |
β Back to top
| Term | Definition | Example |
|---|---|---|
| Label | The correct answer or category assigned to a training example in supervised learning | In a spam dataset, each email is labelled "spam" or "not spam" |
| Latency | The delay between sending a request to an AI model and receiving its response | A model with low latency feels instant; high latency feels slow and frustrating |
| Large Language Model (LLM) | An AI model trained on vast amounts of text data, capable of generating, summarising, and reasoning about language | GPT-4, Claude, Gemini, and Llama are all LLMs |
| Latent Space | The compressed internal representation a model learns, where similar concepts are encoded close together | In image generation models, nearby points in latent space produce visually similar images |
| Learning Rate | A hyperparameter that controls how large a step the model takes when updating its weights during training | Too high and the model overshoots; too low and it trains too slowly |
| LLMOps | The set of practices and tools for deploying, monitoring, and maintaining LLMs in production | Managing prompt versions, monitoring for drift, and evaluating model outputs at scale |
| LoRA (Low-Rank Adaptation) | A parameter-efficient fine-tuning technique that adds small trainable layers to a model without modifying the original weights | Fine-tuning a large model on a custom dataset using a fraction of the compute cost |
β Back to top
| Term | Definition | Example |
|---|---|---|
| Machine Learning (ML) | A branch of AI where systems learn patterns from data rather than being explicitly programmed with rules | A spam filter that improves over time by learning from emails users mark as spam |
| MCP (Model Context Protocol) | An open standard created by Anthropic that allows AI models to connect to external tools, databases, and services in a consistent and secure way β think of it as a universal plug for AI integrations | Connecting Claude to your GitHub repo, Google Drive, or a SQL database so it can read, write, and act on real data |
| Model | A trained AI system that maps inputs to outputs based on what it learned from data | A trained neural network that predicts tomorrow's stock price from historical data |
| Model Card | A document published alongside an AI model describing its purpose, training data, capabilities, and limitations | Hugging Face model cards provide transparency about what a model can and can't do |
| Model Collapse | A phenomenon where AI models trained on AI-generated data degrade in quality over time | A concern as the internet fills with AI-generated content used to train future models |
| Multimodal AI | AI that can process and generate multiple types of content β text, images, audio, and video β together | GPT-4o accepting an image and a question, then answering about the image in text |
β Back to top
| Term | Definition | Example |
|---|---|---|
| Natural Language Processing (NLP) | The field of AI focused on enabling machines to understand, interpret, and generate human language | Machine translation, sentiment analysis, chatbots, and text summarisation |
| Neural Network | A computational model loosely inspired by the structure of the human brain, made up of layers of interconnected nodes | The underlying architecture used by most modern AI systems |
| NLP Pipeline | A sequence of processing steps applied to text data, from raw input to final output | Tokenisation β embedding β classification β output |
| Node | An individual computational unit in a neural network that receives inputs, applies a function, and passes an output | Billions of nodes work together in a large neural network |
β Back to top
| Term | Definition | Example |
|---|---|---|
| Object Detection | A computer vision task that identifies what objects are in an image and where they are located | A self-driving car identifying pedestrians, traffic lights, and other vehicles in real time |
| Ontology | A formal representation of concepts and the relationships between them within a specific domain | A medical ontology defining how "disease", "symptom", and "treatment" relate to each other |
| Open Source Model | An AI model whose weights and/or code are publicly available for anyone to use, modify, and distribute | Meta's Llama models, Mistral, and Stable Diffusion |
| Overfitting | When a model learns the training data too precisely β including its noise β and fails to generalise to new data | A model that scores 99% on training data but only 60% on real-world data |
β Back to top
| Term | Definition | Example |
|---|---|---|
| Parameter | An internal numerical value a model learns during training that shapes how it processes and generates outputs | GPT-4 is estimated to have over a trillion parameters |
| Pre-training | The initial large-scale training phase where a model learns from a massive general dataset before specialisation | Training an LLM on hundreds of billions of words from the internet and books |
| Precision | The percentage of positive predictions that were actually correct | Of all emails the model flagged as spam, what percentage were truly spam? |
| Prompt | The instruction, question, or input you give to an AI model to guide its response | "Summarise this article in three bullet points for a non-technical audience" |
| Prompt Engineering | The practice of designing and refining prompts to get better, more reliable outputs from AI models | Using structured formatting, role assignment, and examples to improve response quality |
| Prompt Injection | An attack where malicious instructions hidden in content the AI reads attempt to hijack its behaviour | A webpage containing invisible text instructing a browsing AI to leak your personal data |
β Back to top
| Term | Definition | Example |
|---|---|---|
| Quantisation | A technique that reduces a model's memory usage by representing its weights with lower numerical precision, making it faster and cheaper to run | Running a compressed Llama model on a laptop instead of a high-end server |
| Query | The input or question sent to an AI model or database to retrieve information | "What are the side effects of ibuprofen?" sent to a medical AI system |
β Back to top
| Term | Definition | Example |
|---|---|---|
| RAG (Retrieval-Augmented Generation) | A technique that combines real-time document retrieval with AI generation, reducing hallucination and keeping responses current | A chatbot that searches your company's knowledge base before answering a support question |
| Recall | The percentage of actual positives that the model successfully identified | Of all actual fraud cases, what percentage did the model correctly flag? |
| Recommendation System | An AI system that predicts and surfaces content or products a user is likely to want, based on past behaviour | Netflix's "Because you watched" suggestions; Spotify's Discover Weekly playlist |
| Red Teaming | Deliberately attempting to break or manipulate an AI system to discover safety vulnerabilities before release | Researchers probing a model with adversarial prompts to expose harmful outputs |
| Regression | A machine learning task where the model predicts a continuous numerical value | Predicting a house's sale price based on size, location, and age |
| Reinforcement Learning (RL) | Training a model through a system of rewards and penalties, so it learns to maximise cumulative reward | AlphaGo learning to play Go by playing millions of games and receiving rewards for winning |
| RLHF (Reinforcement Learning from Human Feedback) | A training technique where humans rate AI outputs, and the model learns to produce outputs humans prefer | The technique used to align ChatGPT and Claude to be helpful, harmless, and honest |
| RNN (Recurrent Neural Network) | A neural network designed for sequential data, where outputs feed back as inputs β largely replaced by transformers | Used in early speech recognition and text generation before transformers dominated |
β Back to top
| Term | Definition | Example |
|---|---|---|
| Semantic Search | Search that understands the meaning and intent behind a query rather than matching exact keywords | Searching "how to fix a broken bone" and getting results about fracture treatment, not carpentry |
| Sentiment Analysis | AI that determines the emotional tone β positive, negative, or neutral β of a piece of text | Automatically classifying thousands of customer reviews to measure product satisfaction |
| Speech Recognition | AI that converts spoken audio into written text | Apple's Siri, Google Voice, and OpenAI's Whisper model |
| Supervised Learning | A training approach where the model learns from labelled input-output pairs | Training a model on thousands of (email, spam/not spam) pairs so it can classify new emails |
| Synthetic Data | Artificially generated data used to train or test models when real data is scarce, costly, or sensitive | Generating fake patient records to train a healthcare AI without privacy concerns |
| System Prompt | A hidden set of instructions given to an AI before the user conversation begins, used to shape its behaviour and persona | A company using a system prompt to make Claude respond only about their products |
β Back to top
| Term | Definition | Example |
|---|---|---|
| Temperature | A setting that controls how predictable or creative an AI's outputs are β low is focused and deterministic, high is varied and creative | Set temperature low for factual Q&A; set it high for creative brainstorming |
| Text-to-Image | AI that generates images from a natural language description | DALLΒ·E, Midjourney, and Stable Diffusion generating artwork from a text prompt |
| Text-to-Speech (TTS) | AI that converts written text into natural-sounding spoken audio | ElevenLabs generating a realistic voice clone from a few seconds of audio |
| Token | The basic unit of text an LLM processes β roughly a word or part of a word | "Artificial" might be split into "Art", "ific", "ial" β three tokens |
| Top-p Sampling | A setting that controls output variety by limiting the pool of next-word candidates to a cumulative probability threshold | Often tuned alongside temperature to balance quality and creativity |
| TPU (Tensor Processing Unit) | Hardware designed specifically to accelerate AI workloads, developed by Google | Used to train Google's Gemini and other large models |
| Training | The process of exposing a model to data and adjusting its weights to minimise prediction error | Training GPT-4 required thousands of GPUs running for months |
| Transfer Learning | Reusing a model trained on one task as the starting point for a new but related task | Adapting a model trained on English text to work with French by fine-tuning on French data |
| Transformer | An attention-based neural network architecture that is the backbone of virtually all modern LLMs | GPT, Claude, Gemini, and Llama are all transformer-based models |
| TTS β see Text-to-Speech | β Back to top
| Term | Definition | Example |
|---|---|---|
| Underfitting | When a model is too simple to capture the underlying patterns in the data, resulting in poor performance | A linear model trying to predict stock prices β too simple for the complexity of the problem |
| Unsupervised Learning | Training a model on unlabelled data so it discovers its own patterns and structure | Grouping news articles into topic clusters without being told what the topics are |
β Back to top
| Term | Definition | Example |
|---|---|---|
| Validation Set | A portion of data held back from training, used to tune the model and catch overfitting before final evaluation | Monitoring validation loss during training to decide when to stop |
| Vector | A list of numbers that represents data (like a word or image) in a mathematical space | The word "king" might be represented as a vector of 768 numbers in an embedding model |
| Vector Database | A database that stores and indexes embeddings (vectors) so AI can retrieve semantically relevant information quickly | Pinecone, Weaviate, and Chroma are popular vector databases used in RAG systems |
β Back to top
| Term | Definition | Example |
|---|---|---|
| Weight | A numerical parameter inside a neural network that is adjusted during training to reduce error | A model with 70 billion parameters has 70 billion weights stored in memory |
| Weight Decay | A regularisation technique that penalises large weights during training to prevent overfitting | Commonly used alongside dropout to keep models from memorising training data |
β Back to top
| Term | Definition | Example |
|---|---|---|
| XAI (Explainable AI) | AI systems and techniques designed to make model decisions interpretable and understandable to humans | A credit scoring model that explains: "Rejected due to high debt-to-income ratio and short credit history" |
β Back to top
| Term | Definition | Example |
|---|---|---|
| YAML | A human-readable data format commonly used to write configuration files for AI tools and ML pipelines | Writing a training configuration file for a machine learning experiment |
| YOLO (You Only Look Once) | A real-time object detection algorithm known for its speed and efficiency | Detecting and tracking multiple objects in a live video feed at 60 frames per second |
β Back to top
| Term | Definition | Example |
|---|---|---|
| Zero-Shot Learning | A model's ability to perform a task it was never explicitly trained on, by generalising from related knowledge | Asking GPT-4 to translate a language it saw rarely during training with no translation-specific training |
| Zero-Shot Prompting | Giving an AI a task with no examples β relying entirely on its pre-trained knowledge | "Classify this review as positive or negative: 'The food was amazing!'" β no examples given |
This glossary covers 100+ terms across the full AI landscape. Bookmark it, share it, and revisit it as you grow.