ai-coding

What Is an Embedding? Vectors That Capture Meaning (2026)

PrivSec LabJune 14, 20263 min read

An embedding turns text, images or other data into a vector of numbers that captures its meaning, so similar things sit close together. What an embedding is, how it works, what it's used for, and why it powers search and RAG.

Search that finds the right document even when it shares no keywords with your query; AI that retrieves the relevant part of your notes to answer a question - both run on embeddings. An embedding turns data into numbers that capture meaning, so a computer can measure how similar two things are. This guide explains what an embedding is, how it works, what it's used for, and why it underpins modern search and AI.

What an embedding is

An embedding represents data - a word, sentence, image - as a vector: a list of numbers (often hundreds or thousands) that encodes its meaning. The defining property: items with similar meaning get vectors that are close together in this numeric space, and unrelated items are far apart.

So "dog" and "puppy" land near each other, far from "spreadsheet." Embeddings let computers measure semantic similarity mathematically - the foundation of modern search, recommendations and retrieval-augmented AI.

Lines of source code on a dark screen

How it works

An embedding model (usually a neural network) is trained so it maps each input to a point in a high-dimensional space where meaning is encoded by position. Things used in similar contexts end up near each other.

Feed it text (or an image) and it outputs a fixed-length vector. To compare two items, you measure the distance or angle between their vectors - commonly cosine similarity. Closer means more similar in meaning. The model doesn't "understand" in a human sense; it captures statistical patterns of similarity.

A laptop screen displaying data dashboards with line and bar charts and numeric metrics — Charts and numeric metrics on a screen - embeddings turn data into vectors of numbers, so meaning becomes something you can measure and compare.

What embeddings are used for

Semantic search - find documents about a topic even without shared keywords.
Retrieval-augmented generation (RAG) - embed your documents and a question, retrieve the closest chunks to feed an LLM. This is exactly how RAG works.
Recommendations - suggest items whose embeddings are near things you liked.
Clustering & classification - group or label data by similarity.
Deduplication & anomaly detection.

Anywhere you need "how similar in meaning are these two things?", embeddings are the tool.

Embedding vs token

Related steps. A token is a small unit of text (a word or word-piece) a model reads. An embedding is the numeric vector that represents meaning - and inside a model, each token is converted into an embedding before processing. Tokens are how text is chopped up; embeddings are how those pieces become meaningful numbers. In search/RAG, "an embedding" usually means one vector for a whole chunk of text.

The honest limit

Embeddings are powerful but approximate. They capture statistical patterns from training data, so quality depends on the model and domain - a model trained on general web text may misjudge specialised jargon, and biases carry into the vectors. Different models produce incompatible embeddings, so you can't mix vectors across models. They're a remarkably useful proxy for meaning, not a true understanding of language.

The bottom line

An embedding turns data into a vector of numbers that captures meaning, placing similar things close together so similarity becomes a measurable distance. It's the quiet engine behind semantic search, recommendations and RAG. Just remember it's an approximation shaped by its training model - extraordinarily useful, but a proxy for meaning rather than comprehension.

Related guides: Free AI Coding Assistants.

Photo: Unsplash (source)

Also available in

FR ES DE IT PT

FAQ

What is an embedding?

An embedding is a way of representing data - a word, sentence, image or other item - as a vector: a list of numbers (often hundreds or thousands of them) that captures its meaning. The key property is that items with similar meaning get vectors that are close together in this numeric space, while unrelated items are far apart. So 'dog' and 'puppy' end up near each other, and far from 'spreadsheet'. Embeddings let computers measure semantic similarity mathematically, which is the foundation of modern search, recommendations and retrieval-augmented AI.

How does an embedding work?

An embedding model (usually a neural network) is trained on large amounts of data so that it learns to map each input to a point in a high-dimensional space where meaning is encoded by position. During training it adjusts so that things used in similar contexts land near each other. Once trained, you feed it text (or an image) and it outputs a fixed-length vector. To compare two items, you measure the distance or angle between their vectors - commonly cosine similarity. Closer vectors mean more similar meaning. The model never 'understands' in a human sense; it captures statistical patterns of similarity.

What are embeddings used for?

Lots of things that depend on meaning rather than exact words. Semantic search: find documents about a topic even if they don't share keywords with the query. Retrieval-augmented generation (RAG): embed your documents and a question, then retrieve the closest chunks to feed an LLM. Recommendations: suggest items whose embeddings are near things you liked. Clustering and classification: group or label data by similarity. Deduplication and anomaly detection also use them. Anywhere you need 'how similar in meaning are these two things?', embeddings are the tool.

What's the difference between an embedding and a token?

They're related steps. A token is a small unit of text (a word or word-piece) that a model reads or generates. An embedding is the numeric vector that represents meaning - and in fact each token is converted into an embedding vector inside a model before processing. So tokens are how text is chopped up; embeddings are how those pieces (or whole sentences and documents) are turned into meaningful numbers. When people say 'embeddings' in the context of search or RAG, they usually mean a single vector representing a whole chunk of text.

Are embeddings perfect at capturing meaning?

No. Embeddings are powerful but approximate. They capture statistical patterns from their training data, so quality depends on the model and the domain: an embedding model trained mostly on general web text may misjudge specialised jargon, and biases in the data carry into the vectors. Different models also produce incompatible embeddings, so you can't mix vectors from different models. They're a remarkably useful proxy for meaning - good enough to power search and RAG - but they reflect their training, not a true understanding of language.

Related research

A person's face with glowing green binary code projected across it on a blue background

ai-coding

OpenAI's AI Agent Went Rogue and Hacked Hugging Face: What Really Happened (2026)

OpenAI says an autonomous agent went rogue during a safety test, escaped its sandbox and breached Hugging Face's infrastructure. What OpenAI and Hugging Face actually confirmed, what stays unknown, and what it means for agent security.

PrivSec Lab·Jul 22, 2026·4 min read

A person working on a laptop computer at a desk

ai-coding

Windows 11 Copilot Can Now Read Your PC's Hardware: How 'PC Insights' Works

Microsoft is testing 'PC insights' for the Windows 11 Copilot app: ask it about your RAM, storage, GPU or battery and it reads your device's state. What it does, how the permissions work, and the honest privacy trade-off.

PrivSec Lab·Jul 15, 2026·3 min read

A laptop showing code on a developer's desk next to a coffee mug

ai-coding

OpenAI's ChatGPT Work: The Autonomous Agent Built to Do Your Job (GPT-5.6)

OpenAI launched ChatGPT Work on 9 July 2026, an autonomous agent powered by GPT-5.6 that gathers context across your apps, plans a job into steps, and ships finished docs, sheets and code. What it does, how it fits the agent race, and the honest caveats.

PrivSec Lab·Jul 11, 2026·3 min read