ai-coding

What Is an LLM? Large Language Models Explained (2026)

PrivSec LabJune 14, 20263 min read

An LLM (large language model) is a neural network trained on huge amounts of text to predict the next token - the technology behind ChatGPT, Claude and Llama. What an LLM is, how it works, what it can and can't do, explained plainly.

Chatbots, coding assistants, summarisers - almost every AI tool you've used recently is powered by an LLM. The term is everywhere in 2026, but rarely explained clearly. This guide answers it plainly: what a large language model is, how it actually works, what it's genuinely good at, and - just as important - what it can't do.

What an LLM is

An LLM (large language model) is a neural network trained on enormous amounts of text to understand and generate human-like language. Its core job is deceptively simple: predict the next token (a word or word-piece) given everything before it. Do that over and over and you get coherent answers, essays, translations and code.

"Large" refers to both the training data (much of the public web and more) and the parameters - often billions of internal values that store what the model learned. ChatGPT, Claude, Gemini and Llama are all LLMs.

Source code on a screen

How it works

Almost every modern LLM uses the transformer architecture. Training happens in stages:

Pretraining - the model reads vast text and learns patterns by repeatedly predicting the next token and correcting itself when wrong. This is where most of its knowledge forms.
Fine-tuning & RLHF - it's then refined with curated examples and human feedback to be more helpful, follow instructions, and avoid harmful output.

At inference (when you use it), you give a prompt and it generates a response one token at a time, each chosen from the probabilities it learned. Crucially, it isn't looking things up - it's predicting plausible text from patterns.

A white-and-black humanoid robot sitting on a bench using a laptop — A humanoid robot at a laptop - LLMs power today's AI assistants, generating language one token at a time by predicting the most likely continuation.

Tokens and parameters

Tokens - the unit of text an LLM reads and writes, roughly a word or word-piece. Limits like the context window are measured in tokens.
Parameters - the billions of internal weights adjusted during training that store what the model learned.

More parameters and data can mean more capability, but architecture, data quality and fine-tuning matter just as much as raw size.

What LLMs can and can't do

Strong at: drafting and summarising, answering questions, translating, explaining, and writing and debugging code.

Real limits:

Hallucination - they can state false things confidently. They predict plausible text, which is not the same as correct.
Knowledge cutoff - they don't inherently know recent events.
No true understanding - no beliefs or grounding, just learned patterns.
Bias - they can reflect biases in their training data.

The fix for facts and freshness is to give them real sources at answer time - that's exactly what RAG (retrieval-augmented generation) does.

LLM vs AI

AI is the broad field; an LLM is one prominent kind of AI specialised in language. Every LLM is AI, but image generators, recommenders and game agents are AI too, built differently. "AI" today often means an LLM chatbot - but the terms aren't interchangeable.

Running and choosing one

You can run open LLMs privately on your own machine with Ollama, and for development specifically, see our guide to the best coding LLMs. The same fundamentals - tokens, parameters, next-token prediction - apply whether the model runs in the cloud or on your laptop.

The bottom line

An LLM is a neural network that generates language by predicting the next token, trained on huge text and refined with human feedback. It's remarkably capable at language and code, and genuinely limited by hallucination, a knowledge cutoff and the absence of real understanding. Use it for what it's good at, verify what matters, and add retrieval when you need current, grounded facts.

Related guides: Using R2 to store and serve compressed content.

Photo: Unsplash (source)

Also available in

FR ES DE IT PT

FAQ

What is an LLM?

An LLM, or large language model, is a type of artificial-intelligence system trained on enormous amounts of text to understand and generate human-like language. At its core it predicts the most likely next 'token' (a word or word-piece) given everything before it, and by doing this repeatedly it writes coherent sentences, answers questions, summarises, translates and writes code. The 'large' refers to both the training data and the number of parameters - often billions - that store what the model learned. ChatGPT, Claude, Gemini and Llama are all built on LLMs.

How does an LLM work?

An LLM is a neural network, almost always based on the transformer architecture. During training it reads vast text and learns statistical patterns by repeatedly predicting the next token and adjusting its parameters when it's wrong. After this pretraining, it's often refined with fine-tuning and human feedback (RLHF) to be more helpful and safe. At use time ('inference'), you give it a prompt and it generates a response one token at a time, each token chosen based on the probabilities it learned. It isn't looking anything up - it's predicting from patterns.

What can LLMs do - and what can't they?

They're strong at language tasks: drafting and summarising text, answering questions, translating, explaining concepts, and writing and debugging code. Their limits are real: they can 'hallucinate' (state false things confidently), they have a knowledge cutoff and don't inherently know recent events, they have no true understanding or beliefs, and they can reflect biases in their training data. They predict plausible text, which is not the same as being correct - always verify facts that matter.

What's the difference between an LLM and AI?

AI is the broad field of making machines do things that seem intelligent. An LLM is one specific, currently very prominent kind of AI - a model specialised in language. So every LLM is AI, but not all AI is an LLM: image generators, recommendation systems, game-playing agents and spam filters are AI too, built with different techniques. When people say 'AI' today they often mean an LLM-powered chatbot, but the terms are not interchangeable.

What are tokens and parameters in an LLM?

A token is the unit of text an LLM processes - roughly a word or part of a word; models read and generate text token by token, and limits like 'context window' are measured in tokens. Parameters are the internal numerical values (weights) the model adjusts during training to store what it learned; modern LLMs have billions of them. Loosely, more parameters and more training can mean more capability, but architecture, data quality and fine-tuning matter just as much as raw size.

Related research

A developer seen from behind, wearing headphones and working at a monitor showing code in a dark, blue-lit room

ai-coding

Claude Opus 5 Is Now in GitHub Copilot: Who Gets It, How It Is Billed, and the Security Caveat

Claude Opus 5 became available in GitHub Copilot on 24 July 2026 for Pro+, Max, Business and Enterprise. It is billed at provider API list price rather than a flat multiplier, and it ships safeguards that may block some security-adjacent requests.

PrivSec Lab·Jul 29, 2026·4 min read

Lines of C++ source code on a dark editor screen

ai-coding

Nvidia, Microsoft, Meta and 20+ Firms Sign an Open Letter Against Banning Open-Weight AI (2026)

On July 24, 2026, around 25 tech firms - Nvidia, Microsoft, Dell, Hugging Face, IBM, Mistral, Mozilla and more - urged Washington not to restrict open-weight AI models. Who signed, who is notably absent, the China context, and what it means for developers.

PrivSec Lab·Jul 25, 2026·4 min read

A person's face with glowing green binary code projected across it on a blue background

ai-coding

OpenAI's AI Agent Went Rogue and Hacked Hugging Face: What Really Happened (2026)

OpenAI says an autonomous agent went rogue during a safety test, escaped its sandbox and breached Hugging Face's infrastructure. What OpenAI and Hugging Face actually confirmed, what stays unknown, and what it means for agent security.

PrivSec Lab·Jul 22, 2026·5 min read