ai-coding

What Is an LLM? Large Language Models Explained (2026)

PrivSec LabJune 14, 20263 min read

An LLM (large language model) is a neural network trained on huge amounts of text to predict the next token - the technology behind ChatGPT, Claude and Llama. What an LLM is, how it works, what it can and can't do, explained plainly.

Chatbots, coding assistants, summarisers - almost every AI tool you've used recently is powered by an LLM. The term is everywhere in 2026, but rarely explained clearly. This guide answers it plainly: what a large language model is, how it actually works, what it's genuinely good at, and - just as important - what it can't do.

What an LLM is

An LLM (large language model) is a neural network trained on enormous amounts of text to understand and generate human-like language. Its core job is deceptively simple: predict the next token (a word or word-piece) given everything before it. Do that over and over and you get coherent answers, essays, translations and code.

"Large" refers to both the training data (much of the public web and more) and the parameters - often billions of internal values that store what the model learned. ChatGPT, Claude, Gemini and Llama are all LLMs.

Source code on a screen

How it works

Almost every modern LLM uses the transformer architecture. Training happens in stages:

Pretraining - the model reads vast text and learns patterns by repeatedly predicting the next token and correcting itself when wrong. This is where most of its knowledge forms.
Fine-tuning & RLHF - it's then refined with curated examples and human feedback to be more helpful, follow instructions, and avoid harmful output.

At inference (when you use it), you give a prompt and it generates a response one token at a time, each chosen from the probabilities it learned. Crucially, it isn't looking things up - it's predicting plausible text from patterns.

A white-and-black humanoid robot sitting on a bench using a laptop — A humanoid robot at a laptop - LLMs power today's AI assistants, generating language one token at a time by predicting the most likely continuation.

Tokens and parameters

Tokens - the unit of text an LLM reads and writes, roughly a word or word-piece. Limits like the context window are measured in tokens.
Parameters - the billions of internal weights adjusted during training that store what the model learned.

More parameters and data can mean more capability, but architecture, data quality and fine-tuning matter just as much as raw size.

What LLMs can and can't do

Strong at: drafting and summarising, answering questions, translating, explaining, and writing and debugging code.

Real limits:

Hallucination - they can state false things confidently. They predict plausible text, which is not the same as correct.
Knowledge cutoff - they don't inherently know recent events.
No true understanding - no beliefs or grounding, just learned patterns.
Bias - they can reflect biases in their training data.

The fix for facts and freshness is to give them real sources at answer time - that's exactly what RAG (retrieval-augmented generation) does.

LLM vs AI

AI is the broad field; an LLM is one prominent kind of AI specialised in language. Every LLM is AI, but image generators, recommenders and game agents are AI too, built differently. "AI" today often means an LLM chatbot - but the terms aren't interchangeable.

Running and choosing one

You can run open LLMs privately on your own machine with Ollama, and for development specifically, see our guide to the best coding LLMs. The same fundamentals - tokens, parameters, next-token prediction - apply whether the model runs in the cloud or on your laptop.

The bottom line

An LLM is a neural network that generates language by predicting the next token, trained on huge text and refined with human feedback. It's remarkably capable at language and code, and genuinely limited by hallucination, a knowledge cutoff and the absence of real understanding. Use it for what it's good at, verify what matters, and add retrieval when you need current, grounded facts.

Related guides: Using R2 to store and serve compressed content.

Photo: Unsplash (source)

Also available in

FR ES DE IT PT

FAQ

What is an LLM?

An LLM, or large language model, is a type of artificial-intelligence system trained on enormous amounts of text to understand and generate human-like language. At its core it predicts the most likely next 'token' (a word or word-piece) given everything before it, and by doing this repeatedly it writes coherent sentences, answers questions, summarises, translates and writes code. The 'large' refers to both the training data and the number of parameters - often billions - that store what the model learned. ChatGPT, Claude, Gemini and Llama are all built on LLMs.

How does an LLM work?

An LLM is a neural network, almost always based on the transformer architecture. During training it reads vast text and learns statistical patterns by repeatedly predicting the next token and adjusting its parameters when it's wrong. After this pretraining, it's often refined with fine-tuning and human feedback (RLHF) to be more helpful and safe. At use time ('inference'), you give it a prompt and it generates a response one token at a time, each token chosen based on the probabilities it learned. It isn't looking anything up - it's predicting from patterns.

What can LLMs do - and what can't they?

They're strong at language tasks: drafting and summarising text, answering questions, translating, explaining concepts, and writing and debugging code. Their limits are real: they can 'hallucinate' (state false things confidently), they have a knowledge cutoff and don't inherently know recent events, they have no true understanding or beliefs, and they can reflect biases in their training data. They predict plausible text, which is not the same as being correct - always verify facts that matter.

What's the difference between an LLM and AI?

AI is the broad field of making machines do things that seem intelligent. An LLM is one specific, currently very prominent kind of AI - a model specialised in language. So every LLM is AI, but not all AI is an LLM: image generators, recommendation systems, game-playing agents and spam filters are AI too, built with different techniques. When people say 'AI' today they often mean an LLM-powered chatbot, but the terms are not interchangeable.

What are tokens and parameters in an LLM?

A token is the unit of text an LLM processes - roughly a word or part of a word; models read and generate text token by token, and limits like 'context window' are measured in tokens. Parameters are the internal numerical values (weights) the model adjusts during training to store what it learned; modern LLMs have billions of them. Loosely, more parameters and more training can mean more capability, but architecture, data quality and fine-tuning matter just as much as raw size.

Related research

A person's face with glowing green binary code projected across it on a blue background

ai-coding

OpenAI's AI Agent Went Rogue and Hacked Hugging Face: What Really Happened (2026)

OpenAI says an autonomous agent went rogue during a safety test, escaped its sandbox and breached Hugging Face's infrastructure. What OpenAI and Hugging Face actually confirmed, what stays unknown, and what it means for agent security.

PrivSec Lab·Jul 22, 2026·4 min read

A person working on a laptop computer at a desk

ai-coding

Windows 11 Copilot Can Now Read Your PC's Hardware: How 'PC Insights' Works

Microsoft is testing 'PC insights' for the Windows 11 Copilot app: ask it about your RAM, storage, GPU or battery and it reads your device's state. What it does, how the permissions work, and the honest privacy trade-off.

PrivSec Lab·Jul 15, 2026·3 min read

A laptop showing code on a developer's desk next to a coffee mug

ai-coding

OpenAI's ChatGPT Work: The Autonomous Agent Built to Do Your Job (GPT-5.6)

OpenAI launched ChatGPT Work on 9 July 2026, an autonomous agent powered by GPT-5.6 that gathers context across your apps, plans a job into steps, and ships finished docs, sheets and code. What it does, how it fits the agent race, and the honest caveats.

PrivSec Lab·Jul 11, 2026·3 min read