Store JSON in PostgreSQL: json vs jsonb (2026)

Use jsonb, not json, for almost anything you store in PostgreSQL. PostgreSQL has had two JSON column types since 9.4 (December 2014): json keeps an exact text copy of what you wrote (whitespace, key order, and duplicate keys all preserved) and reparses that string on every access, while jsonb stores a decomposed binary representation that processes faster, drops the cosmetic detail, and (this is the big one) can be indexed with a GIN index so containment and key-existence queries hit an index instead of scanning every row. Below are the differences, a comparison against normalizing into real columns, a worked schema with a GIN index, and how this compares to storing JSON in MySQL.

Short answer: attributes jsonb for a column you query into; json only when you must preserve the input text verbatim (exact whitespace, original key order, duplicate keys) and you only ever read the document whole. The reason jsonb wins almost every time is the index story: PostgreSQL can put a GIN index directly over the whole document, with no generated-column gymnastics. More on that below.

Why jsonb beats json

Both types accept and emit the same JSON, and both validate on insert (malformed JSON is rejected by either). The difference is entirely in how the value is stored and what you can do with it after.

json is the raw text. PostgreSQL stores the exact characters you sent. Semantically-insignificant whitespace survives, the order of keys in each object is preserved, and if you wrote the same key twice both copies are kept (functions that process it treat the last value as the live one). Every operation that reaches into the document has to reparse the text from scratch.
jsonb is a parsed binary tree. On insert PostgreSQL decomposes the document into an internal binary format. Whitespace is gone, object keys are reordered (and stored sorted), and duplicate keys are collapsed to the last value. Reads do not reparse: the server walks the binary structure directly, so pulling a field out is cheap.

The conversion to binary makes inserts marginally more expensive and the stored value can be a touch larger than the raw text, but everything you do afterwards is faster. And only jsonb supports the indexing and containment operators that make a JSON column actually queryable at scale. That is the trade, and for data you query into it is not close.

When json is the right choice

json earns its place in exactly one situation: you need to give back byte-for-byte what you stored. If the exact formatting matters (you are caching a third-party API response and a downstream consumer checks a signature over the raw bytes), or the original key order is semantically meaningful to some other system, or you genuinely need to round-trip duplicate keys, then jsonb will quietly rewrite the document and json will not.

That is a narrow case. A cached payload you only ever read whole, where formatting is load-bearing, is the honest use for json. The one other place it can pay off is a write-heavy, append-only sink (think a raw event or log table you ingest fast and rarely query), where skipping the decompose-to-binary step on every insert shaves real cost. The moment you want to filter on a field inside the document, or index it, you want jsonb.

jsonb vs normalizing into real columns

The harder question is not json versus jsonb. It is whether the data should be JSON at all, or whether it belongs in ordinary columns.

If the shape is known and stable, normal columns are almost always better. You get typed columns, NOT NULL and foreign-key constraints, plain B-tree indexes, and clean joins. A product has a price, a SKU, and a weight: those are columns, not keys in a JSON blob. (The price in particular wants a typed numeric column, not a JSON string, for the reasons in storing money in PostgreSQL.) Burying stable fields in JSON throws away everything a relational database is good at.

jsonb earns its place when the data is genuinely sparse, dynamic, or ad-hoc: attributes that differ per row, optional metadata you cannot enumerate up front, a settings document whose keys you do not control. A products catalog where a book has author and pages, a shirt has size and color, and a cable has length and gauge is the textbook case. Modelling that as columns gives you a wide table full of nulls. A single attributes jsonb column holds whatever each row needs.

My rule: stable, shared fields are columns; the long tail of per-row variation is one jsonb column alongside them. You choose per field, not globally.

Comparison: json vs jsonb vs normalized columns

Aspect	`json`	`jsonb`	Normalized columns
Storage	Exact text copy	Decomposed binary	Typed per-column values
Preserves text/key order/dupes	Yes, verbatim	No (reordered, dupes dropped)	N/A
Reads	Reparse on every access	Walk binary, no reparse	Native column access
Indexing	None useful	GIN over the whole document	B-tree per column
Best for	Verbatim round-trip, read whole	Sparse / dynamic data you query	Known, stable, shared fields

A worked schema: products with a jsonb attributes column

Here is a products table. The stable fields (sku, name, price) are real columns. The per-product variation lives in a single attributes jsonb column.

sql

CREATE TABLE products (
  id         bigint GENERATED ALWAYS AS IDENTITY PRIMARY KEY,
  sku        text NOT NULL UNIQUE,
  name       text NOT NULL,
  price      numeric(10,2) NOT NULL,
  attributes jsonb NOT NULL,
  created_at timestamptz NOT NULL DEFAULT now()
);

A live PostgreSQL session comparing JSON and JSONB, with the create statement and the real query output. — A live psql session running this schema in PostgreSQL 16: real output, not illustrative.

Insert a document. PostgreSQL validates it on the way in and decomposes it to the binary form, so a typo in the JSON fails loudly here, not three reads later:

sql

INSERT INTO products (sku, name, price, attributes)
VALUES (
  'TSHIRT-RED-L',
  'Cotton T-Shirt',
  19.99,
  '{"color": "red", "size": "L", "material": "cotton", "tags": ["summer", "sale"]}'
);

Pull a single field out with the ->> operator. It returns the value as text (unquoted), so you get red, not "red":

sql

SELECT name, attributes->>'color' AS color
FROM products
WHERE attributes->>'size' = 'L';

Here -> returns a field as jsonb (strings stay quoted, arrays stay intact) and ->> returns it as text. For a nested path, #> and #>> take a path array, for example attributes#>>'{dimensions,length}'. Those operators work fine on their own, but the WHERE above with no index scans every row. That is what GIN fixes.

Indexing a jsonb column with GIN

This is where PostgreSQL pulls decisively ahead of MySQL. You can put a GIN index directly on the whole jsonb column and it indexes every key and value inside the document:

sql

CREATE INDEX idx_products_attributes
  ON products USING GIN (attributes);

That single index now serves containment (@>) and key-existence (?, ?|, ?&) queries against any field in the document. The containment query "find products whose attributes contain this fragment" uses the index:

sql

-- products that are red size L, index-backed
SELECT name FROM products
WHERE attributes @> '{"color": "red", "size": "L"}';

psql creating a GIN index named idx_attrs on a jsonb attributes column, then EXPLAIN showing a containment query on that column planned as a Bitmap Index Scan on idx_attrs instead of a sequential scan. — A GIN index on the jsonb attributes column turns a containment query into a Bitmap Index Scan on idx_attrs, proven by EXPLAIN. Real output from PostgreSQL 16.

@> asks "does the left document contain the right one as a subset?" The key-existence operators ask whether a top-level key is present: attributes ? 'material' (this key exists), attributes ?| array['color','colour'] (any of these), attributes ?& array['size','color'] (all of these). All four ride the same default GIN index.

If you only ever run containment queries and never the ? family, use the jsonb_path_ops operator class instead. It indexes only the @> (and jsonpath) path, producing a smaller, faster index:

sql

CREATE INDEX idx_products_attributes_path
  ON products USING GIN (attributes jsonb_path_ops);

For richer querying, PostgreSQL 12 added the SQL/JSON path language: the @? and @@ operators and functions like jsonb_path_query, for example attributes @? '$.tags[*] ? (@ == "sale")'. Those also use the GIN index. They are worth reaching for when a flat containment check is not expressive enough.

Updating a single key in a jsonb column

A common follow-up: how do you change one field inside the document without rewriting the whole JSON string? Use jsonb_set, which returns a new jsonb with one path replaced:

sql

-- bump the size on one product, leave the rest of the document untouched
UPDATE products
SET attributes = jsonb_set(attributes, '{size}', '"XL"')
WHERE sku = 'TSHIRT-RED-L';

The path is a text array ('{size}' here, '{dimensions,length}' for a nested key). By default jsonb_set only updates a key that already exists; pass true as the fourth argument to create it if missing. To merge a fragment in (add or overwrite several keys at once) the || concatenation operator is simpler than nested jsonb_set calls:

sql

-- add a "clearance" flag and overwrite the price tier in one statement
UPDATE products
SET attributes = attributes || '{"clearance": true, "tier": "B"}'
WHERE sku = 'TSHIRT-RED-L';

To delete a key, use the - operator (attributes - 'tier') or #- for a nested path. None of this works on a plain json column: the update operators and jsonb_set are jsonb-only, which is one more reason a column you mutate in place should be jsonb.

How this compares to MySQL

If you have indexed JSON in MySQL, the contrast is the whole point. MySQL's JSON type is also a binary format, and it also gives you path operators (its -> and ->> use $.path syntax). But MySQL cannot index a JSON column directly: to make a path searchable you add a generated column over that one path and index that, or use a functional index on the extraction expression. You index one path at a time, by hand.

PostgreSQL's GIN index indexes the entire document at once. One CREATE INDEX ... USING GIN (attributes) covers containment and key-existence against every field, including keys you had not thought to index when you wrote the DDL. That is a genuinely more powerful model for sparse, unpredictable data, and it is the main reason I default to jsonb in PostgreSQL where I would think harder about JSON-vs-columns in MySQL. For the MySQL side of this trade in full, see storing JSON in MySQL with the native JSON type.

What to do next

For the MySQL equivalent and its generated-column indexing route, see How to Store JSON in MySQL.
When the data is just a list of scalars (tags, role names) rather than a nested document, a native PostgreSQL array column is often a better fit than jsonb, with its own GIN-indexed containment operators.
For the broader schema-design judgement call on JSON versus real columns, the same normalize-the-stable-shape rule applies across engines.

FAQ

Use jsonb for almost everything. It stores a decomposed binary format that reads without reparsing and, crucially, can be indexed with a GIN index so containment (@>) and key-existence (?) queries hit an index. Use json only when you must return the input text byte-for-byte, preserving exact whitespace, original key order, and duplicate keys, and you only ever read the document whole.

The json type stores an exact text copy of what you inserted: whitespace, key order, and duplicate keys are all preserved, and every access reparses the string. The jsonb type decomposes the document into an internal binary tree on insert: whitespace is dropped, keys are reordered, duplicate keys collapse to the last value, and reads walk the binary structure without reparsing. Both validate JSON on insert.

Create a GIN index over the whole column: CREATE INDEX ... USING GIN (attributes). The default operator class supports containment (@>) and the key-existence operators (?, ?|, ?&) against any field in the document. If you only run containment and jsonpath queries (no key-existence ?), use USING GIN (attributes jsonb_path_ops) for a smaller, faster index that supports @>, @?, and @@.

Both extract a field from a JSON value. attributes->'color' returns the field as jsonb (strings stay quoted, arrays stay intact). attributes->>'color' returns it as text, so a string comes back as red rather than "red". Use ->> when comparing against a plain SQL string. For nested paths, #> and #>> take a path array.

Use jsonb_set(attributes, '{size}', '"XL"') in an UPDATE to replace one path without rewriting the whole document; the path is a text array, so a nested key looks like '{dimensions,length}'. By default it only updates an existing key, pass true as the fourth argument to create it if missing. To add or overwrite several keys at once, merge a fragment with the || operator. To remove a key, use - (or #- for a nested path). All of these are jsonb-only and do not work on a plain json column.

When the shape is known and stable. Stable, shared fields belong in typed columns where you get constraints, foreign keys, plain B-tree indexes, and clean joins. Reserve a jsonb column for genuinely sparse, dynamic, or per-row data you cannot enumerate up front. A common pattern is both: real columns for the shared fields and one jsonb column for the long tail of variation.

For sparse, unpredictable documents, yes. MySQL's JSON is also binary but you cannot index a JSON column directly: you add a generated column over each path you want searchable, one at a time. PostgreSQL's GIN index covers the entire document at once, so a single index serves containment and key-existence queries against every field, including keys you did not anticipate. See storing JSON in MySQL for the MySQL side in full.

How to Store JSON in PostgreSQL: json vs jsonb

Why jsonb beats json

When json is the right choice

jsonb vs normalizing into real columns

Comparison: json vs jsonb vs normalized columns

A worked schema: products with a jsonb attributes column

Indexing a jsonb column with GIN

Updating a single key in a jsonb column

How this compares to MySQL

What to do next

FAQ

See also

Sources

Ishan Karunaratne

Related posts

How to Store Money in PostgreSQL: numeric vs the money Type

How to Store JSON in MySQL: The JSON Type vs TEXT

How to Store an Array in PostgreSQL

Should I use json or jsonb in PostgreSQL?

What is the difference between json and jsonb storage?

How do I index a jsonb column in PostgreSQL?

What do the -> and ->> operators do in PostgreSQL?

How do I update a single key inside a jsonb column?

When should I normalize into columns instead of using jsonb?

Is PostgreSQL jsonb indexing better than MySQL's JSON indexing?

Sources

Ishan Karunaratne