[RFC] SLIM Data Type - Compact JSON Alternative (17-62% smaller)

Started by Marco Matteucci14 days ago4 messages
#1Marco Matteucci
marco.matteucci.1972@gmail.com

Hi hackers,

I'd like to propose a new data type for PostgreSQL: SLIM (Structured
Lightweight Interchange Markup), a compact alternative to JSONB that
achieves 17-62% storage reduction while remaining human-readable.

== The Problem ==

JSONB is great, but has inherent inefficiencies:
- Redundant field names in arrays of objects
- Verbose syntax (quotes, colons, brackets)
- Boolean representation uses 4-5 bytes each
- No format-level compression

For databases with millions of JSON documents, this adds up.

== SLIM Format ==

JSON:
{"users":[{"id":1,"name":"Alice","active":true},{"id":2,"name":"Bob","active":false}]}
SLIM: {users:|2|id#,name,active?|1,Alice,T|2,Bob,F}

Savings: 45%

Key optimizations:
- Table format for arrays: schema once, data in rows
- Type markers: # (number), ? (boolean), ! (null)
- Compact booleans: T/F instead of true/false
- No quotes for simple strings

== Real-World Benchmarks ==

Tested on a production TimescaleDB database (74.9 million rows, 74 GB):

Format | Sample | JSON | SLIM | Savings
--------------------------|-----------|---------|---------|--------
Object (token_metrics) | 100K rows | 16 MB | 13 MB | 18.7%
Object (wallet_activity) | 100K rows | 20 MB | 16 MB | 17.0%
Table format (arrays) | 10K rows | 580 KB | 222 KB | 61.7%

Extrapolated on 74 GB: 12.5-45.9 GB savings depending on data structure.

== Implementation ==

I've built a working PostgreSQL extension (pg_slim) that provides:
- Native SLIM data type with TOAST support
- Implicit casts to/from JSONB and TEXT
- Full operator support: ->, ->>, @>, <@, ?, ?|, ?&
- B-tree and hash indexing
- Comparison operators for sorting

Example usage:

CREATE TABLE documents (id SERIAL PRIMARY KEY, data SLIM);
INSERT INTO documents (data) VALUES ('{name:Alice,age:#30}'::slim);
INSERT INTO documents (data) VALUES
(slim_encode('{"name":"Bob"}'::jsonb));
SELECT data->>'name' FROM documents; -- Works like JSONB
SELECT * FROM documents WHERE data @> '{name:Alice}'::slim;

The extension compiles and passes tests on PostgreSQL 15.

== Questions for Discussion ==

1. Is there community interest in a SLIM type?
2. Should this remain an extension or be considered for contrib/core?
3. Which index types should be prioritized (GIN support is planned)?
4. Any concerns about the format design?

== Links ==

- Extension: https://github.com/matteuccimarco/pg-slim
- Full RFC:
https://github.com/matteuccimarco/pg-slim/blob/main/RFC-SLIM-TYPE.md
- SLIM spec: https://github.com/matteuccimarco/slim-protocol-core
- Benchmarks:
https://github.com/matteuccimarco/pg-slim/blob/main/BENCHMARK-RESULTS.md

Looking forward to your feedback.

Best regards,
Marco Matteucci

--
Marco Matteucci
+39 340 7063047
Vuoi un appuntamento con me? Clicca qui
<https://calendar.app.google/SaCBAJfiJywVfQJb7&gt;

#2Andreas Karlsson
andreas.karlsson@percona.com
In reply to: Marco Matteucci (#1)
Re: [RFC] SLIM Data Type - Compact JSON Alternative (17-62% smaller)

On 1/10/26 5:28 PM, Marco Matteucci wrote:

I'd like to propose a new data type for PostgreSQL: SLIM (Structured
Lightweight Interchange Markup), a compact alternative to JSONB that
achieves 17-62% storage reduction while remaining human-readable.

Unless this gets much wider adoption in the general dev ecosystem I do
not think it belongs in core. This is better as a third dparty extension.

Andreas

#3Marco Matteucci
marco.matteucci.1972@gmail.com
In reply to: Andreas Karlsson (#2)
Re: [RFC] SLIM Data Type - Compact JSON Alternative (17-62% smaller)

Hi Andreas,

thank you for taking the time to read and consider my proposal.

That makes sense — I understand the concern around inclusion in core
without broad ecosystem adoption. I appreciate the clear guidance, and I’m
happy to treat this as an extension-level effort for now.

Thanks again for the feedback.

Best regards,
Marco

Il giorno mer 14 gen 2026 alle ore 02:53 Andreas Karlsson <andreas@proxel.se>
ha scritto:

On 1/10/26 5:28 PM, Marco Matteucci wrote:

I'd like to propose a new data type for PostgreSQL: SLIM (Structured
Lightweight Interchange Markup), a compact alternative to JSONB that
achieves 17-62% storage reduction while remaining human-readable.

Unless this gets much wider adoption in the general dev ecosystem I do
not think it belongs in core. This is better as a third dparty extension.

Andreas

--
Marco Matteucci
+39 340 7063047
Vuoi un appuntamento con me? Clicca qui
<https://calendar.app.google/SaCBAJfiJywVfQJb7&gt;

#4Andreas Karlsson
andreas.karlsson@percona.com
In reply to: Marco Matteucci (#1)
Re: [RFC] SLIM Data Type - Compact JSON Alternative (17-62% smaller)

On 1/14/26 9:41 AM, Marco Matteucci wrote:

That makes sense — I understand the concern around inclusion in core
without broad ecosystem adoption. I appreciate the clear guidance, and
I’m happy to treat this as an extension-level effort for now.

Thanks again for the feedback.

Welcome to the PostgreSQL community and good luck with your extension!
Have you added it to PGXN[1]?

1. https://pgxn.org/

Andreas