Introduction to PostgreSQL physical storage

scosman · on Jan 13, 2016

This is really awesome. I've spent a ton of time optimizing high throughput PG instances. Every time one of my optimizations failed it was from an incomplete understanding of the physical storage layer and/or MVCC.

These 2 docs really help with a deeper understanding if you want to learn more: http://momjian.us/main/writings/pgsql/internalpics.pdf http://momjian.us/main/writings/pgsql/mvcc.pdf

pilif · on Jan 13, 2016

>First, TOAST stands for The Oversized-Attribute Storage Technique, probably the best acronym in the history of Computer Science.

Actually, that price IMHO goes to GIN (Generalized Inverted Index) which I actually believed to be a coincidence until the people behind Postgres' GIN index announced their new VODKA index format

dexterdog · on Jan 14, 2016

Um, how about TWAIN?

dspillett · on Jan 14, 2016

TWAIN isn't actually an acronym - it comes from "and never the twain shall meet", replacing east and west in that prose with scanners, printers and other digital image sources/sinks, representing the difficulty people had getting examples of such technology to cooperate with each other.

dexterdog · on Jan 14, 2016

TIL - thank you.

farresito · on Jan 14, 2016

I took a database class in college which had a decent amount of theory behind it and found pretty interesting the way Postgres (and I presume other databases) handle multiple transactions simultaneously, including the isolation level of each transaction (read uncommited, read commited, repeatable read, and serializable).

daigoba66 · on Jan 13, 2016

Coming from pretty heavy background in MSSQL internals, this article is really great.

I keep telling myself that I need to invest time into learning PostreSQL as a (far less expensive) alternative to MSSQL for some use cases.

jud_white · on Jan 14, 2016

> Coming from pretty heavy background in MSSQL internals, this article is really great.

Do you know of any good resources for the undocumented function fn_dblog? I'm looking to understand the structure of RowLog Contents and Log Record in different Operations/Contexts to reconstruct DDL/DML.

http://www.sqlskills.com/blogs/paul/inside-the-storage-engin... is a good example but is really just an introduction.

jl6 · on Jan 14, 2016

Does anybody have a view on whether the 1GB limit on TOAST data could ever be raised, thereby making the large object facility further obsolete?

anarazel · on Jan 14, 2016

It'd not be particularly hard to raise it, although it'd require some code restructuring/duplication (for on-disk backward compat). I doubt it's really worth it anytime soon, such large datums imo have biggers problems than the length limits.