3.4 Azure Cosmos DB for PostgreSQL

Key Takeaways

**Azure Cosmos DB for PostgreSQL** is distributed PostgreSQL built on the open-source **Citus** extension; it is the rebranded successor to the former Hyperscale (Citus) deployment option of Azure Database for PostgreSQL.
It is **fully relational** and 100% PostgreSQL — despite the Cosmos DB name, it is not the multi-model NoSQL Cosmos DB service covered in the next chapter.
Citus shards tables across many worker nodes so a single logical database can scale to **hundreds of terabytes** with parallel query execution.
Best fits include **multi-tenant SaaS**, **real-time operational analytics (HTAP)**, and **very large OLTP** workloads that have outgrown a single PostgreSQL server.
Applications connect with standard PostgreSQL drivers; Citus distributed-table syntax (`create_distributed_table`, `create_reference_table`) is the main difference from a single-node Postgres.

Last updated: June 2026

What Azure Cosmos DB for PostgreSQL Is

Quick Answer: Azure Cosmos DB for PostgreSQL is a managed, distributed PostgreSQL service built on the open-source Citus extension. It is relational — it speaks ordinary PostgreSQL — but it scales horizontally across many nodes by sharding tables. It is the renamed product that replaced the older Hyperscale (Citus) deployment option of Azure Database for PostgreSQL.

Despite sharing the Cosmos DB brand, this is not the non-relational, multi-model Cosmos DB service that supports NoSQL APIs (covered in Chapter 4). Microsoft groups it under the Cosmos DB family because it is globally distributed, but the engine, query language, and data model are pure PostgreSQL.

How Citus Distributes Data

A Cosmos DB for PostgreSQL cluster has one coordinator node and one or more worker nodes. When you mark a table as distributed, Citus:

Hashes a chosen distribution column (often a tenant ID or customer ID).
Splits the table into shards by that hash.
Places shards across worker nodes.
Routes queries through the coordinator, which executes them in parallel on workers and combines results.

Reference tables that need to be available everywhere (small lookup data) can be replicated to every node using create_reference_table. Large fact tables that drive throughput use create_distributed_table.

When to Choose It

Workload Pattern	Why Cosmos DB for PostgreSQL Fits
Multi-tenant SaaS	Distribute on `tenant_id` so each tenant's data lives on a small set of nodes; the platform scales linearly as tenants are added.
Real-time operational analytics (HTAP)	Parallel query execution lets dashboards run analytical queries directly on operational data without an ETL hop.
Large-scale OLTP	Throughput and storage scale by adding worker nodes; single-server PostgreSQL caps are removed.
Time-series and IoT	Combine Citus with PostgreSQL's range partitioning and `timescaledb`-style patterns for high-ingest workloads.

When Not to Use It

Small workloads that fit comfortably on a single PostgreSQL server — use Azure Database for PostgreSQL Flexible Server instead.
Applications that require document, key-value, or graph APIs — use Azure Cosmos DB (NoSQL, covered in Chapter 4).
Workloads that depend on the Microsoft SQL Server engine or T-SQL — use the Azure SQL family.

Compatibility

Azure Cosmos DB for PostgreSQL is 100% PostgreSQL — you connect using standard PostgreSQL drivers (libpq, JDBC, psycopg, Npgsql) and tools (psql, pgAdmin, Azure Data Studio). The main code change versus a single-node Postgres is choosing which tables to distribute and on which column. ACID transactions are supported within a single node and across distributed tables when the transaction touches the same distribution key.

Why It Matters for DP-900

The exam expects you to know that:

Cosmos DB for PostgreSQL is relational, not NoSQL.
It is built on the Citus open-source extension.
It is the right answer for very large PostgreSQL workloads and multi-tenant SaaS.
It replaces the former Hyperscale (Citus) deployment option of Azure Database for PostgreSQL.

Architecture in More Detail

A cluster has a fixed node configuration you choose at provisioning: a coordinator node size, a worker node size, and a worker node count. You scale out by adding workers and up by resizing nodes. Because the coordinator holds the metadata and plans queries, it is sized for connection handling and planning, while workers are sized for storage and parallel execution. The platform also offers a single-node (basic) configuration for small workloads or development, which can later grow into a multi-node cluster without an application rewrite.

Sharding Strategy and Co-location

The power of Citus comes from co-location. If two distributed tables share the same distribution column type and value range (for example, orders and order_items both distributed on tenant_id), Citus places matching shards on the same worker node. Joins between co-located tables then run locally on each worker with no cross-node data shuffle, which is what makes multi-tenant SaaS queries fast. Picking a single, consistent distribution column across the tenant's tables is therefore the central design decision, mirroring how a Cosmos DB partition key or a SQL Server partition scheme drives performance.

Performance and Indexing

Because it is real PostgreSQL, you keep the full PostgreSQL toolbox: B-tree, GIN, GiST, and BRIN indexes, the EXPLAIN planner, materialized views, common table expressions, window functions, and extensions such as postgis and pg_stat_statements. Citus simply distributes these features across the cluster. Columnar storage is available for analytical tables, helping the HTAP (hybrid transactional/analytical) scenario where the same cluster serves both operational writes and dashboard reads.

Exam Contrast Table

Service	Engine	Model	Scale model
Azure Database for PostgreSQL Flexible Server	PostgreSQL	Relational	Single node (scale up)
Azure Cosmos DB for PostgreSQL	PostgreSQL + Citus	Relational, distributed	Scale out across workers
Azure SQL Database Hyperscale	SQL Server	Relational	Scale storage via page servers
Azure Cosmos DB (NoSQL/Mongo/etc.)	Cosmos engine	Non-relational, multi-model	Scale out across partitions

The recurring DP-900 trap is to assume anything branded "Cosmos DB" is NoSQL. Cosmos DB for PostgreSQL is the exception: it is relational PostgreSQL that happens to share the globally distributed Cosmos DB brand and management surface.

Test Your Knowledge

A SaaS company runs PostgreSQL for a multi-tenant application and is approaching the storage and CPU limits of a single Flexible Server. They want to keep using PostgreSQL drivers and SQL while scaling horizontally across many nodes. Which Azure service should they choose?

Azure Cosmos DB for PostgreSQL

Azure Cosmos DB for NoSQL

Azure SQL Database Hyperscale

Azure SQL Managed Instance

Test Your Knowledge

Which statement about Azure Cosmos DB for PostgreSQL is TRUE?

It is a non-relational document database that supports SQL-like queries.

It is a distributed, fully relational PostgreSQL service built on the Citus extension.

It is the same engine as Azure SQL Database and supports T-SQL.

It only supports key-value and graph APIs.

Up Next

3.5 Relational Data Tasks and Tools

Continue learning

Microsoft Azure Data Fundamentals

Azure DP-900

3.4 Azure Cosmos DB for PostgreSQL

Key Takeaways

What Azure Cosmos DB for PostgreSQL Is

How Citus Distributes Data

When to Choose It

When Not to Use It

Compatibility

Why It Matters for DP-900

Architecture in More Detail

Sharding Strategy and Co-location

Performance and Indexing

Exam Contrast Table

Microsoft Azure Data Fundamentals

1Chapter 1: Introduction & Exam Overview

2Chapter 2: Core Data Concepts

3Chapter 3: Relational Data on Azure

4Chapter 4: Non-Relational Data on Azure

5Chapter 5: Analytics Workloads on Azure

Azure DP-900

3.4 Azure Cosmos DB for PostgreSQL

Key Takeaways

What Azure Cosmos DB for PostgreSQL Is

How Citus Distributes Data

When to Choose It

When Not to Use It

Compatibility

Why It Matters for DP-900

Architecture in More Detail

Sharding Strategy and Co-location

Performance and Indexing

Exam Contrast Table