Snowflake -> ClickHouse Equivalent Concepts by dhtclk · Pull Request #6244 · ClickHouse/clickhouse-docs

dhtclk · 2026-05-18T21:19:15Z

Summary

Snowflake -> ClickHouse equivalent concepts page to strengthen our migration story.

Checklist

Delete items not relevant to your PR
URL changes should add a redirect to the old URL via https://github.com/ClickHouse/clickhouse-docs/blob/main/docusaurus.config.js
If adding a new integration page, also add an entry to the integrations list here: https://github.com/ClickHouse/clickhouse-docs/blob/main/docs/integrations/index.mdx

vercel · 2026-05-18T21:19:22Z

The latest updates on your projects. Learn more about Vercel for GitHub.

Project	Deployment	Actions	Updated (UTC)
clickhouse-docs	Error	Comment	Jun 11, 2026 8:39pm

4 Skipped Deployments

Project	Deployment	Actions	Updated (UTC)
clickhouse-docs-jp	Ignored		Jun 11, 2026 8:39pm
clickhouse-docs-ko	Ignored	Preview	Jun 11, 2026 8:39pm
clickhouse-docs-ru	Ignored	Preview	Jun 11, 2026 8:39pm
clickhouse-docs-zh	Ignored	Preview	Jun 11, 2026 8:39pm

…into snowflake-equivalent-concepts

Blargian

@dhtclk structure looks great - left some comments around accuracy of some statements. We will need a few more pairs of eyes on this as well.

amychen1776 · 2026-06-09T18:21:12Z

+| Account | [Warehouse](/cloud/reference/warehouses) | Each service scales compute independently; storage is shared at the warehouse level. Tier and billing are set at the organization level, not per warehouse. |
+| Database | [Database](/sql-reference/statements/create/database) | Logical container for tables. Snowflake uses a Database → Schema → Table hierarchy; ClickHouse flattens this to Database → Table. See [Schemas](#schemas) below. |
+
+:::note[Warehouse terminology]


def let's keep this and link to the warehouses page

amychen1776 · 2026-06-09T19:54:41Z

+| Row access policy | [Row policy](/sql-reference/statements/create/row-policy) — a `WHERE`-style expression evaluated per user | Row policies apply transparently to every query against the table. |
+| Sequence | [`generateSerialID`](/sql-reference/functions/other-functions#generateSerialID) for a Keeper-backed sequential counter; [`generateSnowflakeID`](/sql-reference/functions/uuid-functions#generateSnowflakeID) or [`generateUUIDv7`](/sql-reference/functions/uuid-functions#generateUUIDv7) for distributed unique IDs | `generateSerialID` is the closest match to an auto-incrementing sequence: a named, monotonic counter coordinated through ClickHouse Keeper. The UUID functions suit high-throughput unique IDs that don't need a shared counter. |
+
+:::note[Time Travel and backups]


have we thought about just having this part and not including these features in the table? to lessen up the mentions?

amychen1776 · 2026-06-09T19:55:26Z

+
+| Snowflake | ClickHouse | Notes |
+|---|---|---|
+| Primary key (advisory) | Primary key — drives the on-disk sort order and the [sparse primary index](/guides/best-practices/sparse-primary-indexes) | Where Snowflake's PK is advisory only, ClickHouse's PK is load-bearing — it determines physical layout and is used to prune granules, avoid re-sorts, and short-circuit `LIMIT`. Neither system enforces uniqueness. |


we should explicitly call out the fact that our primary key does not have to be unique. That's like an industry standard (that PKs are unique)

amychen1776 · 2026-06-09T19:56:12Z

+| Snowflake | ClickHouse | Notes |
+|---|---|---|
+| Primary key (advisory) | Primary key — drives the on-disk sort order and the [sparse primary index](/guides/best-practices/sparse-primary-indexes) | Where Snowflake's PK is advisory only, ClickHouse's PK is load-bearing — it determines physical layout and is used to prune granules, avoid re-sorts, and short-circuit `LIMIT`. Neither system enforces uniqueness. |
+| Foreign key (advisory) | Wide tables or [dictionaries](/dictionary) for lookups | ClickHouse doesn't accept foreign-key declarations even as advisory hints. |


Are we talking about foreign key constraints or...? I'm confused by this because foreign keys to me are just the join key

amychen1776 · 2026-06-09T19:57:20Z

+| Search Optimization Service | Secondary indexes — [bloom-filter](/engines/table-engines/mergetree-family/mergetree#bloom-filter), token-bloom, [minmax](/engines/table-engines/mergetree-family/mergetree#minmax) | ClickHouse asks you to pick the index type per column and tune its parameters; there's no automatic equivalent. |
+| Cortex Search / Snowflake Cortex Search | [Full-text index](/engines/table-engines/mergetree-family/textindexes) | Token index over string columns for in-database search. |
+| `VECTOR` data type and vector search | [`Array(Float32)`](/sql-reference/data-types/array) or [`Array(BFloat16)`](/sql-reference/data-types/float#bfloat16) with a [vector ANN index](/engines/table-engines/mergetree-family/annindexes); or [`QBit`](/sql-reference/data-types/qbit) for tunable-precision search | ClickHouse has no dedicated `VECTOR` type. Embeddings store as `Array(Float32)`, or `Array(BFloat16)` to halve storage, with an ANN index accelerating approximate nearest-neighbor lookups. `QBit` keeps full precision while letting you trade bits for speed at query time. |
+| Materialized view | [Incremental MV](/materialized-view/incremental-materialized-view) — updates on each insert into a base table | Source-shape rules differ; review both before porting an existing MV. Cost is paid at insert time in ClickHouse. |


fun fact - Snowflake views are extremely limited and don't even support joins :)

amychen1776 · 2026-06-09T20:02:30Z

+| Network policies (IP allowlist) | IP allowlists and [private connectivity](/cloud/security/connectivity/private-networking) — PrivateLink (AWS, Azure) and Private Service Connect (GCP) for ingress restriction | Private connectivity is available across the three major clouds. |
+| Tri-Secret Secure (customer-managed keys) | [CMEK](/cloud/security/cmek) on the service | Supports key rotation and revocation. See the CMEK page for the current list of supported cloud providers. |
+| Object tagging (governance metadata) | — | ClickHouse exposes metadata via `system.*` tables rather than user-defined tags. |
+| Data classification (sensitive-data detection) | — | Not a managed feature; external tools (e.g. DataHub) cover this layer. |


We do support tagging but it's definitely not to level of Snowflake

…into snowflake-equivalent-concepts

…/04_equivalent-concepts.md Co-authored-by: Amy Chen <46451573+amychen1776@users.noreply.github.com>

…ickHouse/clickhouse-docs into snowflake-equivalent-concepts

morsapaes

Didn't manage to review the whole PR yet, but adding some suggestions for what I was able to review this week.

morsapaes · 2026-06-10T15:22:16Z

+
+## Schemas {#schemas}
+
+A Snowflake schema serves multiple roles and has no single equivalent in ClickHouse.


A schema in Snowflake is technically equivalent to a database in ClickHouse.

morsapaes · 2026-06-10T15:29:50Z

+in Snowflake.
+:::
+
+## Schemas {#schemas}


Not sure this section makes sense as is. It should probably be a subsection of the one above that explains how to map the Snowflake namespace hierarchy to the more restrictive hierarchy we use in ClickHouse. There are know issues when users migrate, e.g. with integrations like dbt.

morsapaes · 2026-06-10T17:33:42Z

+| Warehouse size (XS through 6X-Large) | Vertical [autoscaling](/cloud/features/autoscaling/vertical) bounds | Sizing is configured as min/max memory and CPU bounds rather than discrete t-shirt sizes; setting min = max effectively fixes the size. |
+| Multi-cluster warehouse | Manual [horizontal scaling](/cloud/features/autoscaling/horizontal) | ClickHouse scales replica count rather than cluster count. There's no direct equivalent to Snowflake's auto-scaling policies (`Standard`/`Economy`); horizontal replica count is set manually. |
+| Auto-suspend / auto-resume | Service [idling](/cloud/features/autoscaling/idling) | Compute stops when there's no work, restarts on the next query. |
+| Resource monitors (credit-quota spend caps) | [Workloads](/operations/workload-scheduling) for runtime scheduling; per-query limits (memory, threads, execution time) | ClickHouse workloads cover runtime resource scheduling but not spend caps; there's no primitive that suspends a service on hitting a credit threshold. |


We do have billing thresholds and threshold-based notifications, might be worth mentioning. These are only informational, though; we don't cap or restrict usage.

morsapaes · 2026-06-10T17:42:25Z

+
+## Billing and pricing model {#billing}
+
+ClickHouse Cloud meters compute as per-minute [compute units (8 GiB RAM, 2 vCPU)](/cloud/manage/billing/overview#how-is-compute-metered) rather than as credits scaled by warehouse size, charges for storage as compressed bytes without Time Travel or Fail-safe overhead, and bills backups as a separate line item rather than bundling them into retention windows. Most Snowflake "serverless compute" features (Snowpipe, Search Optimization, Auto-clustering, materialized view refresh, Cortex) are bundled into service compute on ClickHouse; [ClickPipes](/integrations/clickpipes) is the explicit exception and is [metered separately](/cloud/reference/billing/clickpipes). As in Snowflake, ClickHouse Cloud charges for public internet egress and cross-region data transfer and offers committed-spend discounts. See [ClickHouse Cloud pricing](/cloud/manage/billing/overview) for current rates, tiers, and commitment options.


Can we just follow the current documentation? This paragraph on billing is pretty convoluted. Here, we simply say:

ClickHouse Cloud bills based on the usage of compute, storage, data transfer (egress over the internet and cross-region), and ClickPipes.

More direct to understand. Backups are lumped into storage costs, I don't think we need to (or should) mention them upfront.

morsapaes · 2026-06-11T07:43:29Z

+
+## Storage and tables {#storage-tables}
+
+In ClickHouse, a table's behavior is set at creation time: the engine (MergeTree family) determines merge and storage semantics, and `ORDER BY` / `PARTITION BY` / `TTL` clauses configure physical layout and retention. Many Snowflake per-feature settings map to a clause in the ClickHouse `CREATE TABLE` statement. Physical schema design also differs between platforms; see the [migration guide](./02_migration_guide.md) for design tradeoffs.


"Merge" is a ClickHouse-specific concept, +1 on simplifying this sentence. It feels like we've described this a million times, we should be able to reuse existing descriptions:

In ClickHouse, you define storage and data layout upfront at table creation time. A CREATE TABLE statement specifies not only the columns and data types, but also the table engine and the sorting and indexing strategy through an ORDER BY clause. The ORDER BY clause is equivalent to a Snowflake clustering key: it defines how data is sorted on disk and indexed. In ClickHouse, unlike Snowflake, you don't incur additional background costs for maintaining the sort order once the table is created. This gives you direct control over query performance and storage costs.

Other clauses like PARTITION BY or TTL are available for partitioning, retention, and other data management strategies, as needed. Many of the settings you configure per-feature in Snowflake map to these clauses in a single CREATE TABLE statement. See the migration guide for design tradeoffs.

Initial draft - resource hierarchy and schemas confirmed

786ff5b

vercel Bot had a problem deploying to Preview – clickhouse-docs May 18, 2026 21:20 Failure

Blargian reviewed May 19, 2026

View reviewed changes

Comment thread docs/cloud/onboard/02_migrate/01_migration_guides/04_snowflake/04_equivalent-concepts.md Outdated

another interation

7505b1a

vercel Bot had a problem deploying to Preview – clickhouse-docs May 20, 2026 21:44 Failure

Merge branch 'main' of https://github.com/ClickHouse/clickhouse-docs …

67f5f1b

…into snowflake-equivalent-concepts

vercel Bot deployed to Preview – clickhouse-docs-ko May 27, 2026 15:56 View deployment

vercel Bot deployed to Preview – clickhouse-docs-jp May 27, 2026 15:56 View deployment

vercel Bot deployed to Preview – clickhouse-docs-zh May 27, 2026 15:56 View deployment

vercel Bot had a problem deploying to Preview – clickhouse-docs May 27, 2026 15:57 Failure

fix broken link

0bd6435

vercel Bot deployed to Preview – clickhouse-docs May 27, 2026 16:32 View deployment

dropping unverifiable claims, tweaks

be0c9c1

vercel Bot deployed to Preview – clickhouse-docs May 27, 2026 18:28 View deployment

Clean up

3ae1209

vercel Bot deployed to Preview – clickhouse-docs May 27, 2026 19:51 View deployment

clean up pass

d1e6f19

vercel Bot deployed to Preview – clickhouse-docs May 28, 2026 18:09 View deployment

Blargian requested changes Jun 1, 2026

View reviewed changes

PR review feedback

a88c48c

vercel Bot deployed to Preview – clickhouse-docs June 1, 2026 16:08 View deployment

PR review feedback #2

d8b4756

vercel Bot deployed to Preview – clickhouse-docs June 1, 2026 17:21 View deployment

PR feedback #3

c1e00a5

vercel Bot deployed to Preview – clickhouse-docs June 1, 2026 18:23 View deployment

amychen1776 reviewed Jun 9, 2026

View reviewed changes

Comment thread docs/cloud/onboard/02_migrate/01_migration_guides/04_snowflake/04_equivalent-concepts.md Outdated

amychen1776 reviewed Jun 9, 2026

View reviewed changes

dhtclk added 2 commits June 10, 2026 09:55

Merge branch 'main' of https://github.com/ClickHouse/clickhouse-docs …

79b1610

…into snowflake-equivalent-concepts

Merge branch 'main' of https://github.com/ClickHouse/clickhouse-docs …

e5b7830

…into snowflake-equivalent-concepts

vercel Bot deployed to Preview – clickhouse-docs June 10, 2026 15:19 View deployment

Update docs/cloud/onboard/02_migrate/01_migration_guides/04_snowflake…

6c2c239

…/04_equivalent-concepts.md Co-authored-by: Amy Chen <46451573+amychen1776@users.noreply.github.com>

vercel Bot deployed to Preview – clickhouse-docs June 10, 2026 21:24 View deployment

Update docs/cloud/onboard/02_migrate/01_migration_guides/04_snowflake…

a875b37

…/04_equivalent-concepts.md Co-authored-by: Amy Chen <46451573+amychen1776@users.noreply.github.com>

vercel Bot deployed to Preview – clickhouse-docs June 10, 2026 21:50 View deployment

dhtclk added 2 commits June 10, 2026 16:59

PR Feedback from Amy #1

65e72a1

Merge branch 'snowflake-equivalent-concepts' of https://github.com/Cl…

08fbeca

…ickHouse/clickhouse-docs into snowflake-equivalent-concepts

vercel Bot deployed to Preview – clickhouse-docs June 10, 2026 22:11 View deployment

PR Feedback

66a2e98

vercel Bot had a problem deploying to Preview – clickhouse-docs June 11, 2026 20:39 Failure

morsapaes reviewed Jun 11, 2026

View reviewed changes

dhtclk added the Don't Merge Don't merge yet label Jun 11, 2026


		## Schemas {#schemas}

		A Snowflake schema serves multiple roles and has no single equivalent in ClickHouse.


		## Billing and pricing model {#billing}

		ClickHouse Cloud meters compute as per-minute [compute units (8 GiB RAM, 2 vCPU)](/cloud/manage/billing/overview#how-is-compute-metered) rather than as credits scaled by warehouse size, charges for storage as compressed bytes without Time Travel or Fail-safe overhead, and bills backups as a separate line item rather than bundling them into retention windows. Most Snowflake "serverless compute" features (Snowpipe, Search Optimization, Auto-clustering, materialized view refresh, Cortex) are bundled into service compute on ClickHouse; [ClickPipes](/integrations/clickpipes) is the explicit exception and is [metered separately](/cloud/reference/billing/clickpipes). As in Snowflake, ClickHouse Cloud charges for public internet egress and cross-region data transfer and offers committed-spend discounts. See [ClickHouse Cloud pricing](/cloud/manage/billing/overview) for current rates, tiers, and commitment options.


		## Storage and tables {#storage-tables}

		In ClickHouse, a table's behavior is set at creation time: the engine (MergeTree family) determines merge and storage semantics, and `ORDER BY` / `PARTITION BY` / `TTL` clauses configure physical layout and retention. Many Snowflake per-feature settings map to a clause in the ClickHouse `CREATE TABLE` statement. Physical schema design also differs between platforms; see the [migration guide](./02_migration_guide.md) for design tradeoffs.

Conversation

dhtclk commented May 18, 2026

Summary

Checklist

Uh oh!

vercel Bot commented May 18, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

Blargian left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

morsapaes left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

vercel Bot commented May 18, 2026 •

edited

Loading