ClickHouse
diff --git a/‎docs/cloud/guides/index.md‎
Lines changed: 1 addition & 19 deletions b/‎docs/cloud/guides/index.md‎
Lines changed: 1 addition & 19 deletions
diff --git a/‎docs/deployment-guides/replication-sharding-examples/02_2_shards_1_replica.md‎
Lines changed: 16 additions & 14 deletions b/‎docs/deployment-guides/replication-sharding-examples/02_2_shards_1_replica.md‎
Lines changed: 16 additions & 14 deletions
diff --git a/‎docs/deployment-guides/replication-sharding-examples/03_2_shards_2_replicas.md‎
Lines changed: 20 additions & 9 deletions b/‎docs/deployment-guides/replication-sharding-examples/03_2_shards_2_replicas.md‎
Lines changed: 20 additions & 9 deletions
diff --git a/‎docs/deployment-guides/replication-sharding-examples/_snippets/_working_example.mdx‎
Lines changed: 1 addition & 1 deletion b/‎docs/deployment-guides/replication-sharding-examples/_snippets/_working_example.mdx‎
Lines changed: 1 addition & 1 deletion
diff --git a/‎docs/integrations/data-ingestion/clickpipes/postgres/faq.md‎
Lines changed: 4 additions & 0 deletions b/‎docs/integrations/data-ingestion/clickpipes/postgres/faq.md‎
Lines changed: 4 additions & 0 deletions
diff --git a/‎docs/integrations/data-visualization/dot-and-clickhouse.md‎
Lines changed: 72 additions & 0 deletions b/‎docs/integrations/data-visualization/dot-and-clickhouse.md‎
Lines changed: 72 additions & 0 deletions
diff --git a/‎docs/integrations/data-visualization/index.md‎
Lines changed: 2 additions & 0 deletions b/‎docs/integrations/data-visualization/index.md‎
Lines changed: 2 additions & 0 deletions
@@ -7,22 +7,4 @@ doc_type: 'landing-page'
 ---
 
 <!--AUTOGENERATED_START-->
-| Page | Description |
-|-----|-----|
-| [Accessing S3 Data Securely](/cloud/security/secure-s3) | This article demonstrates how ClickHouse Cloud customers can leverage role-based access to authenticate with Amazon Simple Storage Service(S3) and access their data securely. |
-| [AWS PrivateLink](/manage/security/aws-privatelink) | This document describes how to connect to ClickHouse Cloud using AWS PrivateLink. |
-| [Azure Private Link](/cloud/security/azure-privatelink) | How to set up Azure Private Link |
-| [Cloud Compatibility](/whats-new/cloud-compatibility) | This guide provides an overview of what to expect functionally and operationally in ClickHouse Cloud. |
-| [Cloud IP Addresses](/manage/security/cloud-endpoints-api) | This page documents the Cloud Endpoints API security features within ClickHouse. It details how to secure your ClickHouse deployments by managing access through authentication and authorization mechanisms. |
-| [Common Access Management Queries](/cloud/security/common-access-management-queries) | This article shows the basics of defining SQL users and roles and applying those privileges and permissions to databases, tables, rows, and columns. |
-| [Configuring organization and service role assignments within the console](/cloud/guides/sql-console/configure-org-service-role-assignments) | Guide showing how to configure org and service role assignments within the console |
-| [Configuring SQL console role assignments](/cloud/guides/sql-console/config-sql-console-role-assignments) | Guide showing how to configure SQL console role assignments |
-| [Data masking in ClickHouse](/cloud/guides/data-masking) | A guide to data masking in ClickHouse |
-| [Gather your connection details](/cloud/guides/sql-console/gather-connection-details) | Gather your connection details |
-| [GCP Private Service Connect](/manage/security/gcp-private-service-connect) | This document describes how to connect to ClickHouse Cloud using Google Cloud Platform (GCP) Private Service Connect (PSC), and how to disable access to your ClickHouse Cloud services from addresses other than GCP PSC addresses using ClickHouse Cloud IP access lists. |
-| [Inviting new users](/cloud/security/inviting-new-users) | This page describes how administrators can invite new users to their organisation and assign roles to them |
-| [Multi tenancy](/cloud/bestpractices/multi-tenancy) | Best practices to implement multi tenancy |
-| [SAML SSO Setup](/cloud/security/saml-setup) | How to set up SAML SSO with ClickHouse Cloud |
-| [Setting IP Filters](/cloud/security/setting-ip-filters) | This page explains how to set IP filters in ClickHouse Cloud to control access to ClickHouse services. |
-| [Usage limits](/cloud/bestpractices/usage-limits) | Describes the recommended usage limits in ClickHouse Cloud |
-<!--AUTOGENERATED_END-->
+<!--AUTOGENERATED_END-->
@@ -557,11 +557,7 @@ SHOW DATABASES;
 
 ## Create a table on the cluster {#creating-a-table}
 
-Now that the database has been created, you will create a distributed table.
-Distributed tables are tables which have access to shards located on different
-hosts and are defined using the `Distributed` table engine. The distributed table
-acts as the interface across all the shards in the cluster.
-
+Now that the database has been created, you will create a table.
 Run the following query from any of the host clients:
 
 ```sql
@@ -608,8 +604,6 @@ SHOW TABLES IN uk;
    └─────────────────────┘
 ```
 
-## Insert data into a distributed table {#inserting-data}
-
 Before we insert the UK price paid data, let's perform a quick experiment to see
 what happens when we insert data into an ordinary table from either host.
 
@@ -622,7 +616,7 @@ CREATE TABLE test.test_table ON CLUSTER cluster_2S_1R
     `id` UInt64,
     `name` String
 )
-ENGINE = ReplicatedMergeTree
+ENGINE = MergeTree()
 ORDER BY id;
 ```
 
@@ -654,16 +648,18 @@ SELECT * FROM test.test_table;
 --   └────┴────────────────────┘
 ```
 
-You will notice that only the row that was inserted into the table on that
+You will notice that unlike with a `ReplicatedMergeTree` table only the row that was inserted into the table on that
 particular host is returned and not both rows.
 
-To read the data from the two shards we need an interface which can handle queries
+To read the data across the two shards, we need an interface which can handle queries
 across all the shards, combining the data from both shards when we run select queries
-on it, and handling the insertion of data to the separate shards when we run insert queries.
+on it or inserting data to both shards when we run insert queries.
 
-In ClickHouse this interface is called a distributed table, which we create using
+In ClickHouse this interface is called a **distributed table**, which we create using
 the [`Distributed`](/engines/table-engines/special/distributed) table engine. Let's take a look at how it works.
 
+## Create a distributed table {#create-distributed-table}
+
 Create a distributed table with the following query:
 
 ```sql
@@ -674,8 +670,12 @@ ENGINE = Distributed('cluster_2S_1R', 'test', 'test_table', rand())
 In this example, the `rand()` function is chosen as the sharding key so that
 inserts are randomly distributed across the shards.
 
-Now query the distributed table from either host and you will get back
-both of the rows which were inserted on the two hosts:
+Now query the distributed table from either host, and you will get back
+both of the rows which were inserted on the two hosts, unlike in our previous example:
+
+```sql
+SELECT * FROM test.test_table_dist;
+```
 
 ```sql
    ┌─id─┬─name───────────────┐
@@ -694,6 +694,8 @@ ON CLUSTER cluster_2S_1R
 ENGINE = Distributed('cluster_2S_1R', 'uk', 'uk_price_paid_local', rand());
 ```
 
+## Insert data into a distributed table {#inserting-data-into-distributed-table}
+
 Now connect to either of the hosts and insert the data:
 
 ```sql
 
@@ -586,12 +586,9 @@ SHOW DATABASES;
    └────────────────────┘
 ```
 
-## Create a distributed table on the cluster {#creating-a-table}
+## Create a table on the cluster {#creating-a-table}
 
-Now that the database has been created, next you will create a distributed table.
-Distributed tables are tables which have access to shards located on different
-hosts and are defined using the `Distributed` table engine. The distributed table
-acts as the interface across all the shards in the cluster.
+Now that the database has been created, next you will create a table with replication.
 
 Run the following query from any of the host clients:
 
@@ -663,14 +660,16 @@ SHOW TABLES IN uk;
 
 ## Insert data into a distributed table {#inserting-data-using-distributed}
 
-To insert data into the distributed table, `ON CLUSTER` cannot be used as it does
+To insert data into the table, `ON CLUSTER` cannot be used as it does
 not apply to DML (Data Manipulation Language) queries such as `INSERT`, `UPDATE`,
 and `DELETE`. To insert data, it is necessary to make use of the 
 [`Distributed`](/engines/table-engines/special/distributed) table engine.
+As you learned in the [guide](/architecture/horizontal-scaling) for setting up a cluster with 2 shards and 1 replica, distributed tables are tables which have access to shards located on different
+hosts and are defined using the `Distributed` table engine.
+The distributed table acts as the interface across all the shards in the cluster.
 
 From any of the host clients, run the following query to create a distributed table
-using the existing table we created previously with `ON CLUSTER` and use of the
-`ReplicatedMergeTree`:
+using the existing replicated table we created in the previous step:
 
 ```sql
 CREATE TABLE IF NOT EXISTS uk.uk_price_paid_distributed
@@ -749,4 +748,16 @@ SELECT count(*) FROM uk.uk_price_paid_local;
    └──────────┘
 ```
 
-</VerticalStepper>
+</VerticalStepper>
+
+## Conclusion {#conclusion}
+
+The advantage of this cluster topology with 2 shards and 2 replicas is that it provides both scalability and fault tolerance.
+Data is distributed across separate hosts, reducing storage and I/O requirements per node, while queries are processed in parallel across both shards for improved performance and memory efficiency.
+Critically, the cluster can tolerate the loss of one node and continue serving queries without interruption, as each shard has a backup replica available on another node.
+
+The main disadvantage of this cluster topology is the increased storage overhead—it requires twice the storage capacity compared to a setup without replicas, as each shard is duplicated.
+Additionally, while the cluster can survive a single node failure, losing two nodes simultaneously may render the cluster inoperable, depending on which nodes fail and how shards are distributed.
+This topology strikes a balance between availability and cost, making it suitable for production environments where some level of fault tolerance is required without the expense of higher replication factors.
+
+To learn how ClickHouse Cloud processes queries, offering both scalability and fault-tolerance, see the section ["Parallel Replicas"](/deployment-guides/parallel-replicas).
@@ -2,5 +2,5 @@
 The following steps will walk you through setting up the cluster from
 scratch. If you prefer to skip these steps and jump straight to running the
 cluster, you can obtain the example
-files from the [examples repository](https://github.com/ClickHouse/examples/tree/main/docker-compose-recipes)
+files from the examples repository ['docker-compose-recipes' directory](https://github.com/ClickHouse/examples/tree/main/docker-compose-recipes/recipes).
 :::
@@ -356,3 +356,7 @@ Yes, for a Postgres ClickPipe with replication mode as CDC or Snapshot + CDC, yo
 <Image img={failover_slot} border size="md"/>
 
 If the source is configured accordingly, the slot is preserved after failovers to a Postgres read replica, ensuring continuous data replication. Learn more [here](https://www.postgresql.org/docs/current/logical-replication-failover.html).
+
+### I am seeing errors like `Internal error encountered during logical decoding of aborted sub-transaction` {#transient-logical-decoding-errors}
+
+This error suggests a transient issue with the logical decoding of aborted sub-transaction, and is specific to custom implementations of Aurora Postgres. Given the error is coming from `ReorderBufferPreserveLastSpilledSnapshot` routine, this suggests that logical decoding is not able to read the snapshot spilled to disk. It may be worth trying to increase [`logical_decoding_work_mem`](https://www.postgresql.org/docs/current/runtime-config-resource.html#GUC-LOGICAL-DECODING-WORK-MEM) to a higher value.
@@ -0,0 +1,72 @@
+---
+sidebar_label: 'Dot'
+slug: /integrations/dot
+keywords: ['clickhouse', 'dot', 'ai', 'chatbot', 'mysql', 'integrate', 'ui', 'virtual assistant']
+description: 'AI Chatbot | Dot is an intelligent virtual data assistant that answers business data questions, retrieves definitions and relevant data assets, and can even assist with data modelling, powered by ClickHouse.'
+title: 'Dot'
+doc_type: 'guide'
+---
+
+import Image from '@theme/IdealImage';
+import dot_01 from '@site/static/images/integrations/data-visualization/dot_01.png';
+import dot_02 from '@site/static/images/integrations/data-visualization/dot_02.png';
+import CommunityMaintainedBadge from '@theme/badges/CommunityMaintained';
+
+# Dot
+
+<CommunityMaintainedBadge/>
+
+[Dot](https://www.getdot.ai/) is your **AI Data Analyst**.
+It connects directly to ClickHouse so you can ask data questions in natural language, discover data, test hypotheses, and answer why questions — directly in Slack, Microsoft Teams, ChatGPT or the native Web UI.
+
+## Pre-requisites {#pre-requisites}
+
+- A ClickHouse database, either self-hosted or in [ClickHouse Cloud](https://clickhouse.com/cloud)  
+- A [Dot](https://www.getdot.ai/) account  
+- A [Hashboard](https://www.hashboard.com/) account and project.
+
+## Connecting Dot to ClickHouse {#connecting-dot-to-clickhouse}
+
+<Image size="md" img={dot_01} alt="Configuring ClickHouse connection in Dot (light mode)" border />
+<br/>
+
+1. In the Dot UI, go to **Settings → Connections**.  
+2. Click on **Add new connection** and select **ClickHouse**.  
+3. Provide your connection details:  
+   - **Host**: ClickHouse server hostname or ClickHouse Cloud endpoint  
+   - **Port**: `9440` (secure native interface) or `9000` (default TCP)  
+   - **Username / Password**: user with read access  
+   - **Database**: optionally set a default schema  
+4. Click **Connect**.
+
+<Image img={dot_02} alt="Connecting ClickHouse" size="sm"/>
+
+Dot uses **query-pushdown**: ClickHouse handles the heavy number-crunching at scale, while Dot ensures correct and trusted answers.
+
+## Highlights {#highlights}
+
+Dot makes data accessible through conversation:
+
+- **Ask in natural language**: Get answers without writing SQL.  
+- **Why analysis**: Ask follow-up questions to understand trends and anomalies.  
+- **Works where you work**: Slack, Microsoft Teams, ChatGPT, or the web app.  
+- **Trusted results**: Dot validates queries against your schemas and definitions to minimize errors.  
+- **Scalable**: Built on query-pushdown, pairing Dot’s intelligence with ClickHouse’s speed.
+
+## Security and governance {#security}
+
+Dot is enterprise-ready:
+
+- **Permissions & roles**: Inherits ClickHouse user access controls  
+- **Row-level security**: Supported if configured in ClickHouse  
+- **TLS / SSL**: Enabled by default for ClickHouse Cloud; configure manually for self-hosted  
+- **Governance & validation**: Training/validation space helps prevent hallucinations  
+- **Compliance**: SOC 2 Type I certified
+
+## Additional resources {#additional-resources}
+
+- Dot website: [https://www.getdot.ai/](https://www.getdot.ai/)  
+- Documentation: [https://docs.getdot.ai/](https://docs.getdot.ai/)  
+- Dot app: [https://app.getdot.ai/](https://app.getdot.ai/)  
+
+Now you can use **ClickHouse + Dot** to analyze your data conversationally — combining Dot’s AI assistant with ClickHouse’s fast, scalable analytics engine.
@@ -29,6 +29,7 @@ Now that your data is in ClickHouse, it's time to analyze it, which often involv
 - [Astrato](./astrato-and-clickhouse.md)
 - [Chartbrew](./chartbrew-and-clickhouse.md)
 - [Deepnote](./deepnote.md)
+- [Dot](./dot-and-clickhouse.md)
 - [Draxlr](./draxlr-and-clickhouse.md)
 - [Embeddable](./embeddable-and-clickhouse.md)
 - [Explo](./explo-and-clickhouse.md)
@@ -53,6 +54,7 @@ Now that your data is in ClickHouse, it's time to analyze it, which often involv
 | [AWS QuickSight](./quicksight-and-clickhouse.md)     | MySQL interface               | ✅      | ✅          | Works with some limitations, see [the documentation](./quicksight-and-clickhouse.md) for more details                |
 | [Chartbrew](./chartbrew-and-clickhouse.md)           | ClickHouse official connector              | ✅      | ✅          |                                                                                                                                         |
 | [Deepnote](./deepnote.md)                            | Native connector              | ✅      | ✅          |                                                                                                                                         |
+| [Dot](./dot-and-clickhouse.md)                            | Native connector              | ✅      | ✅          |                                                                                                                                         |
 | [Explo](./explo-and-clickhouse.md)                   | Native connector              | ✅      | ✅          |                                                                                                                                         |
 | [Fabi.ai](./fabi-and-clickhouse.md)                  | Native connector              | ✅      | ✅          |                                                                                                                                         |
 | [Grafana](./grafana/index.md)                        | ClickHouse official connector | ✅      | ✅          |                                                                                                                                         |