Integrating MySQL with ClickHouse
This page covers two options for integrating MySQL with ClickHouse:
- using the
MySQL
table engine, for reading from a MySQL table - using the
MaterializedMySQL
database engine, for syncing a database in MySQL with a database in ClickHouse
Connecting ClickHouse to MySQL using the MySQL Table Engine
The MySQL
table engine allows you to connect ClickHouse to MySQL. SELECT and INSERT statements can be made in either ClickHouse or in the MySQL table. This article illustrates the basic methods of how to use the MySQL
table engine.
1. Configure MySQL
Create a database in MySQL:
CREATE DATABASE db1;
Create a table:
CREATE TABLE db1.table1 (
id INT,
column1 VARCHAR(255)
);Insert sample rows:
INSERT INTO db1.table1
(id, column1)
VALUES
(1, 'abc'),
(2, 'def'),
(3, 'ghi');Create a user to connect from ClickHouse:
CREATE USER 'mysql_clickhouse'@'%' IDENTIFIED BY 'Password123!';
Grant privileges as needed. (For demonstration purposes, the
mysql_clickhouse
user is granted admin prvileges.)GRANT ALL PRIVILEGES ON *.* TO 'mysql_clickhouse'@'%';
If you are using this feaure in ClickHouse Cloud, you may need the to allow the ClickHouse Cloud IP addresses to access your MySQL instance. Check the ClickHouse Cloud Endpoints API for egress traffic details.
2. Define a Table in ClickHouse
Now let's create a ClickHouse table that uses the
MySQL
table engine:CREATE TABLE mysql_table1 (
id UInt64,
column1 String
)
ENGINE = MySQL('mysql-host.domain.com','db1','table1','mysql_clickhouse','Password123!')The minimum parameters are:
parameter Description example host hostname or IP mysql-host.domain.com database mysql database name db1 table mysql table name table1 user username to connect to mysql mysql_clickhouse password password to connect to mysql Password123! noteView the MySQL table engine doc page for a complete list of parameters.
3. Test the Integration
In MySQL, insert a sample row:
INSERT INTO db1.table1
(id, column1)
VALUES
(4, 'jkl');Notice the existing rows from the MySQL table are in the ClickHouse table, along with the new row you just added:
SELECT
id,
column1
FROM mysql_table1You should see 4 rows:
Query id: 6d590083-841e-4e95-8715-ef37d3e95197
┌─id─┬─column1─┐
│ 1 │ abc │
│ 2 │ def │
│ 3 │ ghi │
│ 4 │ jkl │
└────┴─────────┘
4 rows in set. Elapsed: 0.044 sec.Let's add a row to the ClickHouse table:
INSERT INTO mysql_table1
(id, column1)
VALUES
(5,'mno')Notice the new row appears in MySQL:
mysql> select id,column1 from db1.table1;
You should see the new row:
+------+---------+
| id | column1 |
+------+---------+
| 1 | abc |
| 2 | def |
| 3 | ghi |
| 4 | jkl |
| 5 | mno |
+------+---------+
5 rows in set (0.01 sec)
Summary
The MySQL
table engine allows you to connect ClickHouse to MySQL to exchange data back and forth. For more details, be sure to check out the documentation page for the MySQL table engine.
Replicate a MySQL Database in ClickHouse
This page is not applicable to ClickHouse Cloud. The feature documented here is not yet available in ClickHouse Cloud services. See the ClickHouse Cloud Compatibility guide for more information.
The MaterializedMySQL
database engine allows you to define a database in ClickHouse that contains all the existing tables in a MySQL database, along with all the data in those tables. On the MySQL side, DDL and DML operations can continue to made and ClickHouse detects the changes and acts as a replica to MySQL database.
This article demonstrates how to configure MySQL and ClickHouse to implement this replication.
1. Configure MySQL
Configure the MySQL database to allow for replication and native authentication. ClickHouse only works with native password authentication. Add the following entries to
/etc/my.cnf
:default-authentication-plugin = mysql_native_password
gtid-mode = ON
enforce-gtid-consistency = ONCreate a user to connect from ClickHouse:
CREATE USER clickhouse_user IDENTIFIED BY 'ClickHouse_123';
Grant the needed permissions to the new user. For demonstration purposes, full admin rights have been granted here:
GRANT ALL PRIVILEGES ON *.* TO 'clickhouse_user'@'%';
noteThe minimal permissions needed for the MySQL user are RELOAD, REPLICATION SLAVE, REPLICATION CLIENT and SELECT PRIVILEGE.
Create a database in MySQL:
CREATE DATABASE db1;
Create a table:
CREATE TABLE db1.table_1 (
id INT,
column1 VARCHAR(10),
PRIMARY KEY (`id`)
) ENGINE = InnoDB;Insert a few sample rows:
INSERT INTO db1.table_1
(id, column1)
VALUES
(1, 'abc'),
(2, 'def'),
(3, 'ghi');
2. Configure ClickHouse
Set parameter to allow use of experimental feature:
set allow_experimental_database_materialized_mysql = 1;
Create a database that uses the
MaterializedMySQL
database engine:CREATE DATABASE db1_mysql
ENGINE = MaterializedMySQL(
'mysql-host.domain.com:3306',
'db1',
'clickhouse_user',
'ClickHouse_123'
);The minimum parameters are:
parameter Description example host:port hostname or IP and port mysql-host.domain.com database mysql database name db1 user username to connect to mysql clickhouse_user password password to connect to mysql ClickHouse_123 noteView the MaterializedMySQL database engine doc page for a complete list of parameters.
3. Test the Integration
In MySQL, insert a sample row:
INSERT INTO db1.table_1
(id, column1)
VALUES
(4, 'jkl');Notice the new row appears in the ClickHouse table:
SELECT
id,
column1
FROM db1_mysql.table_1The response looks like:
Query id: d61a5840-63ca-4a3d-8fac-c93235985654
┌─id─┬─column1─┐
│ 1 │ abc │
└────┴─────────┘
┌─id─┬─column1─┐
│ 4 │ jkl │
└────┴─────────┘
┌─id─┬─column1─┐
│ 2 │ def │
└────┴─────────┘
┌─id─┬─column1─┐
│ 3 │ ghi │
└────┴─────────┘
4 rows in set. Elapsed: 0.030 sec.Suppose the table in MySQL is modified. Let's a column to
db1.table_1
in MySQL:alter table db1.table_1 add column column2 varchar(10) after column1;
Now let's insert a row to the modified table:
INSERT INTO db1.table_1
(id, column1, column2)
VALUES
(5, 'mno', 'pqr');Notice the table in ClickHouse now has the new column and the new row:
SELECT
id,
column1,
column2
FROM db1_mysql.table_1The previous rows will have
NULL
forcolumn2
:Query id: 2c32fd15-3c83-480b-9bfc-cba5d932d674
Connecting to localhost:9000 as user default.
Connected to ClickHouse server version 22.2.2 revision 54455.
┌─id─┬─column1─┬─column2─┐
│ 3 │ ghi │ ᴺᵁᴸᴸ │
└────┴─────────┴─────────┘
┌─id─┬─column1─┬─column2─┐
│ 2 │ def │ ᴺᵁᴸᴸ │
└────┴─────────┴─────────┘
┌─id─┬─column1─┬─column2─┐
│ 1 │ abc │ ᴺᵁᴸᴸ │
│ 5 │ mno │ pqr │
└────┴─────────┴─────────┘
┌─id─┬─column1─┬─column2─┐
│ 4 │ jkl │ ᴺᵁᴸᴸ │
└────┴─────────┴─────────┘
5 rows in set. Elapsed: 0.017 sec.
Summary
That's it! The MaterializedMySQL
database engine will keep the MySQL database synced on ClickHouse. There are a few details and limitations, so be sure to read the doc page for MaterializedMySQL for more details.