update materialized view with join statement

Tags:

clickhouse

Suppose I have 2 tables A and B. I create a MV(materialized view) with a join query of two tables, psuedo like:

create materialized view a_b engine = Memory as 
select * from(
    select * from A
) all inner join (
    select * from B
) using some_col;

I known that a_b is only updated when inserting data into table A and nothing else happen when push data to B. I want my MV have to update when both table are updated.

My workaround is to create another MV that change postition of A, B and point to a_b like

create materialized view a_b_2 to a_b as 
select * from(
    select * from B
) all inner join (
    select * from A
) using same_col;

I have some questions about this approach:
1. Are there any more legal way to archive same effect in clickhouse?
2. Suppose I have 2 incoming batches data BD_A and BD_B are going to insert to A and B simultaneously. Some data of 2 batches themself (BD_A_B) is fit join condition . Is there any chance that the MV lost those BD_A_B because MV a_b processes BD_A with before-inserted B and MV a_b_2 processes BD_B with before-inserted A.

694

asked Jul 08 '18 15:07

Thang Nguyen

1 Answers

As far as I understand, you are trying to have a workaround of a limitation.

Clickhouse does not support multiple source tables for a MV and they have quite good reasons for this. I actually asked this to devs and got this answer:

In ClickHouse materialized view behaves more like BEFORE INSERT TRIGGER, each time processing new block arrived with insert.

So that is quite natural limitation as inserts to 2 different table will come asynchronously and you usually expect to see in JOINs whole table not only newly arrived blocks.

answered Oct 10 '22 21:10

Ramazan Polat

Related questions
                            
                                How make JOIN table in ClickHouse DB faster?
                            
                                How to setup an admin account for Clickhouse?
                            
                                How to create primary keys in ClickHouse
                            
                                How to search the string in query with case insensitive on Clickhouse database?
                            
                                Code: 210. DB::NetException: Connection refused (localhost:9000)
                            
                                Select only rows with max date
                            
                                Clickhouse JDBC driver class name
                            
                                How to create database in database docker container?
                            
                                Change column name in a table in Clickhouse
                            
                                How to avoid duplicates in clickhouse table?
                            
                                Clickhouse string field disk usage: null vs empty
                            
                                Clickhouse as time-series storage
                            
                                Can I query per hour increment of a accumulation column in clickhouse?
                            
                                Inserting String Array through CSV format in ClickHouse db
                            
                                ClickHouse: How to store JSON data the right way?
                            
                                How understand the granularity and block in ClickHouse?
                            
                                How to kill a process (query) in ClickHouse
                            
                                How to group by time bucket in ClickHouse and fill missing data with nulls/0s
                            
                                Import JSON into ClickHouse

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With