Replies: 1 comment
-
In Doris, if the dictGet functionality in Clickhouse needs to be replaced, the Join operation is usually used to implement the association of dimensional data. For the scenario you mentioned, because the IP data in the latitude table is relatively dispersed, the IP data in the fact table may not be equally dispersed, which puts some restrictions on using Colocation Join. Colocation Join has the advantage of avoiding data shuffling, which makes the Join more efficient, but it relies on the data being bucketed in a specific way so that the Join can be performed directly on the corresponding node. If the IP data distribution of the latitude and fact tables does not fit the Colocation Join requirements, you may want to consider the following solutions: Use Broadcast Join: If the latitude table is not large, a Broadcast Join can be used to broadcast the latitude table to all nodes, so that each node can perform a Join operation locally. However, be aware that if the latitude table is large, this approach may incur higher network overhead. |
Beta Was this translation helpful? Give feedback.
-
DictGet
is a function in Clickhouse to quickly get the value mapped in theDict table
by key. With functionDictGet
, I can complete the operation of labelling the data in the query SQL.I checked the official Doris documentation and there doesn't seem to be a similar feature.
I currently have some Clickhouse SQL which uses
DictGet
. i currently want to migrate to Doris, best practice is to change toJoin
?Beta Was this translation helpful? Give feedback.
All reactions