How to ingest 1B rows/s in ClickHouse

32 points by _peregrine_ 4 months ago · 11 comments

Reader

Minor quibble but: probably get another 5-15% by upgrading from oldoldstable Debian 11/Bullseye with kernel 5.10 to something modern like Debian 13/Trixie.

Edit: oh these are containers, right; the difference will be far less since the kernel is determined by the host. Still, I find it sad to see old software versions like this used.

logifail 4 months ago

> probably get another 5-15% by upgrading from oldoldstable Debian 11/Bullseye with kernel 5.10 to something modern like Debian 13/Trixie
For the on-prem/homelab crowd: I upgraded my desktop from Debian 12 to 13 yesterday only to discover my Mellanox ConnectX-3 NIC wasn't supported by the Trixie kernel. From 10GbE to 0GbE in one reboot :/
Have ordered a ConnectX-4 so that should get me going again, of course 100GbE is severe overkill for my needs but since I already have a 100GbE switch I figured why not...
javisantana 4 months ago

Thanks.
I re-used all scripts and didn't even notice the linux version.
_peregrine_OP 4 months ago

nice one

jandrewrogers 4 months ago

I don't know about ClickHouse specifically but these days it isn't that hard to ingest >10M rows/sec per server, including parsing, processing, indexing, and storage. You can just throw hardware at it.

_peregrine_OP 4 months ago

yeah I mean that's basically what Javi talks about in the post... if you can throw hardware at it you can scale it (ingestion scales linearly with shards)
but the post has some interesting thoughts on how you do the high-scale ingestion while also handling background merge processes, reads, etc.

trollied 4 months ago

From yesterday https://news.ycombinator.com/item?id=44933452

_peregrine_OP 4 months ago

definitely interesting and related

rognjen 4 months ago

Is it just me or is there a noticeable uptick in Clickhouse related posts that are border-line marketing?

coolspot 4 months ago

ClickHouse is just awesome and underrated. I tried to use TimeScale DB for 1-minute stock prices, and it was super slow and inefficient. On ClickHouse, the same data works, and it is blazingly fast.

Settings

How to ingest 1B rows/s in ClickHouse

Keyboard Shortcuts