Exploring Kafka Streams Partitioning, Scaling, and Fault Tolerance

Exploring Kafka Steams Partitioning, Scaling, and Fault Tolerance – we’ll build a simple streaming app and inspect the contents of the repartition topic, RocksDB state store, and the Kafka internal changelog topic. Also, we’ll see how the workload gets distributed between the running app instances.

Continue Reading

Integration Tests for your Kafka Producer with “Testcontainers” in C#

An article about integration tests for your Kafka workflow. Using C# and the “Testcontainers” framework, we first run Kafka in Docker, call an API that will push a message to Kafka, consume the produced Kafka message ensuring it’s correct, and finally drop the created containers.

Continue Reading

Monitoring Public Transportation with the Kafka Ecosystem (Demo Project)

Monitoring Public Transportation with the Kafka Ecosystem – a demo project part of the Udacity “Data Streaming” Nanodegree. It contains data streaming workflows using Kafka Producers and Consumers, JSON and Avro serialization, Kafka Connect (JDBC), REST Proxy, Robinhood Faust, and KSQL.

Continue Reading

Should You Migrate Your Reporting Queries From a “General Purpose” DB (MongoDB) to a Data Warehouse (ClickHouse)? (Performance Overview)

Migrating Your Reporting Queries From a “General Purpose” DB to a Data Warehouse – Performance Overview. The article focuses on MongoDB and ClickHouse, but most takeaways apply to other “traditional” DBs and “data warehouse” solutions.

Continue Reading

Site Footer