Skip to main content

5 docs tagged with "real-time"

View all tags

Apache Druid: Real-Time Analytics Database

Apache Druid is a high-performance, distributed analytics database designed for real-time ingestion and fast querying of large-scale data. It's optimized for time-series data and provides sub-second query performance for analytical workloads.

Change Data Capture (CDC): Real-time Data Synchronization

Change Data Capture (CDC) is a design pattern that identifies and captures changes made to data in a database, then delivers those changes in real-time to downstream systems. CDC enables real-time data integration, synchronization, and streaming analytics across heterogeneous systems.

Elasticsearch Overview

Elasticsearch is a distributed, RESTful search and analytics engine capable of addressing a growing number of use cases. It's built on top of Apache Lucene and provides near real-time search and analytics capabilities. Elasticsearch is commonly used for log analytics, full-text search, security intelligence, business analytics, and operational intelligence. It is a key component of the Elastic Stack (formerly known as the ELK Stack), which includes Elasticsearch, Logstash, and Kibana.

Kafka CDC Integration: Real-time Data Streaming

This guide covers integrating Change Data Capture (CDC) with Apache Kafka to create real-time data streaming pipelines. We'll explore Kafka Connect, Debezium, and custom implementations for building robust CDC systems.

StarRocks: High-Performance OLAP Database

StarRocks is a high-performance, distributed OLAP database designed for real-time analytics and sub-second query performance. It features an MPP (Massively Parallel Processing) architecture optimized for analytical workloads with high concurrency support.