Master key concepts with interactive flashcards
What is Amazon Kinesis Data Streams and when should you use it for ML pipelines?
Click to reveal answer
Kinesis Data Streams is a real-time data streaming service that captures and processes data continuously. Use it for ML pipelines when you need: real-time feature computation from event streams, sub-second data ingestion for online learning, ordered processing of time-series data, and replay capability for reprocessing. Key settings: shard count controls throughput (1 MB/s in, 2 MB/s out per shard), retention period (24 hours default, up to 365 days).
Click to show question