Diverse data access solutions
Tinybird provides a comprehensive data access solution that supports real-time import of data from a variety of modern data sources. The platform natively integrates a Kafka connector, allowing direct mapping of Kafka topics to Tinybird data sources for event processing with sub-second latency. For batch data scenarios, the platform supports timed or triggered import of data files from object storage such as S3, Google Cloud Storage, etc., supporting JSON, CSV, Parquet and other formats. The special Events API provides a channel to push event data directly via HTTP protocol, which facilitates mobile applications and front-ends to report user behavior data directly.
A typical case of an e-commerce recommendation system is that user clickstreams flow through Kafka into Tinybird in real time, while catalog information is periodically updated in bulk from S3. The platform's data version control mechanism ensures that API availability is not affected during data updates. The incremental processing mode consumes only the new data, which greatly improves processing efficiency. For scenarios that require compliance auditing, all data changes have complete metadata records, which meets the security requirements of SOC 2 Type 2.
Supported data source types
- Real-time streaming data: Kafka, Events API
- Object Storage: S3, Google Cloud Storage
- Local files: uploaded via CLI
- Database change log: support for CDC mode
This answer comes from the articleTinybird: a platform for rapidly building real-time data analytics APIsThe