Project Metamorphosis: Wir präsentieren die Event-Streaming-Plattform der nächsten Generation. Mehr erfahren

Dublin Apache Kafka Meetup by Confluent - Recording

Watch recording

Joined by an Apache Kafka community member

Watch a recording from the Dublin Apache Kafka Meetup by Confluent from July 4, 2017.

Rethinking Stream Processing with Apache Kafka: Applications vs. Clusters, Streams vs. Databases

Speaker: Michael Noll, Product Manager, Confluent

Den Kern moderner Unternehmen bilden Daten, und diese Daten befinden sich in stetigem Wandel. Wie können wir diese Flut an Informationen in Echtzeit erfassen und nutzen? Die Antwort lautet Datenstromverarbeitung – und Apache Kafka ist die Technologie, die sich zur Kernplattform für Daten-Streaming entwickelt hat. Zu den tausenden von Unternehmen, die Kafka nutzen, um ihre jeweiligen Branchen zu transformieren und neu zu definieren, gehören Netflix, Uber, PayPal und AirBnB, aber auch etablierte Größen wie Goldman Sachs, Cisco und Oracle.

Unfortunately, today’s common architectures for real-time data processing at scale suffer from complexity: there are many technologies that need to be stitched and operated together, and each individual technology is often complex by itself. This has led to a strong discrepancy between how we, as engineers, would like to work vs. how we actually end up working in practice.

In this session we talk about how Apache Kafka helps you to radically simplify your data architectures. We cover how you can now build normal applications to serve your real-time processing needs — rather than building clusters or similar special-purpose infrastructure — and still benefit from properties such as high scalability, distributed computing, and fault-tolerance, which are typically associated exclusively with cluster technologies. We discuss common use cases to realize that stream processing in practice often requires database-like functionality, and how Kafka allows you to bridge the worlds of streams and databases when implementing your own core business applications (inventory management for large retailers, patient monitoring in healthcare, fleet tracking in logistics, etc), for example in the form of event-driven, containerized microservices.


Real-time Data Integration at Scale with Kafka Connect**

Speaker: Robin Moffat, Partner Technology Evangelist, Confluent

Apache Kafka is a streaming data platform. It enables integration of data across the enterprise, and ships with its own stream processing capabilities. But how do we get data in and out of Kafka in an easy, scalable, and standardised manner? Enter Kafka Connect. Part of Apache Kafka since 0.9, Kafka Connect defines an API that enables the integration of data from multiple sources, including MQTT, common NoSQL stores, and CDC from relational databases such as Oracle. By "turning the database inside out" we can enable an event-driven architecture in our business that reacts to changes made by applications writing to a database, without having to modify those applications themselves. As well as ingest, Kafka Connect has connectors with support for numerous targets, including HDFS, S3, and Elasticsearch.

This presentation will briefly recap the purpose of Kafka, and then dive into Kafka Connect, with practical examples of data pipelines that can be built with it and are in production at companies around the world already. We'll also look at the Single Message Transform (SMT) capabilities introduced with Kafka and how they can make Kafka Connect even more flexible and powerful.