Featured

How to use OpenTelemetry to Trace and Monitor Apache Kafka Systems



Published
https://cnfl.io/podcast-episode-255 | How can you use OpenTelemetry to gain insight into your Apache Kafka® event systems? Roman Kolesnev, Staff Customer Innovation Engineer at Confluent, is a member of the Customer Solutions & Innovation Division Labs team working to build business-critical OpenTelemetry applications so companies can see what’s happening inside their data pipelines. In this episode, Roman joins Kris to discuss tracing and monitoring in distributed systems using OpenTelemetry. He talks about how monitoring each step of the process individually is critical to discovering potential delays or bottlenecks before they happen; including keeping track of timestamps, latency information, exceptions, and other data points that could help with troubleshooting.

Tracing each request and its journey to completion in Kafka gives companies access to invaluable data that provides insight into system performance and reliability. Furthermore, using this data allows engineers to quickly identify errors or anticipate potential issues before they become significant problems. With greater visibility comes better control over application health - all made possible by OpenTelemetry's unified APIs and services.

As described on the OpenTelemetry.io website, "OpenTelemetry is a Cloud Native Computing Foundation incubating project. Formed through a merger of the OpenTracing and OpenCensus projects." It provides a vendor-agnostic way for developers to instrument their applications across different platforms and programming languages while adhering to standard semantic conventions so the traces/information can be streamed to compatible systems following similar specs.

By leveraging OpenTelemetry, organizations can ensure their applications and systems are secure and perform optimally. It will quickly become an essential tool for large-scale organizations that need to efficiently process massive amounts of real-time data. With its ability to scale independently, robust analytics capabilities, and powerful monitoring tools, OpenTelemetry is set to become the go-to platform for stream processing in the future.

Roman explains that the OpenTelemetry APIs for Kafka are still in development and unavailable for open source. The code is complete and tested but has never run in production. But if you want to learn more about the nuts and bolts, he invites you to connect with him on the Confluent Community Slack channel. You can also check out Monitoring Kafka without instrumentation with eBPF - Antón Rodríguez to learn more about a similar approach for domain monitoring.

EPISODE LINKS
► OpenTelemetry java instrumentation: https://github.com/open-telemetry/opentelemetry-java-instrumentation
► OpenTelemetry collector: https://github.com/open-telemetry/opentelemetry-collector
► Distributed Tracing for Kafka with OpenTelemetry—Kafka London 2022: https://cnfl.io/distributed-tracing-for-kafka-with-opentelemetry-episode-255
► Monitoring Kafka without instrumentation with eBPF: https://cnfl.io/monitoring-extreme-scale-apache-kafka-using-ebpf-at-new-relic-episode-255
► Join the Confluent Community: https://cnfl.io/confluent-community-episode-255
► Learn more with Kafka tutorials, resources, and guides at Confluent Developer: https://cnfl.io/confluent-developer-episode-255
► Use PODCAST100 to get an additional $100 of free Confluent Cloud usage: https://cnfl.io/try-cloud-episode-255
► Promo code details: https://cnfl.io/podcast100-details-episode-255

TIMESTAMPS
0:00 - Inro
4:14 - What is OpenTelemetry?
7:52 - Tracing vs. Logs
11:26 - Three ways to do application-level tracing with OpenTelemetry
15:47 - What can you do if OpenTelemetry's agent doesn't support a specific API?
17:57 - What's missing in OpenTelemetry's native Kafka support?
32:29 - What can you see when using OpenTelemetry?
36:10 - Getting started with OpenTelemetry for event-level tracing
39:14 - Synchronous vs. Asynchronous processes
48:13 - It's a wrap!

CONNECT
Subscribe: https://youtube.com/c/confluent?sub_confirmation=1
Site: https://confluent.io
GitHub: https://github.com/confluentinc
Facebook: https://facebook.com/confluentinc
Twitter: https://twitter.com/confluentinc
LinkedIn: https://www.linkedin.com/company/confluent
Instagram: https://www.instagram.com/confluent_inc

ABOUT CONFLUENT
Confluent is pioneering a fundamentally new category of data infrastructure focused on data in motion. Confluent’s cloud-native offering is the foundational platform for data in motion – designed to be the intelligent connective tissue enabling real-time data, from multiple sources, to constantly stream across the organization. With Confluent, organizations can meet the new business imperative of delivering rich, digital front-end customer experiences and transitioning to sophisticated, real-time, software-driven backend operations. To learn more, please visit www.confluent.io.

#streamprocessing #apachekafka #kafka #confluent
Category
Audio
Be the first to comment