This book teaches you how to implement scalable observability, how to improve engineering efficiency by leveraging AI, and how to expand observability practices from production all the way into development by integrating it into modern internal development platforms!
You’ll start with foundational concepts of observability, log analytics, and metrics, then explore how AIOps enhances signal correlation, noise reduction, anomaly detection, and root cause analysis. Using examples and architectural guidance, the book shows you how to integrate AIOps components into existing systems and build pipelines that proactively detect and resolve issues before they impact users.
You will learn best practices to implement observability in operations and from there expand left into the software development lifecycle, providing engineers with AI Observability as a self-service through internal development platforms (IDPs).
Through practical use cases and examples, you’ll learn to use tools such as OpenTelemetry, Prometheus, Elasticsearch, and Grafana alongside machine learning models for automated diagnostics and remediation.
By the end of this book, you’ll be able to design and implement AIOps-enabled observability solutions to make your systems more resilient, responsive, and efficient. You will learn how modern observability has evolved from monitoring static systems to leveraging AI on today’s dynamic cloud native environments.
If you want to read the first chapter for free
click here 
Accept the invitation and you will receive a private Message from Andi.
You can pre-order the book on Amazon or the Packt website!
The Authors
Hilliary Lipsig is an autodidact and start-up veteran who has frequently learned and applied technologies to get a job done. She’s had her hand in every part of the application delivery process, honing her skills originally as a Quality Engineer. Hilliary is an IT polyglot able to talk the lingo of both the Operations and Development teams.
She’s currently a Senior Principal SRE at Red Hat, and she’s passionate about process, consistency in tooling, and scalability.
Andreas Grabner is a technical advocate for making distributed systems observable and making automated data-driven decisions across the software development lifecycle. In his capacity as a CNCF ambassador and a DevRel at Dynatrace, he connects and educates global software engineering communities on building and continuously validating digital services for resiliency, high availability, and security.
Since his early days, he has been passionate about software quality and performance engineering, it resulted in building excellent digital products. Andi uses his advocacy platforms to share best practices on topics such as observability, progressive delivery, DevOps, site reliability engineering, platform engineering, and digital business operations!
Robert Rati is a platform engineer veteran of small, medium, and large corporations in regulated industries ranging from wireless communications to the financial sector. He is passionate about reducing noise and enabling teams to focus on creating business value. He emphasises maintainability, consistency, user friendliness, and productivity when planning projects. Robert is currently a Senior Software Engineer with Second Front.
