From On-call to Non-call: Resolving Incidents Before They Even Happen
Artificial intelligence has captured the attention of the world, with tools like ChatGPT and large language models (LLMs) driving the conversation. But you don’t need to wait for the future or new features powered by LLMs to start working smarter—the tech industry has been investing in intelligent, automated tools for years and they’re ready for production now. In this talk, you’ll learn how the engineering teams at Toyota Connected use tools like Datadog Watchdog, Anomaly Detection, and Workflows to make our lives easier and keep our platform stable. I’ll discuss how we configure and use these tools to help us detect and triage issues earlier, allowing us to respond before they turn into outages. You’ll leave with an understanding of what these tools do and how you might use them in your own systems.