Operations | Monitoring | ITSM | DevOps | Cloud

Latest News

How To Troubleshoot Missing Performance Data in Netreo

Missing performance data or statistics on dashboards or reports is always troublesome and could be critical. Let’s say you and your IT team recently added a new server to handle your growing graphics department. First thing in the morning, you hop on your IT operations dashboard to check CPU Utilization. Yikes! No performance data. You check your recent server report and find nothing there, either.

Guide to Monitoring Your Apache Zipkin Environment Using Telegraf

Using Apache Zipkin is important because it provides detailed, end-to-end tracing of requests across distributed systems, helping to identify latency issues and performance bottlenecks. Monitoring your Zipkin environment is crucial to ensure the reliability and performance of your tracing system, allowing you to quickly detect and address any anomalies or downtime.

Platform Engineering Best Practices: Data Security and Privacy

Security is and will always be a huge concern, and Platform Engineering is here to stay: so, what are some Platform Engineering best practices that can support your data security and privacy efforts? You’d be surprised where they overlap, and what you can learn about putting security and productivity together — we’ll explain.

Bridging the gap: Integrating network and application monitoring for complete visibility

As technology progresses and applications become more intertwined, sticking to the old ways of monitoring networks separately just doesn’t cut it anymore. Network and application teams often work in silos, using different tools and focusing on different goals. This split approach frequently leaves both sides with a piecemeal understanding of issues, making it challenging to pinpoint and fix problems that span both areas.

Independent, Involved, Informed, and Informative: The Characteristics of a CoPE

As our Field CTO Liz Fong-Jones says, production excellence is important for cloud-native software organizations because it ensures a safe, reliable, and sustainable system for an organization’s customers and employees. A CoPE helps organizations cultivate the practices and tools necessary to achieve that consistently. In part one of our CoPE series, we analogized the CoPE with safety departments.

Announcing HAProxy 3.0

Here we are in our twenty-third year, and open source HAProxy is going strong. HAProxy is the world’s fastest and most widely used software load balancer, with over one billion downloads on Docker Hub. It is the G2 category leader in API management, container networking, DDoS protection, web application firewall (WAF), and load balancing.

Grafana Loki query acceleration: How we sped up queries without adding resources

As we discussed when we rolled out the latest major release of Grafana Loki, we’ve grown the log aggregation system over the past five years by balancing feature development with supporting users at scale. A big part of the latter has been making queries much faster — and that was a major focus with Loki 3.0 too. We’ve seen peak query throughput grow from 10 GB/s in our Loki 1.0 days to greater than 1 TB/s even before 3.0.

Removing ad trackers and cookies - the technical perspective

Sentry recently completed a multi-month project to remove all non-essential cookies and trackers from our public websites. For more context, see two blog posts that offer differing perspectives on the project: one from our marketing team, another from our legal team, and a third blog post that explains our privacy values and our ultimate motivation.