Operations | Monitoring | ITSM | DevOps | Cloud

Latest News

Optimizing ClickHouse Performance: Diagnosing and Resolving Common Bottlenecks

ClickHouse, a columnar database designed for high-performance real-time analytics, is excellent at handling large datasets with speed and efficiency. However, performance issues can occur due to factors like unoptimized queries, resource contention, or improper configuration. As data and query complexity grow, keeping ClickHouse fast can be challenging. This blog will explore common bottlenecks, how to diagnose and resolve them, and include a Python script for automating diagnostics. Lets get started!

Catching Flaky Tests Before It's Too Late

This is a guest post from Artem Zakharchenko, creator of MSWJS, an API mocking library for Javascript. He also writes about testing for EpicWeb and on his personal blog. Test flakiness is a big issue. Not only can it be a colossal time investment to detect and fix, but it hurts perhaps the biggest value you get from your tests—their trustworthiness. A test you cannot trust is a useless test. Time spent maintaining a useless test is time wasted; time that could have been spent building.

Performing for the holidays: Look beyond uptime for season sales success

With the holiday shopping season in full swing, poor web performance can have a big impact on revenue. There’s intense competition for online shoppers, and customers will quickly bounce to another site instead of slogging through a bad experience. The best way to track and achieve your web performance goals is through experience-based SLOs (Experience Level Objectives, or XLOs).

Top 5 outages detected by StatusGator in November 2024

StatusGator continues to demonstrate its value by providing early warning alerts for service disruptions, often detecting issues before official acknowledgment. Below, we highlight key incidents from November 2024 where StatusGator’s monitoring helped users stay ahead.

The Leading Synthetic Monitoring Tools

For accurate and effective performance testing, synthetic monitoring has become a staple and this is only going to continue in the coming years. This is mainly due to the fact that this process is beneficial and offers numerous advantages to organizations. With synthetic monitoring, your organization can identify performance issues before they affect real users. By continuously simulating user interactions, your team can highlight and rectify performance bottlenecks and infrastructure issues in real time.

Leveraging AWS Private Image Build for a Compliant Cribl Deployment

In today’s data-driven world, ensuring the security and compliance of your data pipelines is paramount. Cribl Stream and Cribl Edge offer powerful telemetry data management and enrichment solutions. However, deploying these tools within your environment often requires careful consideration of security and compliance standards.

Top AWS monitoring best practices

AWS powers countless businesses with its vast services and unmatched scalability, but managing such a dynamic environment comes with challenges. Effective monitoring isn’t an option—it’s essential for ensuring performance, controlling costs, and maintaining compliance. Without a strategic approach, issues can escalate quickly, impacting customer experiences and business outcomes.

The why and how of network availability monitoring

You might be familiar with the following scenario: You have a monitor displaying 20 open applications to oversee multiple networks or various aspects of your network infrastructure. Your inbox is steadily filling up with emails—many of which you can't seem to open and respond to in a timely manner. Outstanding tasks are accumulating, all due to an unexpected outage in a data center. If this resonates with you, it's likely that you are a network administrator or someone who works closely with them.