Operations | Monitoring | ITSM | DevOps | Cloud

%term

swampUP Recap: "EveryOps" is Trending as a Software Development Requirement

swampUP 2024, the annual JFrog DevOps Conference, was unique in it’s addressing not only more familiar DevOps and DevSecOps issues, but adding specific operational challenges, stemming from the explosive growth of GenAI and the resulting need for specialized capabilities for handling AI models and datasets, while supporting new personae such as AI/ML engineers, data scientists and MLOps professionals.

IT Infrastructure Support: Complete Guide And Best Practices

Ensuring that systems run smoothly is a key priority for any organization that relies on technology. To achieve this, effective IT infrastructure support is essential. This involves implementing a robust process for managing and maintaining hardware, software, networks, and other critical components that keep operations running seamlessly.

Managing Software Acquisition: Best Practices For Your Business

When it comes to acquiring new software for your organization, it's easy to focus on finding the right product and getting it implemented as quickly as possible. But without a structured software acquisition process, that can lead to overspending, compliance risks, or ending up with tools that don’t fit your needs. In this article, we’ll take a closer look at the software acquisition process and explore practical ways to improve it.

Flaky tests: their hidden costs and how to address flaky behavior

Flaky tests are bad—this is a fact implicitly understood by developers, platform and DevOps engineers, and SREs alike. When tests flake (i.e., generate conflicting results across test runs, without any changes to the code or test), they can arbitrarily fail builds, requiring developers to re-run the test or the full pipeline. This process can take hours—especially for large or monolithic repositories—and slow down the software delivery cycle.

Beyond Their Intended Scope: Uzing into Russia

The first installment of our new blog series, Beyond Their Intended Scope, covers BGP mishaps that may have escaped the community’s attention but are worthy of analysis. In this post, we review a recent BGP leak that redirected internet traffic through Russia and Central Asia as a result of a path error leak by Uztelecom, the incumbent service provider of Uzbekistan.

Key Metrics to Monitor for a Healthy Kafka Cluster

Maintaining a healthy Kafka cluster is critical to ensuring your real-time data pipelines run smoothly. However, keeping your Kafka environment in tip-top shape isn’t just about setting it up and letting it run. Regular monitoring of key metrics is essential to catch issues before they escalate, optimize performance, and keep everything humming along smoothly. So, what should we be looking at when it comes to Kafka metrics? Let’s break down the most important ones and how to interpret them.

AWS X-Ray vs Jaeger - Choosing the Right Distributed Tracing Tool

Distributed tracing has become an essential part of any application's performance monitoring strategy. As businesses adopt distributed architectures, choosing the right tracing tool is crucial for efficient troubleshooting and performance monitoring. The two most prominent choices are AWS X-Ray and Jaeger, each offering unique features and advantages. AWS X-Ray, a managed service by Amazon, simplifies tracing for applications running on AWS.