Operations | Monitoring | ITSM | DevOps | Cloud

Latest Posts

AI in the enterprise: Avoid hitting the infrastructure performance wall

“It’s nearly impossible to manage the growing complexity for corporate on-prem and Cloud infrastructure,” says Tim Conley, Principal at The ATS Group & Galileo Suite. “Most IT teams use a mix of tools to monitor and measure the health of their environment. However, this delays incident resolution, contributes to silos within an IT organization, and slows down your business.”

Funding update: $840k secured and more to come

As with all start-ups, especially for a cloud provider, access to funds is imperative to build and scale quickly – after all building out new data centre regions doesn't come cheap! So in recent months we quietly opened a seed round to acquire $2.8m worth of funding – giving Civo a pre-money valuation of $16,800,000. Since launching into beta nearly 2 years ago, we’ve had tons of VC companies knocking on our door, but at this stage we decided not to take VC money.

Bulk Update Multiple WebLogic WLSDM Settings via WL-OPC

When you need to change WLSDM WebLogic settings and you have so many WLSDM WebLogic domains, use the “WLSDM Configuration” page to standardize the bulk WLSDM WebLogic domains settings. WL-OPC prevents struggling with numerous tabs, unwanted confusion and saves your time with WLSDM Configurations Page! The “WLSDM Configuration” page has rich content and simple usage.

Qovery goes beyond app deployment - The Future of Qovery - Week #5

During the next six weeks, our team will work to improve the overall experience of Qovery. We gathered all your feedback (thank you to our wonderful community 🙏), and we decided to make significant changes to make Qovery a better place to deploy and manage your apps. This series will reveal all the changes and features you will get in the next major release of Qovery. Let's go!

Reduce Toil with Better Alerting Systems

If not tackled early, increasing toil can affect the morale and productivity of your SRE team. In this blog we look at some of the ways you can counter toil with the help of better alerting systems in place. Are you an SRE or On-call engineer struggling to manage toil? Toil is any repetitive or monotonous activity that can lead to frustration within an incident management team. Also at the business level, toil doesn't add any functional value towards growth and productivity.

Logz.io Debuts Multiple Tracing Accounts and Jaeger Architecture Visualization

Logz.io has pressed hard to align our tracing and metrics analytics capabilities over the past year. And as our technology advances, so does our service. We are announcing Multiple Tracing Accounts with Logz.io Distributed Tracing, aligning it with our logging and metrics tools. Complementing multiple data sources for metrics and logs, Logz users can segment their data according to sources and teams for better organization.

HAProxy Forwards Over 2 Million HTTP Requests per Second on a Single Arm-based AWS Graviton2 Instance

For the first time, a software load balancer exceeds 2-million RPS on a single Arm instance. A few weeks ago, while I was working on an HAProxy issue related to thread locking contention, I found myself running some tests on a server with an 8-core, 16-thread Intel Xeon W2145 processor that we have in our lab. Although my intention wasn’t to benchmark the proxy, I observed HAProxy reach 1.03 million HTTP requests per second.

How we use metamonitoring Prometheus servers to monitor all other Prometheus servers at Grafana Labs

One of the big questions in monitoring can be summed up as: Who watches the watchers? If you rely on Prometheus for your monitoring, and your monitoring fails, how will you know? The answer is a concept known as metamonitoring. At Grafana Labs, a handful of geographically distributed metamonitoring Prometheus servers monitor all other Prometheus servers and each other cross-cluster, while their alerting chain is secured by a dead-man’s-switch-like mechanism.