Operations | Monitoring | ITSM | DevOps | Cloud

Latest News

Why 87% of AI/ML Projects Never Make It Into Production-And How to Fix It

Going from prototype to production is perilous when it comes to artificial intelligence (AI) and machine learning (ML). However, many organizations struggle moving from a prototype on a single machine to a scalable, production-grade deployment. In fact, research has found that the vast majority—87%—of AI projects never make it into production. And for the few models that are ever deployed, it takes 90 days or more to get there.

Your First Dagger Kubernetes Deployment with Shipa

The DevOps and Platform Engineering space certainly is one that evolves fast. As new development paradigms get consumed, supporting the development pipeline is crucial. Pushing a public release of v0.2.x on March 30th, 2022, Dagger, from the creators of Docker, is another approach in portability and consistency in CI/CD pipelines. What the Docker Container has done applications, Dagger is hoping to achieve that with CI/CD pipelines.

[UPDATED] We're changing the way development environment URLs are generated

UPDATE 6 April 2022: There are times where we have to revisit our plan, and today is one of those times. Last week, we released a fix to solve some inconsistencies in how our development URLs are generated. Even though this change was not impacting any production environments, many customers reported that it was breaking their test integrations with third-party services, or that they were reaching a provisioning limit with our Let’s Encrypt certificates.

How to automate verification of deployments with Argo Rollouts and Elastic Observability

Shipping complex applications at high velocity lead to increased failures. Longer pipelines, scattered microservices, and more code inherently lead to bigger complexity where small mistakes may cost you big time.

New StackPod Episode: Implementing an SRE Practice with Yousef Sedky of Axiom/Hyke

For our latest StackPod episode, we invited Hyke’s DevOps team lead and AWS Cloud architect: Yousef Sedky. Axiom Telecom is one of the largest telephone retailers in the United Arab Emirates and Saudi Arabia and Hyke, its sister company, is a distribution platform for mobile products.

Getting started with DNS attacks

Whenever an online service goes down, you're likely to hear three words: "it was DNS!" Blaming DNS might be a running joke among network admins and engineers, but it's one rooted in experience. DNS problems are known for causing massive, Internet-wide outages such as the 2021 Akamai outage that temporarily made the websites for Delta Air Lines, American Express, Airbnb, and others unreachable.