Operations | Monitoring | ITSM | DevOps | Cloud

%term

Join Ken on SMC Journal - Scaling Kubernetes, Microservices, and Ephemeral Environments

Check out Ken Ahrens and Scott Moore as they discuss some blockers of developer productivity when building in Kubernetes, and how removing environment and data challenges can reduce toil and frustration! You can catch the full podcast on Scott’s page here: Scott Moore: Hey everybody out there in internet meme land. It’s time to hide your kids and hide your wife because it’s time for the SMC Journal podcast. Some of you will get that joke. Others will not.

The Impact of MTTR on Customer Satisfaction and Business Success

Today, businesses are increasingly reliant on their ability to provide uninterrupted service and respond swiftly to any disruptions. Whether it's a website outage, a malfunctioning application, or hardware failure, downtime can significantly affect a company's operations. Customers expect quick resolutions, and delays can result in dissatisfaction, loss of trust, and ultimately, business failure.

Benefits as Well as Drawbacks of AI in the Industry - Kellyn Gorman | Redgate

Kelly Gorman, Director of Data and AI at Silk, shares the biggest benefits as well as drawbacks of AI in the Industry. Test Data Management is the process of providing DevOps teams with test data to evaluate the performance and functionality of applications.

Battle-Tested Reliability Strategies - Incidentally Reliable with Abhishek Ghosh

We dive into the trenches with Abhishek Ghosh, a veteran who has led SRE teams at Pinterest, and now at Cribl. He shares gripping war room stories from Pinterest, strategies for maintaining uptime, insights into the role of AI in observability, and more! Discover the future of SRE and learn how to navigate the challenges of digital reliability. Tune in to gain valuable lessons from one of the industry's leading experts.

What Is Five 9s in Availability Metrics?

What comes to mind when you hear that an IT component has “five 9s availability”? Five 9s availability of >= 99.999% is the peak metric for IT availability. Five 9s predicts that a measured component — whether it is a server, communication line, app, service, or any other item — will be available at least 99.999% of the time during a specific period.

Capitalizing on the Potential of Automation in Network Operations: Why Integration is Key

In many organizations, network teams are experiencing a significant skills shortage. The network operations center (NOC) requires expertise in various emerging technologies, which makes it increasingly challenging to find qualified candidates with the right skills. A recent survey revealed that in 2022, only 26% of companies found it somewhat to very difficult to hire networking professionals. By 2024, this figure had risen to 41%.

Enhancing IT Monitoring with DX UIM 23.4 Cumulative Update 2

In the ever-evolving landscape of IT infrastructure, staying ahead of potential issues and ensuring optimal performance is crucial. Broadcom’s DX Unified Infrastructure Management (DX UIM) has been a trusted solution for comprehensive monitoring and management. With the release of DX UIM 23.4 Cumulative Update 2, users can expect a host of new features and improvements designed to enhance their monitoring capabilities.

Six ways Australian local government IT teams can benefit from AIOps in monitoring

Running IT operations in an Australian city council is a complex role that faces a unique set of challenges and opportunities. Typically, a city council in an advanced country like Australia runs its IT on a hybrid model, with a combination of continuing on-premise installations working in tandem with modern cloud platforms, such as Azure.