Operations | Monitoring | ITSM | DevOps | Cloud

What can SREs do to make holiday season's peak traffic less chaotic?

Holiday season's peak traffic is the most challenging period for SREs and on-call engineers. In this blog, we have highlighted the things that SREs can do to make the holiday season less chaotic. The recently concluded Black Friday weekend could have potentially been the most challenging shift for on-call engineers working in the Retail or E-Commerce sector. Since such peak-traffic events push the system to the limits, engineering teams are engulfed in a lot of tension preparing for it.

How to Use Rapid Experimentation To Improve Big Data Adaptability?

Big Data, a serious shift toward rapid experimentation, is the need of the day for most firms who are interested in reaping its potential benefits and constructing a wise and clear path to the changeover. Big Data has been the topic of discussion for a few years and is a method for businesses to acquire large amounts of data on their customers in order to deal with that data while respecting customer privacy and adhering to ethical guidelines.

State of IT Management Survey Report 2020-21

As we continue to adapt following the pandemic, which has impact us all both personally and professionally, we take this moment to commemorate the IT veterans we've lost to the pandemic. With the pandemic drastically changing the way we do business, we have conducted a study to understand the state of IT management at the height of these radical changes and analyzed how to offer a holistic approach to changing IT management needs to prepare for the post-pandemic IT world.

7 Trends in Database DevOps & Monitoring - Download the infographic

Earlier this year, we surveyed over 5,700 global IT professionals and asked them what the most pressing challenges they faced in Database DevOps and Monitoring are. We also asked specific questions to gauge what trends we could spot in the industry and compared the responses to the last 3-5 years of data we have.

An Introduction to Log Analysis

If you think log files are only necessary for satisfying audit and compliance requirements, or to help software engineers debug issues during development, you’re certainly not alone. Although log files may not sound like the most engaging or valuable assets, for many organizations, they are an untapped reservoir of insights that can offer significant benefits to your business.

New in the Kubernetes integration for Grafana Cloud: curated dashboards, built-in alerts, and more

Back in May, we announced the Kubernetes integration to help users easily monitor and alert on core Kubernetes cluster metrics using the Grafana Agent, our lightweight observability data collector optimized for sending metric, log, and trace data to Grafana Cloud. The integration allows Grafana Cloud users to monitor and alert on Kubernetes cluster metrics. Since the original release, we’ve added new features and enhancements to help our users go even further.

Network AF, Episode 6: Cat Gurinski on mentorship and the shared languages of network engineering

In the latest episode of the Network AF podcast, your host Avi Freedman welcomes his friend and networking pro Cat Gurinski to the show. As a senior network engineer with loads of experience, Cat is most passionate about automation and troubleshooting, and especially loves to use Python and Arista’s pyeapi frameworks in her pursuits. She’s also the current chair of the NANOG Program Committee, and previously worked for companies like Best Buy, Switch and Data, and Equinix.