Operations | Monitoring | ITSM | DevOps | Cloud

%term

What site reliability engineering (SRE) and how is it different from DevOps?

Site reliability engineering (SRE) is Google’s approach to service management where software engineers run production systems using a software engineering approach. It’s clear that Google is unique, and they usually need to tackle software bugs and errors in different and non-conventional ways. But having software engineers doing a job that is traditionally done by professionals with a systems administration background sounds impractical.

How to Stay Ahead of Data Retention Requirements - Part 2

In part 1 of this series, we tried to outline what data retention is and why it is needed to overcome increasing requirements for various regulatory standards. As detailed, there are some clear guidelines for organizations to take what we called a “data retention approach for compliance”. In this follow up post, outline some specific technological and procedural challenges you might face as well as some practical guidelines and strategies to overcome them.

Opsgenie Actions: Sign Up For Early Access

When operating always-on services, engineers need to quickly respond to alerts and prevent issues from becoming outages. Fortunately, many alerts can be resolved through easy changes to systems or network infrastructure. However, these tasks still require manual intervention and cause interruptions for on-call responders.

DNS Hijacking: What You Need to Know

Crashed websites and slow loading pages can be devastating for any site owner. But there’s another type of threat that often goes undetected. A report published by FireEye on Thursday details a particular type of DNS hijacking that allows hackers to easily steal information. These attacks have been going on for approximately two years and involve three different methods that compromise websites without alarming users.