The latest News and Information on Service Reliability Engineering and related technologies.
Site Reliability Engineering (SRE) is playing an increasingly pivotal role in supporting hybrid-cloud, DevOps environments, where Dev teams need to release updates fast and Ops need to avoid errors and failures in production. Powered by integrations to monitoring, orchestration, provisioning and ITSM tools, Interlink’s SRE solution brings improved understanding of where threats to the health of your IT services might lurk within DevOps workflows.
As companies today are racing to build site reliability engineering(SRE) practices within their engineering teams, site reliability engineering has become one of the hottest and highest paying jobs in tech. Site reliability engineering was a term coined by Google engineer Benjamin Treynor in 2003 when he was tasked with making sure that Google services were reliable, secure and functional.
Many organizations are transitioning toward a DevOps operational model, where software developers are responsible for operating the applications they develop, instead of a centralized IT operations group. In this “CTO Perspective” interview we talk to BigPanda’s CTO Elik Eizenberg about the challenges in that transition, and what it takes to make it easier. Lean back and watch the interview, or if you prefer reading, take a few minutes to read the transcript.