Operations | Monitoring | ITSM | DevOps | Cloud

Blog

Building a more reliable infrastructure with new Stackdriver tools and partners

Every software organization faces challenges in keeping applications available and running reliably. At Google, we’ve developed and practiced a discipline known as Site Reliability Engineering (SRE). Following SRE practices lets us build and operate services reliably for our billions of users. Google has about 2,500 Site Reliability Engineers who support both internal and external services.

Pivotal Cloud Foundry architecture

Pivotal Cloud Foundry (PCF) is a multi-cloud platform for the deployment, management, and continuous delivery of applications, containers, and functions. PCF is a distribution of the open source Cloud Foundry developed and maintained by Pivotal Software, Inc. PCF is aimed at enterprise users and offers additional features and services—from Pivotal and from other third parties—for installing and operating Cloud Foundry as well as to expand its capabilities and make it easier to use.

Psychological Theories Behind Agile Software Development

With an almost 20-year career in social services—including working in institutions such as The University of Chicago, the United States Peace Corps, Chicago public schools, Child Protective Services, the Catholic Church, and for the U.S. federal government in probation and parole programs—my leap into Silicon Valley was as much a culture shock as the 2.5 years I lived in the Andes Mountains of Ecuador.

Must-Have Features Of Every Effective Website

Every reliable web development company understands that the website of a company has a direct impact on the company’s business either positively or negatively. So, it is your duty to engage your web development company on ways by which your website will boost your lead generation and conversion rate. On that note, here are some of the features your web development company must include in the design and development of your website.

How StatusHub Complements and Extends Your Incident Management Process?

Although the main focus of StatusHub is incident communication, it compliments each 5 activities of Incident Management: Identification, Categorization, Prioritization, Response and Communication with the user community through the life of the incident.

AWS CloudWatch Configuration Guide: Getting Started

If you remember getting an Erector Set as a kid, I’m sorry. In a stocking full of toy building systems, an Erector Set is the proverbial lump of coal. The instructions are complicated, and the pieces are made of metal, connected together with tiny screws. Few children have ever completed one of these sets successfully.

Exploring Network Monitoring? 3 Things to Look For

Network performance monitoring and diagnostics (commonly referred to as NPM or NPMD) tools are valuable for IT Ops teams that want to maintain visibility into the health and performance of their networking infrastructure. They provide this visibility in two major ways: by retrieving diagnostic data from network infrastructure components (such as routers and switches) and by analyzing network traffic flow and quality of service through various techniques (such as deep packet inspection).