Operations | Monitoring | ITSM | DevOps | Cloud

October 2020

The Future of Ops Careers

Have you seen Lambda: A Serverless Musical? If not, you really have to. I love Hamilton, I love serverless, and I’m not trying to be a crank or a killjoy or police people’s language. BUT, unfortunately, the chorus chose to double-down on one of the stupidest and most dangerous tendencies the serverless movement has had from day one: misunderstanding and trash-talking operations.

New Product Updates What Does it Mean to Observe and Debug in 'Hi Res'

A number of Honeycomb features have been released throughout spring 2019 that, collectively, we like to say deliver “hi-res” across the Engineering and DevOps lifecycle. What do we mean? First, hi-res with Honeycomb means you get clearer visibility about how your production is behaving in real time, as you release new code. Secondly, it means once you have those insights (thanks to granular event data stored in Honeycomb), you can debug and resolve more efficiently. So, how do we do it?

Eaze into Observability

On-call teams use Honeycomb’s analytics to discover exactly what is happening with code in production. While incident response is a key reason engineers rely on Honeycomb, observability also delivers unique value during the development process. Eaze takes observability a step further and uses Honeycomb to prioritize what’s needed to stabilize their existing service while informing how they build their new Go and Node.js microservices platform all at the same time.

Handle Unruly Outliers with Log Scale Heatmaps

We often say that Honeycomb helps you find a needle in your haystack. But how exactly is that done? This post walks you through when and how to visualize your data with heatmaps, creating a log scale to surface data you might otherwise miss, and using BubbleUp to quickly discover the patterns behind why certain data points are different.

Honeycomb Learn Ep. 4: Bubble-Up to Spot Outliers in Production

The power of Honeycomb lies in the way you analyze production data using different interactive views. See what's happening across many dimensions (fields) in your system with BubbleUp. Pick the timeframe, breakdown by any field, such as customer name or ID, then filter by a specific dataset or where any errors occur. The query results are heatmap that highlight events over the baseline, over time. Use BubbleUp to select outliers on the heatmap and drill down to all related fields in that data. It will help you understand which part of the code is misbehaving.

Honeycomb Learn Ep 5 Never Alone On Call

In this webinar, we’ll discuss and show how: Honeycomb's query history gives rich meaningful context, Honeycomb’ers dogfood and learn from each others' compound wisdom, benefits span engineering cycles and use-cases when debugging and maintaining, & to build a culture of observability and why you should do it now.

Honeycomb Learn Ep 1 Instrument Better for a Happy Debugging Team

Nathan LeClaire, Sales Engineer @honeycombio knows first-hand that the key to instrumenting code is to start with baby steps. With Honeycomb, a little instrumentation will give vast insights as soon as you ingest your data. With Honeycomb Beelines, we take the heavy lifting out of instrumenting. Listen to learn: See Honeycomb in action, hear best practices, and learn how fast and painless instrumentation can be.

Honeycomb Learn Ep. 2: De-stress Debugging -Triggers, Feature Flags, & Fast Query

This episode in our Honeycomb Learn series looks at how to cut stress levels when debugging issues in production. Starting with a hypothesis, run fast queries, and then navigate to the code where the problem lies. Be proactive and set triggers to let you know if something needs attention. When engineering is about to ship a new release, set a feature flag to watch how production behaves in real-time. Curtail performance issues and reduce customer impact with the right tools to better understand production systems, right now.

Honeycomb Learn Ep. 3: See The Trace? Discover Errors, Latency & More across Distributed Systems

Distributed systems bring complexity for developer and ops teams. When incidents occur in production, expected and unexpected, you want to pinpoint which part of the service is giving problems. Distributed tracing illuminates distributed systems, making your logs easier to navigate. Quickly identify where there are errors or latency in your code or service, even within 3rd party services you use. Instrumentation is the key to the best tracing experience possible.

A User Journey: Setting Up the Node Beeline on Lambda

Nic Wise at Tend Health recently wrote a series of blog posts exploring how they moved away from logs and metrics, toward adopting observability with Honeycomb. In that series, he shares lessons learned as they got their NodeJS app instrumented in an AWS environment making use of CloudFront, API Gateway, Lambda, and a few other services. Tend is a New Zealand-based healthcare platform launching in 2020.

Outreach Engages Their Production Code with Honeycomb

Outreach is the number one sales engagement platform with the largest customer base and industry-leading usage. Outreach helps companies dramatically increase productivity and drive smarter, more insightful engagement with their customers. Outreach is a privately held company based in Seattle, Washington. Tech Environment & Tools: Millions of Logs and Limited View Metrics.

Observability 101: Terminology and Concepts

When I first started following Charity on Twitter back in early 2019, I was quickly overwhelmed by the new words and concepts she was discussing. I liked the results she described: faster debugging, less alert fatigue, happier users. Those are all things I wanted for my team! But I was hung up on these big polysyllabic words, which stopped me from taking those first steps toward improving our own observability.

Ask an SRE Panel Talk

Our SRE Leaders Panel series gathers leading minds in the SRE and resilience community to share their insights. In this edition, we are so excited to have an amazing all-women panel who will be diving deep into testing in production: The event will consist of 40 minutes of roundtable discussion with Shelby and Talia facilitated by Blameless' Staff SRE Amy Tobey, followed by 20 minutes of Q&A from the audience. This is an open and candid discussion so come with your questions. We look forward to seeing you there!