Operations | Monitoring | ITSM | DevOps | Cloud

DevOps

The latest News and Information on DevOps, CI/CD, Automation and related technologies.

Advice for building an incident management program

On this weeks' episode of The Debrief, we chatted with Jeff Forde, an Architect on the Platform Engineering team at Collectors. With a background spanning finance, healthcare, and various product-led startups, Forde has honed his expertise in DevOps, site reliability, and platform engineering. Beyond his professional life, he's also a dedicated volunteer first responder and certified fire instructor in Connecticut, offering him a unique perspective on managing incidents of all typesz.

Ubuntu AI | S2E3 | GPU utilisation optimisation at KubeconEU 2024

Maciej is not only the host of our podcast, but also an experienced keynote speaker. After a joint keynote at KubeconEU 2023 about highly sensitive data, in 2024, Maciej goes to Paris to talk about the GPU utilisation. During our podcast, we cover a lot of aspects of GPU utilisation. From best practices to existing tooling, there are different angles that Maciej talk about, giving a sneak-peak into his keynote. Are you curious how open source tooling plays a role in optimising the GPU utilisation? Listen to our podcast!

Changelog Breakdown: Focus Tab, GitKraken.dev, & more

Dive into the latest GitKraken Client updates – starting with Focus View, helping you prioritize all PRs, Issues, and WIPs so you waste less time wondering, "What's next?" and more time coding. Worried about security? We've got new customizable protections to ensure that your work (and your mind) stays safe and at ease. Whether you're managing your Workspaces or sharing code with Cloud Patches, GitKraken brings everything you care about into one accessible, secure, and efficient place.

Cloud threat detection and response

Google Security Command Center (SCC) Enterprise is the industry’s first cloud risk management solution that fuses cloud security and enterprise security operations - supercharged by Mandiant expertise and AI at Google scale. Watch and learn how to detect threats to your cloud resources and automate attack response.

SOC 2 Compliance Requirements: Examples, Use Cases + More

SOC 2 compliance requirements (Service Organization Controls Type 2) ensure that customer data stays private and secure — essential for any business that stores or processes sensitive data. In this blog, we’ll explore the specifics of SOC 2 compliance, and provide a solution to help you automate and enforce SOC 2 compliance going forward.

Release Roundup March 2024: More ways to discover and test your services

2024 is off to a fast start here at Gremlin. Since our last release roundup, we’ve released new experiment types, new features to improve integration with cloud platforms, and improvements to our auto-detection processes. Now you can push processes to their limits, find dependencies even easier, limit when tests can be run, and much more. We also introduced a slew of platform improvements to improve efficiency, performance, and user experience in the Gremlin web application.

Schedule Cron Jobs in Node.js with Node-Cron

Cron jobs are tasks set to run by themselves at certain times or intervals. They help with doing repetitive tasks automatically, like backing up data, sending emails, and updating systems. In Node.js, cron jobs can make tasks in applications run by themselves, making things more efficient and reliable. Node.js gives a good way to set these tasks through different libraries and tools.

Secure External Document Sharing in SharePoint

SharePoint, a product of Microsoft’s suite of office tools, has revolutionized the way organizations collaborate and manage documents. At its core, SharePoint is designed to facilitate the seamless sharing of information, both within an organization and with external partners. The ability to share documents externally is particularly valuable in today’s global business environment, where collaboration with vendors, clients, and contractors across geographical boundaries is commonplace.

How to Monitor ClickHouse With Telegraf and MetricFire

Monitoring your ClickHouse database is a proactive measure that helps maintain its health and ensure that it continues to meet the needs of your applications and users efficiently. It allows you to address issues before they become critical, ensuring that your database environment is secure, reliable, and performing optimally. In this article, we'll detail how to use the Telegraf agent to collect performance metrics from your ClickHouse clusters, and forward them to a datasource.

The Value Hosted Graphite brings to the Heroku Marketplace

Hosted Graphite is a time-series metrics monitoring tool used for application, systems, infrastructure and network monitoring. HostedGraphite is a Hosted Graphite service that offers the full capabilities and benefits of Graphite, without any of the hassle of trying to set up your own open-source Graphite installation.