%term

Building your AI infra, our tips

Aug 1, 2025 By Canonical Ubuntu In Canonical

Modular architecture: Decouple compute from storage so each can scale independently. This makes it easier to adapt to growing or shifting workloads over time. Future-ready hardware: Select GPUs and CPUs not just for current workloads but with an eye on scalability, including support for newer accelerator types. Scalable design: Ensure the system allows seamless addition of compute nodes or storage without a full redesign.

View Video

Canonical

Read more about Building your AI infra, our tips

Running AI without blowing up your storage

Aug 1, 2025 By Canonical Ubuntu In Canonical

Storage is often underestimated: In infrastructure discussions, compute and networking get most of the attention, while storage is treated as secondary. For AI workloads, that can be a costly oversight. Data throughput for specialized hardware: AI infrastructure powered by GPUs can process massive volumes of data at unprecedented speeds. This puts immense pressure on the storage system to keep up. Scale-out performance: An on-prem, scale-out, software-defined storage setup allows you to meet high performance demands, grow capacity as needed, and stay in control of infrastructure costs.

View Video

Canonical

Read more about Running AI without blowing up your storage

CapCut for Real Estate: AI Voice Narration for Property Tours

Aug 1, 2025 By OpsMatters In OpsMatters

Listing videos have proved a potent display of property available on the internet; however, not all videos with good frames cut through the market. The CapCut Desktop Video Editor has been designed as an all-in-one editing tool that enables real estate professionals to design a property tour with AI voiceover, action transitions, and high-definition pictures. CapCut gives the opportunity to create high-quality, compelling virtual tours even in the case of absence of a professional narrator and a studio where it is possible to shoot.

Read Post

OpsMatters

Read more about CapCut for Real Estate: AI Voice Narration for Property Tours

When AI Becomes the Judge: Understanding "LLM-as-a-Judge"

Jul 31, 2025 By Alin Dobra In Bunnyshell

Imagine building a chatbot or code generator that not only writes answers - but also grades them. In the past, ensuring AI quality meant recruiting human reviewers or using simple metrics (BLEU, ROUGE) that miss nuance. Today, we can leverage Generative AI itself to evaluate its own work. LLM-as-a-Judge means using one Large Language Model (LLM) - like GPT-4.1 or Claude 4 Sonnet/Opus - to assess the outputs of another. Instead of a human grader, we prompt an LLM to ask questions like "Is this answer correct?" or "Is it on-topic?" and return a score or label. This approach is automated, fast, and surprisingly effective.

Read Post

Bunnyshell

Read more about When AI Becomes the Judge: Understanding "LLM-as-a-Judge"

Can Agentic AI Fix the Chatbot Fatigue in the CX Industry? A Strategic Analysis for CXOs

Jul 31, 2025 By iOPEX Technologies , In iOPEX

Belinda Parmar, CEO of The Empathy Business, in a recent article with Financial Times, said, Customer service has undergone a significant transformation in recent years. Where success was once measured by resolution speed and cost efficiency, today’s customers expect far more. They seek personalized interactions, contextual awareness, and a genuine human touch, delivered alongside fast, reliable support.

Read Post

iOPEX

Read more about Can Agentic AI Fix the Chatbot Fatigue in the CX Industry? A Strategic Analysis for CXOs

With AI, You're Gonna Have to Manage Your (Massive) Energy Use in SPM

Jul 31, 2025 By Jason Kotlinski In Broadcom

Forget boring spreadsheets. Strategic portfolio management (SPM) isn't just about ticking boxes. It’s the big boss plan that makes sure every penny spent and every project your company starts points towards the main goal. It's your company's smart GPS, guiding you through the AI energy maze. When it comes to AI's power hunger, SPM is a knight in shining armor. It helps leaders get smart, making sure they grab all the fancy tech without trashing the world.

Read Post

Broadcom

Read more about With AI, You're Gonna Have to Manage Your (Massive) Energy Use in SPM

Smarter debugging with Sentry MCP and Cursor

Jul 31, 2025 By Cody De Arkland In Sentry

Debugging a production issue with Cursor? Your workflow probably looks like this: Alt-Tab to Sentry, copy error details, switch back to your IDE, paste into Cursor. By the time you’ve context-switched three times, you’ve lost your flow and you’re looking at generic suggestions that don’t show any understanding of your actual production environment or codebase.

Read Post

Sentry

Read more about Smarter debugging with Sentry MCP and Cursor

Semantic Caching: What We Measured, Why It Matters

Jul 31, 2025 By Rahul Raj In Catchpoint

Semantic caching promises to make AI systems faster and cheaper by reducing duplicate calls to large language models (LLMs). But what happens when it doesn’t work as expected? We built a test environment to find out. Through a caching system, we evaluated how semantically similar queries would behave. When the cache worked, response times were fast. When it didn’t, things got expensive. In fact, a single semantic cache miss increased latency by more than 2.5x.

Read Post

Catchpoint

Read more about Semantic Caching: What We Measured, Why It Matters

Is on-prem the top choice to run AI?

Jul 31, 2025 By Canonical Ubuntu In Canonical

‎‎Subscribe. Fuel your curiosity. In this episode, we break down what we’ve learned from teams running AI at scale, and why on-premises infrastructure is making a strong comeback. We’re seeing a shift: performance, cost control, data sovereignty, and platform flexibility are driving conversations about on-prem strategies for AI. No one-size-fits-all answers, but if you’re building or scaling AI, this might help you think a few steps ahead.

View Video

Canonical

Read more about Is on-prem the top choice to run AI?

Are you running AI the smart way?

Jul 31, 2025 By Canonical Ubuntu In Canonical

Data locality: AI models often rely on large datasets. Locating compute close to the data reduces transfer times and improves training performance. Latency sensitivity: Real-time AI applications, like recommendation systems or edge analytics, depend on low-latency environments. This can be more easily tuned in private or hybrid setups. Hardware specialization: Some AI workloads benefit from custom hardware like GPUs or TPUs. Private cloud allows more control over this, while public cloud offers broader access but less customization.

View Video