Tech Week Singapore 2025

Loading

From GPU Waste to Smart Scaling: Building Cost-Effective Private AI Infrastructure

08 Oct 2025
Productivity Optimisation & AI Adoption Theatre
From GPU Waste to Smart Scaling: Building Cost-Effective Private AI Infrastructure

While GPU hardware dominates AI infrastructure costs, most private deployments suffer from chronically low utilization rates due to static resource allocation. This session demonstrates how open-source elastic inference technology transforms GPU pools to serve multiple models dynamically, significantly reducing infrastructure costs while maintaining production-grade performance.

Speaker(s)
Yanzhen Yu, R&D Manager - Arcfra

2025 Sponsors

Platinum Sponsors



 

Silver Sponsor



 

Bronze Sponsor



 

VIP Lunch Partner




 

2027 Partners

Official Press Release Distribution Partner


 

Association Partner


 

Media Partner