Graphics Processing Unit (GPU) stories
Red Hat launches llm-d, an open source project for scalable generative AI inference, backed by Google Cloud, IBM, NVIDIA and others to support any model, hardware or cloud.
NVIDIA launches NVLink Fusion, a silicon tech enabling partners to build bespoke AI infrastructure, integrating CPUs from Fujitsu and Qualcomm with NVIDIA GPUs.
NVIDIA and Foxconn partner with Taiwan to build an AI supercomputer using 10,000 Blackwell GPUs, boosting research and innovation across industries.
Google Cloud has launched significant updates to its Kubernetes Engine, enhancing capabilities for managing large-scale AI workloads and improving resource efficiency.
CMTG has revealed an AUD $1 million upgrade to its Morley data centre, enhancing infrastructure to meet growing technological demands and sustainability goals.
Pure Storage has integrated the NVIDIA AI Data Platform into its FlashBlade platform, boosting capabilities for enterprises seeking scalable AI solutions.
NVIDIA has launched its DGX SuperPOD infrastructure with Blackwell Ultra GPUs, revolutionising AI processing for businesses amid a fierce competition for AI capabilities.
MSI has unveiled new AI servers, the CG480-S5063 and CG290-S3063, leveraging NVIDIA MGX architecture to enhance enterprise and cloud data centre performance.
NVIDIA has unveiled Dynamo, an open-source tool aimed at enhancing AI reasoning model efficiency, optimising GPU resources and reducing operational costs.
NVIDIA has launched its DGX Spark and DGX Station AI desktops, bringing advanced capabilities for AI research directly to users' workspaces.
Pure Storage has unveiled its new FlashBlade//EXA data storage platform, aiming to resolve legacy system bottlenecks for AI and HPC workloads across Asia Pacific.
Compuware Technology has launched the CPR-6622-1M1, a powerful 6,600W dual-output power supply designed for AI and GPU server applications.
Akamai Technologies has unveiled a Managed Container Service, enhancing user experiences by running workloads closer to users across over 700 cities.
SambaNova has achieved a breakthrough in AI deployment, unveiling the DeepSeek-R1 model's remarkable speed of 198 tokens per second, revolutionising efficiency in processing.
As US export limits on GPUs ignite concern, EU leaders are urged to bolster their semiconductor industry for AI growth by 2025, believes LIAN Group co-founder.
Haoyuan Li, CEO of Alluxio, predicts that by 2025, multi-modal training and innovative data strategies will reshape AI infrastructure and its deployment.
Aligned partners with Lambda to build AI-ready DFW-04 data centre in Plano, Texas, featuring advanced NVIDIA GPU cooling and Lambda's AI Cloud platform.
NVIDIA has unveiled a new research centre in Boston focused on integrating quantum computing with AI supercomputers, promising advancements in the field.
HighPoint Technologies has launched its exclusive RocketStor PCIe GPU Expansion chassis, offering unprecedented support for Gen5 and Gen4 solutions.
Vultr has become the first cloud provider to deploy AMD Instinct MI325X GPUs at its Chicago data centre, enhancing AI capabilities for businesses.