Zyphra Launches AI Cloud Platform Powered by AMD Instinct™ MI355X GPUs
Zyphra has officially introduced Zyphra Cloud, a full-stack AI platform designed to bring cutting-edge innovations from Zyphra Research into real-world production. Built for developers, enterprises, and next-generation AI hyperscalers, the platform combines high-performance computing with advanced AI capabilities to streamline the development and deployment of intelligent systems.
Powered by AMD Instinct™ MI355X GPUs and running on TensorWave’s purpose-built infrastructure, Zyphra Cloud integrates model serving, agent infrastructure, and scalable compute into a unified ecosystem. This enables organizations to build, scale, and deploy AI applications efficiently—all within a single platform.
Zyphra Inference: Serverless AI for High-Performance Workloads
At launch, Zyphra Cloud introduces Zyphra Inference, a serverless inference service that provides seamless access to leading open-weight AI models such as DeepSeek V3.2, Kimi K2.6, and GLM 5.1.
Zyphra Inference is engineered for performance, leveraging:
- Custom AI kernels
- Advanced long-context inference algorithms
- Optimized parallelism techniques
These innovations deliver high-throughput and low-latency performance, making the platform ideal for demanding use cases like:
- Agentic coding
- Deep research workflows
- Long-horizon automation tasks
Bridging Research and Production AI
According to Zyphra CEO and Founder Krithik Puthalath, Zyphra Cloud represents the culmination of years of AI research and optimization on AMD infrastructure.
“Zyphra Cloud is the natural extension of our research. We’ve spent years building and validating AI systems on AMD, and now we’re bringing that capability to developers and enterprises through a production-ready platform.”
The platform is designed to deliver the performance, efficiency, and scalability required for real-world AI deployments.
AMD and TensorWave Enable Next-Gen AI Infrastructure
AMD continues to strengthen its position in AI infrastructure through open platforms and strategic collaborations.
“Zyphra Inference on AMD Instinct MI355X GPUs showcases how optimized AI software combined with our accelerator architecture delivers leading inference performance,” said Negin Oliver, Corporate VP of AI Business Development at AMD.
TensorWave plays a critical role by providing dedicated, high-performance compute infrastructure tailored for AI-native companies.
“Our mission is to give companies like Zyphra the AMD-powered infrastructure they need to scale without compromise,” said Jeff Tatarchuk, Co-Founder and Chief Growth Officer at TensorWave.
Why Zyphra Cloud Matters
Zyphra Cloud stands out in the rapidly evolving AI landscape by offering:
- A unified AI development and deployment platform
- Serverless inference for scalability and flexibility
- Support for frontier open-weight models
- Infrastructure optimized for long-context and high-performance AI workloads
As demand for scalable AI solutions continues to grow, Zyphra Cloud positions itself as a powerful platform for organizations looking to accelerate innovation and deploy advanced AI systems at scale.

