VAST Undivided Attention – storage for AI and HPC

News: VAST revolutionizes AI performance – and Aixia delivers the solution

VAST Data has just launched VUA (VAST Undivided Attention) – a new open software technology that greatly improves the speed and efficiency of AI processing. As a VAST partner, Aixia is proud to offer this groundbreaking solution to our customers.

What is it all about?

When AI models, such as large language models (LLMs), generate text and analytics, huge amounts of data (so-called tokens) are created in real time. These tokens normally need to be stored in the server’s GPU memory to avoid time-consuming recalculations. The problem is that the GPU memory quickly becomes full – slowing down the whole process.

VAST’s VUA solves this by cleverly storing these tokens on lightning-fast NVMe-connected SSDs. This gives GPU servers access to significantly more “virtual” memory, without sacrificing performance. This means AI services can scale up faster, handling more complex queries while reducing both response times and hardware costs.

What does it mean for a CFO?

  • Shorter response times = better user experience and competitive advantage.

  • Less need to buy more expensive GPUs.

  • Increased efficiency and lower TCO (Total Cost of Ownership) of AI infrastructure.

What it means for a technician:

  • VUA creates a new cache layer between GPU, CPU and NVMe, integrated with GPUDirect.

  • Global, shared cache that can handle billions of tokens and minimize cache misses.

  • 292% faster token generation in tests – and support for the increasingly large AI models of the future.

With VUA from VAST, companies can take their AI initiatives to the next level, while optimizing their infrastructure investments. Aixia helps you implement and customize the solution to your needs – from consulting to full operation.
Want to know more about how VUA can accelerate your AI strategy? Contact us at Aixia!

Latest News

Why 87% of AI models never reach production – and what you can do about it

87% of machine learning models never reach production. MLOps and AiQu are helping Swedish companies overcome the gap between AI…
Read more

Data center design not keeping up – are Swedish facilities really ready for AI?

Swedish data centers are often touted as world leaders. But there is an inconvenient truth: they are built for a…
Read more

Why industry AI initiatives are stuck between pilot and reality

Many AI pilots look promising but lose momentum in production. Here are five mistakes that are stalling industry AI ventures….
Read more

Storage architecture 2026: When is NAS enough and when do you need something else?

Data volumes are exploding. AI training data, 4K video and CAD models are placing new demands on storage. Learn when…
Read more