VAST Undivided Attention

News: VAST revolutionizes AI performance – and Aixia delivers the solution

VAST Data has just launched VUA (VAST Undivided Attention) – a new open software technology that greatly improves the speed and efficiency of AI processing. As a VAST partner, Aixia is proud to offer this groundbreaking solution to our customers.

What is it all about?

When AI models, such as large language models (LLMs), generate text and analytics, huge amounts of data (so-called tokens) are created in real time. These tokens normally need to be stored in the server’s GPU memory to avoid time-consuming recalculations. The problem is that the GPU memory quickly becomes full – slowing down the whole process.

VAST’s VUA solves this by cleverly storing these tokens on lightning-fast NVMe-connected SSDs. This gives GPU servers access to significantly more “virtual” memory, without sacrificing performance. This means AI services can scale up faster, handling more complex queries while reducing both response times and hardware costs.

What does it mean for a CFO?

  • Shorter response times = better user experience and competitive advantage.

  • Less need to buy more expensive GPUs.

  • Increased efficiency and lower TCO (Total Cost of Ownership) of AI infrastructure.

What it means for a technician:

  • VUA creates a new cache layer between GPU, CPU and NVMe, integrated with GPUDirect.

  • Global, shared cache that can handle billions of tokens and minimize cache misses.

  • 292% faster token generation in tests – and support for the increasingly large AI models of the future.

With VUA from VAST, companies can take their AI initiatives to the next level, while optimizing their infrastructure investments. Aixia helps you implement and customize the solution to your needs – from consulting to full operation.
Want to know more about how VUA can accelerate your AI strategy? Contact us at Aixia!

Latest News

The five mistakes we see over and over again when organizations run AI in the cloud

Five mistakes we see over and over again when organizations run AI in the cloud – from TCO calculations that…
Read more

AI in manufacturing: the pilot projects are over

Fictiv and MISUMI’s new report shows that AI adoption in manufacturing has jumped from 87% to 93%. But the pilot…
Read more

Pentagon invests $13.4 billion in AI – and it’s not just about autonomous weapons

The Pentagon is investing $13.4 billion in AI – but it’s not about drones. It’s about decision-making, sensor fusion and…
Read more

AiQu: the infrastructure that takes AI from promising pilot to actual production

Scaling AI is more about infrastructure than algorithms. AiQu doesn’t lock you to one vendor – supporting NVIDIA, AMD, Intel…
Read more