At NVIDIA GTC, Hewlett Packard Enterprise announced updates to its comprehensive AI-native portfolios to advance the operationalization of generative AI (GenAI), deep learning, and machine learning (ML) applications.
Several notable updates have been introduced in this release. Firstly, users now have access to two cutting-edge solutions co-engineered by HPE and NVIDIA, encompassing the full GenAI stack. Additionally, a preview of the HPE Machine Learning Inference Software is available, offering a glimpse into its capabilities.
Furthermore, an enterprise-ready retrieval-augmented generation (RAG) reference architecture has been unveiled, providing a robust framework for enhanced performance. Lastly, support has been extended for the development of forthcoming products leveraging the new NVIDIA Blackwell platform, promising advancements in future offerings.
Advertisement
“To deliver on the promise of GenAI and effectively address the full AI lifecycle, solutions must be hybrid by design,” said Antonio Neri, president and CEO at HPE. “From training and tuning models on-premises, in a colocation facility or the public cloud, to inferencing at the edge, AI is a hybrid cloud workload. HPE and NVIDIA have a long history of collaborative innovation, and we will continue to deliver co-designed AI software and hardware solutions that help our customers accelerate the development and deployment of GenAI from concept into production.”
“Generative AI can turn data from connected devices, data centers and clouds into insights that can drive breakthroughs across industries," said Jensen Huang, founder and CEO at NVIDIA. "Our growing collaboration with HPE will enable enterprises to deliver unprecedented productivity by leveraging their data to develop and deploy new AI applications to transform their businesses.”
Advertisement
Announced at SC23, HPE’s supercomputing solution for generative AI is now available to order for organizations seeking a preconfigured and pretested full-stack solution for the development and training of large AI models. Purpose-built to help customers accelerate GenAI and deep learning projects, the turnkey solution is powered by NVIDIA and can support up to 168 NVIDIA GH200 Grace Hopper Superchips.
The solution enables large enterprises, research institutions, and government entities to streamline the model development process with an AI/ML software stack that helps customers accelerate GenAI and deep learning projects, including LLMs, recommender systems, and vector databases. Delivered with services for installation and set-up, this turnkey solution is designed for use in AI research centers and large enterprises to realize improved time-to-value and speed up training by 2-3X.
Previewed at Discover Barcelona 2023, HPE’s enterprise computing solution for generative AI is now available to customers directly or through HPE GreenLake with a flexible and scalable pay-per-use model. Co-engineered with NVIDIA, the pre-configured fine-tuning and inference solution is designed to reduce ramp-up time and costs by offering the right compute, storage, software, networking, and consulting services that organizations need to produce GenAI applications.
Featuring a high-performance AI compute cluster and software from HPE and NVIDIA, the solution is ideal for lightweight fine-tuning of models, RAG, and scale-out inference. The fine-tuning time for a 70 billion parameter Llama 2 model running this solution decreases linearly with node count, taking six minutes on a 16-node system.
Advertisement
The speed and performance enable customers to realize faster time-to-value by improving business productivity with AI applications like virtual assistants, intelligent chatbots, and enterprise search.
To address the AI skills gap, HPE Services experts will help enterprises design, deploy, and manage the solution, which includes applying appropriate model tuning techniques. For more information or to order it today, visit HPE’s enterprise computing solution for generative AI.