Skip to Content Facebook Feature Image

WEKA Debuts New Solution Blueprint to Simplify AI Inferencing at Scale

Business

WEKA Debuts New Solution Blueprint to Simplify AI Inferencing at Scale
Business

Business

WEKA Debuts New Solution Blueprint to Simplify AI Inferencing at Scale

2024-11-20 03:06 Last Updated At:03:25

WARRP Reference Architecture Provides Comprehensive Modular Solution That Accelerates the Development of RAG-based Inferencing Environments

ATLANTA and CAMPBELL, Calif., Nov. 20, 2024 /PRNewswire/ -- From Supercomputing 2024: WEKA, the AI-native data platform company, debuted a new reference architecture solution to simplify and streamline the development and implementation of enterprise AI inferencing environments. The WEKA AI RAG Reference Platform (WARRP) provides generative AI (GenAI) developers and cloud architects with a design blueprint for the development of a robust inferencing infrastructure framework that incorporates retrieval-augmented generation (RAG), a technique used in the AI inference process to enable large language models (LLMs) to gather new data from external sources.

The Criticality of RAG in Building Safe, Reliable AI Operations
According to a recent study of global AI trends conducted by S&P Global Market Intelligence, GenAI has rapidly emerged as the most highly adopted AI modality, eclipsing all other AI applications in the enterprise.[1]

A primary challenge enterprises face when deploying LLMs is ensuring they can effectively retrieve and contextualize new data across multiple environments and from external sources to aid in AI inference. RAG is the leading technique for AI inference, and it is used to enhance trained AI models by safely retrieving new insights from external data sources. Using RAG in the inferencing process can help reduce AI model hallucinations and improve output accuracy, reliability and richness, reducing the need for costly retraining cycles.

However, creating robust production-ready inferencing environments that can support RAG frameworks at scale is complex and challenging, as architectures, best practices, tools, and testing strategies are still rapidly evolving.

A Comprehensive Blueprint for Inferencing Acceleration
With WARRP, WEKA has defined an infrastructure-agnostic reference architecture that can be leveraged to build and deploy production-quality, high-performance RAG solutions at scale.

Designed to help organizations quickly build and implement RAG-based AI inferencing pipelines, WARRP provides a comprehensive blueprint of modular components that can be used to quickly develop and deploy a world-class AI inference environment optimized for workload portability, distributed global data centers and multicloud environments.

The WARRP reference architecture builds on WEKA® Data Platform software running on an organization's preferred cloud or server hardware as its foundational layer. It then incorporates class-leading enterprise AI frameworks from NVIDIA — including NVIDIA NIMâ„¢ microservices and NVIDIA NeMoâ„¢ Retriever, both part of the NVIDIA AI Enterprise software platform — advanced AI workload and GPU orchestration capabilities from Run:ai and popular commercial and open-source data management software technologies like Kubernetes for data orchestration, and Milvus Vector DB for data ingestion.

"As the first wave of generative AI technologies began moving into the enterprise in 2023, most organizations' compute and data infrastructure resources were focused on AI model training. As GenAI models and applications have matured, many enterprises are now preparing to shift these resources to focus on inferencing but may not know where to begin," said Shimon Ben-David, chief technology officer at WEKA. "Running AI inferencing at scale is extremely challenging. We are developing the WEKA AI RAG Architecture Platform on leading AI and cloud infrastructure solutions from WEKA, NVIDIA, Run:ai, Kubernetes, Milvus, and others to provide a robust production-ready blueprint that streamlines the process of implementing RAG to improve the accuracy, security and cost of running enterprise AI models."

WARRP delivers a flexible, modular framework that can support a variety of LLM deployments, offering scalability, adaptability, and exceptional performance in production environments. Key benefits include:

"As AI adoption accelerates, there is a critical need for simplified ways to deploy production workloads at scale. Meanwhile, RAG-based inferencing is emerging as an important frontier in the AI innovation race, bringing new considerations for an organization's underlying data infrastructure," said Ronen Dar, chief technology officer at Run:ai. "The WARRP reference architecture provides an excellent solution for customers building an inference environment, providing an essential blueprint to help them develop quickly, flexibly and securely using industry-leading components from NVIDIA, WEKA and Run:ai to maximize GPU utilization across private, public and hybrid cloud environments. This combination is a win-win for customers who want to outpace their competition on the cutting edge of AI innovation."

"Enterprises are looking for a simple way to embed their data to build and deploy RAG pipelines," said Amanda Saunders, director of Enterprise Generative AI software, NVIDIA. "Using NVIDIA NIM and NeMo with WEKA, will give enterprise customers a fast path to develop, deploy and run high-performance AI inference and RAG operations at scale."

The first release of the WARRP reference architecture is now available for free download. Visit https://www.weka.io/resources/reference-architecture/warrp-weka-ai-rag-reference-platform/ to obtain a copy.

Supercomputing 2024 attendees can visit WEKA in Booth #1931 for more details and a demo of the new solution. 

Supporting AI Cloud Service Provider Quotes

Applied Digital
"As companies increasingly harness advanced AI and GenAI inferencing to empower their customers and employees, they recognize the benefits of leveraging RAG for greater simplicity, functionality and efficiency," said Mike Maniscalco, chief technology officer at Applied Digital. "WEKA's WARRP stack provides a highly useful reference framework to deliver RAG pipelines into a production deployment at scale, supported by powerful NVIDIA technology and reliable, scalable cloud infrastructure."

Ori Cloud
"Leading GenAI companies are running on Ori Cloud to train the world's largest LLMs and achieving maximum GPU utilization thanks to our integration with the WEKA Data Platform," said Mahdi Yahya, founder and chief executive officer at Ori Cloud. "We look forward to working with WEKA to build robust inference solutions using the WARRP architecture to help Ori Cloud customers maximize the benefits of RAG pipelines to accelerate their AI innovation."

Yotta
"To run AI effectively, speed, flexibility, and scalability are required. Yotta's AI solutions, powered by NVIDIA GPUs and built on the WEKA Data Platform, are helping organizations to push the boundaries of what's possible in AI, offering unparalleled performance and flexible scale," said Sunil Gupta, chief executive officer at Yotta. "We look forward to collaborating with WEKA to further enhance our Inference-as-a-Service offerings for natural-language processing, computer vision, and generative AI leveraging the WARRP reference architecture and NVIDIA NIM microservices."

About WEKA 
WEKA is architecting a new approach to the enterprise data stack built for the AI era. The WEKA® Data Platform sets the standard for AI infrastructure with a cloud and AI-native architecture that can be deployed anywhere, providing seamless data portability across on-premises, cloud, and edge environments. It transforms legacy data silos into dynamic data pipelines that accelerate GPUs, AI model training and inference, and other performance-intensive workloads, enabling them to work more efficiently, consume less energy, and reduce associated carbon emissions. WEKA helps the world's most innovative enterprises and research organizations overcome complex data challenges to reach discoveries, insights, and outcomes faster and more sustainably – including 12 of the Fortune 50. Visit www.weka.io to learn more or connect with WEKA on LinkedIn, X, and Facebook.

WEKA and the WEKA logo are registered trademarks of WekaIO, Inc. Other trade names used herein may be trademarks of their respective owners. 

[1] 2024 Global Trends in AI, September 2024, S&P Global Market Intelligence

** The press release content is from PR Newswire. Bastille Post is not involved in its creation. **

WEKA Debuts New Solution Blueprint to Simplify AI Inferencing at Scale

WEKA Debuts New Solution Blueprint to Simplify AI Inferencing at Scale

NINGBO, China, Nov. 20, 2024 /PRNewswire/ -- On November 13, the 2024 United Nations International Procurement Seminar (IPS) commenced in Ningbo, marking the first time this authoritative, high-profile and professional conference has been held in Asia. The event brought together 50 senior procurement officials and representatives from 16 UN agencies, alongside over 200 supplier representatives from both domestic and international markets.

The selection of Ningbo as the venue prompts the question of why the UN chose Asia, and specifically China, for this significant event. A representative from the UN Development Programme (UNDP) office in China encapsulated the sentiment with the phrase "mutual engagement."

In recent years, the total value of UN procurement has steadily climbed, reaching as high as $29.6 billion. Traditionally, the IPS has been held primarily in Europe three times a year. As the UN seeks to innovate its procurement methods and achieve sustainable development goals, it has increasingly turned its attention to China.

"China has a diverse industrial landscape, offering high-quality products and services at competitive prices. By increasing its procurement of Chinese goods, the UN can enhance the efficiency of its funding use, allowing more developing countries to benefit from UN development aid projects. This aligns with our mutual interests," noted an official from the Ministry of Commerce's Department of International Trade and Economic Relations.

Ningbo has consistently emphasized the importance of UN procurement, organizing various initiatives for local enterprises to engage in UN procurement activities in recent years. In September last year, the city hosted the China (Ningbo) UN Procurement Promotion Conference, helping nearly 110 companies successfully register on the official United Nations Global Marketplace (UNGM) website.

Following this event, the UNDP office in China expressed gratitude, recognizing Ningbo's thriving manufacturing and foreign trade service sectors as having substantial potential to provide innovative solutions for UN procurement. "Ningbo serves as both a catalyst and a bridge, linking China with the world and the UN," stated Neris M. Baez, Director of the UN Secretariat Procurement Division.

The UN procurement system offers several advantages, including duty exemptions, low exchange rate risks, market stability, and avoidance of trade barriers. These factors can assist in opening markets in countries and regions involved in the Belt and Road Initiative, generating potential benefits and spillover effects.

"China possesses a comprehensive and robust industrial and supply chain and is a globally influential logistics hub. However, we currently lack the necessary channels to connect with international high standards. The key to expanding institutional openness lies in mastering international high-standard trade regulations," emphasized a representative from the Ningbo Committee of the China Council for the Promotion of International Trade.

If this seminar enables participants to become familiar with international high-standard trade regulations, it could establish a robust support system for engaging in UN procurement. This would create new opportunities in the international public procurement sector, empowering enterprises to secure more international orders through the UNGM website. From this perspective, the seminar represents an important "icebreaking journey."

Contact: Sun Jiali
Tel.: 0086-18069269225
E-mail: 409189530@qq.com

** The press release content is from PR Newswire. Bastille Post is not involved in its creation. **

UN International Procurement Seminar Held in Asia for the First Time: A Collaborative Initiative Between the UN and Ningbo

UN International Procurement Seminar Held in Asia for the First Time: A Collaborative Initiative Between the UN and Ningbo

UN International Procurement Seminar Held in Asia for the First Time: A Collaborative Initiative Between the UN and Ningbo

UN International Procurement Seminar Held in Asia for the First Time: A Collaborative Initiative Between the UN and Ningbo

UN International Procurement Seminar Held in Asia for the First Time: A Collaborative Initiative Between the UN and Ningbo

UN International Procurement Seminar Held in Asia for the First Time: A Collaborative Initiative Between the UN and Ningbo

UN International Procurement Seminar Held in Asia for the First Time: A Collaborative Initiative Between the UN and Ningbo

UN International Procurement Seminar Held in Asia for the First Time: A Collaborative Initiative Between the UN and Ningbo

Recommended Articles