NVIDIA and Hugging Face’s Latest Offer: “Training Cluster as a Service”

By Dan Miller on August 15, 2023

Dateline SIGGRAPH: GPU-manufacturer NVIDIA joined the open-source machine learning specialists Hugging Face to announce “Training Cluster as a Service”. Since 1974, the Association for Computing Machinery (ACM) has convened its Special Interest Group on Computer Graphics and Interactive Techniques (abbreviated SIGGRAPH). Attendance is approaching pre-pandemic levels (over 14,000) attracted by presentations of academic papers and a trade show and exhibition of the latest developments in computer graphics, animation, visual effects and interactiee experiences.

The partnership leverages NVIDIA’s DGX Cloud, an “AI Supercomputing Instance now running on Oracle’s Cloud infrastructure, to offer enterprises a platform for fine-tuning large language models (LLMs) using information and data from their proprietary knowledge bases. We have written in previous posts about the trend towards “Be Your Own LLM.” Fine-tuning an existing model like OpenAI’s ChatGPT or Meta’s open source Lllama 2 is one strategy, though it requires significant expertise and computing resources. The Hugging Face / NVIDIA partnership makes the fine-tuning process more accessible.

Addressing Complexity in Fine-tuning LLMs

Hugging Face’s “Training Cluster as a Service” introduces a service-oriented approach to AI model customization by integrating NVIDIA’s DGX Cloud infrastructure. This platform offers developers who are using Hugging Face’s console and tools access to optimized AI computing resources. It dramatically streamlines the process of training and fine-tuning LLMs. The focal point of interest for enterprises with substantial proprietary data lies in the potential to fine-tune LLMs according to specific business requirements. This service aims to facilitate the integration of in-house data, enabling companies to adapt AI models to industry-specific applications such as chatbots, search algorithms, and content summarization.

The process of fine-tuning LLMs entails a complex interplay of factors, including data preprocessing, hyperparameter optimization, and validation. Proficiency in these areas is essential to achieving optimal results. The collaboration between Hugging Face and NVIDIA acknowledges these challenges and provides a platform supported by the AI supercomputing capabilities of DGX Cloud. This support can potentially alleviate the expertise and resource constraints associated with LLM customization.

The Product of a Year of Collaboration

Collaboration between Hugging Face and NVIDIA dates back to May 2022 when they hosted a joint workshop on “AI for Natural Language Processing. It represented one of the first opportunities for developers on both platforms to discuss how to make it easier to build and deploy AI models. It is also important to note that RIVA, NVIDIA’s packaged resource to support Automated Speech Recognition (ASR), Text-to-Speech (TTS) and Conversational AI figured prominently in the partnership since mid-2022 as well. Together, NVIDIA and Hugging Face offer enterprise developers an avenue to explore refined AI model development without necessitating extensive technical knowledge or dedicated resources for AI supercomputing. The service intends to facilitate the integration of proprietary data into AI model development pipelines, potentially enhancing the applicability of LLMs in a range of industry-specific contexts, including both voicebots and chatbots.

‹ For Zoom, The Future Is Now, And “Federated LLMs” Are The Key

Automated Speech Processing Revolutionized by LLMs and Transformers ›

Categories: Intelligent Assistants, Articles