NVIDIA brings large language AI models to enterprises
NVIDIA is enabling enterprises to build their own domain-specific chatbots, personal assistants and other sophisticated AI applications. The company has unveiled the NVIDIA NeMo Megatron framework for training language models with trillions of parameters. The technology includes the Megatron 530B customisable large language model (LLM) that can be trained for new domains and languages, and the NVIDIA Triton Inference Server with multi-GPU, multinode distributed inference functionality. Combined with NVIDIA DGX systems, these tools provide a production-ready, enterprise-grade solution to simplify the development and deployment of large language models. The framework is optimised to scale out across the large-scale accelerated computing offered by the NVIDIA DGX SuperPOD. “Large language models have proven to be flexible and capable, able to answer deep domain questions, translate languages, comprehend and summarise documents, write stories and compute programs, all without ...