首页 › News › 正文

Nvidia Open Source 340 Billion Parameter Model Nemotron-4 340B

海角七号发表于 2024-6-15 16:13:55

284 0 0

On June 14th local time, Nvidia opened up the Nemotron-4 340B (340 billion parameter) series model. According to NVIDIA, developers can use this series of models to generate synthetic data for training Large Language Models (LLMs) for commercial applications in healthcare, finance, manufacturing, retail, and other industries.
The Nemotron-4 340B includes the base model, instruction model, and reward model. Nvidia used 9 trillion tokens (text units) for training. In common sense reasoning tasks such as ARC-c, MMLU, and BBH benchmark tests, Nemotron-4 340B-Base can be comparable to Llama-3 70B, Mixture 8x22B, and Qwen-2 72B models.

CandyLake.com 系信息发布平台，仅提供信息存储空间服务。
声明：该文观点仅代表作者本人，本文不代表CandyLake.com立场，且不构成建议，请谨慎对待。

支持

反对

转播

Nvidia Open Source 340 Billion Parameter Model Nemotron-4 340B

浏览过的版块