Meta releases open-source big model Llama 3.1 with strong support from Nvidia
网事大话每
发表于 2024-7-24 13:05:30
1239
0
0
Science and Technology Innovation Board Daily, July 24th (Reporter Zhang Yangyang) Zuckerberg will continue to open source big models to the end.
Early this morning, Meta officially released the new generation of open-source big model Llama 3.1 series, which includes three versions: 8B, 70B, and 405B, with a maximum context increase of 128k.
Meta founder Mark Zuckerberg also posted on the official website to strongly endorse his own model. He said that most leading technology companies and scientific research today are built on open source software, which is the direction for AI to move forward, and Meta is moving towards becoming the industry standard for open source AI.
It should be emphasized that in the technology industry, the dispute over open source and closed source has a long history. Critics argue that open source conceals a lack of technological originality and only makes simple adjustments to the open source model, rather than substantive innovation. Robin Lee, the founder of Baidu, even said that the open source model has value in academic research, teaching and other specific scenarios, but it is not applicable to most application scenarios. Supporters believe that customized improvements based on mature open source architectures are the norm of technological development, which can drive rapid innovation and progress in technology.
In the field of big models, there is often a comparison of the advantages and disadvantages between open source and closed source big models. So far, open-source models have mostly lagged behind closed models in terms of functionality and performance. But with the release of Llama 3.1, there may be a new round of intense competition between open source and closed source big models.
According to benchmark data provided by Meta, Llama 3.1 has 405 billion parameters, making it one of the largest large-scale language models in recent years. This model is trained on 15 trillion tokens and over 16000 H100 GPUs, making it the first Llama model in Meta's history to be trained on this scale. Meta states that in terms of advanced features such as common sense, manipulability, mathematics, tool usage, and multilingual translation, Llama 3.1 is sufficient to benchmark top closed source big models such as GPT-4o and Claude 3.5Sonnet.
Llama 3.1 is now available for download on the Meta official website and Hugging Face. The latest data shows that the total download volume of all Llama versions has exceeded 300 million times.
At the same time on the same day, Nvidia also launched a combination training service, providing strong assists for Llama 3.1.
The reporter from the Science and Technology Innovation Board Daily learned from Nvidia that Nvidia has officially launched new NVIDIA AI Foundry services and NVIDIA NIM inference microservices. NVIDIA AI Foundry is driven by the NVIDIA DGX Cloud AI platform, which is jointly designed by NVIDIA and public cloud and can provide enterprises with a large amount of computing power resources.
NVIDIA AI Foundry and NVIDIA NIM are used together with the Llama 3.1 series open source models, allowing enterprises to create custom "super models" for their specific industry use cases. Enterprises can also use their own data and synthetic data generated by Llama 3.1 405B and NVIDIA Nemotron Reward models to train these super models.
Nvidia founder and CEO Huang Renxun stated that Meta's Llama 3.1 open-source model marks a critical moment for global enterprises to adopt generative AI. Llama 3.1 will ignite a wave of enterprises and industries creating advanced generative AI applications. NVIDIA AI Foundry has integrated Llama 3.1 throughout the entire process and is able to assist enterprises in building and deploying custom Llama hypermodels.
CandyLake.com 系信息发布平台,仅提供信息存储空间服务。
声明:该文观点仅代表作者本人,本文不代表CandyLake.com立场,且不构成建议,请谨慎对待。
声明:该文观点仅代表作者本人,本文不代表CandyLake.com立场,且不构成建议,请谨慎对待。
猜你喜欢
- Over 10000 Nvidia Blackwell chips have been delivered to Huang Renxun in response to tariff issues
- Financial analysis: Nvidia's Q4 performance guidance falls short of the highest expectations, and the stock price fell more than 5% after the market closed
- Global Finance: Market pays attention to Nvidia's performance. The three major stock indexes of the New York Stock Exchange fluctuated on the 20th
- Nvidia's Q4 performance guidance fell short of the highest expectations, and its stock price fell more than 5% after hours
- Alibaba CEO Wu Yongming: AI development requires a batch of open-source models of different scales and fields
- Nvidia's third quarter revenue reached $35.082 billion
- NVIDIA's performance growth slows down, Huang Renxun steps in to 'appease' the market! Analyst: Investors Underestimate Demand for Blackwell Chips
- Nvidia's Q4 performance guidance falls short of the highest expected stock price, with a drop of over 5% after the market closed
- The stock price has skyrocketed by 33%! Snowflakes overshadow Nvidia analysts: AI software outperforms semiconductors or trends
- The three major US stock indices collectively closed higher, while the Dow Jones Industrial Average rose more than 1%. Nvidia's stock price hit a new intraday high
-
11月21日、2024世界インターネット大会烏鎮サミットで、創業者、CEOの周源氏が大会デジタル教育フォーラムとインターネット企業家フォーラムでそれぞれ講演、発言したことを知っている。周源氏によると、デジタル教 ...
- 不正经的工程师
- 3 小时前
- 支持
- 反对
- 回复
- 收藏
-
アリババは、26億5000万ドルのドル建て優先無担保手形と170億元の人民元建て優先無担保手形の定価を発表した。ドル債の発行は2024年11月26日に終了する予定です。人民元債券の発行は2024年11月28日に終了する予定だ ...
- SOGO
- 前天 09:05
- 支持
- 反对
- 回复
- 收藏
-
スターバックスが中国事業の株式売却の可能性を検討していることが明らかになった。 11月21日、外国メディアによると、スターバックスは中国事業の株式売却を検討している。関係者によると、スターバックスは中国事 ...
- 献世八宝掌
- 昨天 16:29
- 支持
- 反对
- 回复
- 收藏
-
【意法半導体CEO:中国市場は非常に重要で華虹と協力を展開】北京時間11月21日、意法半導体(STM.N)は投資家活動の現場で、同社が中国ウェハー代工場の華虹公司(688347.SH)と協力していると発表した。伊仏半導体 ...
- 黄俊琼
- 昨天 14:29
- 支持
- 反对
- 回复
- 收藏