Meta releases the strongest open-source model Llama 3.1, Zuckerberg: it will become a turning point in the industry
胡胡胡美丽_ss
发表于 2024-7-24 09:38:16
232
0
0
On the evening of July 23rd Beijing time, Meta officially released the latest open-source model Llama 3.1 series, further narrowing the gap between open-source models and closed source models. Llama 3.1 includes three parameter scales of 8B, 70B, and 450B, with the 450B parameter model surpassing OpenAI's GPT-4o in multiple benchmark tests and comparable to leading closed source models such as Claude 3.5 Sonnet.
Meta founder and CEO Mark Zuckerberg also posted a blog on the official website at the same time to promote the release. He stated that Llama 3.1 version will be a turning point in the industry, and most developers will begin to primarily use open source. Open source AI is the future direction of development.
NVIDIA Senior Research Scientist Jim Fan congratulated the Meta team on X, stating, "The power of GPT-4 is in our hands, and this is a truly historic moment
In terms of specific details, the context windows of the three versions of Llama 3.1 have increased from 8K to 128K, a 16 fold expansion, and support 8 languages simultaneously. The Llama 3.1-405B model was trained using over 15 trillion tokens, and in order to achieve this training scale, the team used 16000 H100 GPUs. Officially, the 405B model is the first Llama model trained at this scale.
Open source large-scale language models often lag behind closed source models in terms of functionality and performance, but now we are ushering in a new era led by open source
In the official blog, Meta evaluated the performance of over 150 benchmark datasets and compared the performance of Llama 3.1 with other models. The flagship model Llama 3.1-405B is comparable to GPT-4, GPT-4o, and Claude 3.5 Sonnet in a range of tasks such as common sense, operability, and mathematics. In addition, the 8B and 70B small models are competitive with closed source and open source models with similar numbers of parameters.
In real-world scenarios, Llama 3.1 405B performed better overall than GPT-4o and Claude 3.5 Sonnet compared to manual evaluations.
Meta has also updated its open source license this time, allowing developers to use the output of the Llama model (including 405B) for the first time to improve other models. Compared to GPT-4o, the official statement states that they will also use a combination approach to integrate image, video, and voice functions into Llama 3, enabling the model to recognize images and videos and support interaction through voice. However, this feature is still under development and is not yet ready for release.
In the official blog, Meta stated that the total download volume of all Llama versions has exceeded 300 million times so far.
In addition to this model release, Zuckerberg also posted a long article on the official website titled "Open Source AI Is the Path Forward", which mentioned the importance of open source. He believes that open source is good for all developers, Meta, and the world.
Zuckerberg used the example of open-source system Linux defeating closed source system Unix, believing that artificial intelligence will develop in a similar way. Several technology companies are developing leading closed models, but open source is quickly narrowing the gap. He mentioned that last year, Llama 2 could only be compared to the old generation models. And this year, Llama 3 has competitiveness in some fields, even leading the most advanced models in some aspects.
Zuckerberg believes that open source can promote innovation, reduce costs, and improve security. For developers, using open source can train, fine tune, and distill their own models. Each organization has different needs, and it is best to use models of different sizes to meet these needs, which are trained or fine tuned with specific data.
Meanwhile, developers can avoid being locked into closed vendors to protect data security. Open source software is often more secure because its development is more transparent and can be widely reviewed, "said Zuckerberg.
Zuckerberg also mentioned that open-source models have lower costs and higher efficiency, allowing developers to run inference on Llama 3.1 405B on their own infrastructure at a cost of approximately 50% of using closed models like GPT-4o, suitable for user interface and offline inference tasks.
Open source artificial intelligence represents the world's best opportunity. In Zuckerberg's view, utilizing this technology can create the greatest economic opportunity and security.
CandyLake.com 系信息发布平台,仅提供信息存储空间服务。
声明:该文观点仅代表作者本人,本文不代表CandyLake.com立场,且不构成建议,请谨慎对待。
声明:该文观点仅代表作者本人,本文不代表CandyLake.com立场,且不构成建议,请谨慎对待。
猜你喜欢
- Company Review | BeiGene has suffered continuous losses in the past 7 years, and its stock price is under pressure. Can the new CFO bring a turning point?
- Robin Lee's internal speech exposes that the open source model is not efficient enough to solve the problem of computing power
- Zuckerberg 'Explodes' AI Wearable Devices
- Multiple teams at Meta have reported layoffs, and Zuckerberg's' efficiency year 'is still ongoing
- Alibaba Tongyi Qianwen Code Model Qwen2.5-Coder Full Series Officially Open Source
- Alibaba CEO Wu Yongming: AI development requires a batch of open-source models of different scales and fields
- Relationship easing? Trump and Meta CEO Zuckerberg have dinner at Mar-a-Lago Estate
- Foreign media: Meta CEO Zuckerberg has dinner with Trump
- Meta CEO Zuckerberg invited to meet with Trump
- Open source securities: AI leads the rapid development of the education industry
-
生成式人工知能(AI)が巻き起こす技術の波の中で、電力会社は意外にも資本市場の寵児になった。 今年のスタンダード500割株の上昇幅ランキングでは、Vistraなどの従来の電力会社が注目を集め、株価が2倍になってリ ...
- xifangczy
- 3 天前
- 支持
- 反对
- 回复
- 收藏
-
隔夜株式市場 世界の主要指数は金曜日に多くが下落し、最新のインフレデータが減速の兆しを示したおかげで、米株3大指数は大幅に回復し、いずれも1%超上昇した。 金曜日に発表されたデータによると、米国の11月のPC ...
- SNT
- 前天 12:48
- 支持
- 反对
- 回复
- 收藏
-
長年にわたって、昔の消金大手の捷信消金の再編がようやく地に着いた。 天津銀行の発表によると、同行は京東傘下の2社、対外貿易信託などと捷信消金再編に参加する。再編が完了すると、京東の持ち株比率は65%に達し ...
- SNT
- 前天 12:09
- 支持
- 反对
- 回复
- 收藏
-
グーグルは現地時間12月19日、新しい「推理」モデルとしてGemini 2.0 Flash Thinkingを発売すると発表した。紹介によると、このモデルはまだ実験段階であり、訓練を経た後、モデルが反応を起こした時に経験した「思 ...
- 地下水
- 3 天前
- 支持
- 反对
- 回复
- 收藏