Meta releases' Strongest Open Source Model ', opening a new page in the battle between open source and closed source. The big model may face a reshuffle
邹高清
发表于 2024-7-28 10:40:19
1277
0
0
On July 23rd local time, Meta officially released the latest version of its language model Llama3.1. This release is seen by the AI community as a powerful counterattack against the "open source backwardness theory", and Meta founder and CEO Mark Zuckerberg also stated during the release that "open source AI is the path to the future".
OpenAI has always been criticized by the outside world for the closed nature of ChatGPT, claiming that although it is called "Open", it actually does "Close" things. However, the strength of closed source big models represented by ChatGPT-4o often discourages the industry, as if the concept that "closed source big models must have better performance than open source big models" has become the default.
But the release of Llama3.1 this time seems to rewrite this pattern. Meta has released three versions of Llama3.1, namely 8B, 70B, and 405B, with 405B being the "top of the line" version. Meta claims that its performance is comparable to the best closed source models.
The Strongest Open Source Model
Why can Llama3.1 405B compete with the best closed source models? Along with the release of Llama3.1, Meta also published a paper titled 'The Llama 3 Herd of Models', which detailed the development details of the Llama 3 model.
Firstly, in terms of usage, Llama3.1 supports 8 languages and the context windows of all three versions have been extended to 128K, which is the same as GPT-4 Turbo; Meanwhile, Llama3.1 405B has 405 billion model parameters, with a training scale 50 times larger than Llama2, and adopts a dense Transformer architecture to maintain more stable performance. In this way, Llama can process up to 96000 words of text at once, and can handle both long and short texts with ease.
In the paper, Meta also published performance comparison data between Llama3.1 405B and closed source models such as ChatGPT-4o and Claude 3.5 Sonnet. The test results show that Llama3.1 405B leads in multiple aspects such as general performance, long text processing, and multilingual processing. For example, in the ZeroSCROLLS project testing, Llama3.1 405B scored 95.2, while the latter two were both 90.5.
The outstanding performance and large training base of Llama3.1 have earned it the title of "the strongest open-source big model". However, the current Llama3.1 is still a large model mainly focused on language processing and does not support processing images, videos, or speech. This means that ChatGPT still has outstanding capabilities in multimodal task processing.
Open source AI is the path of the future
Perhaps the actual user experience of Llama has not yet reached a perfect level, but the release of Llama 3.1 405B is of great significance to AI workers around the world, as it opens a new page in the open source and closed source struggle for large models.
On the Meta official website, Zuckerberg released an open letter firmly proclaiming that "open-source AI is the path to the future". In the letter, he stated that although multiple companies are developing leading closed source models, open source is rapidly narrowing the gap. Taking Llama as an example, last year Llama 2 could only compete with older versions of the general large model, but this year Llama 3 has achieved competition with the most advanced large models and is leading in some fields.
Therefore, Zuckerberg hopes to turn Llama into the Linux of the big model era and become the industry standard for open source AI. In the early days of high-performance computing, major technology companies invested heavily in developing their own closed source versions of Unix... Today, open-source Linux has become the industry standard foundation for cloud computing and operating systems that run most mobile devices, and I believe artificial intelligence will develop in a similar way
CandyLake.com 系信息发布平台,仅提供信息存储空间服务。
声明:该文观点仅代表作者本人,本文不代表CandyLake.com立场,且不构成建议,请谨慎对待。
声明:该文观点仅代表作者本人,本文不代表CandyLake.com立场,且不构成建议,请谨慎对待。
猜你喜欢
- Tesla's Shanghai Gigafactory delivered 74117 Model 3 vehicles in July, setting a historical high
- Tesla recalls over 1.68 million models across the entire lineup in China
- Apple reportedly will produce high-end iPhone Pro models in India for the first time this year
- OpenAI's commercial subscription users have exceeded 1 million, and there are rumors of astonishing price increases for new models
- Rolling crazy! The big model price war continues! Alibaba announces: 85% price reduction!
- European privacy regulators investigate Google's use of data for artificial intelligence models
- Hong Kong stock market's apple industry chain surges! What will be the future sales of the iPhone 16 Pro model as demand remains strong?
- Elon Musk's Cybercab is about to be released! Many domestic giants are competing to explore the Robotaxi China model
- Will DeepMind's open-source biomolecule prediction model win the Nobel Prize and ignite a wave of AI pharmaceuticals?
- Baidu Robin Lee: In the past 24 months, the biggest change in the AI industry is that the big model has basically eliminated illusion
-
【いい日が来る?米科学技術企業は首を長くして待っている:トランプ2.0は監督管理を減らすことが重要になる】トランプ氏が総選挙に勝利したことに伴い、多くの米科学技術会社幹部は喜んでいる。トランプ氏が勝利し ...
- 内托体头
- 前天 12:51
- 支持
- 反对
- 回复
- 收藏
-
11月18日、グーグルクラウド(Google Cloud)は、尹世明氏が大中華区総裁にグーグルクラウドに加入したと発表した。グーグルクラウドに加入する前は、マイクロアライアンスの最高経営責任者、百度グループの副総裁 ...
- 内托体头
- 前天 12:06
- 支持
- 反对
- 回复
- 收藏
-
米東時間11月18日、米株終値は反落し、ダウ平均は3営業日連続で下落した。ナスダック・金龍中国指数は上昇した。米株BAKTは162.37%上昇し、盤中5回の溶断メカニズムをトリガした。 大口商品では、WTI原油価格が上昇 ...
- 就放荡不羁就h
- 昨天 09:18
- 支持
- 反对
- 回复
- 收藏
-
テスラ(TSLA)中国は、上汽と2輪FSD(Full Self-Driving、完全自動運転)のライセンスを検討していることについて、このニュースは事実ではないと答えた。これに先立ち11月17日、市場ではテスラのFSDが中国に進出 ...
- 月望魂
- 前天 17:14
- 支持
- 反对
- 回复
- 收藏