Crush all opponents? Google releases a lightweight open source model that can run on laptops
老蟹2017
发表于 2024-2-22 13:16:00
1213
0
0
The open source big model track welcomes a heavyweight new product.
On February 21st local time, Google announced the official launch of a new open source big language model (LLM) called Gemma, aimed at helping developers and researchers responsibly build artificial intelligence.
It is reported that the Gemma big model shares technology and infrastructure with Google's largest and most powerful artificial intelligence model, Gemini. "Inspired by Gemini, Google DeepMind collaborated with other Google teams to develop Gemma, which is named after Gemma, meaning 'gem' in Latin."
However, compared to Gemini, Gemma is more lightweight. Meanwhile, Gemma remains free to use, its model weights are also open-source, and commercial use is allowed.
Google has released two models with different weight scales, Gemma 2B (2 billion parameters) and Gemma 7B (7 billion parameters). Each scale has pre trained and instruction fine-tuning versions, allowing all organizations (regardless of size) to responsibly conduct commercial and distribution.
On the same day that Google released Gemma, the popular chip manufacturer Nvidia also announced a partnership with Google to ensure the smooth operation of the Gemma model on its chips. Nvidia also stated that its chatbot software Chat With RTX will soon support Gemma.
It is worth noting that Google also emphasizes that Gemma can surpass larger models on key benchmarks. What's even more impressive is that Google Gemma can run on laptops.
Google has stated that Gemini is the largest and most powerful AI model widely used today. Compared to other open models, Gemma 2B and 7B can achieve the best performance in their class within their scope. The Gemma model can run directly on developers' laptops or desktops, "It's worth noting that Gemma surpasses larger models on key benchmarks while adhering to our strict standards of safe and responsible output."
Along with the open source model, Google also released a technical report on Gemma's performance, dataset composition, and modeling methods in detail. Researchers have found in a technical report that Gemma supports a vocabulary size of 256K, which means it can provide better and faster support for languages other than English.
Comparison of Llama 2 parameters released by Gemma and Meta, from Google's official website
Gemma was also launched as soon as possible on the well-known open-source model libraries HuggingFace and HuggingChat. Shortly after its launch, both Gemma 2B and 7B models have reached the top of HuggingFace's "Big Language Model List".
AI industry expert and author of the deep learning framework Keras, Franois Chollet, further stated that the position of the strongest open source big model has now changed ownership.
Gemma's competitor Llama 3 is also about to be released. On January 19th, Meta co-founder and CEO Zuckerberg announced that Meta is training Llama 3 and will continue to open source in a responsible manner.
CandyLake.com 系信息发布平台,仅提供信息存储空间服务。
声明:该文观点仅代表作者本人,本文不代表CandyLake.com立场,且不构成建议,请谨慎对待。
声明:该文观点仅代表作者本人,本文不代表CandyLake.com立场,且不构成建议,请谨慎对待。
猜你喜欢
- Nvidia Open Source 340 Billion Parameter Model Nemotron-4 340B
- Nvidia suddenly opens up!
- Nvidia Open Source 340 Billion Parameter Model Nemotron-4 340B
- Meta releases the strongest open-source model Llama 3.1, Zuckerberg: it will become a turning point in the industry
- Meta releases "industry-leading" open-source artificial intelligence (AI) model Llama 3.1
- Meta releases open-source big model Llama 3.1 with strong support from Nvidia
- Huang Renxun, Zuckerberg supports AI big model open source, two people exchange jackets to express brotherly love
- Robin Lee's internal speech exposes that the open source model is not efficient enough to solve the problem of computing power
- Alibaba Tongyi Qianwen Code Model Qwen2.5-Coder Full Series Officially Open Source
- Alibaba CEO Wu Yongming: AI development requires a batch of open-source models of different scales and fields
-
11月21日、2024世界インターネット大会烏鎮サミットで、創業者、CEOの周源氏が大会デジタル教育フォーラムとインターネット企業家フォーラムでそれぞれ講演、発言したことを知っている。周源氏によると、デジタル教 ...
- 不正经的工程师
- 4 小时前
- 支持
- 反对
- 回复
- 收藏
-
アリババは、26億5000万ドルのドル建て優先無担保手形と170億元の人民元建て優先無担保手形の定価を発表した。ドル債の発行は2024年11月26日に終了する予定です。人民元債券の発行は2024年11月28日に終了する予定だ ...
- SOGO
- 前天 09:05
- 支持
- 反对
- 回复
- 收藏
-
スターバックスが中国事業の株式売却の可能性を検討していることが明らかになった。 11月21日、外国メディアによると、スターバックスは中国事業の株式売却を検討している。関係者によると、スターバックスは中国事 ...
- 献世八宝掌
- 昨天 16:29
- 支持
- 反对
- 回复
- 收藏
-
【意法半導体CEO:中国市場は非常に重要で華虹と協力を展開】北京時間11月21日、意法半導体(STM.N)は投資家活動の現場で、同社が中国ウェハー代工場の華虹公司(688347.SH)と協力していると発表した。伊仏半導体 ...
- 黄俊琼
- 昨天 14:29
- 支持
- 反对
- 回复
- 收藏