Crush all opponents? Google releases a lightweight open source model that can run on laptops
老蟹2017
发表于 2024-2-22 13:16:00
1218
0
0
The open source big model track welcomes a heavyweight new product.
On February 21st local time, Google announced the official launch of a new open source big language model (LLM) called Gemma, aimed at helping developers and researchers responsibly build artificial intelligence.
It is reported that the Gemma big model shares technology and infrastructure with Google's largest and most powerful artificial intelligence model, Gemini. "Inspired by Gemini, Google DeepMind collaborated with other Google teams to develop Gemma, which is named after Gemma, meaning 'gem' in Latin."
However, compared to Gemini, Gemma is more lightweight. Meanwhile, Gemma remains free to use, its model weights are also open-source, and commercial use is allowed.
Google has released two models with different weight scales, Gemma 2B (2 billion parameters) and Gemma 7B (7 billion parameters). Each scale has pre trained and instruction fine-tuning versions, allowing all organizations (regardless of size) to responsibly conduct commercial and distribution.
On the same day that Google released Gemma, the popular chip manufacturer Nvidia also announced a partnership with Google to ensure the smooth operation of the Gemma model on its chips. Nvidia also stated that its chatbot software Chat With RTX will soon support Gemma.
It is worth noting that Google also emphasizes that Gemma can surpass larger models on key benchmarks. What's even more impressive is that Google Gemma can run on laptops.
Google has stated that Gemini is the largest and most powerful AI model widely used today. Compared to other open models, Gemma 2B and 7B can achieve the best performance in their class within their scope. The Gemma model can run directly on developers' laptops or desktops, "It's worth noting that Gemma surpasses larger models on key benchmarks while adhering to our strict standards of safe and responsible output."
Along with the open source model, Google also released a technical report on Gemma's performance, dataset composition, and modeling methods in detail. Researchers have found in a technical report that Gemma supports a vocabulary size of 256K, which means it can provide better and faster support for languages other than English.
Comparison of Llama 2 parameters released by Gemma and Meta, from Google's official website
Gemma was also launched as soon as possible on the well-known open-source model libraries HuggingFace and HuggingChat. Shortly after its launch, both Gemma 2B and 7B models have reached the top of HuggingFace's "Big Language Model List".
AI industry expert and author of the deep learning framework Keras, Franois Chollet, further stated that the position of the strongest open source big model has now changed ownership.
Gemma's competitor Llama 3 is also about to be released. On January 19th, Meta co-founder and CEO Zuckerberg announced that Meta is training Llama 3 and will continue to open source in a responsible manner.
CandyLake.com 系信息发布平台,仅提供信息存储空间服务。
声明:该文观点仅代表作者本人,本文不代表CandyLake.com立场,且不构成建议,请谨慎对待。
声明:该文观点仅代表作者本人,本文不代表CandyLake.com立场,且不构成建议,请谨慎对待。
猜你喜欢
- Nvidia suddenly opens up!
- Nvidia Open Source 340 Billion Parameter Model Nemotron-4 340B
- Meta releases the strongest open-source model Llama 3.1, Zuckerberg: it will become a turning point in the industry
- Meta releases "industry-leading" open-source artificial intelligence (AI) model Llama 3.1
- Meta releases open-source big model Llama 3.1 with strong support from Nvidia
- Huang Renxun, Zuckerberg supports AI big model open source, two people exchange jackets to express brotherly love
- Robin Lee's internal speech exposes that the open source model is not efficient enough to solve the problem of computing power
- Alibaba Tongyi Qianwen Code Model Qwen2.5-Coder Full Series Officially Open Source
- Alibaba CEO Wu Yongming: AI development requires a batch of open-source models of different scales and fields
- Open source securities: AI leads the rapid development of the education industry
-
生成式人工知能(AI)が巻き起こす技術の波の中で、電力会社は意外にも資本市場の寵児になった。 今年のスタンダード500割株の上昇幅ランキングでは、Vistraなどの従来の電力会社が注目を集め、株価が2倍になってリ ...
- xifangczy
- 3 天前
- 支持
- 反对
- 回复
- 收藏
-
隔夜株式市場 世界の主要指数は金曜日に多くが下落し、最新のインフレデータが減速の兆しを示したおかげで、米株3大指数は大幅に回復し、いずれも1%超上昇した。 金曜日に発表されたデータによると、米国の11月のPC ...
- SNT
- 前天 12:48
- 支持
- 反对
- 回复
- 收藏
-
長年にわたって、昔の消金大手の捷信消金の再編がようやく地に着いた。 天津銀行の発表によると、同行は京東傘下の2社、対外貿易信託などと捷信消金再編に参加する。再編が完了すると、京東の持ち株比率は65%に達し ...
- SNT
- 前天 12:09
- 支持
- 反对
- 回复
- 收藏
-
グーグルは現地時間12月19日、新しい「推理」モデルとしてGemini 2.0 Flash Thinkingを発売すると発表した。紹介によると、このモデルはまだ実験段階であり、訓練を経た後、モデルが反応を起こした時に経験した「思 ...
- 地下水
- 3 天前
- 支持
- 反对
- 回复
- 收藏