Google releases the strongest AI model Gemini, with top securities firms quickly commenting: Continuously optimistic about the prospects of the AI industry
因醉鞭名马幌
发表于 2023-12-7 13:03:38
253
0
0
On December 7th, Caixin News Agency reported that US technology giant Google recently announced the launch of its largest and most powerful AI intelligent model, the Gemini.
The Gemini model released by Google this time can achieve multimodality and significantly improve performance. Gemini is a multimodal model built on Transformer decoder, which can process information in different forms of content such as video, audio, and text. The latest Gemini model is able to perform more complex reasoning and understand finer information compared to previous technologies. It can extract key points from hundreds of thousands of documents by reading, filtering, and understanding information, which will help achieve new breakthroughs in many fields from science to finance.
The Gemini model can be divided into three versions based on its size: Gemini Ultra, Gemini Pro, and Gemini Nano, all of which support contextual 32K understanding. Among them:
1) The Ultra version is the most powerful version and can demonstrate the highest efficiency in the corresponding TPU infrastructure. In multiple tests, the performance of the Ultra version exceeds GPT4V;
2) The Pro version is a cost-effective optimized version with strong capabilities in reasoning, multimodality, and other aspects. It has good scalability and can complete pre training within a few weeks. In multiple tests, it is second only to GPT4V and stronger than mainstream large models such as PaLM2, Claude2, LLaMA2, and GPT3.5;
3) Nano: It is a 4-bit model distilled from other models, with two versions: 1.8B and 3.25B, targeting low memory and high memory devices respectively, and supporting local deployment
The Gemini model, as the first multimodal model released by Google and globally, supports cloud and edge testing. According to relevant test data, Gemini Ultra outperforms human expert models in MMLU (Massive Multi tasking Language Understanding), with performance surpassing GPT-4 in multiple tasks when compared horizontally.
Minsheng Securities stated that by evaluating the Gemini model family in over 50 benchmark tests, as the model size increases, the Gemini model family continues to improve its quality in reasoning, mathematics/science, and long texts. Among all six abilities, Gemini Ultra is the best model. As the second largest model in the Gemini model family, Gemini Pro is also highly competitive in performance and more efficient in providing services.
Minsheng Securities pointed out that the Gemini training process can also innovate infrastructure, algorithms, and datasets;
In terms of infrastructure: Gemini is trained by Google TPUV5e and TPUV4, and has demonstrated engineering innovation during the training process. For example, by connecting 4096 TPUV4 chips to a dedicated optical switch, the 4x4x4 chip cube can be dynamically reconfigured as a super node of any 3D ring topology structure in about 10 seconds, and targeted deployment of Gemini Ultra and thermal maintenance functions. In response to the high inter chip interconnection speed required for the Ultra version, Google has applied multiple patented technologies such as OCS optical switching, but the final speed is not yet provided in the article.
In terms of algorithms, techniques such as single control algorithms and XLA compilers are used to optimize the training process, and stable training is achieved by preventing SDC and other issues.
In terms of dataset, Gemini training and inference speed are improved through word segmentation technology, and a series of filtering methods are used to ensure the high quality of the data used for training
The latest version of Google's computing chip TPU v5p has been released simultaneously. TPU v5p is an improvement of the previous TPU v4 version. Compared with TPU v4, TPU v5p has twice the floating-point performance and trains large language models 2.8 times faster. CITIC Securities believes that the official release of the multimodal Gemini model can expand the application scenarios and bring about continuous upgrades in computing power demand. Minsheng Securities continues to be optimistic about the future prospects of the AI industry and believes that the release of models such as GPT-5 will also bring more catalysis.
CITIC Securities stated that in the current search scenario, Gemini can reduce latency by approximately 40%. For the entire industry, the promotion of Google's productization and commercialization will also bring about overall changes. At the same time, with the launch of models such as GPT-5, it is expected to see: 1) the increase in computing power demand brought by multimodal models; 2) More and more AI scenarios and products are emerging.
The release of Gemini will further bring more expectations for multimodal models, which will drive an increase in computing power demand for the industry; In the medium to long term, it is expected that the upgrade of multimodal models will enrich the usage scenarios of related products, coupled with cost optimization brought about by hardware upgrades and algorithm optimization. The progress of 2C products is worth looking forward to.
CITIC Securities stated that it remains optimistic about the long-term impact and changes of this round of generative AI on the technology industry, and continues to focus on leading manufacturers in areas such as computing power, algorithms, data, and applications.
CandyLake.com 系信息发布平台,仅提供信息存储空间服务。
声明:该文观点仅代表作者本人,本文不代表CandyLake.com立场,且不构成建议,请谨慎对待。
声明:该文观点仅代表作者本人,本文不代表CandyLake.com立场,且不构成建议,请谨慎对待。
猜你喜欢
- Google's strongest AI model Gemini officially released: multimodal, three major versions
- Google's Strongest AI Model Gemini Releases 100 ETFs (588120) on the Science and Technology Innovation Board, with a Transaction Volume of Over 300 million yuan and Net Inflow of Over 300 million yuan in the Past 10 Days
- Who is the strongest in advanced intelligent driving? Baidu, Huawei, and Xiaopeng have started arguing
- Increase holdings in concept stocks! The latest disclosure from two top private equity firms
- Four top private equity firms exposed their "US stock performance report": Pinduoduo is still at Hillhouse and Gao Yi, but Jinglin quietly reduces its holdings
- Meta releases strongest open-source model to catch up with GPT-4, Xiaozha: overtake next year
- Global stock market crash! Urgent notice from securities firms: Suspend night trading!
- Hema's own brand products are listed on Lazada, a leading e-commerce platform in Singapore
- Top 20 US Stock Transactions: Securities firm Jefferies downgraded Apple's rating, citing high expectations for iPhone
-
南方財経は11月12日、百済神州が2024年第3四半期の報告書を発表し、同社の第3四半期の営業収入は71.39億元で、前年同期比26.9%増加した。上場企業の株主に帰属する純利益は-8.09億元で、主に前年同期に百時米施貴宝 ...
- 1900_后
- 3 天前
- 支持
- 反对
- 回复
- 收藏
-
北京商報(何倩記者)は11月14日、「チャトンと野菜を買ってサウジに出航した」との情報に対し、チャトンと野菜を買った関係責任者は北京商報記者に対し、関連業務はまだ初歩的な模索にすぎず、しばらく詳細な情報 ...
- 柔柔树呆熊呆j
- 昨天 16:39
- 支持
- 反对
- 回复
- 收藏
-
黄仁勲が最新発表! 11月13日、英偉達の黄仁勲CEOは英偉達日本サミットで、日本最大のAI工場を含むソフトバンクと協力して日本にAIインフラを構築すると発表した。ソフトバンクの孫正義元会長兼社長は、「ソフトバ ...
- tomy123123
- 前天 14:57
- 支持
- 反对
- 回复
- 收藏
-
【世界市場】1、ダウは0.11%、スタンダードは0.02%、ナノ指は0.26%下落した。2、大型科学技術株の多くが上昇し、アマゾンは2%超上昇し、株価は過去最高を記録した。3、国際金価格は4日連続で下落し、2600ドルの関門 ...
- 就放荡不羁就h
- 昨天 14:54
- 支持
- 反对
- 回复
- 收藏