Challenge OpenAI, Google's new move! Significantly updated generative AI, launching video model VEO 2 and the latest version Imagen3
wdx5566
发表于 前天 09:32
3091
0
0
Google DeepMind, the flagship AI research laboratory of Google (GOOGL, stock price $196.66, market value $2407.3 billion), significantly upgraded its AI driven content generation tool on Monday, launching the Veo 2 video generation model and an enhanced version of the Imagen 3 image model, challenging OpenAI's leading position in AI image and video generation. Google stated that these updates are expected to completely change the creative workflow, providing video and image creators with higher realism and customized experiences.
According to Google, Veo 2 is a video generation tool that can generate high-quality videos with diverse themes and styles. Google stated in its blog that this model excels in realism, capturing details such as human expressions and movie effects. Its enhanced understanding of physics and film enables users to generate stunning content, including tracking shots and wide-angle compositions.
For example, Veo 2 is familiar with the language of movie shooting, and users can request a certain type of style, specify the lens, and suggest movie effects. Veo 2 will present videos at up to 4K resolution and extended to several minutes in length. It is worth noting that this resolution is 4 times that of the OpenAI Sora model, and the video duration is more than 6 times longer.
However, these advantages are still theoretical at present. In Google's experimental video creation tool VideoFX, videos generated by Veo 2 are limited to 720p resolution and 8 seconds in length. (In contrast, Sora's maximum output is 1080p, 20 second short films.)
Google stated that although video generation models often "hallucinate" unnecessary details such as extra fingers or unexpected objects, Veo 2 performs more realistically in this regard with a lower frequency of generation errors. In addition, the videos generated by Veo 2 include invisible SynthID watermarks to mark them as AI generated content, thereby reducing the risk of misuse or incorrect attribution.
DeepMind's Vice President of Product, Eli Collins, told the media that as the model gradually becomes ready for large-scale use, Google will provide Veo 2 through its Vertex AI developer platform.
Developers and creators can currently access the tool through Google Labs, and it is expected to be widely integrated into platforms such as YouTube Shorts by 2025. Meanwhile, the Imagen 3 model has been enhanced in terms of image composition and detail accuracy, supporting various styles from realistic to abstract, generating richer textures, and responding more faithfully to user prompts.
Currently, Imagen 3 has been launched in over 100 countries through Google Labs' ImageFX tool, allowing global users to experiment with its cutting-edge features.
In addition, Google has also launched Whisk, a creative tool that combines the visual analysis capabilities of Imagen 3 and Gemini. Users can input images, generate detailed text descriptions, remix styles, or design personalized works such as digital dolls or enamel badges.
Google introduced that Whisk combines the Imagen 3 model with Gemini's visual understanding and descriptive capabilities. The Gemini model will automatically generate detailed textual descriptions for the user's images and pass these descriptions to Imagen 3. This process allows users to remix themes, scenes, and styles in interesting new ways.
On December 10th Beijing time, Google announced the development of its new quantum chip Willow. This powerful chip has achieved a crucial breakthrough in the field of quantum computing over the past 30 years, completing tasks that today's computers take 10 years to complete in just 5 minutes. The research results were published in the journal Nature on December 9th.
After the news came out, the quantum information industry cheered and the AI circle was also greatly shocked.
Willow's major breakthroughs are reflected in two aspects: one is the significant increase in performance, that is, computing power. 5 minutes of computation is equivalent to a task that the fastest computer currently can complete in 10 years. 10& sup2; Years are much older than the age of the universe (about 13 billion years). 5 minutes and 10& sup2; In the year, this comparison shows that the leap in computing speed is very terrifying.
The second is the powerful quantum error correction capability. Willow's significant progress in the field of quantum error correction is that, based on a scalable square grid, the number of logical qubits (currently 105 qubits) increases while the error rate rapidly decreases. It expands from 3x3 encoded qubits to 5x5 grids, and then to 7x7 grids, with each expansion halving the error rate. Moreover, Willow can perform real-time error correction, making it possible to scale to higher order qubits (such as 1050) in a short period of time.
The above two major breakthroughs, compared to performance improvement, have attracted more attention from scientists in terms of error correction capability.
Quantum chips are the core of quantum computers. Willow's research and development team is the Google Quantum AI Laboratory led by Hartmut Neven. Hartmut stated that Willow is a big step towards large-scale, self correcting quantum computers, whose error correction capabilities and beyond classical computing power bring us closer to a system that can provide commercial applications, from helping discover new drugs, to designing more efficient electric vehicle batteries, to accelerating progress in nuclear fusion and new energy alternatives.
Daily Economic News Comprehensive Google, Public Information
Disclaimer: The content and data in this article are for reference only and do not constitute investment advice. Please verify before use. Based on this operation, the risk is borne by oneself.
CandyLake.com 系信息发布平台,仅提供信息存储空间服务。
声明:该文观点仅代表作者本人,本文不代表CandyLake.com立场,且不构成建议,请谨慎对待。
声明:该文观点仅代表作者本人,本文不代表CandyLake.com立场,且不构成建议,请谨慎对待。
猜你喜欢
- ERNIE Bot has more than 400 million users, Baidu Wu Tian: the big model is reshaping the industrial intelligence engine
- In October of this year, Tesla Model Y won the sales championship for first tier and new first tier city models
- Alibaba CEO Wu Yongming: AI development requires a batch of open-source models of different scales and fields
- Baidu's Q3 core net profit increased by 17%, exceeding expectations. Wenxin's large model daily usage reached 1.5 billion
- The delivery fee pricing has been lowered to 6 yuan, and McDonald's has adjusted the McDonald's delivery fee model
- Ideal Automobile implements a limited time zero interest policy for all models for the first time
- OpenAI launches full health version of the o1 big model and $200 per month ChatGPT Pro
- OpenAI has Rocket again! Officially launched Sora, an AI video generation model
- Google releases its most powerful model to attack OpenAI, shifting focus to AI agents
- Is it increasingly difficult to distinguish between truth and falsehood? Google launches new generation video generation model Veo 2
-
量子計算会社は年内に狂った。 現地時間12月17日、米株3大指数は下落した。ダウ平均は9営業日連続で下落し、1978年以来の最長連続下落を記録した。 人気のある株では、テスラとアップルの株価が再び高値を更新した ...
- SOHU
- 昨天 21:33
- 支持
- 反对
- 回复
- 收藏
-
上海新天地の公式情報によると、フランスの高級ジュエリーブランドBoucheron宝詩龍は2025年に上海新天地石庫門街区の入り口(太倉路と馬当路の境界)に入居する。この店舗は現在スターバックス臻選門店となっている ...
- 内托体头
- 3 天前
- 支持
- 反对
- 回复
- 收藏
-
12月17日、インタフェースニュースは空腹なのか、空腹なのか、今年8月に全国のオンライン騎手の休憩措置を取ったことを明らかにした。連続走行単時間が長すぎると、小休の要求があり、関連措置は継続的に整備されて ...
- 就放荡不羁就h
- 前天 20:01
- 支持
- 反对
- 回复
- 收藏
-
現地時間12月17日、英偉ダミアン株は2%超下落し、これまで3営業日連続で下落した。
- 不正经的工程师
- 前天 19:16
- 支持
- 反对
- 回复
- 收藏