Google's' Her 'rushes to land OpenAI voice AI still' holds on '
六月清晨搅
发表于 2024-8-14 20:39:29
183
0
0
On the early morning of August 14th Beijing time, Google officially released its intelligent voice assistant Gemini Live at the "Made by Google" conference. This feature directly challenges OpenAI's GPT-4o voice mode and marks another step towards more natural, universal, and user-friendly AI interaction.
According to Google, users can have free and smooth conversations with Gemini Live instead of relying on traditional input and output settings.
During the conversation, users can interrupt to inquire about more details or pause for a period of time before resuming.
In order to make conversations more natural, Google also offers ten voices for users to choose from. Google said, "It's like having a companion in your pocket that you can talk to about new ideas or practice important conversations with
The GPT-4o advanced voice mode previously released by Open AI also allows users to interrupt during conversations and perceive and respond to emotional fluctuations. In terms of voice settings, Open AI offers four types of voices, all produced in collaboration with professional voice actors.
In addition, Google will also connect Gemini Live with other applications and tools. Google has announced that it will launch extension features such as Keep, Tasks, Utilities, Calendar, YouTube Music, etc. in the coming weeks.
Google described the specific application scenarios of these features. For example, if a user needs to host a dinner party, Gemini Live can find specific recipes and add ingredients to the Keep shopping list, as well as customize a playlist that "reminds people of the late 1990s"; For example, by taking a photo of a concert poster, Gemini Live can answer whether the user is available on the day and remind them to buy tickets.
However, during the live demonstration of Gemini Live features at the "Made by Google" conference, there was a small incident. Google executive Dave Citron asked Gemini Live if there were any events on his schedule, but he tried Gemini Live twice in a row without any response until he changed his device for the third time before successfully demonstrating.
Currently, Google has provided an English version to Gemini premium subscribers on Android phones and will expand to iOS in the coming weeks, offering more language modes. The latest Pixel 9 series phones released by Google also feature Gemini Live functionality.
Industry insiders believe that the release of Gemini Live is an important milestone in the development of artificial intelligence interaction. By introducing voice interruption and selection functions, Google is not only competing with OpenAI, but also promoting human-computer interaction, thereby changing the competitive landscape of the artificial intelligence chatbot market and forcing other companies to create more natural, practical, and attractive artificial intelligence assistants.
At the same time, the innovative development of human-computer interaction has also brought new problems and challenges. For example, how will artificial intelligence quickly handle topic changes while maintaining contextual unity and relevance? How to handle interference information without losing important clues? More importantly, with the deepening development of artificial intelligence, where is its boundary with real life?
However, GPT-4o, which OpenAI publicly introduced three months ago, has not yet been fully implemented. On August 9th, OpenAI released a blog post about security, detailing the company's security efforts in developing GPT-4o and exploring the potential risks these technologies may pose to society.
OpenAI pointed out in the report the risks that artificial intelligence's humanoid social model may pose. OpenAI believes that users may establish social relationships with artificial intelligence and reduce the need for human interaction. This is beneficial for lonely individuals, but it can affect healthy interpersonal relationships.
OpenAI revealed that during the early testing of GPT-4o, they observed subtle changes in the interaction language between users and models, such as "This is our last day together" and so on. This seemingly harmless expression may hide bigger problems behind it.
In addition, OpenAI also mentioned that GPT-4o sometimes unintentionally generates outputs that mimic user voices, which means that AI speech engines may be used for fraud.
And these security issues are also one of the reasons why OpenAI controls the landing pace of GPT-4o. As for whether Google Gemini Live has addressed similar security risks, it has not been disclosed.
All security related risks, whether we are aware of them or the additional possibilities attached to Pandora's Box, are issues that need to be further addressed in the field of artificial intelligence to ensure that technological progress serves humanity.
CandyLake.com 系信息发布平台,仅提供信息存储空间服务。
声明:该文观点仅代表作者本人,本文不代表CandyLake.com立场,且不构成建议,请谨慎对待。
声明:该文观点仅代表作者本人,本文不代表CandyLake.com立场,且不构成建议,请谨慎对待。
猜你喜欢
- 谷歌“Her”抢跑落地 OpenAI语音AI仍在“hold on”
- グーグル「Her」フライング着地OpenAI音声AIは「ホールドオン」のまま
- 구글"Her"가 앞다투어 착지하는 OpenAI 음성AI는 여전히"hold on"에 있다
- 突发!高管离职、计划重组!OpenAI怎么了?
- 苹果公司据悉不再参与OpenAI融资轮谈判
- Apple reportedly no longer participates in OpenAI funding round negotiations
- アップルはOpenAI融資ラウンド交渉に参加しないという
- 애플은 OpenAI 융자 라운드 협상에 더 이상 참여하지 않는 것으로 알려졌다
-
9月が終わり、映画・テレビ業界が暗躍している。最近、愛奇芸の創始者でCEOのGONG宇氏は、映画・テレビ業界が長短の変化、AIの変化、中国映画・テレビドラマの海外進出の変化の3つの変化を経験していると発表した。 ...
- 寒郁轩良
- 昨天 14:17
- 支持
- 反对
- 回复
- 收藏
-
8月のトヨタ自動車(ダイハツ自動車と日野自動車を除く、レクサスを含む)の世界生産台数は前年同月比11.2%減の709571台、世界販売台数は前年同月比3.1%減の826863台だった。 日本本土市場では、トヨタ自動車の8月 ...
- SOGO
- 3 天前
- 支持
- 反对
- 回复
- 收藏
-
ネットワーク状況監視サイトDownDetectorによると、オーディオストリーミングプラットフォームSpotifyは日曜日に約3時間にわたる障害を経験した後、正常に回復し、ピーク時には米国の4万人以上のユーザーに影響を与 ...
- hecgdge4
- 昨天 09:33
- 支持
- 反对
- 回复
- 收藏
-
百済神州(688235.SH)はA株の有名な革新薬企業で、2017-2024年上半期、同社の各期の研究開発投資はA株の化学製薬会社(申万二級)の中で最も高く、研究開発費用は合計600億元を超えた。 しかし、百済神州にとって ...
- wylz8473
- 前天 18:36
- 支持
- 反对
- 回复
- 收藏