OpenAI Technology Live Episode 6: ChatGPT "Open Your Eyes and See the World" AI Companion/AI Education New Benchmark?
Katlyn30590
发表于 4 天前
1131
0
0
On the sixth day of the technology sharing day, OpenAI provided something closer to the "heart" - ChatGPT opens advanced voice mode: real-time video calls, screen sharing, and image uploading.
Why is it said to be closer to the 'heart'?
OpenAI CEO Altman previously revealed in an interview with Salesforce that his favorite AI movie is "Her" (the story of a man falling in love with his AI virtual assistant), and "the idea of a conversational language interface has incredible foresight." The Information reported that Altman hopes to eventually develop a virtual assistant that can respond quickly like the AI assistant in the movie.
The robot girlfriend in Her represents the ultimate form of embodied intelligence, which can interact with humans without barriers.
Previously, ChatGPT's DAN mode (short for Do anything now) allowed AI to converse with users in a more casual way, and its emphasis on "human touch" has been stunning. It not only enables low latency communication, but also imitates human tone and provides emotional value. This time, ChatGPT not only enables listening and speaking, but also unlocks visual abilities, allowing users to "open their eyes and see the world" through the camera.
In this live sharing session, CEO Sam Altman did not appear. Instead, four employees including Kevin Weil, OpenAI's Chief Product Officer, Jackie Shannon, OpenAI's Product Manager, Michelle Qin, and Rowan Zellers, members of OpenAI's multimodal technology team, introduced the updated features.
The real-time video call function in advanced voice mode is the most outstanding. After the OpenAI team members greeted ChatGPT video and got to know each other, someone asked: What is the name of the colleague with reindeer antlers? ChatGPT provided accurate answers using Santa Claus's limited voice, demonstrating their "memory" ability.
Next, the team demonstrated how ChatGPT can teach people how to operate a hand brewed coffee device. Just make a "video call" to ChatGPT, and it can teach you step by step based on the equipment in front of you. Throughout the entire demonstration, ChatGPT's voice was natural and friendly, adjusting its tone and even laughing like a human.
The screen sharing function allows ChatGPT to "see" your screen through screen sharing, which is also a real-time video understanding ability. Users only need to click on the advanced voice mode icon in the bottom right corner and select Share Screen from the drop-down menu to receive targeted assistance.
After successfully sharing with OpenAI team members, ChatGPT browsed their messages and requested guidance to reply. ChatGPT showed a "high emotional intelligence" side and suggested praising the other party's Christmas decorations.
It is reported that the advanced voice mode supports over 50 languages, 9 realistic output voice options, and each voice has its own unique tone and features. And the GPT-4o behind it can not only convert speech into text, but also understand and label other functions of audio, such as breathing and emotion.
ChatGPT, which supports over 50 languages, is able to understand real-world scenarios in real-time. This not only greatly enhances the experience of ChatGPT as an AI companion tool, but also demonstrates a more efficient and powerful AI education tool.
The above features will be launched in the ChatGPT mobile app from today onwards, and will be open to all team users as well as most Plus and Pro users in the next week.
CandyLake.com 系信息发布平台,仅提供信息存储空间服务。
声明:该文观点仅代表作者本人,本文不代表CandyLake.com立场,且不构成建议,请谨慎对待。
声明:该文观点仅代表作者本人,本文不代表CandyLake.com立场,且不构成建议,请谨慎对待。
猜你喜欢
- AI Agents: A New Track for Technology Companies to Compete in
- 45 billion education technology giant invites Shen Teng to sell equipment
- What do you think of Trump's return to power for American technology companies?
- Most of the "Seven sisters of Science and Technology" rose, and Nvidia's market value increased by 1.2 trillion yuan overnight! Trump Media Technology Falls Over 8%! Microsoft releases ultra convenient cloud PC
- Apple Pro Display XDR 2 may adopt the same quantum dot display technology as MacBook Pro
- Microchip Technology suspends application for chip bill related subsidies
- NIO Technologies increases capital to 18 billion yuan, with a growth rate of 200%
- NIO Technologies increases capital to 18 billion yuan, with a growth rate of 200%
- Hesai Technology's Q3 revenue increased by 21.1% year-on-year
- The tech industry is shaking! OpenAI whistleblower killed himself by explosion!
-
現地時間の金曜日、米デラウェア州地方裁判所のキャサリン・サンジョン・マコミック判事は、マスク氏と2018年の報酬案を承認したテスラ取締役会が1月の裁定に対して上訴することを許可し、30日以内にデラウェア州最 ...
- 湖塘
- 前天 12:13
- 支持
- 反对
- 回复
- 收藏
-
【極越自動車の前ユーザー開発部責任者がスターバックスに首席成長官に就任】スターバックス中国はこのほど、初めて首席成長官(CGO)のポストを設置し、楊振(Tony Yang)を同社首席成長官(CGO)に任命した。記者 ...
- 不正经的工程师
- 前天 09:59
- 支持
- 反对
- 回复
- 收藏
-
上海新天地の公式情報によると、フランスの高級ジュエリーブランドBoucheron宝詩龍は2025年に上海新天地石庫門街区の入り口(太倉路と馬当路の境界)に入居する。この店舗は現在スターバックス臻選門店となっている ...
- 内托体头
- 昨天 20:24
- 支持
- 反对
- 回复
- 收藏
-
12月16日、市場ではアリ氏が銀泰百貨と交渉しており、銀泰百貨の関連株式を紡績アパレル業界のトップ企業ヤゴールに売却する予定だという噂が流れている。この噂に対し、「国際金融報」の記者は最初に関係者に証明 ...
- 东的天下
- 昨天 17:22
- 支持
- 反对
- 回复
- 收藏