首页 News 正文

Shanghai Securities News (reporter Yang Xiangfei) At the 10th Real Time Internet Conference held recently, Zhao Bin, founder and CEO of VoiceNet, said that generative AI is driving major changes in the IT industry. Zhao Bin believes that this trend is mainly reflected in four levels: terminals, software, cloud, and human-machine interface. On the terminal, the ability of large models will drive PC and Phone to evolve towards AI PC and AI Phone. In terms of software, all software can and will be re implemented through large models, and will evolve from Software with AI to AI Native Software. At the cloud level, all clouds need to have the ability to train and reason for large models, and AI Native Cloud will become mainstream. In addition, the mainstream interaction methods of human-computer interfaces will also shift from keyboards, mice, and touchscreens to natural language dialogue interfaces.
As generative AI becomes the theme of the evolution of the IT industry in the next era, RTE (real-time interactive technology) has also become a key part of multimodal applications and infrastructure. In early October, Agora, a sister company of Soundnet, appeared as a voice API collaborator in the Realtime API public beta released by OpenAI. At this conference, Zhao Bin stated that Soundnet MiniMax is polishing China's first Realtime API. Zhao Bin also showcased the artificial intelligence agent developed by Soundnet based on MiniMax Realtime API. In the demonstration video, humans and intelligent agents engage in real-time voice conversations with ease and fluency. When humans interrupt the intelligent agent and ask new questions, the agent can also respond very sensitively and quickly, achieving a natural and smooth dialogue with humans.
Under the wave of generative AI, RTE will provide a broader space. Zhao Bin introduced that Shengwang has officially released the RTE+AI capability panorama. In the panoramic view, Soundnet presents the current technical capabilities and application solutions of combining RTE and AI from five dimensions: real-time AI infrastructure, RTE+AI ecological capabilities, Soundnet AI Agent, real-time multimodal conversational AI solutions, and RTE+AI application scenarios. The scenario innovation brought by the combination of generative AI and RTE will become the theme of the next decade.
您需要登录后才可以回帖 登录 | 立即注册

本版积分规则

白云追月素 注册会员
  • 粉丝

    0

  • 关注

    0

  • 主题

    39