2024 年 5 月 21 日,OpenAI 執行長奧特曼在華盛頓州雷德蒙德微軟總部舉行的微軟 Build 大會上發表講話。
Jason Redmond|法新社|蓋蒂圖片社
OpenAI on Thursday launched a new AI model, “GPT-4o mini,” the artificial intelligence startup’s latest effort to expand use of its popular chatbot.
人工智慧新創公司 OpenAI 於週四推出了一款名為「GPT-4o mini」的新 AI 模型,這是該公司為擴大其熱門聊天機器人使用率所做的最新努力。
The company called the new release “the most capable and cost-efficient small model available today,” and it plans to integrate image, video and audio into it later.
該公司稱這個新版本是「目前功能最強大、成本效益最高的小型模型」,並計劃在之後整合圖像、影片和音訊功能。
The mini AI model is an offshoot of GPT-4o, OpenAI’s fastest and most powerful model yet, which it launched in May during a livestreamed event with executives. The o in GPT-4o stands for omni, and GPT-4o has improved audio, video and text capabilities, with the ability to handle 50 different languages with improved speed and quality, according to the company.
這個迷你 AI 模型是 GPT-4o 的分支版本,GPT-4o 是 OpenAI 於 5 月份與高層主管進行直播活動時推出的迄今為止速度最快、功能最強大的模型。GPT-4o 中的 o 代表 omni(全方位),根據該公司的說法,GPT-4o 提升了音訊、影片和文字處理能力,能夠以更快的速度和更高的品質處理 50 種不同的語言。
OpenAI, backed by Microsoft, has been valued at more than $80 billion by investors. The company, founded in 2015, is under pressure to stay on top of the generative AI market while finding ways to make money as it spends massive sums on processors and infrastructure to build and train its models.
OpenAI 獲得微軟的支持,投資者對其估值超過 800 億美元。這家成立於 2015 年的公司正面臨著保持在生成式 AI 市場領先地位的壓力,同時也需要找到賺錢的方法,因為它在處理器和基礎設施上投入了巨額資金來構建和訓練其模型。
The mini AI model announced Thursday is part of OpenAI’s push to be at the forefront of “multimodality,” or the ability to offer a wide range of types of AI-generated media, like text, images, audio and video, inside one tool: ChatGPT.
週四發布的迷你 AI 模型是 OpenAI 推動「多模態」發展的一部分,即在一個工具 ChatGPT 中提供各種 AI 生成媒體的能力,例如文字、圖像、音訊和影片。
Last year, OpenAI COO Brad Lightcap told CNBC, “The world is multimodal. If you think about the way we as humans process the world and engage with the world, we see things, we hear things, we say things – the world is much bigger than text. So to us, it always felt incomplete for text and code to be the single modalities, the single interfaces that we could have to how powerful these models are and what they can do.”
去年,OpenAI 的營運長 Brad Lightcap 告訴 CNBC:「世界是多模態的。想想我們人類處理和參與世界的方式,我們會看到東西、聽到東西、說出東西——世界比文字大得多。因此,對我們來說,僅僅將文字和程式碼作為單一模態、單一介面來展現這些模型的強大功能和作用,總是讓我們覺得不夠完整。」
GPT-4o mini is available starting Thursday to free users of ChatGPT, along with ChatGPT Plus and Team subscribers, and it will be available to ChatGPT Enterprise users next week, the company said in a release.
該公司在一份新聞稿中表示,從週四開始,ChatGPT 的免費用戶、ChatGPT Plus 和團隊訂閱用戶可以使用 GPT-4o mini,而 ChatGPT 企業用戶將在下週可以使用。