OpenAI Researchers: AI Systems Could Handle Most Research Work Within Two Years

Gate News message, April 29 — OpenAI researchers Sébastien Bubeck and Ernest Ryu say AI systems could perform most human research work within two years, presenting mathematics as a clear measure of AI progress. Unlike vague performance tests, mathematical problems offer precise verification: answers are either correct or incorrect, leaving no room for ambiguity.

Bubeck noted that true AI thinking requires surviving long chains of reasoning. A single error in a multi-step argument collapses the entire proof, making error detection and correction mid-process the ultimate goal for advanced models. OpenAI’s internal labs have already generated more than ten completely new theorems publishable in top-tier combinatorics journals, demonstrating that AI now produces genuinely original, groundbreaking work beyond simply recombining existing papers.

However, sustained scientific breakthroughs demand steady focus across weeks of testing. Current systems still require strict human supervision to guide and verify each shift in direction. Bubeck uses “AGI time” to measure how long a model can independently mimic human thinking; current systems operate at roughly days to one week, with the industry target being weeks or months to enable autonomous work in fields like biology.

Long-term memory is critical to this future. Standard chat windows limit depth—complex mathematical proofs often exceed 50 pages—while code repositories demonstrate how extended work sessions enable deeper problem-solving. As AI gains independence and memory, human expertise becomes more valuable, not less. Workers must retain the deep foundational knowledge to challenge and verify machine answers, and organizations will need new automated filters and reputation systems to maintain trust amid a flood of AI-assisted research.

免責聲明:本頁面資訊可能來自第三方,不代表 Gate 的觀點或意見。頁面顯示的內容僅供參考,不構成任何財務、投資或法律建議。Gate 對資訊的準確性、完整性不作保證,對因使用本資訊而產生的任何損失不承擔責任。虛擬資產投資屬高風險行為,價格波動劇烈,您可能損失全部投資本金。請充分了解相關風險,並根據自身財務狀況和風險承受能力謹慎決策。具體內容詳見聲明

相關文章

Claude's Chinese Language Tokenization Cost 65% Higher Than English, OpenAI Only 15% More

Gate News message, April 29 — AI researcher Aran Komatsuzaki conducted a comparative analysis of tokenization efficiency across six major AI models by translating Rich Sutton's seminal paper "The Bitter Lesson" into nine languages

GateNews16分鐘前

半導體分析師看好 AI 行情「至少再走三年」:先進封裝才是產業瓶頸

Bubble Boi 指 AI 投資週期仍處早期,預計至少再有三年上漲,並不打算獲利了結。他認為先進封裝才是半導體真正瓶頸,需在同封裝內整合更多HBM與更大晶片。對 NAND/Flash 看多,價格可能持續走高,未來或加入快閃供應鏈。個人策略是借入資金增持,並以工程實務背景理解技術細節,認為此為優勢。

鏈新聞abmedia29分鐘前

AWS 在 Amazon Bedrock 擴展 OpenAI 整合

亞馬遜網路服務(Amazon Web Services)於 4 月 29 日宣布,與 OpenAI 的合作夥伴關係將大幅擴展,將 OpenAI 最新的能力整合到其雲端基礎設施中。此次擴展為 Amazon Bedrock 帶來三項新增產品:OpenAI 最新的模型 (有限預覽),以及 Codex 程式

Crypto Frontier41分鐘前

查爾斯三世會見 6 位美國科技執行長,包括黃仁勳、傑夫·貝佐斯與蒂姆·庫克,討論英國新創融資

Gate News 訊息,4 月 29 日——在對美國進行國事訪問期間,英國國王查爾斯三世在華盛頓的布萊爾宮,會見了 6 位傑出的美國科技領袖:NVIDIA 執行長黃仁勳、亞馬遜創辦人傑夫·貝佐斯、蘋果執行長蒂姆·庫克、AMD 執行長蘇姿豐、Salesforce 執行長馬克·貝尼奧夫,以及 Alphabet 總裁露思·波拉特,並

GateNews1小時前

2025 年全球 AR 智慧眼鏡出貨量飆升 98%,由 Meta 的 Ray-Ban Display 與波導技術推動

門戶新聞訊息,4月29日——根據 Counterpoint Research 的數據,全球擴增實境 (AR) 智慧眼鏡出貨量在 2025 年飆升 98%,下半年出貨量則按年增長 148%。增長動能來自擴大產出

GateNews1小時前

傳奇對沖基金交易員談美股本益比:未來幾年買大盤的人要獲利很難

對沖基金經理 Paul Tudor Jones(瓊斯)接受訪談,警告 AI 監管空白可能帶來災難性後果,因其破壞—迭代模式風險前所未見。另指出美股市值占GDP比重達252%、本益比偏高,長期投資大盤難獲利;他以拳擊比喻交易機會稀少,呼籲全球協作制定 AI 監管。

鏈新聞abmedia1小時前
留言
0/400
暫無留言