Gate News message, April 24 — OpenAI engineer Clive Chan has raised detailed objections to the hardware recommendations chapter in the V4 technical report, calling it “surprisingly mediocre and error-prone” compared to the acclaimed V3 version. V3’s hardware guidance, which included Q&A sessions that became the most popular discussion topic at the ISCA academic conference, offered specific recommendations aligned with industry interconnect standards. V4, by contrast, is far more vague.

Chan systematically challenged three key recommendations. On power consumption, the report suggests that software optimization allows chips to run compute, storage, and communication at full capacity simultaneously, and recommends that chip manufacturers reserve additional power headroom. Chan argues this is counterproductive: total chip power is constrained by physical process limitations, so reserving more power margin only reduces operating frequency, ultimately decreasing computational performance. Regarding GPU-to-GPU data transfer, the report advocates a pull model—where GPUs actively fetch data—over a push model, citing high notification overhead in push operations. Chan disputes this, contending that pull is actually slower and that improved network adapter capabilities would be preferable. However, the two may be discussing different layers of the issue: the report addresses notification mechanism overhead, while Chan refers to transmission latency itself.

On activation functions, the report recommends replacing SwiGLU with simpler functions to reduce computational burden. Chan sees no merit in this, noting that Sonic MoE has already demonstrated optimal performance using SwiGLU. Chan suspects DeepSeek may have “deliberately weakened this section.”

View Source

Disclaimer: The information on this page may come from third parties and does not represent the views or opinions of Gate. The content displayed on this page is for reference only and does not constitute any financial, investment, or legal advice. Gate does not guarantee the accuracy or completeness of the information and shall not be liable for any losses arising from the use of this information. Virtual asset investments carry high risks and are subject to significant price volatility. You may lose all of your invested principal. Please fully understand the relevant risks and make prudent decisions based on your own financial situation and risk tolerance. For details, please refer to Disclaimer.

Cohere Acquires German AI Firm Aleph Alpha, Secures $600M Investment for European Expansion

AI Industry News

Gate News message, April 24 — Canadian AI company Cohere announced plans to acquire German AI firm Aleph Alpha to strengthen its presence in Europe. Schwarz Group, a backer of Aleph Alpha, plans to invest $600 million in Cohere's Series E funding round. The funding round is expected to close in 202

GateNews1m ago

Xpeng, Xiaomi Lead In-Car AI Push at Beijing Auto Show

AI Industry News

Gate News message, April 24 — Chinese automakers showcased advanced in-car AI systems at the Beijing Auto Show on April 24, as the country accelerates its AI Plus strategy and seeks greater independence from foreign semiconductors. Xpeng demonstrated voice-controlled parking that allows drivers to

GateNews41m ago

Former ByteDance Seed Engineer: ByteDance AI Iteration Takes Six Months vs Google's Three Months

AI Industry News

Gate News message, April 24 — Zhang Chi, a former engineer at ByteDance's Seed team and current assistant professor at Peking University, revealed on the podcast "Into Asia" that ByteDance requires approximately six months to complete one full cycle of large language model training (pretraining

GateNews57m ago

Naver Launches AI Tab Beta as Google Gemini Enters South Korea Search Market

AI Industry News

Gate News message, April 24 — Naver announced the start of a closed beta for AI Tab, its new conversational search feature, following Google's launch of Gemini in Chrome in South Korea. AI Tab will appear alongside Naver's existing search tabs, offering users a dedicated space for conversational

GateNews1h ago

India AI Engineering Hiring Surges 59.5%, Expands Beyond Tech Hubs

AI Industry News

LinkedIn's AI Labor Market Report 2026, released on April 24, found that AI engineering hiring in India rose 59.5% year on year, marking the fastest pace among the markets studied by the platform. The growth was driven by demand spreading beyond established tech centers. Cities including

CryptoFrontier2h ago

Commonwealth Bank Cuts 120 Jobs Amid AI Expansion

AI Industry News

Commonwealth Bank of Australia announced it will cut approximately 120 jobs as the nation's largest bank reviews roles and expands its use of artificial intelligence, according to Bloomberg. The cuts include 43 roles at Bankwest in Western Australia, with six positions affected by automation. This a

CryptoFrontier2h ago

Comment

0/400

No comments