Tinygrad claims that GLM5.2 can achieve 120 tok/s on a dual-machine interconnected Blackwell configuration, with a price of $150,000.

GPU0%
GLM0%

BlockBeats News, June 21st, GPU retailer Tinygrad announced that, according to reliable sources, the GLM 5.2 model can achieve a inference speed of 120 tokens per second on two networked Blackwell architecture tinyboxes.

The price of this configuration is $150,000, with the option to choose between two standard tinyboxes or one tinybox Pro, both capable of delivering the aforementioned performance. Tinygrad is promoting this as a selling point, emphasizing a private deployment route of "one-time purchase, never pay cloud fees," directly competing with pay-as-you-go cloud inference services.

As of now, this news has not been officially confirmed by the GLM team, and Tinygrad has not disclosed further technical details.

---------------------------------
Click the original text link below to join the BlockBeats · Lark AI News channel, monitoring global AI trends and news 24/7.

Kaynak:BlockBeats

Yasal Uyarı: Mevcut içerik üçüncü taraf kaynaklardan alınmış veya doğrudan yapay zeka tarafından üçüncü taraf kaynaklardan çevrilmiştir. CoinEx, içeriğin gerçekliğini, doğruluğunu ve orijinalliğini garanti etmez ve bu içerik, CoinEx tarafından herhangi bir yatırım tavsiyesi teşkil etmez. Kripto varlıkların fiyatı ciddi dalgalanmalardan geçer, lütfen potansiyel risklerin farkında olun.

İlgili HaberlerHepsine bak

Serenity: SIVE is not just a CPO laser supplier, but a key player in the next-generation optical interconnect architecture

Analysis: The gap between open-source AI and cutting-edge models has shrunk from 12 months to 4 months. By the end of the year, top-tier intelligence may be available for free download.

JPMorgan Chase: AI Capital Spending Raised to $5.5 Trillion, Broadcom Expected to Exceed $150 Billion in AI Revenue by 2027

En Çok Arananlar

Coinler
Fiyat
24sa Değişim