- GPU0%
- GLM0%
BlockBeats News, June 21st, GPU retailer Tinygrad announced that, according to reliable sources, the GLM 5.2 model can achieve a inference speed of 120 tokens per second on two networked Blackwell architecture tinyboxes.
The price of this configuration is $150,000, with the option to choose between two standard tinyboxes or one tinybox Pro, both capable of delivering the aforementioned performance. Tinygrad is promoting this as a selling point, emphasizing a private deployment route of "one-time purchase, never pay cloud fees," directly competing with pay-as-you-go cloud inference services.
As of now, this news has not been officially confirmed by the GLM team, and Tinygrad has not disclosed further technical details.
---------------------------------
Click the original text link below to join the BlockBeats · Lark AI News channel, monitoring global AI trends and news 24/7.
Disclaimer: Konten ini berasal dari pihak lain atau diterjemahkan oleh AI dari pihak lain. CoinEx tidak menjamin konten ini benar, asli, atau akurat, dan tidak memberikan saran investasi. Harga aset kripto sangat tidak stabil, jadi harap berhati-hati terhadap risiko yang ada.
- KriptoHargaPerubahan 24J