- GPU0%
- GLM0%
BlockBeats News, June 21st, GPU retailer Tinygrad announced that, according to reliable sources, the GLM 5.2 model can achieve a inference speed of 120 tokens per second on two networked Blackwell architecture tinyboxes.
The price of this configuration is $150,000, with the option to choose between two standard tinyboxes or one tinybox Pro, both capable of delivering the aforementioned performance. Tinygrad is promoting this as a selling point, emphasizing a private deployment route of "one-time purchase, never pay cloud fees," directly competing with pay-as-you-go cloud inference services.
As of now, this news has not been officially confirmed by the GLM team, and Tinygrad has not disclosed further technical details.
---------------------------------
Click the original text link below to join the BlockBeats · Lark AI News channel, monitoring global AI trends and news 24/7.
Disclaimer: The current content is sourced from third-party perspectives or directly translated by AI from third-party perspectives. CoinEx does not guarantee the authenticity, accuracy, and originality of the content, and it does not constitute any investment advice from CoinEx. The prices of cryptocurrencies are highly volatile, please be aware of the potential risks.
- CoinsPrice24H Change