Nvidia previews Rubin CPX graphics card for disaggregated inference

Nvidia Corp. today previewed an upcoming chip, the Rubin CPX, that will power artificial intelligence appliances with 8 exaflops of performance. AI inference involves two main steps. First, an AI model analyzes the information on which it will draw to answer the user’s prompt. Once the analysis is complete, the algorithm generates its prompt response one token […]

The post Nvidia previews Rubin CPX graphics card for disaggregated inference appeared first on SiliconANGLE.