How is Groq able to achieve throughputs of more than 400 Tokens/Sec and will their technology be available to the open-source community?
Share this post
Real-Time AI with Groq's LPU
Share this post
How is Groq able to achieve throughputs of more than 400 Tokens/Sec and will their technology be available to the open-source community?