How is Groq able to achieve throughputs of more than 400 Tokens/Sec and will their technology be available to the open-source community?
Real-Time AI with Groq's LPU
How is Groq able to achieve throughputs of more than 400 Tokens/Sec and will their technology be available to the open-source community?