February 18Feb 18 The company said the model is optimised for “efficient thinking”, delivering stronger responses while using fewer tokens — a key factor in reducing inference costs in production environments. View the full article
Create an account or sign in to comment