Context Engineering: 2025’s #1 Skill in AI

Paul Iusztin

Jul 22

183

Everything you must know about context engineering to ship successful AI apps

Read →

16 Comments

Paolo Perrone

Sep 2

fantastic piece Paul!

Expand full comment

Reply (1)

Paul Iusztin

Sep 2

An honor to hear that from you, man 🥂

Expand full comment

Thanh Ng

Aug 26Edited

On "Tool Confusion": You mentioned the Gorilla benchmark shows models struggle when given more than one tool. I'm curious about your take on this: Is the core challenge truly the number of tools, or is it the agent's underlying reasoning and planning ability in complex, multi-step workflows? Newer benchmarks seem to focus more on testing this multi-turn reasoning capability.

On the 32k Token "Limit": You highlighted that model accuracy can drop significantly after 32,000 tokens, which is a critical warning for production systems. Since some of the latest models (like GPT-4o and Claude 3.5 Sonnet) are showing strong performance well beyond this point, how do you see this "soft limit" evolving? Is it a moving target that engineers need to constantly re-evaluate for each specific model they use?

Expand full comment

Reply (1)

Paul Iusztin

Sep 2

These are all great questions. To be honest, these are all dimensions that you should be aware of, but it’s extremely hard to know how they perform in your particular use case.

That’s why building AI evaluations that test your features is critical. Benchmarks are good reference points, but they often don’t reflect the real world. That’s why I don’t bother with them; I build my own AI evaluations, test, make changes such as adding/removing tools, and reiterate.

Expand full comment

Darren

Aug 12

A great read.

Expand full comment

Reply (1)

Paul Iusztin

Sep 2

Thanks 🥰

Expand full comment

Meenakshi NavamaniAvadaiappan

Jul 23

So net net we are back to client-agent server-LLM architecture design considerations for systems optimization and user experience which are loved to be the good 😊

Expand full comment

Reply (1)