I gave the tasks to my agent running on gemma4 26b via openclaw on llamacpp to research products that fulfill my need. It was a rather long description of the use case, of what I don’t want and so on.
My expectation was that the agent is spending lots of loops in searching, analyzing etc to find suitable products.
He was done in 1 minute. Found exactly what I don’t need and gave me some shallow general product categories to look into.
It’s exactly what I not want. I wanted my agent to find the products not to tell me where I should search.
I tried than with Claude sonnet 4.6. It behaved better, searched longer and produced also a a very general list of manufacturers that might be interesting.
After I told sonnet that I don’t care for manufacturers who do not have a product in their portfolio that meets my criteria and I want concrete products not just collections/manufactures, I got a list of candidates.
But this was a bit frustrating. This is the kind of research task that I would love to hand over to my agent. But I don’t see that they are capable of doing this. But why? They can search the internet, interpret pictures, navigate pdf catalogs etc. What is stopping them?
submitted by /u/Gold-Drag9242
[link] [comments]
Originally published at reddit.com. Curated by AI Maestro.
Stay ahead of AI. Get the most important stories delivered to your inbox — no spam, no noise.




