The Rise of 'Agentic' Marketing and the Conceptual Fog
Every week, another startup announces a new 'coding agent' or 'AI co-scientist' that promises to work autonomously. According to a new critique published on arXiv (2606.23991v1), the industry has developed a serious definitional problem. The research argues that as Large Language Model (LLM) systems are increasingly marketed with 'agentic' labels, we have lost the plot on where simple automation ends and genuine agency begins.
The paper, released without institutional fanfare, tackles a question that has been nagging experienced developers for over a year: What exactly makes a system an 'agent'? The authors contend that the current industry parlance conflates sophisticated pattern matching with independent decision-making, creating dangerous expectations—both in terms of productivity promises and existential fears.
What the Research Found: Three Core Problems
The critique identifies several systemic issues in how the AI community currently defines agency. First, most commercial 'agents' are simply deterministic pipelines with a chat interface. A system that calls an API, formats the response, and executes a pre-written function is not exercising agency; it is executing a program. Second, the paper warns that labeling every LLM-based workflow as 'agentic' dilutes the term to the point of meaninglessness. Third, and most critical for the existential risk debate, the authors argue that confusing automation with agency feeds unfounded fears about 'machine agency' escaping human control.
For context, the paper references the current wave of tools like Devin, GitHub Copilot Workspace, and various 'AI co-scientist' platforms that claim to operate with 'autonomous reasoning'. The critique suggests these systems, while impressive, lack the fundamental components of true agency: self-directed goal formation, counterfactual reasoning about their own actions, and the capacity to act against their programmed incentives.
Why This Matters for Developers
For the engineering community, this is not just an academic squabble. The definitional confusion has real consequences for system design and deployment. When a manager expects an 'agent' to handle an ambiguous task and the underlying system is merely a sophisticated automation script, the failure modes are unpredictable and often invisible. Developers are being asked to build safety guarantees around systems that their vendors claim are 'agentic' but which, under scrutiny, lack the introspection necessary for reliable autonomous operation.
The arXiv paper suggests we need a two-tier taxonomy: autonomous automation (AI systems that execute predefined workflows with minimal human input) versus agentic systems (those that can formulate novel goals, weigh trade-offs, and modify their own objectives based on context). Most current products fall squarely in the first category.
Implications for Business Leaders and Procurement
For business professionals evaluating these tools, the research provides a practical framework for vendor scrutiny. If a vendor claims their product is an 'AI agent', ask specific questions: Does it formulate its own sub-goals when the primary objective is ambiguous? Can it explain why it chose one action over another in terms of its own decision-making process, not just its training data? If the answer is no, you are likely buying high-end automation, not genuine agency.
The paper also has implications for risk management. The current panic about 'agentic AI' escaping control is, according to this critique, largely based on conflating automation speed with autonomous intention. A script that runs 10,000 API calls per second is not 'thinking'; it is executing. The real risk, the authors contend, is not rogue agency but brittle automation that fails in unexpected ways, taking down critical infrastructure without any malicious intent.
The Path Forward: A Call for Precision
The research does not merely critique; it offers a constructive path. The authors advocate for the AI community to adopt a formal definition of agency rooted in philosophy of mind and action theory. They propose that an agent must possess: (1) goal flexibility (the ability to revise goals based on new information), (2) causal understanding of its own actions, and (3) a capacity for self-evaluation that is not merely a reflection of training data.
For practical developers, this means we need to stop claiming 'agentic' capabilities for systems that are fundamentally retrieval-augmented generation (RAG) pipelines with a planner on top. The distinction matters for debugging, auditing, and liability. If a system is marketed as an agent but is actually an automation tool, who is responsible when it makes a decision that causes harm? The current legal framework does not have an answer, because the definitions are not clear.
The arXiv paper (2606.23991v1) is a timely intervention. As we move into a world where every SaaS product claims to have 'agents', this research provides the intellectual tools to separate genuine innovation from marketing hype. For developers, the takeaway is simple: build systems that do what they say, and label them honestly. For business leaders, the advice is equally direct: demand proof of agency, not just a sales pitch. The future of reliable AI depends on getting this definition right.
Source: Arxiv AI. This article was produced with AI assistance and reviewed for accuracy. Editorial standards.