- The premise: Who is George Hotz and why his voice matters
- How Coding Agents Work: Architecture and Intrinsic Limitations
- The phenomenon of silent technical debt
- SME use cases: where risk is highest
- The speed/quality trade-off: a calculation SMEs must make
- The Divide in the AI Community: It's Not Just Hotz
- What nobody considers: the cost of supervision
- The recommended decision: an operational framework for SMEs
George Hotz, one of the most well-known programmers on the international tech scene, has publicly stated that LLM-based coding agents risk becoming one of the most expensive mistakes in the history of software development. Its evaluation comes after six months of direct testing. Therefore, its judgment is not speculative: it is empirical.
The central problem isn't about prototyping speed, which remains a recognized strength. However, language models exhibit significant fragility in handling technical details. Consequently, the bugs they produce tend to accumulate in deep layers of the code, becoming increasingly difficult to detect over time. This phenomenon has a specific name in the industry: technical debt silent.
For Italian SMEs considering the adoption of AI tools in software development, this perspective calls for strategic reflection. We at SHM Studio We believe that intelligent automation should be integrated with qualified human supervision, not replaced by it. In summary, adopting a coding agent without a control framework is a concrete operational risk, not a simple theoretical matter.
The premise: Who is George Hotz and why his voice matters
George Hotz is not an outsider to the software world. He is the first hacker to have independently unlocked an iPhone. He founded comma.ai, a company specializing in autonomous driving. Additionally, he briefly worked at OpenAI. His technical profile, therefore, is anything but generic.
In May 2026, Hotz made a stark statement: LLM-powered coding agents will become one of the most expensive mistakes in the entire history of software development. The original source is an analysis published by The Decoder, who accurately reported his words. Therefore, it is a public and documented position.
This stance is part of a profound debate. The AI community is currently divided between those who see coding agents as productivity multipliers and those, like Hotz, who consider them a structural source of technical risk. Therefore, ignoring this debate would be a mistake for any organization considering investments in code automation.
How Coding Agents Work: Architecture and Intrinsic Limitations
A coding agent is an AI system capable of autonomously writing, editing, and sometimes executing code. It is based on a Large Language Model, which is a large language model trained on enormous corpora of text and source code. In essence, the model predicts the most probable sequence of tokens given a context.
This mechanism works well for tasks with high repeatability and low contextual complexity. For example, generating a CRUD function, writing a standard unit test, or completing boilerplate code blocks. However, the problem emerges when the context becomes deep and interdependent.
LLMs do not have a true understanding of the semantics of the system they operate on. Consequently, they can produce syntactically correct but logically flawed code. These bugs do not immediately generate errors. Instead, they are embedded in deep layers and only manifest under specific conditions, often in production. According to research by Gartner, The quality of AI-generated code requires systematic human review to maintain acceptable reliability standards.
The phenomenon of silent technical debt
The concept of technical debt It's not new. It indicates the future cost an organization pays for suboptimal technical choices made in the present. However, coding agents introduce a particularly insidious variation: the silent technical debt.
In this scenario, the debt is not visible at the time of writing the code. In fact, the generated code often passes automatic linting checks and first-level tests. The problem emerges weeks or months later, when the system scales or when new features interact with the AI-generated code.
Hotz stressed that after six months of testing, bugs become progressively more difficult to find. This is not a problem of quantity, but of depth. Therefore, the cost of correction grows exponentially over time. For a small and medium-sized enterprise with limited development resources, this scenario can become critical.
Studies of Harvard Business Review They have documented how the average cost of fixing a bug in production is up to 100 times higher than fixing it during development. Therefore, the silent accumulation of defects has concrete and measurable economic implications.
SME Use Cases: Where Risk is Highest
Not all use contexts present the same level of risk. It is useful to distinguish between high-impact and low-impact scenarios for Italian SMEs operating in B2B or retail.
High-risk contexts include:
- Custom management systems and ERPBusiness logic is complex, and any error has a direct impact on operations.
- Third-party API IntegrationsAn AI coding agent can generate seemingly correct API calls but with inadequate error handling.
- E-commerce modules with payment logicAny bug in this context has immediate legal and reputational consequences.
- Authentication and User Management SystemsAI-generated security vulnerabilities are among the hardest to detect.
Conversely, low-risk contexts include generating HTML templates, writing non-critical automation scripts, and producing technical documentation. In these areas, coding agents offer a real advantage without exposing the organization to significant risks.
For SMEs managing complex web projects, we at SHM Studio We recommend accurately mapping the usage boundaries of AI tools before integrating them into development workflows. A AI strategy Well-defined is the prerequisite for any responsible adoption.
The speed/quality trade-off: a calculation SMEs must make
The main argument in favor of coding agents is speed. A prototype that would take a senior developer three days can be sketched out in a few hours with the assistance of an LLM. This advantage is real and should not be underestimated.
However, the economic calculation must include the cost of accumulated technical debt. If the prototype becomes production without an adequate review cycle, the initial savings quickly turn into amplified costs. Furthermore, in SMEs with small development teams, the capacity to handle complex bugs is structurally limited.
The trade-off, therefore, is not simply speed versus quality. It's immediate savings versus future operational risk. For this reason, the decision to adopt coding agents requires an explicit risk assessment, not just a performance evaluation.
Research of MIT Technology Review indicate that organizations integrating AI into software development without structured governance processes show significantly higher technical incident rates compared to those adopting hybrid oversight frameworks.
The Divide in the AI Community: It's Not Just Hotz
Hotz's stance is not isolated. The AI community is deeply divided on this issue. On one hand, companies like GitHub with Copilot and Cursor are actively promoting the adoption of coding agents as productivity tools. On the other hand, a growing segment of the technical community is reporting structural problems.
The debate fundamentally concerns the nature of LLMs. These models are optimized for statistical plausibility, not logical correctness. Therefore, they produce outputs that seem right much more often than they actually are. This gap between appearance and substance is particularly dangerous in a technical context.
Furthermore, the growing reliance on coding agents risks eroding the internal skills of development teams. If programmers stop writing code from scratch, they gradually lose the ability to read and understand AI-generated code. This creates a vicious cycle that increases dependence and reduces oversight capabilities.
What nobody considers: the cost of supervision
An often overlooked element in the debate about coding agents is the real cost of supervision. Many vendors present these tools as solutions that reduce the need for senior developers. In reality, the opposite often happens.
To detect bugs produced by an LLM, a high level of technical expertise is required. A junior developer is not capable of identifying deep logical errors in AI-generated code. Consequently, a coding agent does not replace a senior developer: it requires their constant presence as a supervisor.
This radically changes the ROI calculation. The expected savings on code writing are partially or fully absorbed by the cost of qualified review. For SMEs that do not have senior internal figures, this means outsourcing supervision, adding a cost that is rarely included in initial projections.
Who is responsible for digital marketing and digital transformation for SMEs knows this pattern well: underestimating hidden costs is one of the main causes of failure in technology adoption projects. For this reason too, a structured consulting approach is essential before any investment in AI applied to development.
The recommended decision: an operational framework for SMEs
In light of these considerations, what approach is recommended for an Italian SME evaluating the use of coding agents in 2026?
First, it's useful to distinguish between assisted and autonomous use. Assisted use, where AI suggests and the developer decides, presents an acceptable risk profile. Autonomous use, where AI writes and the developer approves without deep review, is what Hotz rightly criticizes.
Secondly, clear perimeters must be defined. Coding agents should be explicitly enabled for specific types of tasks, not as a general tool. Every output must go through a structured code review process, regardless of the source.
Third, code quality metrics should be monitored over time. Indicators such as bug density per sprint, average incident resolution time, and test coverage are early warning signs of technical debt accumulation.
Finally, team training remains a non-negotiable investment. A team that understands the limitations of LLMs can use them productively. A team that considers them infallible is exposed to significant risks.
For SMEs that wish to delve deeper into these topics, the team at SHM Studio is available for a structured consultation. It is possible to explore our AI services, the solutions of web development, the strategies of SEO e digital marketing. For a direct comparison, the page contacts is the starting point. Furthermore, our blog regularly publishes analyses on these topics, including reflections on AI-assisted copywriting, Google Ads campaigns e LinkedIn campaign.
Related articles
Discover other articles that explore similar topics in depth, selected to give you a more complete and stimulating view. Each piece of content is carefully chosen to enrich your experience.