Language Model Paradox: Why Smaller AI Models Create Bigger Value

Security without intelligence is misplaced investment 
Enterprises are over-prioritizing data residency and perimeter security while underinvesting in domain intelligence, creating AI systems that are secure but ineffective.
Generic AI models fail in regulated enterprise contexts 
Large language models that lack industry-specific training introduce material risk in compliance-heavy environments where contextual accuracy is non-negotiable.
Small, domain-trained models deliver superior business outcomes 
Specialized language models outperform general-purpose models in accuracy, efficiency, and cost by aligning directly with enterprise data, vocabulary, and workflows.
AI security must be architected, not assumed 
True protection comes from system design like across air-gapped inference, differential privacy, and auditability, not from where data is stored.
Privacy-by-design will become a baseline enterprise requirement 
Evolving regulatory frameworks will mandate technical safeguards, making privacy-preserving architectures central to future AI adoption.

Asked why he robbed banks, the American bank robber Willie Sutton is supposed to have answered, “because that’s where the money is.”

It’s a limited point of view, but logical.

I’m seeing a trend among enterprise executives today that would leave Willie shaking his head: many executives are intensely focused on finding a nice neighborhood for their Large Language Model—completely obsessed with data residency and perimeter security, but much less interested in the treasure they want to protect.

In effect, they are trying to build a secure vault with no money inside. A safe with no money? That wouldn’t interest Willie, and it shouldn’t interest executives either.

In fact, it’s worse than that: A large language model may actually create outsized risks for your firm whatever it’s domicile if it hasn’t been trained specifically in the particulars of your industry.

If it can’t let you know what you need to know, if it’s not able to spot Basel III covenant violations for a bank, detect CAPA deviations in pharma manufacturing, or understand what force majeure means in the specific context of an energy contract, it’s not going to be much help to you, wherever it lives.

What enterprises need is a custom language model that provides detailed and accurate analysis of the sensitivities that it must be vigilant about, not a glib overview that could well be wrong. The regulators won’t want to hear that your noncompliance stemmed from your GPT winging an answer on a mission-critical issue.

Small Models, Big Advantages

Besides three to five times greater accuracy, focusing your domain and company specific language model on specialized information has further advantages Willie would approve of: it saves you money; and you can run it in your private environment.

General-purpose models require massive computing power to retain knowledge about everything from 18th-century poetry to quantum physics. A Small Language Model (SLM) in the 1-billion to 13-billion parameter range may be less than 1 percent of the size of one of the industry giants.

This focus enables prompts to use much less energy, and your model to be more easily deployed either on-premises or on a sovereign cloud.

Consider what this looks like in practice:

For an insurance company, a financial SLM trained on the firm’s own underwriting language and risk vocabulary can handle credit covenant analysis in ways that large language models simply cannot do reliably.

For a pharmaceutical manufacturer, an SLM can be used to detect CAPA deviations and note drug interaction risks in the specific terminology your regulatory submissions require.

For an automotive supplier, your SLM can be trained to decode predictive maintenance signals and review supply chain anomalies, then communicate that information in plain language, not just to your data scientists’ dashboards but straight to the shop floor.

No-fault Vault

Of course, security remains a critical priority, even with a highly specialized SLM.

Once you have the SLM you need, security becomes a critical priority. Even now, however, the question of sovereignty is less important than architecture. Your bank may be on Main Street, but what keeps it safe from Willie is the burglar alarm, the thickness of the vault walls, and the complexity of the lock, not geography.

If there is one thing I learned in my two decades in finance IT, it is that security needs to be designed into your architecture.

Wherever your data lives, you need to design your systems so that IP can’t leak, and your query data can’t be retained by third-party API providers, made vulnerable to model inversion attacks, or injected into agentic pipelines.

You need air-gapped inference for tier-one sensitive workloads, differential privacy in training pipelines—mathematical guarantees, not consent forms—and cryptographically signed audit trails for every AI decision.

You want to be able to ask your team: If our model’s weights were stolen tomorrow, what would an adversary learn and have the answer be “not much.”

Securing customer privacy in the AI era favors a similar strategy. There is a version of data privacy in enterprise AI that is imagined in legal documents, and then there is the version that works.

Policy-level controls do not prevent model memorization of private materials during training, inference time re-identification, or the logging of queries by a third-party API provider.

To protect data in practice—not just in theory—enterprises need security by design: federated learning, which trains models across distributed nodes without raw data ever moving; differential privacy, which provides mathematical guarantees against reverse-engineering individual records; and synthetic data generation, which replaces sensitive training data with statistically equivalent proxies.

Finally, it goes without saying that keeping an eye on changing regulations is as important as in the past.

For now, whether or not you implement these measures depends on your own appetite for risk, but soon, EU AI Act Article 10, India's DPDP Act, and a growing patchwork of US state laws will require technical controls, not just policies. By 2027, “privacy-preserving by design” will appear in enterprise AI RFPs as standard.

Getting It Right

Designed correctly and deployed securely, small language models should outperform its larger competitors in all the ways that matter most to stakeholders—in efficiency, predictability, and commitment to success.

And that’s good news for your business, because Willie Sutton was wrong—taking care of your stakeholders is where the money really is.

Disclaimer: This article was originally published on TechRadar on 4^th May 2026. It has been republished here with permission. You can read the original version here.

TAGS: Artificial Intelligence

Frequently Asked Questions

Our FAQ section is designed to guide you through the most common topics and concerns.

A small language model is a domain-focused AI model trained on industry- and company-specific data. It is designed to deliver higher accuracy for specialized tasks, such as compliance checks, risk analysis, and operational monitoring, while using significantly fewer computing resources than general-purpose models.

Small language models understand industry-specific terminology, controls, and regulatory frameworks. This enables them to identify compliance risks, anomalies, and exceptions more reliably, making them suitable for sectors such as banking, pharmaceuticals, insurance, and energy.

These models support security-by-design approaches, including air-gapped inference, differential privacy, and federated learning. Such mechanisms reduce the risk of data leakage, unauthorized retention of queries, and exposure from model inversion attacks.

Yes. Due to their compact size and lower infrastructure requirements, small language models can be deployed on-premises or in sovereign cloud environments, helping organizations maintain greater control over sensitive data and intellectual property.

Small language models offer higher efficiency, better predictability, and lower operational costs. According to Tech Mahindra, their focused training enables more reliable decision support for mission‑critical enterprise workloads without the overhead of large, general-purpose models.

About the Author

Sham Arora

Chief Technology Officer, Tech Mahindra

Sham is Chief Technology Officer (CTO) at Tech Mahindra, driving enterprise-wide innovation and engineering-led solutions. With deep expertise in digital transformation, cloud adoption, automation, and platform modernization, Sham is passionate about creating future-ready ecosystems that deliver measurable business outcomes. As CTO, Sham shapes Tech Mahindra’s technology vision and strategy, aligning innovation with business value creation and accelerating the company’s journey toward a digital-first, hyperconnected world using AI as a central lever for transformation. His focus on building resilient, scalable, and sustainable technology architectures ensures that global enterprises can modernize and thrive in an era of rapid change.

Read Less

Know More

Author(s)

Sham Arora

Chief Technology Officer, Tech Mahindra

Know More

Related Insights

Industrializing AI in Network Operations: Why Domain Knowledge is the Limiting Factor

May 28, 2026

From AI Adoption to AI Advantage

May 28, 2026

Why Reliability Is Becoming the Real Battleground in the AI Era

May 26, 2026

From Siloed AI to Orchestrated Intelligence: The Next Phase of Agentic Transformation

May 18, 2026

Event

Tech Mahindra at Cisco Live US 2026

Purpose-driven. Future-ready.

Scale at Speed with innovation and AI-led delivery for agility, resilience, and efficiency.

Know More

Cut Through the Noise

Get real-world insights from thought leaders and experts building the future of enterprise tech.

Join S/N Newsletter

In the late nineteenth century, when electric dynamos arrived in factories, most managers made a reasonable decision to replace the steam engine with the new motor. Everything else, which included the same shafts, pulleys, and factory floor primarily arranged for a different era of power, would remain unchanged. It was only almost three decades later that factories started to redesign themselves around the actual possibilities that electricity brought to the table. They placed individual motors on every machine, rebuilt the layout, and tracked the productivity gains. Studying this pattern in 1989, Stanford economist Paul David termed it the “productivity paradox:” a phenomenon where the presence of dynamos was widespread, but their measurable impact on economic output remained elusive. This was because the operation model had fundamentally remained unchanged.In my view, enterprise AI is currently at a similar inflection point. AI adoption is broad and accelerating, but the underlying model that organizations follow, be it for competing, serving customers, or pricing their service delivery, is largely intact.The law of diminishing competitive returnsAI technologies are available to every player in the market today. Most organizations deploy the same set of tools against similar workflows and the gains experienced become industry baseline. AI investments may be soaring yet sustained financial impact on performance is elusive.This point is further reinforced by data. MIT Sloan Management in 20251 confirmed that AI may become pervasive, enhance processes, operations, but it does not provide any differentiated advantage to the users. Productivity gains powered by AI are self-eroding in the long haul. Both McKinsey2 and BCG3 put this more plainly in their respective global studies that even with near-universal adoption of AI (nearly 90-94% of global enterprises surveyed) only a tiny fraction (5-6%) generate substantial financial value from AI.So, what can these companies do differently?What AI makes newly possibleFor starters, there has to be a clear idea of how to deploy AI, focused on the new possibilities it can create for their competitive strategies.We can categorize this further into three distinctive shifts.The first shift is when they use AI to access customers and market segments that were previously considered uneconomic. Be it targeting lower price-point segments, expanding into new geographies, or addressing an underserved customer cohort, these become feasible as delivery cost reduces, response time accelerates, and the quality of personalization at scale improves. This is the argument made for revenue, not efficiency. Productivity is nice, but revenue is the most critical piece.The second shift is when enterprises reshape how they deliver outcomes and pricing accordingly. Forward-looking organizations assess how work can be restructured to ensure AI-driven outcomes are contractible, reliable, and measurable. When you produce and measure outcomes consistently, the commercial relationship with customers shifts, aligning price with the value delivered and not the effort expended.The third shift centers around how AI is treated as a lens for entering categories and capabilities that were structurally out of reach previously. This can be considered a massive transition from capital-heavy, monolithic corporate structures to a model of lean disruption. With the right stack, an AI augmented team, lower build costs, and faster iterations, these companies are able to move into new service/product lines with less time and capital.The fact is only a few firms are converting AI into enterprise value at scale. Those that do are widening the gap with their competitors in growth, margins, and shareholder returns. They are compounding the advantage, and competitors’ opportunity to secure it is narrowing.What is holding the majority back from making these shifts?Efficiency is table stakes, but growth is the real prizeThe real barrier is not technical, but organizational and gravitational.Geoffrey Moore, in work that followed Crossing the Chasm, described what he called the pull of the past. It was a simple observation about the tendency successful companies have when it comes to letting go of the business model that made them successful. They will keep optimizing it even after the ground beneath them shifts.Look at how this plays out when it comes to AI in 2026, where this pull takes a specific form. The existing operating model rewards efficiency gains, and this shows up in margins and utilization metrics that are tracked by boards quarterly. But the returns from structural investment in new commercial models, customer segments, and delivery architectures are difficult to measure and pays off rather slowly. Optimizing for quarterly efficiency metrics is quite rational behavior at the end of the day, but only if you accept the premise that AI is primarily a cost and productivity tool, instead of being a growth lever.The companies pulling ahead seems to have found a way to hold on to both. They are able to sustain operational discipline and carve out a deliberate, protected space for the growth-oriented work that AI makes possible.Clearly, executing these shifts will not be easy. Most large organizations have built their tech landscapes over decades, and their legacy systems, data silos, and accumulated architectural decisions cannot be replaced overnight. While getting data foundations right and driving modernization can take years, this should not be a reason to defer the strategic reframing. Instead, it is an opportunity to initiate it with clear priorities and genuine organizational commitment. The companies that generate durable advantage from AI are not necessarily the ones that start with clean infrastructure. Instead, they are those that start with a clear answer to the question of what they were building toward.Separating adoption from advantageOrganizations today shouldn’t be focused only on how much AI they are using. Nor should they only use conventional productivity metrics to measure ROI. Rather, they should question the many ways AI can be used to do something that compounds, opening new revenue streams, creating new commercial relationships, evolving capabilities that deepen with use.At its core, the shift from adoption to advantage is in the strategic question being asked. This does not necessarily require new tech investments but instead a strategic frame that questions what AI makes newly possible for the organization.

Having managed through the disruption of Covid-19, many business leaders are keenly aware of the need for accelerated digital transformation across their organizations – for example, so they can provide consistent, quality customer experiences (CX). Many faltered on the CX front due to fragmented line-of-business (LOB) systems and manual processes that made it even harder to respond and adapt to:Supply chain disruptions such as suppliers going out of business, transportation issues, or border closuresManufacturing disruptions, which resulted in plants not operating in full capacityFinance issues such as not having real-time insights into operations and cash flow and the ability to predict future riskHR challenges such as equipping and managing large numbers of people working from home for extended periodsDeliver sustainable digital transformation: Our solutions help clients integrate their platforms across a wide range of technologies to deliver tangible business value to their stakeholdersDeploy industry cloud solutions: Businesses in the new world order are demanding sophisticated technologies that lower cost; increase sales, efficiency, and performance; protect the environment; and enable better management and control. We support our clients across sectors with tailored solutions. Our solutions and services offer a proven delivery model and in-depth expertise.Support management of your core business: Drive more innovation and automate your core business processes using industry-specific offerings. Examples include AutoShift, a TechM Solution based on SAP S/4HANA and the SAP Business Technology Platform for Automotive supplier industries and FEEDS, a unified, real-time digital solution that provides end-to-end visibility of the fresh food supply chain with real-time traceability, condition monitoring of food products in transit, and more.Optimize the customer experience: We deliver CX solutions using EAGLE, an SAP-certified accelerator for SAP Customer Experience. This accelerator enables faster time-to-benefit and return on investment, up to a 40% reduction in implementation cost, and go-live in half the time – all without sacrificing quality.Enable a digital supply chain: For example, with SWIFTPLAN, Tech Mahindra’s Solution for Integrated Business Planning, we offer a fixed scope, fixed duration, and fixed price package to jumpstart your supply chain transformation.Learn MoreWant to learn more? Join us at SAPPHIRE NOW for session CX604 (Track: Customer Experience), where the CIO of Hillyard, Bill Grimwood, discusses best practices for B2B brands along with the benefits of SAP Commerce Cloud paired with BORN, A Tech Mahindra Company’s Eagle solution.Reach out to Parthasarathy Gopalakrishna (parthasarathy.gopalakrishna@techmahindra.com) to understand how Tech Mahindra can help you with your transformation journey. We can provide an assessment of your IT landscape against business needs, share expert recommendations, and create a digital transformation road map as part of a four-week project.

In a hyper-connected world, organisations across the European Union (EU) envision to become digitally sovereign by the year 2030. To advance the new digital decade in this region, the focus for Ireland and other countries is on four key areas[i] – digital transformation of businesses, improvement of digital skills, development of digital infrastructure, and the digitalisation of public services.Ireland currently holds an overall strong position when it comes to digital economy in the Europe region. The small-to-medium-sized enterprises (SMEs) and multinational enterprises (MNEs) on the island are playing a unique role in the execution of the OECD inclusive framework on BEPS [ii] that collectively addresses tax challenges of digitalisation. This move portrays Ireland’s ambition in setting a seamless pathway for digital economic activity through the creation of new job opportunities, driving infrastructure and robust cybersecurity best practices, and reinforcement of a low-carbon economy.Ireland’s Potential for Business Transformation InitiativesThe digital economy in Ireland is running at two different speeds. While a small proportion of enterprises have fully embraced digitalisation, there is a need to accelerate and enhance digital adoption across Irish businesses with an associated productivity boost. This includes organisations addressing productivity gaps as a result of hybrid work environments and upskilling needs, increased connectivity for business usage, and digitalisation of public sector services. Digital Upskilling with PlatformisationWith the rise of hybrid (and remote) working models, enterprises in Ireland want to digitally upskill their remote employees to contribute to the digital decade, while employees want to utilise their newly acquired skills for upward-mobility opportunities. A unified digital workplace platform with an open architecture that is both extensible and scalable can enable a seamless and secure environment for employer-employee exchange. This, in turn, will facilitate the democratisation of upskilling drives and the subsequent increase in upward-mobility rates. Here, the right platformisation solution will allow enterprises to future-proof their remote-working initiatives. Digital Infrastructure for IoT Connectivity and Monetisation of 5G AssetsThe Digital Ireland Framework wagers on 5G to help the economy realise its digital transformation ambitions, and to position Ireland as a prime destination for international businesses. This means improved wireless network infrastructure that spans wide areas of enterprise operations, enabling a plethora of IoT use cases, such as industry 4.0 automation to cost-effective on-prem data computing.A one-stop 5G enterprise solution that provides end-to-end services in enabling owned private wireless networks is the ideal way forward. It helps not only accelerate the monetisation of 5G assets, but also removes inefficiencies related to slow connectivity, and provides a strong roadmap to support the growing traffic demands for 5G establishment. Evolved Citizen-Centric Public Sector ServicesThere has been a marked shift towards the use of application services for automated and self-service options across public sectors in Ireland. But there is still a lag in making real impact in citizen service delivery and resolution efficacy due to poor application performance management. A customised application performance management (APM) solution can help public sector enterprises ranging from healthcare to energy and utilities, and banking with end-to-end management of complex and distributed business-critical IT applications for seamless customer experiences. THE NEXTThe world has changed with epic events – the pandemic as well as the advent of stunning technologies such as 5G and quantum computing and path breaking processes using robotic process automation and data science. The changes brought upon companies, people and governments are not evolutionary but revolutionaryIn Ireland, many businesses are yet to fully utilise their digital potential. The pressing priority for these businesses would be to take a proactive approach to meet IT needs, rather than reactive, and this requires end-to-end technology solutions. With the right kind of technology implementation partner, Irish businesses can future-proof their digital presence starting from designing progressive businesses and modernising infrastructure and operations, to working towards a data-led business model for connected experiences. So, what NEXT? How have we thought about delivering new solutions to address the needs of the new world we live in. We have to design solutions that address what might be the future of banking, financial services, insurance, manufacturing, energy and utilities, retail and so on.And that is what we have done – by IMAGINING a future that we can deliver on. By crafting solutions, that are unique and market-making we deliver the Future. But in a pragmatic way that can be delivered NOW. And that is essentially our promise.NXT.NOWTM: Unlocking the Digital Next for Businesses NowTech Mahindra’s NXT.NOWTM, a proprietary framework, can enable Irish businesses to tap opportunities with its three-pillar strategy – imagine, build, and run. This strategy can accommodate evolutionary change, deliver differentiators, and help businesses meet current and future digital targets by harnessing the power of emerging technologies through a holistic approach.Across different industry verticals and with over 120k colleagues in 90+ countries, we are studying markets, best practices and trends, and crafting solutions that will offer our clients competitive dominance, not just parity or advantage. In short, we help companies Imagine a future, Build solutions and Run them, to deliver tangible value and outcomes.And that is the meaning and promise of NXT.NOWTM.

Our Promise

Featured Report

Featured Press Release

Featured White Paper

Featured Event

Featured Case Study

Featured Case Study

Small Language Models Trained for Your Industry Can Deliver More for Your Business

Small Models, Big Advantages

No-fault Vault

Getting It Right