A client handed me a manila folder of W-2s and 1099s last spring and asked me a question I had not heard a client ask before. She wanted to know where her data was about to go.
Not the contractual answer. She had already read the privacy policy. She wanted the technical answer. When I scanned her W-2 into the document automation tool I was using at the time, what physical infrastructure did the bytes traverse, who could read them along the way, and where did the extracted values rest before they reached her tax return?
I did not know.
I knew what the vendor’s privacy policy said, which is a different thing. I knew the marketing language, which is a different thing again. I did not know — and could not have told her, with the technical precision a working tax preparer ought to be able to summon — where her tax data lived during the thirty seconds between her W-2 entering the tool and the extracted numbers landing back in my workpaper.
That gap, between the contractual answer a tax preparer is willing to give and the technical answer they actually possess, is where the next decade of tax practice is going to be won or lost. The architectural question my client asked me is becoming the load-bearing buying criterion for tax software in 2026 — and most preparers are not ready for it.
The architectural distinction every tax firm should know in 2026
Tax software in 2026 generally falls into one of two architectural patterns: shared cloud or private cloud. The names are doing real work, and the difference is not cosmetic.
Shared cloud tax software is the dominant architecture. The firm uploads documents to vendor-operated infrastructure. The vendor’s servers process the documents — running OCR, running classification, running automation — and return the extracted data to the firm. The firm’s documents are encrypted at rest, but on hardware the firm does not own. The decryption keys belong to the vendor. The firm’s tax data is co-resident with thousands of other firms’ returns inside a multi-tenant database. The contract describes the privacy controls. The contract can also be amended.
Private cloud tax software is the architectural alternative. The firm’s tax content lives inside infrastructure the firm controls — typically the firm’s own Google Workspace or equivalent. Documents are processed inside the firm’s environment. The vendor stores authentication and audit metadata, never tax content. The privacy promise is enforced by the architecture itself, not by contract language that can change.
Both architectures are functional. Both meet the technical bar of “stored encrypted, transmitted encrypted, processed by software.” The difference is where the data physically lives and who has the technical ability to access it. That difference is invisible at the marketing layer and load-bearing at the regulatory and reputational layer.
What actually happens to a W-2 in each architecture
In a shared cloud system, when I scan a client’s W-2:
- The image leaves my computer and travels to the vendor’s servers, often in a data center I cannot name and on hardware I cannot inspect.
- The vendor’s OCR engine reads the document — including the SSN, employer EIN, wages, withholdings, and any other data on the form.
- The extracted values are stored in the vendor’s database, typically in fields I can see when I log in but the vendor’s engineers can also see when they need to debug.
- The data may be retained for a period defined in the privacy policy — a period the vendor can update.
- When my session ends, the data does not disappear. It persists in vendor infrastructure under the terms of the contract.
In a private cloud system built on the firm’s own Google Workspace:
- The image is uploaded into the firm’s own Google Drive, inside the firm’s Google Workspace tenant.
- OCR processing runs against the document transiently. The extracted data is written back into the firm’s own Google Sheets and Drive.
- The vendor — Sophicor in our case — stores authentication tokens, user metadata, and audit log entries in vendor infrastructure. Tax content never persists in vendor storage.
- When my session ends, the tax data is exactly where it was at the start: in the firm’s own Workspace, encrypted at rest by Google’s standard encryption, governed by the firm’s own access policies.
- Sensitive fields are sanitized from API responses on the workflow path, so even the data flowing between firm and vendor systems does not carry the full sensitive content.
The two architectures produce identical preparer-facing experiences. They produce radically different answers when a client asks: where does my data live?
Verification, not replacement: the AI dimension
The shared-cloud question is being amplified right now by a second architectural shift: AI tax tools.
A generation of automation products is racing to remove manual data entry from the tax preparation workflow. The pitch is touchless: upload, extract, populate, file. The implicit promise is efficiency. The unspoken cost is that the human practitioner moves further from the data, not closer to it.
This is where the AI safety conversation in tax practice gets serious — and where the brand of automation matters more than its speed. There are two distinct AI postures available to a tax software vendor in 2026.
Auto-pilot AI treats the tax preparer as the bottleneck. The system extracts, classifies, populates, and files with minimal human review. The audit trail records the system’s decisions, not the practitioner’s verifications. The selling point is throughput. The structural risk is that the practitioner has signed something they did not actually verify, on data that may have been processed in infrastructure they do not control.
Co-pilot AI treats the tax preparer as the authority. The system extracts and proposes. The practitioner reviews and approves. Every document is reviewed by a human preparer before its extracted data becomes part of the return. Approval events are audit-logged with preparer attribution and timestamp. The audit trail is the evidence that the practitioner did what they were licensed to do.
AI in tax preparation is dangerous when it is shared-cloud, opaque, unverified, and treated as a substitute for human judgment. AI in tax preparation is safe when it is private-cloud, transparent, verification-required, and treated as co-pilot for the trained practitioner. Both versions ship as “AI tax automation” in 2026 marketing language. They are radically different products.
Why the data question becomes the buying criterion
Until recently, the load-bearing buying criterion for tax software was speed. Then it became automation breadth — how many forms, how many integrations. The criterion shifts again in 2026, and the new criterion is the architectural answer to the data question.
Three forces are converging.
Taxpayer awareness is rising. The general public is becoming sophisticated about where their data lives. The same conversation reshaped consumer healthcare, consumer banking, and consumer streaming. Tax is next. Taxpayers are beginning to ask their preparers the kind of question I got from my client last spring. The preparers without an answer are about to feel the silence.
Regulatory pressure is rising. State Boards of Accountancy, IRS Office of Professional Responsibility, and state attorneys general are paying closer attention to how tax data is handled. IRC §7216 has been on the books for decades, but the enforcement posture is sharpening. The architectural question is no longer purely commercial.
Insurance underwriters are paying attention. Professional liability insurers writing tax preparer policies are starting to ask about data architecture in their underwriting. The firms that can answer “we operate on private cloud architecture, our clients’ data lives in our own Workspace, our audit logs are firm-owned” are going to underwrite cleaner than the firms whose answer is a vendor’s privacy policy.
The combined effect is that the data question becomes the question. The firms ready with one defensible technical sentence will hold their clients, retain their underwriting, and meet rising regulatory scrutiny. The firms whose answer is a shrug will lose all three.
What changes inside a tax practice when the architecture answers the question
I run a small EA practice in Phoenix. When my client asked me where her data was going, the experience was professionally embarrassing. Not because I was negligent — because the tools I was using at the time had quietly removed my ability to answer my client’s question with technical precision. The tools assumed I would defer to the vendor’s privacy policy. So I deferred to it. So did most of my peers.
When the architecture changes — when the firm’s tax content lives in the firm’s own Google Workspace, when OCR processing happens transiently and writes results back into firm infrastructure, when SSN values are stored in dedicated secret management infrastructure rather than in the firm’s working data — the answer to the data question becomes one sentence.
That sentence is the difference between a preparer who is a customer of their tax software and a preparer who is an operator of it. Customers defer to vendor policies. Operators answer their clients’ questions in plain English.
The taxpayer’s data should never leave the firm that prepared it. That is the architectural commitment. Private cloud tax automation is the category of software that delivers it. Verification, not replacement, is the AI posture that protects it.
These three sentences are the working spine of professional tax practice in the AI era. They will be the floor before they are the ceiling.
The load-bearing question
A practitioner who cannot answer where their client’s data lives is operating without a key piece of the professional toolkit they will be expected to carry by 2030. The question is small enough to fit in a single sentence and large enough to redefine which tax software earns the load-bearing position in the next decade.
When a client asks me now, I have one sentence. Your tax data lives in our firm’s own Google Workspace. The software we use processes it inside our environment. The vendor stores authentication and audit logs, not your tax content.
That is the answer the AI tax era is going to require of every working preparer. The firms that have the architecture to deliver it will define the next era. The firms that don’t will spend the next five years answering questions in front of clients, examiners, and underwriters who have already moved on to the firms that can.
The taxpayer’s data should never leave the firm that prepared it. The tax practitioner is the one who has to stand behind that promise — and stand behind it with technical precision, not a privacy policy.
That is the quiet architectural question becoming the defining criterion for AI-era tax practice. It is not quiet for much longer.
— Yatin Miglani
Enrolled Agent · Phoenix, Arizona
Founder, Sophicor · sophicor.com