Today’s Lesson
Security for Legal SaaS — Episode 37: Governed Writes and Human-in-the-Loop
AI Proposes, Humans Dispose
In Episode 36, we examined how models can leak training data through inference attacks. This episode addresses a different risk entirely: what happens when AI doesn't just analyse — it acts.
The principle is simple: AI systems in legal practice should propose, never dispose. They can draft a contract clause, suggest edits to a brief, classify a document, or flag a compliance risk. But filing a court document, sending a client communication, modifying a matter record, or executing a financial transaction — those actions must require a human professional to review, approve, and take responsibility.
This isn't just good security practice. It's a professional obligation.
Why Legal AI Must Not Autonomously Write
Professional Responsibility Demands Human Judgment
ABA Formal Opinion 512, issued in July 2024, is the American Bar Association's first comprehensive ethics guidance on generative AI in legal practice. It makes the obligation explicit: "GAI tools lack the ability to understand the meaning of the text they generate or evaluate its context, and therefore are not a substitute for the independent professional judgment a lawyer must exercise."1
The Opinion addresses several Model Rules:
| Model Rule | Requirement | AI Implication |
|---|---|---|
| Rule 1.1 (Competence) | Lawyer must provide competent representation | Lawyer must understand AI tool's capabilities and limitations |
| Rule 1.4 (Communication) | Keep client informed of case status | Client must know when AI is being used in their matter |
| Rule 1.6 (Confidentiality) | Protect client information | AI tool's data handling must preserve confidentiality |
| Rule 3.3 (Candor) | Duty of candor toward the tribunal | Lawyer must verify all AI-generated citations and analysis |
| Rule 5.1/5.3 (Supervision) | Partners must supervise subordinates | AI tool use requires supervisory frameworks |
The EU AI Act reinforces this globally. Article 14 requires that high-risk AI systems — which includes AI used in legal decision-making — "be designed and developed in such a way that they can be effectively overseen by natural persons during the period in which they are in use."2 The oversight must enable humans to understand the system's capabilities, correctly interpret its output, and decide not to use the system or disregard its output.
The Automation Bias Problem
Automation bias — the tendency to trust automated outputs without critical evaluation — is the practical reason governed writes matter. Research consistently shows that humans over-rely on AI suggestions, especially when those suggestions are presented with apparent confidence.3
In legal AI, this manifests as:
- Accepting AI-drafted clauses without verifying they reflect the client's negotiated position
- Filing AI-generated briefs without checking cited authorities (the phenomenon that produced the Mata v. Avianca sanctions in 2023)
- Approving AI-classified documents without reviewing edge cases
Case study: Mata v. Avianca (S.D.N.Y., 2023). Attorney Steven Schwartz submitted a brief containing six fabricated case citations generated by ChatGPT. When opposing counsel could not locate the cases, Schwartz asked ChatGPT to confirm they were real — and it did. The court sanctioned Schwartz and his firm. The failure was not in using AI; it was in treating AI output as a final product rather than a draft requiring verification.4
Technical Enforcement: The Draft State Pattern
The "governed writes" principle must be enforced technically, not just by policy. Telling lawyers "always review AI output" is insufficient — the system architecture should make unreviewed AI writes impossible.
Architecture Pattern: AI → Draft → Review → Production
AI generates output → Stored in DRAFT state
↓
Human reviewer receives notification
↓
Reviewer approves / edits / rejects
↓
If approved → Promoted to PRODUCTION state
If rejected → Returned to AI with feedback
Every AI-generated artifact — a contract clause, a document classification, a research memo, a billing entry — enters the system in a draft state that cannot reach production without human approval. The approval event is logged with the reviewer's identity, timestamp, and the specific version reviewed.
Implementation Patterns
| Pattern | Description | Use Case |
|---|---|---|
| Approval queue | AI outputs land in a review queue; nothing progresses without explicit approval | Court filings, client communications, regulatory submissions |
| Confidence threshold | Low-confidence AI outputs require review; high-confidence outputs may proceed with lighter oversight | Document classification, email triage |
| Escalation rules | Certain output types always require senior review regardless of confidence | Privilege designations, conflict checks, financial transactions |
| Four-eyes principle | Two independent reviewers must approve before promotion to production | High-stakes filings, M&A document production |
Key distinction: A confidence threshold does NOT mean "skip human review for confident outputs." It means "route high-confidence outputs to a faster review track and low-confidence outputs to a more thorough one." Even at 99% confidence, a human must see the output before it becomes official.5
What "Review" Actually Means
Effective human oversight requires more than a rubber stamp. The IAPP's analysis of human-in-the-loop requirements notes that oversight fails when reviewers lack the time, training, or technical understanding to meaningfully evaluate AI outputs.6 For legal AI, meaningful review means:
- The reviewer can see the AI's reasoning — not just the output, but what inputs it relied on and how confident it is
- The reviewer has domain expertise — a junior associate reviewing a complex derivatives clause is not meaningful oversight
- The reviewer has time — if the approval queue contains 500 items and the reviewer has 30 minutes, oversight is theatrical
- The reviewer can reject without friction — if rejecting an AI output requires more effort than approving it, approval becomes the default
Specific Governed Write Scenarios in Legal SaaS
Court Filing Systems
An AI that drafts a motion should produce a reviewable document with tracked changes, citations flagged for verification, and a summary of the legal reasoning. The "File" button must require attorney authentication and a certification that the filing has been reviewed. The system should log: who reviewed it, when, which version, and whether they made edits.
Client Communication
AI-drafted emails to clients should enter an outbox that requires explicit send approval. The system should prevent scheduled auto-send of AI-generated content — every communication must pass through a human checkpoint. As ABA Formal Opinion 512 notes, boilerplate consent in engagement letters is not sufficient to authorise unrestricted AI use in client communications.1
Document Classification and Privilege Review
AI can accelerate privilege review by pre-classifying documents, but the privilege designation must be confirmed by a qualified attorney. Incorrect privilege designations have discovery consequences — a document wrongly marked "not privileged" and produced to opposing counsel cannot be unproduced.
Contract Review and Redlining
AI-suggested redlines should appear as tracked changes, not direct edits. The reviewing attorney must be able to accept, reject, or modify each suggestion individually. The final document should record which changes originated from AI and which from the attorney.
Audit Trail Requirements
Every governed write must produce an audit trail that answers:
- Who generated the output? (Which AI model, which version)
- Who reviewed it? (Authenticated identity of the human reviewer)
- When was it reviewed? (Timestamp, with time zone)
- What did the reviewer see? (The exact version presented for review)
- What did the reviewer decide? (Approve, reject, modify — and the specific modifications)
- Why was it escalated? (If applicable — confidence threshold, document type, matter sensitivity)
This audit trail is not optional. It is the evidence that human oversight actually occurred, and it must be tamper-evident — a topic we will cover in Episode 42.
The Harvard Standard for AI Oversight Liability
A 2024 Harvard Journal of Law & Technology analysis proposed redefining the standard of human oversight for AI negligence. The argument: if a professional claims to have "overseen" an AI system but the audit trail shows they approved 200 outputs in 15 minutes without opening any of them, the oversight was illusory, and the professional bears the same liability as if no oversight occurred.7 Meaningful oversight leaves a forensic trail that can withstand scrutiny.
What's Next
Episode 38 covers LLM API Key Isolation and Inference Gateways — how to manage the API keys that connect your legal AI to cloud providers like OpenAI and Anthropic, and why a single leaked key can cost you more than a data breach.
Sources & Further Reading
Sources & references
- ABA, Formal Opinion 512: Generative Artificial Intelligence Tools (July 2024).
- EU AI Act, Article 14: Human Oversight.
- Strata.io, Human-in-the-Loop: A 2026 Guide to AI Oversight.
- Mata v. Avianca, Inc., No. 22-cv-1461 (S.D.N.Y. June 22, 2023) — sanctions for AI-fabricated citations.
- IBM, What Is Human In The Loop (HITL)?.
- IAPP, 'Human in the Loop' in AI Risk Management — Not a Cure-All Approach.
- Harvard Journal of Law & Technology, Redefining the Standard of Human Oversight for AI Negligence.
- Trilateral Research, Human-in-the-Loop AI Balances Automation and Accountability.
- Kiteworks, Human in the Loop: AI Compliance and Oversight Requirements.
- NYC Bar, Formal Opinion 2024-5: Generative AI in the Practice of Law.
- Small Wars Journal, Human-in-the-Loop or Loophole? Targeting AI and Legal Accountability.