Security for Legal SaaS

Episode 37 · Module 8 · AI Security

Governed Writes and Human-in-the-Loop

19 May 2026 · 8:02 · Security for Legal SaaS

8:02 8:02

In Episode 36, we examined how models can leak training data through inference attacks. This episode addresses a different risk entirely: what happens when AI doesn't just analyse — it acts. The principle is simple: AI systems in legal practice should propose, never dispose. They can draft a contract clause, suggest edits to a brief, classify a document, or flag a compliance risk. But filing a court document, sending a client communication, modifying a matter record, or executing a financial transaction — those actions must require a human professional to review, approve, and take responsibility.

Today’s Lesson

Security for Legal SaaS — Episode 37: Governed Writes and Human-in-the-Loop

AI Proposes, Humans Dispose

In Episode 36, we examined how models can leak training data through inference attacks. This episode addresses a different risk entirely: what happens when AI doesn't just analyse — it acts.

The principle is simple: AI systems in legal practice should propose, never dispose. They can draft a contract clause, suggest edits to a brief, classify a document, or flag a compliance risk. But filing a court document, sending a client communication, modifying a matter record, or executing a financial transaction — those actions must require a human professional to review, approve, and take responsibility.

This isn't just good security practice. It's a professional obligation.

Why Legal AI Must Not Autonomously Write

Professional Responsibility Demands Human Judgment

ABA Formal Opinion 512, issued in July 2024, is the American Bar Association's first comprehensive ethics guidance on generative AI in legal practice. It makes the obligation explicit: "GAI tools lack the ability to understand the meaning of the text they generate or evaluate its context, and therefore are not a substitute for the independent professional judgment a lawyer must exercise."1

The Opinion addresses several Model Rules:

Model Rule Requirement AI Implication
Rule 1.1 (Competence) Lawyer must provide competent representation Lawyer must understand AI tool's capabilities and limitations
Rule 1.4 (Communication) Keep client informed of case status Client must know when AI is being used in their matter
Rule 1.6 (Confidentiality) Protect client information AI tool's data handling must preserve confidentiality
Rule 3.3 (Candor) Duty of candor toward the tribunal Lawyer must verify all AI-generated citations and analysis
Rule 5.1/5.3 (Supervision) Partners must supervise subordinates AI tool use requires supervisory frameworks

The EU AI Act reinforces this globally. Article 14 requires that high-risk AI systems — which includes AI used in legal decision-making — "be designed and developed in such a way that they can be effectively overseen by natural persons during the period in which they are in use."2 The oversight must enable humans to understand the system's capabilities, correctly interpret its output, and decide not to use the system or disregard its output.

The Automation Bias Problem

Automation bias — the tendency to trust automated outputs without critical evaluation — is the practical reason governed writes matter. Research consistently shows that humans over-rely on AI suggestions, especially when those suggestions are presented with apparent confidence.3

In legal AI, this manifests as:

Case study: Mata v. Avianca (S.D.N.Y., 2023). Attorney Steven Schwartz submitted a brief containing six fabricated case citations generated by ChatGPT. When opposing counsel could not locate the cases, Schwartz asked ChatGPT to confirm they were real — and it did. The court sanctioned Schwartz and his firm. The failure was not in using AI; it was in treating AI output as a final product rather than a draft requiring verification.4

Technical Enforcement: The Draft State Pattern

The "governed writes" principle must be enforced technically, not just by policy. Telling lawyers "always review AI output" is insufficient — the system architecture should make unreviewed AI writes impossible.

Architecture Pattern: AI → Draft → Review → Production

AI generates output → Stored in DRAFT state
                      ↓
Human reviewer receives notification
                      ↓
Reviewer approves / edits / rejects
                      ↓
If approved → Promoted to PRODUCTION state
If rejected → Returned to AI with feedback

Every AI-generated artifact — a contract clause, a document classification, a research memo, a billing entry — enters the system in a draft state that cannot reach production without human approval. The approval event is logged with the reviewer's identity, timestamp, and the specific version reviewed.

Implementation Patterns

Pattern Description Use Case
Approval queue AI outputs land in a review queue; nothing progresses without explicit approval Court filings, client communications, regulatory submissions
Confidence threshold Low-confidence AI outputs require review; high-confidence outputs may proceed with lighter oversight Document classification, email triage
Escalation rules Certain output types always require senior review regardless of confidence Privilege designations, conflict checks, financial transactions
Four-eyes principle Two independent reviewers must approve before promotion to production High-stakes filings, M&A document production
Key distinction: A confidence threshold does NOT mean "skip human review for confident outputs." It means "route high-confidence outputs to a faster review track and low-confidence outputs to a more thorough one." Even at 99% confidence, a human must see the output before it becomes official.5

What "Review" Actually Means

Effective human oversight requires more than a rubber stamp. The IAPP's analysis of human-in-the-loop requirements notes that oversight fails when reviewers lack the time, training, or technical understanding to meaningfully evaluate AI outputs.6 For legal AI, meaningful review means:

  1. The reviewer can see the AI's reasoning — not just the output, but what inputs it relied on and how confident it is
  2. The reviewer has domain expertise — a junior associate reviewing a complex derivatives clause is not meaningful oversight
  3. The reviewer has time — if the approval queue contains 500 items and the reviewer has 30 minutes, oversight is theatrical
  4. The reviewer can reject without friction — if rejecting an AI output requires more effort than approving it, approval becomes the default

Specific Governed Write Scenarios in Legal SaaS

Court Filing Systems

An AI that drafts a motion should produce a reviewable document with tracked changes, citations flagged for verification, and a summary of the legal reasoning. The "File" button must require attorney authentication and a certification that the filing has been reviewed. The system should log: who reviewed it, when, which version, and whether they made edits.

Client Communication

AI-drafted emails to clients should enter an outbox that requires explicit send approval. The system should prevent scheduled auto-send of AI-generated content — every communication must pass through a human checkpoint. As ABA Formal Opinion 512 notes, boilerplate consent in engagement letters is not sufficient to authorise unrestricted AI use in client communications.1

Document Classification and Privilege Review

AI can accelerate privilege review by pre-classifying documents, but the privilege designation must be confirmed by a qualified attorney. Incorrect privilege designations have discovery consequences — a document wrongly marked "not privileged" and produced to opposing counsel cannot be unproduced.

Contract Review and Redlining

AI-suggested redlines should appear as tracked changes, not direct edits. The reviewing attorney must be able to accept, reject, or modify each suggestion individually. The final document should record which changes originated from AI and which from the attorney.

Audit Trail Requirements

Every governed write must produce an audit trail that answers:

This audit trail is not optional. It is the evidence that human oversight actually occurred, and it must be tamper-evident — a topic we will cover in Episode 42.

The Harvard Standard for AI Oversight Liability

A 2024 Harvard Journal of Law & Technology analysis proposed redefining the standard of human oversight for AI negligence. The argument: if a professional claims to have "overseen" an AI system but the audit trail shows they approved 200 outputs in 15 minutes without opening any of them, the oversight was illusory, and the professional bears the same liability as if no oversight occurred.7 Meaningful oversight leaves a forensic trail that can withstand scrutiny.

What's Next

Episode 38 covers LLM API Key Isolation and Inference Gateways — how to manage the API keys that connect your legal AI to cloud providers like OpenAI and Anthropic, and why a single leaked key can cost you more than a data breach.

Sources & Further Reading

Sources & references

  1. ABA, Formal Opinion 512: Generative Artificial Intelligence Tools (July 2024).
  2. EU AI Act, Article 14: Human Oversight.
  3. Strata.io, Human-in-the-Loop: A 2026 Guide to AI Oversight.
  4. Mata v. Avianca, Inc., No. 22-cv-1461 (S.D.N.Y. June 22, 2023) — sanctions for AI-fabricated citations.
  5. IBM, What Is Human In The Loop (HITL)?.
  6. IAPP, 'Human in the Loop' in AI Risk Management — Not a Cure-All Approach.
  7. Harvard Journal of Law & Technology, Redefining the Standard of Human Oversight for AI Negligence.
  8. Trilateral Research, Human-in-the-Loop AI Balances Automation and Accountability.
  9. Kiteworks, Human in the Loop: AI Compliance and Oversight Requirements.
  10. NYC Bar, Formal Opinion 2024-5: Generative AI in the Practice of Law.
  11. Small Wars Journal, Human-in-the-Loop or Loophole? Targeting AI and Legal Accountability.