The Architects Are Working One Side of the Table
AI is redistributing power inside the Freedom of Information Act. The agency side has spent two years designing for itself. The requester side has not been designed at all.
The Architects Are Working One Side of the Table
AI is redistributing power inside the Freedom of Information Act. The agency side has spent two years designing for itself. The requester side has not been designed at all.
Twelve papers at one academic workshop. Seven federal agencies already deploying or piloting AI in FOIA processing. One architecture proposal sitting inside the official body that recommends FOIA reform to the Archivist of the United States. The agency side of the Freedom of Information Act is being designed in public. The requester side is being designed by no one.
This is an essay about that asymmetry, and about what the next decade of public records depends on someone building.
PART I / The Agency-Side Architecture, As Designed
The AIOG 2026 workshop at ICAIL in Singapore, June 8, will publish the most concentrated single drop of FOIA-AI architectural thinking in the history of the field. The headline paper is Jason R. Baron and Devanshu Kejriwal's "An AI-Orchestrated Architecture for Responding to FOIA Requests" [1], which proposes a five-stage agency pipeline: an intake chatbot, a search and preservation layer, a sensitivity-review classifier for exempt material, a generative determination-letter author, and an audit substrate. It is the most fully specified architecture for AI-mediated FOIA processing in print, and the authors acknowledge in their related-work section that they could not find prior work on multi-stage AI orchestration in this space.
It is not alone. Larooij and Graus at the University of Amsterdam show that a Qwen3.5-9B model running air-gapped on a single consumer GPU, prompted with chain-of-thought and few-shot error examples, outperforms BERT and prior classifiers on Exemption 5 deliberative-process classification [2]. Romana Afroze at Delaware Law School proposes a deontic-logic framework that beats fine-tuned BERT by 14.4 points of precision on 847 D.C. Circuit Vaughn-index entries, arguing that current agency-side ML classifiers "are operating outside the legal framework they are supposed to instantiate" [3]. Ravi Mahajan at SUNY Buffalo lifts retrieval recall on a 512-document NARA-and-MuckRock benchmark from 51 percent to 86 percent with a multimodal RAG pipeline aimed at the 78 percent of NARA's holdings that exist only as scans [4]. Jack McKechnie at Glasgow argues that sensitivity-aware search must be implemented as a cascading pipeline where every retrieval stage, not just the final classifier, is sensitivity-aware [5].
These are not careless papers. They engage live engineering problems with real benchmarks and real deployment concerns. They will be read by every serious FOIA-AI researcher for the next several years.
Every one of them assumes the agency is the operator.
In Baron and Kejriwal's architecture, the requester appears as a dialogue partner for Stage 1's chatbot and the eventual recipient of Stage 4's determination letter. They never appear as an actor with their own tooling, intent, or strategy. The clarification loop assumes the requester arrives in good faith looking for help disambiguating their request. In 2026, that is no longer a defensible baseline. Capable requesters are drafting with large language models, mapping custodians from Office of Inspector General reports, and reverse-engineering exemption patterns from agency reading rooms and prior releases aggregated on the FOIA Project's lawsuit database [10]. A chatbot whose job is to resolve ambiguities is going to meet ambiguities that are deliberate tradecraft, not error. The architecture does not account for a move it is going to see immediately.
In Baron and Kejriwal's architecture, the requester appears at two interface points.
Five internal stages, one audit substrate, one oversight pipe. Everything between the two interface points is internal to the agency. The architecture is the system. The requester is a peripheral.
- FAIL 1The clarification loop assumes good-faith ambiguity. Capable requesters bring deliberate tradecraft.
- FAIL 2The classifier decides what to withhold without consulting the case law the letter then cites. The seam between decision and justification closes.
- FAIL 3The authors concede AI cannot reliably evaluate the foreseeable-harm standard - the post-2016 legal pressure point.
Two interface points. Five internal stages. The requester is a packet that goes in and a packet that comes out.
The Stage 3 to Stage 4 chain creates a second problem. A learned classifier, in Baron and Kejriwal's specification a fine-tuned Legal-BERT model, produces the withholding decision. The Stage 4 generative model then writes the legal narrative that explains it, retrieving FOIA case law and prior agency decisions through a RAG pipeline. Today's clumsy determination letters leak. They show their seams. A RAG-narrated letter citing on-point case law will read more defensible than the underlying decision actually is, because the classifier did not consult the case law the letter cites. Requesters lose the diagnostic value that today's awkward letters quietly provide: the visible seam between the redaction call and its legal scaffolding. The append-only audit log in Baron and Kejriwal's Figure 3 gives oversight a record. It does not give the requester insight. Those are not the same accountability surface.
The third failure is the one the authors themselves flag. Section 4 item 5 of the paper concedes that AI cannot reliably evaluate the foreseeable-harm standard that Congress codified in the FOIA Improvement Act of 2016 [14]. That standard is not a small concession. The D.C. Circuit in Reporters Committee for Freedom of the Press v. FBI required "a focused and concrete demonstration" of why disclosing "the particular type of material at issue" would "actually impede" agency deliberations, "given the specific context of the agency action at issue," and held that "a perfunctory statement that disclosure of all the withheld information would jeopardize the free exchange of information will not suffice" [12]. Exemption 5 with the foreseeable-harm gate is where the post-2016 litigation lives. An architecture that automates everything except the legal pressure point has not been tested against the work that actually drives appeals.
Only Afroze, among the entire AIOG cohort, formalizes the foreseeable-harm predicate at all, and even Afroze places it inside the agency's own reasoning, not as a tool the requester can wield [3]. Larooij and Graus, with the candor that good engineers bring to public papers, write that "recall is more important than precision, as the impact of forgetting to redact sensitive information can be much larger than suggesting to redact too much" [2]. That loss function is honest. It is also a structural decision to privilege withholding. Once productionized inside an agency, it does not stop at the redaction step. It is the agency's posture, encoded.
Every one of these failures is the same failure. The architecture was designed without an adversary in the room.
PART II / The Architecture Is Also Underbuilt On Its Own Side
This is not an academic exercise.
OGIS 2024 RMSA
OMB Inventory 2025
Across status buckets
Per the National Archives Office of Government Information Services 2024 Records Management Self-Assessment, 18.6 percent of federal agencies, 50 of 269 respondents, already use artificial intelligence or machine learning in FOIA processing [8]. The Office of Management and Budget's federal AI use case inventory lists 3,611 use cases across 56 agencies in 2025.
The federal AI-in-FOIA roster is varied in maturity. In current production: Relativity eDiscovery, Microsoft Purview, FOIAXpress AI Assist, and Veritas Clearwell at the Department of Commerce [9]; and ArkCase at DOJ INTERPOL, the Office of Personnel Management, and the Equal Employment Opportunity Commission. In formal pilot: an AI/LLM-based redaction pilot at the Centers for Medicare and Medicaid Services (built consistent with Department of Justice guidance per HHS reporting), and a separate AI/LLM-based responsiveness-and-redaction pilot inside the Department of Health and Human Services Office of the Secretary [16]. Explored or partially piloted: MITRE's FOIA Assistant at the Consumer Products Safety Commission, which flags Exemption 4 monetary figures, Exemption 5 deliberative language, and Exemption 6 personally identifiable information per MuckRock's agency-disclosure investigation.
Adjacent records-AI deployment is even further along. The State Department's Machine Learning Declassification Pilot, operational since October 2022, made correct declassification decisions on 1997 diplomatic cables 99.29 percent of the time and reduced State Department labor on cable declassification by over 60 percent [15]. Declassification is not FOIA processing, but it is the federal records-AI maturity frontier. None of this is speculative. Production, pilot, and adjacent are all happening at once.
The DOJ Office of Information Policy, the official voice for federal FOIA policy, calls AI "an emerging and promising area" while emphasizing "the importance of ensuring that there is sufficient human monitoring and that appropriate safeguards are established so that the government is operating consistent with our obligations under FOIA" [7]. OGIS, whose mandate is FOIA oversight, observes that AI and machine learning "have the potential to aid in FOIA processing" and qualifies in the next sentence that they "are not a substitute for the judgment of FOIA professionals on application of exemptions and foreseeable harm" [8]. The critical academic voice in the field, Ronald Capaldi's 2025 article in the Journal of the National Association of Administrative Law Judiciary, argues that without algorithmic audits and mandatory disclosure of deployment, agency AI converts FOIA into "an empty promise of openness" [6]. He is right about the risk. He stops one move short.
So the agency side is being deployed, the official posture says human FOIA professionals must remain in the loop on the legal layer, the academic critic warns that agency AI without audits hollows the law, and in the official materials surveyed here - DOJ OIP guidance, OGIS reporting, FOIA Advisory Committee output, and the academic-policy literature - no source addresses the question of whether the requester side gets to operate AI at all. The redistribution-of-power frame appears nowhere in the official record.
Even granting the agency-side architecture its own premises, the system being sketched is not deployable at the scale the next four years will demand. Five structural omissions sit underneath the design.
First, cost. An architecture that demands GPU-accelerated inference, managed vector databases at scale, and continuous Stage 5 oversight is not a budget-neutral migration from current human-only processing. Federal AI procurement at FedRAMP-grade authorization runs in the millions to tens of millions of dollars per agency per year, with vendor lock-in to a small number of cleared LLM and infrastructure providers. Most agencies cannot afford it. The architecture as proposed quietly assumes a Treasury, Health and Human Services, or Department of Homeland Security budget.
Second, workforce. The system requires a supervisor breed that does not exist in the federal FOIA workforce today: senior FOIA professionals who can evaluate model outputs, adversarially test classifier decisions, and rule on edge cases at speed. The trilemma is hard. That person cannot be hired at General Schedule pay grades. They cannot be substituted by contractors without budgets no agency actually has. The roughly five to seven thousand federal FOIA staff across all agencies are a niche workforce already, and the AI-FOIA-supervisor specialization is narrower still.
Third, security. A Stage 3 BERT classifier trained on classified or controlled material is itself a data-handling problem. Vector embeddings of FOIA-processed documents in managed databases can be inverted, at least partially, into their source text using techniques that are well-developed in the machine learning security literature. The append-only audit log that Stage 5 maintains is a high-value target precisely because it captures input and output snapshots of the very sensitive material the system was processing. The Stage 1 chatbot is an attack surface: prompt injection in request text, jailbreaks to extract the classification rubric, side-channel inference of exemption thresholds. None of this is addressed in the published architectures.
Fourth, what might be called meta-FOIA recursion. The pipeline generates records. The append-only audit log is a federal record. The classification decisions are federal records. The chatbot transcripts are federal records. The model versions and training data are discoverable in litigation. All of them are subject to NARA retention schedules. All of them are subject to future FOIA requests. The architecture creates its own FOIA exposure. The more comprehensive the audit, the larger the exposure surface. There is no FOIA-specific NARA schedule addressing AI intermediate state in the published guidance, and no FOIA Advisory Committee recommendation addressing what happens when a requester FOIAs the audit log of the AI that processed their original request.
Fifth, demand. The 1.5 million federal FOIA requests of fiscal year 2024 are the last figure the Baron and Kejriwal paper cites [1]. The system is being designed for that volume. Per ProPublica and Gizmodo reporting on the Heritage Foundation's Oversight Project, the campaign filed on the order of one hundred thousand AI-generated FOIA requests in roughly two months, generating intake rates that some impacted FOIA offices reported as one per second and consuming a documented share of staff days at the targeted agencies [18][18a]. Some agencies subsequently lost FOIA personnel to reductions in force in mid-2025, partly in connection with the strain. That episode is the canary. A single requester-side actor with AI forced visible structural change at federal FOIA offices in months, not years. The next actor with AI will face the question of what they should be building for, not whether the demand will arrive. The architecture being designed at AIOG is being designed for the demand environment of 2025, not the demand environment of 2029.
There is one further fact that is not technical, but is the hardest fact in this essay.
The architecture, the seat, and the recommendation are held by one person.
The author of the most fully specified agency-side FOIA-AI architecture in print also co-chairs an Implementation Subcommittee whose recommendations reach the Archivist of the United States. The FOIA Advisory Committee - established by NARA, 17 members across 7 government seats and 10 non-government seats - has no comparable requester-side architectural voice.
Jason Baron is the lead author of the AIOG 2026 architecture paper. He also co-chairs the Implementation Subcommittee of the FOIA Advisory Committee - NARA's standing advisory body whose recommendations shape federal FOIA reform - in the committee's current 2024-2026 term [8b][8c]. The committee has 17 members across two structural bins: 7 government seats (representing the agencies that process FOIA requests) and 10 non-government seats (representing requesters, academics, journalists, and the commercial requester community). Audited across all 17: none of the 7 government members has published a peer-reviewed FOIA-AI architecture proposal. None of the 9 other non-government members has either. Baron is the only one, and his architecture serves the agency side from a non-government seat. The closest near-misses are Ryan Mulvey's AFPF policy essays and Nicholas Wittenberg's Armedia vendor commentary; neither is architecture-specific or peer-reviewed.
Baron's paper is not an academic exercise. The architecture being sketched at AIOG is being authored by an institutional voice on the body that recommends FOIA reform to the National Archivist. There is no equivalent requester-side architectural voice in that room.
PART III / The Other Half of the Problem Belongs to the Requester
The honest current state of the requester side is not zero. It is also not the architectural counterweight the situation demands.
MuckRock has filed 176,343 requests across 28,198 agencies, fulfilled 58,809, and released 11.9 million pages [9]. Its production platform has no AI. Its only AI build is a research prototype called Agent Moss, a retrieval system over public records retention schedules, second-stage, not productionized [9b]. The FOIA Project's lawsuit database covers 18,742 FOIA cases through March 2026, useful for litigation research and unavailable for request generation [10]. The Reporters Committee for Freedom of the Press maintains the Open Government Guide and a legal hotline; their iFOIA tool was sunset and the URL now redirects to their main site [11].
Several AI-native entrants have arrived in 2025 and 2026. FOIA Fluent, built at the NYC Data Science for Social Good civic project, is the most architecturally complete to date: corpus-search-grounded request drafting against statute and prior outcomes, a 1,600-agency federal transparency hub with success-rate/response-speed/fee/portal-availability rankings across 54 jurisdictions, per-agency exemption-pattern deep-dives, one-click appeal generation, and a daily-feed pattern engine that watches court opinions, agency enforcement actions, recalls, IG reports, and regulatory dockets over 60-day windows [19]. Open FOIA Project, built by James Scott, ships an exemption anticipator that predicts which of the nine federal exemptions an agency will invoke and offers pre-written challenge strategies, a case-law library including controlling FOIA precedent, an appeal letter generator citing case law, and 100+ agency profiles with denial rates and common exemptions across all fifty states [25]. EZFOIA offers AI document analysis on already-received releases at $75 per single request and up to $500 per month for unlimited filing [20]. FOIAflow, launched in April 2026 by a high-school student and his brother and covered respectfully by FOIA Advisor, claims autonomous filing, follow-up, appeals, and document analysis from a topic input [21]. FOIA Buddy operates a sub-fifty-cent-per-request service with a documented appellate history. Narrower legacy tools include FOIA Friend's request drafting workflow [22] and FOIA Machine's open-source IRE-lineage letter generator. AILA's FOIA Toolbox is a 735-page PDF that costs $181 and is not software [24].
These are real builds. FOIA Fluent and Open FOIA Project in particular are doing serious engineering work, between them covering partial pieces of the first four asymmetric-layer elements at marketing depth. What has not been built, by anyone at production-grade depth, is the full integrated stack - and specifically, the two elements that most precisely counter agency-side AI: per-analyst correspondence intelligence, and counter-AI vendor fingerprinting. Those remain uncontested territory.
Zero platforms cover the FULL asymmetric layer.
Five capabilities specifically counter agency-side AI deployment. The current market: one platform ships one element at production-grade depth (FOIA Fluent's agency hub). Partial coverage on three more elements is scattered across the field. Two elements have no coverage at all - exemption pattern mining and counter-AI fingerprinting.
| AgencyIntel | IntakeTradecraft | ExemptionPattern Mining | HarmRebuttal | CounterAI | Coverage | |
|---|---|---|---|---|---|---|
| 1 full + 3 partial | ||||||
| 4 / 5 partial | ||||||
| 1 / 5 partial | ||||||
| 2 / 5 partial | ||||||
| 1 / 5 partial | ||||||
| 0 / 5 | ||||||
| 0 / 5 | ||||||
| Column coverage | 1 full / 3 partial | 2 partial | 2 partial | 3 partial | 0 | 1 full total / 0 platforms cover all 5 |
The most architecturally complete requester-side platform to date. Civic-tech provenance, no commercial pressure. The only platform with documented full coverage on any single element of the asymmetric layer.
- Verified1,600-agency federal hub with success rate / response speed / fee rate / portal availability rankings across 54 jurisdictions
- VerifiedPer-agency deep-dives with exemption-pattern surfacing
- VerifiedCorpus-grounded drafting against statute + eCFR + past outcomes (model constrained against fabricated citations)
- VerifiedOne-click appeal letters generated from the request timeline
- VerifiedStatutory deadline monitoring
- VerifiedDaily-feed pattern engine: court opinions, agency enforcement, recalls, IG reports, regulatory dockets, 60-day windows
- MissingPer-analyst correspondence intelligence
- MissingCounter-AI vendor fingerprinting
- MissingAdversarial intake (interviewing requester before drafting)
Personally funded by founder James Scott. The largest single direct collision with the asymmetric-layer thesis at marketing depth. Claims partial coverage on four of five columns. Verification of actual shipping depth vs. marketing copy is the open question.
- ClaimedAI request drafting with 5 U.S.C. § 552 citations
- ClaimedExemption Anticipator: predicts which of the 9 federal exemptions agencies might invoke + pre-written challenge strategies
- ClaimedCase Law Library (named cases include NLRB v. Robbins Tire, DOJ v. Reporters Committee)
- ClaimedAppeal Letter Generator citing case law
- Claimed100+ agency profiles with denial rates and common exemptions, all 50 states
- ClaimedDeadline tracker, multi-agency filing, fee-waiver analyzer, 1,000+ templates
- MissingCounter-AI vendor fingerprinting
- MissingPer-analyst correspondence intelligence
- MissingProduction-depth verification on most modules
The largest requester-side platform by volume. 176,343 requests filed across 28,198 agencies, 58,809 fulfilled, 11.9M pages released. Zero AI features shipped in the consumer product. Internal Agent Moss / foia-coach RAG project exists in MuckRock's GitHub organization but is not productionized.
- Verified28,198-agency directory (Name + Jurisdiction visible; per-request statistics surface per request, not as structured per-office intelligence)
- VerifiedPer-request response statistics aggregated across platform history
- VerifiedFOIA filing workflow, document hosting, news/community layer
- MissingAI request drafting
- MissingExemption prediction
- MissingHarm rebuttal automation
- MissingCounter-AI
- MissingPer-analyst intelligence
Built by 17-year-old Amanuel Asfaw with his brother. Launched 7 April 2026. Site is mostly imagery; capabilities are marketing claims, not verified production depth. Pricing surfaced externally: Solo $49/mo, Pro $99/mo. Respectfully covered by FOIA Advisor on launch.
- VerifiedLaunched April 2026 with FOIA Advisor coverage
- ClaimedTopic-to-filing autonomous agent (drafts from topic input)
- ClaimedAutonomous follow-ups
- ClaimedAppeals generation
- ClaimedDocument analysis
- MissingPer-analyst intelligence
- MissingCounter-AI
- MissingProduction-depth verification across all modules
Commercial service. Sub-fifty-cent-per-request after a free first request. All 50 states. Real adversarial filer with a documented appellate history (Curry and FOIA Buddy v. Sw. Sch. Dist., Pennsylvania Office of Open Records). Traditional workflow, no AI features on landing page.
- VerifiedPer-request filing at sub-$0.50 / request
- VerifiedAll 50 states + DC coverage
- VerifiedAgency contact data + state-law summaries
- VerifiedDocumented appellate history (Curry v. Sw. Sch. Dist., PA OOR)
- MissingAI drafting (no AI features documented on landing page)
- MissingPer-analyst intelligence
- MissingCounter-AI
- MissingProduction-depth on exemption mining or harm rebuttal
Document-side AI only. Search, summarization, insight extraction on releases the user has already received. Pricing: $75 per single request, $200 for a 5-pack, $500 for unlimited monthly. A different axis from the asymmetric layer entirely.
- VerifiedPost-receipt document search and AI analysis
- VerifiedSummarization and insight extraction from released documents
- VerifiedPricing: $75 single, $200 / 5-pack, $500 unlimited monthly
- MissingAgency directory or intelligence
- MissingRequest drafting (AI or otherwise)
- MissingAppeals generation
- MissingExemption mining
- MissingCounter-AI or per-analyst intelligence
FOIA Machine is the original free letter generator with Investigative Reporters and Editors lineage. FOIA Friend is a single-line landing page offering drafting + tracking. Both pre-date the AI-native generation. Historically important; new entrants are partly competing to fill the void these tools left.
- VerifiedFOIA Machine: open-source free letter generation, IRE lineage
- VerifiedFOIA Friend: drafting + tracking workflow
- MissingAI features of any kind (neither platform documents AI augmentation)
- MissingCounter-AI
- MissingPer-analyst intelligence
- MissingModern stack across the board
Five concrete elements of the requester-side architecture are uncontested territory.
The operator at the center. Five capabilities to reach parity with the agency side.
Each module is technically achievable today. The corpus, the legal scaffolding, and the agency intelligence already exist as public data. The build window closes as the agency-side AI stack matures toward 2028.
A living per-FOIA-office knowledge graph maintained at portal depth, with per-analyst behavior signatures mined from years of agency correspondence. Updates as agencies move portals, change fee schedules, restructure sub-components, and rotate FOIA staff.
- FOIA.gov API (federal agency directory + annual performance data)
- FOIA Project (TRAC) lawsuit database
- Agency reading rooms and FOIA libraries (scheduled scraping)
- Agency FOIA portals (live state, change-detection)
- Historical request response logs (per-analyst correspondence)
- GovInfo + eCFR for regulatory citations and Privacy Act eligibility
- Postgres + JSONB per-agency knowledge graph
- Next.js scrapers on scheduled jobs (Vercel Cron / GitHub Actions)
- Claude API for unstructured agency-prose enrichment and analyst-pattern classification
- Vector embeddings (pgvector) for similar-agency matching and routing
- FOIA.gov API client with rate-limit aware backoff
- Per-agency routing card (correct office, submission method, current fee schedule)
- Per-analyst behavior fingerprint ("Analyst Y closes 73% of requests early at agency X; routinely cites Exemption 5")
- Fee waiver guidance per requester category
- Real-time portal-change alerts ("Treasury moved FOIA off portal Dec 2025")
- Sub-component routing guard for large agencies (DHS, DOJ, HHS)
A conversational LLM agent that interviews the requester about real intent before drafting. Decides when ambiguity is strategically valuable. Files Privacy Act in parallel where applicable. Refuses reflexive scope-narrowing - the exact opposite move from Baron's S1 clarification loop.
- Requester interview transcripts (multi-turn structured)
- FOIA Improvement Act 2016 statutory framework as context
- Privacy Act eligibility decision tree
- Per-agency drafting templates (loaded from Agency Intel knowledge graph)
- Case law corpus for foundational citation
- Multi-turn Claude API session with adversarial drafting system prompt
- Statutory framework + agency-specific templates as injected context
- Privacy Act parallel-filing trigger logic
- Streaming UI for live drafting reveal
- Strategically drafted FOIA request with deliberate ambiguity preserved where it widens the search
- Parallel Privacy Act request when requester is the subject of the records
- Scope justification document (for agency challenge response)
- Expedite-justification language where applicable
Mines historical agency releases at scale. Surfaces per-office withholding patterns, exemption-claim language, and the loss-on-appeal language that signals an agency's vulnerable arguments. Predicts which exemptions a given office will invoke for a given document type and topic.
- Historical agency releases (PDF + redaction patterns + reading-room exports)
- FOIA Project lawsuit database (cases lost on appeal, with opinion text)
- Determination letters from prior requester submissions
- OGIS mediation reports for agency-specific dispute patterns
- D.C. Circuit FOIA opinions for binding precedent
- PDF/OCR pipeline for historical releases (Mistral OCR or equivalent)
- Vector DB (pgvector) of release content for similarity search
- Per-agency exemption-claim classifier fine-tuned on labeled corpus
- Loss-on-appeal language detector (D.C. Circuit corpus)
- Topic + document-type taxonomy for prediction lookup
- "Agency X invokes Exemption 5 on 80% of deliberative-process documents in topic Y"
- "This determination-letter language pattern has lost on appeal in N D.C. Circuit cases"
- Exemption-claim prediction for the requester's specific submission
- Recommended request scoping to minimize anticipated exemptions
Drafts appeals against the foreseeable-harm standard codified in the FOIA Improvement Act of 2016 (5 U.S.C. § 552(a)(8)(A)). Cites controlling D.C. Circuit precedents. Identifies the agency's specific failure to meet Reporters Committee v. FBI's "focused and concrete demonstration" standard. Requests in-camera review on contested redactions.
- D.C. Circuit FOIA case law (Reporters Cmte v. FBI 3 F.4th 350 + post-2016 lineage)
- Full FOIA case-law corpus (TRAC FOIA Project lawsuit database, 18,742 cases)
- The agency's specific determination letter (parsed for exemption claims)
- The agency's prior loss-on-appeal language (from Exemption Pattern Mining module)
- 5 U.S.C. § 552 statutory text + DOJ OIP guidance
- Legal-text RAG over case law corpus (vector + BM25 hybrid)
- Determination-letter parser (extracts exemption claims by paragraph)
- Appeal template generator with citation injection
- In-camera-review request boilerplate generator
- Appeal-authority lookup (from Agency Intel) for proper submission
- Appeal letter with case-citation arguments per exemption invoked
- In-camera review request for contested redactions
- Foreseeable-harm gap analysis (where the agency's letter fails the standard)
- Deadline-tracked submission package to the agency appeal authority
Identifies which agency-side AI tool processed a given FOIA response based on determination-letter fingerprints. Looks up that tool's documented failure modes. Drafts appeals tuned to the specific tool's blind spots.
- MITRE FOIA Assistant public documentation + CPSC pilot reports
- FOIAXpress AI Assist vendor documentation
- ArkCase / Armedia federal deployment disclosures
- Microsoft Purview + Relativity public technical docs
- NIST + academic literature on classifier failure modes
- Vector DB of determination-letter linguistic patterns by tool
- Determination-letter fingerprint classifier (multi-class, per-tool)
- Per-tool failure-mode database (Postgres + structured findings)
- Tuned appeal templates per tool deployment
- Vendor-tool deployment tracker (which agency uses what)
- "This response was likely processed by MITRE FOIA Assistant - known recall problems on Exemption 4"
- Appeal language calibrated to the specific tool's blind spots
- Pattern-of-error documentation (for FOIA Advisory Committee comments or litigation)
- Counter-tool deployment tracker as a public artifact (transparency by-product)
One. An agency intelligence layer with portal-depth precision and per-analyst granularity. Not just a directory of which agency holds what kind of record, but the working knowledge of how each FOIA office actually behaves: the correct submission method this month (the Treasury Department moved its requests off its portal in December 2025), the current fee schedule, the appeal authority's mailing address and deadline, the recent reading-room releases that establish what the office considers segregable, the per-component routing inside large agencies where the wrong DHS sub-component can cost a requester six months, and the per-analyst behavior signatures that emerge from years of agency correspondence: which analyst routinely closes early, which writes substantive determination letters, which loses on appeal, which makes Glomar responses on what topics. None of the existing platforms maintains this at portal depth across the fifty states, where FOIA-equivalent statutes (state public records laws) double the addressable market and triple the agency surface. None mines correspondence at the analyst level on any jurisdiction.
Two. Intake counter-tradecraft. An agent that interviews the requester about the actual records sought before composing the request, that decides when ambiguity is strategically valuable and should be preserved, that knows when a deliberately broad request forces a more honest search than a narrow one, and that knows when to file a Privacy Act request alongside the FOIA to widen the legal lever. The opposite of Baron and Kejriwal's clarification loop, which is designed to narrow the requester into the agency's preferred scope before any record gets pulled.
Three. Per-agency exemption-pattern reverse engineering. Agencies release documents under FOIA every day. Those releases reveal, over time, which exemptions a given office invokes most aggressively, which categories of material they have historically released and then later withheld, which redactions they have lost on appeal, and which language in determination letters consistently maps to which underlying decisions. None of the existing requester-side tools mines its own past releases this way at the per-agency level. The intelligence is already in the documents. It is just not being extracted.
Four. Appeal drafting and foreseeable-harm rebuttal generation. The Reporters Committee v. FBI standard requires the agency to provide a focused, concrete, particularized demonstration that disclosure would harm a protected interest. A requester-side AI that knew the case law and could draft an appeal that named the agency's failure to meet that standard, exemption by exemption, with cited cases and a request for in-camera review on contested redactions (a procedural ask for the judge to inspect the unredacted documents privately and test the withholding against the record), is not a hypothetical capability. The underlying legal corpus is public. None of the existing requester-side platforms produces this output today.
Five. Deadline orchestration plus counter-intelligence against agency AI redaction tools. The agency side is already deploying or piloting MITRE's FOIA Assistant, FOIAXpress AI Assist, ArkCase (at DOJ INTERPOL, OPM, EEOC), and various Relativity and Purview integrations at Commerce, plus the CMS and HHS Office of the Secretary AI redaction pilots [9][16]. The general machine-learning literature documents recall-vs-precision tradeoffs, classifier bias under class imbalance, and tool-specific output fingerprinting techniques. Tool-specific failure modes for the named federal AI-in-FOIA deployments are largely undocumented in the public record today - which is itself the opening. A requester-side architecture that learned each tool's behavior in the field, surfaced its failure modes, and drafted appeals tuned to those failure modes is the asymmetric layer. No one is building it at production-grade depth.
There is a bilingual layer that none of the existing platforms touches: the sixty million Spanish-primary residents of the United States who have the same legal right to public records and zero end-to-end Spanish-language workflow available to them.
And there is a knowledge-base architecture that none of the existing platforms has attempted: a demand-weighted, user-contributed map of how agencies actually behave, updated by every request, every appeal, every release, every denial, that compounds in value as more requesters use it. Each user makes the next user's request more effective. No competitor has attempted this, and the longer it stays unbuilt, the more compounded the advantage to whoever ships it first.
The framing shift that the next decade of public records depends on is small in words and very large in consequence. Artificial intelligence does not merely help the Freedom of Information Act work better. It redistributes power inside the Freedom of Information Act. The agency side has been working that redistribution to its own advantage for two years, in academic workshops, in federal procurement pilots, and inside the official body that recommends reform to the National Archivist. The requester side has been waiting for permission to do the same work. That permission was never coming. The work starts now.
FOIA Warfare
Tips · intel · counter-arguments · dialogue joel@foiawarfare.com
Project GitHub README github.com/joelsartain/joelsartain
26 Sources Cited ▾
- Jason R. Baron and Devanshu Kejriwal, "An AI-Orchestrated Architecture for Responding to FOIA Requests," Proceedings of the AI & Open Government Workshop (AIOG 2026), ICAIL 2026, Singapore, 8 June 2026. openreview.net/pdf?id=HFogzYStIN
- Maik Larooij and David Graus, "To Redact, or not to Redact? A Local LLM Approach to Deliberative Process Privilege Classification," AIOG 2026. openreview.net/forum?id=dRuSH2zX2n
- Romana Afroze, "Formalizing FOIA Exemption Standards as Computable Legal Norms: A Framework for LLM-Assisted Sensitivity Review," AIOG 2026. openreview.net/forum?id=lEz6D2PUop
- Ravi Dnyaneshwar Mahajan, "ArchiveRAG: A Multimodal Retrieval-Augmented Generation Architecture for Legacy Government Document Disclosure," AIOG 2026. openreview.net/forum?id=OetKz8x7ni
- Jack McKechnie, "Perspectives on Cascading Pipelines for Sensitivity-Aware Search," AIOG 2026. openreview.net/forum?id=Sx1KbmK542
- Ronald L. Capaldi, "FOIA and the Use of AI in Government: Freedom of Information or an Empty Promise of Openness?" 45 J. Nat'l Ass'n Admin. L. Judiciary 25 (2025). digitalcommons.pepperdine.edu
- U.S. Department of Justice, Office of Information Policy, "Chief FOIA Officers Council Meeting Showcases Use of Advanced Technologies in FOIA," DOJ OIP Blog. justice.gov/oip/blog
- National Archives and Records Administration, Office of Government Information Services, 2024 Records Management Self-Assessment Report (September 2025). archives.gov/ogis/2024-rmsa
- FOIA Advisory Committee 2024-2026 term roster. archives.gov/ogis/foia-advisory-committee/2024-2026-term
- FOIA Advisory Committee, "FOIA Advisory Committee Votes to Approve Two Recommendations," 8 May 2026. foia.blogs.archives.gov
- MuckRock, "How federal agencies responded to our requests about AI use in FOIA," 7 May 2025. muckrock.com/news/2025/may/07
- MuckRock GitHub organization, foia-coach2 repository (Agent Moss prototype). github.com/MuckRock
- The FOIA Project (TRAC), foiaproject.org
- Reporters Committee for Freedom of the Press; iFOIA sunset notice. rcfp.org/were-seeking-input-updates-ifoia
- Reporters Committee for Freedom of the Press v. FBI, 3 F.4th 350 (D.C. Cir. 2021), Millett, J. law.justia.com/cadc/20-5091
- FOIA Improvement Act of 2016, Public Law 114-185, codified at 5 U.S.C. § 552(a)(8)(A). congress.gov/PLAW-114publ185
- Office of the Historian / SRP / Center for Analytics, U.S. Department of State, "Improving Declassification: Applying Machine Learning to Diplomatic Cable Review," American Historical Association Perspectives on History (October 2023). historians.org / perspectives / oct 2023 · corroborated at Federal News Network, "State Department declassifies diplomatic cables using AI assistant" (October 2023). federalnewsnetwork.com
- U.S. Department of Health & Human Services, 2026 Chief FOIA Officer Report, Section IV: Steps Taken to Greater Utilize Technology. Discloses CMS AI/LLM redaction pilot built consistent with DOJ guidance, and HHS Office of the Secretary AI/LLM-based automated review and redaction pilot. hhs.gov/foia/reports/chief-foia-officer-reports/2026-section-4
- ProPublica coverage of Heritage Foundation FOIA campaign. propublica.org
- Gizmodo coverage of Heritage AI-generated FOIA flood and agency response. gizmodo.com
- FOIA Fluent, foiafluent.com
- EZFOIA, ezfoia.com
- FOIAflow, foaiflow.vercel.app · FOIA Advisor coverage 7 April 2026. foiaadvisor.com/foia-blog/2026/4/7
- FOIA Friend, foia-friend.com
- American Immigration Lawyers Association, FOIA Toolbox (May 2022). aila.org/shop/ailas-foia-toolbox
- Open FOIA Project, AI-native FOIA request platform built by James Scott. openfoia.ai