Date
Monday, May 25, 2026
Sources monitored
26
Back to The Wire

The Architects Are Working One Side of the Table

Today in the wire

AI is redistributing power inside the Freedom of Information Act. The agency side has spent two years designing for itself. The requester side has not been designed at all.

The Wire / Analysis

The Architects Are Working One Side of the Table

AI is redistributing power inside the Freedom of Information Act. The agency side has spent two years designing for itself. The requester side has not been designed at all.

Twelve papers at one academic workshop. Seven federal agencies already deploying or piloting AI in FOIA processing. One architecture proposal sitting inside the official body that recommends FOIA reform to the Archivist of the United States. The agency side of the Freedom of Information Act is being designed in public. The requester side is being designed by no one.

This is an essay about that asymmetry, and about what the next decade of public records depends on someone building.


PART I  /  The Agency-Side Architecture, As Designed

The AIOG 2026 workshop at ICAIL in Singapore, June 8, will publish the most concentrated single drop of FOIA-AI architectural thinking in the history of the field. The headline paper is Jason R. Baron and Devanshu Kejriwal's "An AI-Orchestrated Architecture for Responding to FOIA Requests" [1], which proposes a five-stage agency pipeline: an intake chatbot, a search and preservation layer, a sensitivity-review classifier for exempt material, a generative determination-letter author, and an audit substrate. It is the most fully specified architecture for AI-mediated FOIA processing in print, and the authors acknowledge in their related-work section that they could not find prior work on multi-stage AI orchestration in this space.

It is not alone. Larooij and Graus at the University of Amsterdam show that a Qwen3.5-9B model running air-gapped on a single consumer GPU, prompted with chain-of-thought and few-shot error examples, outperforms BERT and prior classifiers on Exemption 5 deliberative-process classification [2]. Romana Afroze at Delaware Law School proposes a deontic-logic framework that beats fine-tuned BERT by 14.4 points of precision on 847 D.C. Circuit Vaughn-index entries, arguing that current agency-side ML classifiers "are operating outside the legal framework they are supposed to instantiate" [3]. Ravi Mahajan at SUNY Buffalo lifts retrieval recall on a 512-document NARA-and-MuckRock benchmark from 51 percent to 86 percent with a multimodal RAG pipeline aimed at the 78 percent of NARA's holdings that exist only as scans [4]. Jack McKechnie at Glasgow argues that sensitivity-aware search must be implemented as a cascading pipeline where every retrieval stage, not just the final classifier, is sensitivity-aware [5].

These are not careless papers. They engage live engineering problems with real benchmarks and real deployment concerns. They will be read by every serious FOIA-AI researcher for the next several years.

Every one of them assumes the agency is the operator.

In Baron and Kejriwal's architecture, the requester appears as a dialogue partner for Stage 1's chatbot and the eventual recipient of Stage 4's determination letter. They never appear as an actor with their own tooling, intent, or strategy. The clarification loop assumes the requester arrives in good faith looking for help disambiguating their request. In 2026, that is no longer a defensible baseline. Capable requesters are drafting with large language models, mapping custodians from Office of Inspector General reports, and reverse-engineering exemption patterns from agency reading rooms and prior releases aggregated on the FOIA Project's lawsuit database [10]. A chatbot whose job is to resolve ambiguities is going to meet ambiguities that are deliberate tradecraft, not error. The architecture does not account for a move it is going to see immediately.

Fig 1  /  The Agency Model

In Baron and Kejriwal's architecture, the requester appears at two interface points.

Five internal stages, one audit substrate, one oversight pipe. Everything between the two interface points is internal to the agency. The architecture is the system. The requester is a peripheral.

INPUT Requester
FAIL 1 STAGE 01
Intake Chatbot
Clarification loop
STAGE 02
Search + Preserve
RAG over holdings
FAIL 2 STAGE 03
Sensitivity Classifier
Legal-BERT
FAIL 3 STAGE 04
Determination Letter
RAG narrative
OUTPUT Letter only
STAGE 05
Append-Only Audit Log
Aggregates every classification decision and chat turn across S1 - S4
→ OVERSIGHT
NOT THE REQUESTER
// Three architectural failures the essay names
  • FAIL 1The clarification loop assumes good-faith ambiguity. Capable requesters bring deliberate tradecraft.
  • FAIL 2The classifier decides what to withhold without consulting the case law the letter then cites. The seam between decision and justification closes.
  • FAIL 3The authors concede AI cannot reliably evaluate the foreseeable-harm standard - the post-2016 legal pressure point.

Two interface points. Five internal stages. The requester is a packet that goes in and a packet that comes out.

Fig 1  //  Baron 5-stage pipeline as published, with the requester rendered as a peripheral. Source: Baron & Kejriwal, AIOG 2026.

The Stage 3 to Stage 4 chain creates a second problem. A learned classifier, in Baron and Kejriwal's specification a fine-tuned Legal-BERT model, produces the withholding decision. The Stage 4 generative model then writes the legal narrative that explains it, retrieving FOIA case law and prior agency decisions through a RAG pipeline. Today's clumsy determination letters leak. They show their seams. A RAG-narrated letter citing on-point case law will read more defensible than the underlying decision actually is, because the classifier did not consult the case law the letter cites. Requesters lose the diagnostic value that today's awkward letters quietly provide: the visible seam between the redaction call and its legal scaffolding. The append-only audit log in Baron and Kejriwal's Figure 3 gives oversight a record. It does not give the requester insight. Those are not the same accountability surface.

The third failure is the one the authors themselves flag. Section 4 item 5 of the paper concedes that AI cannot reliably evaluate the foreseeable-harm standard that Congress codified in the FOIA Improvement Act of 2016 [14]. That standard is not a small concession. The D.C. Circuit in Reporters Committee for Freedom of the Press v. FBI required "a focused and concrete demonstration" of why disclosing "the particular type of material at issue" would "actually impede" agency deliberations, "given the specific context of the agency action at issue," and held that "a perfunctory statement that disclosure of all the withheld information would jeopardize the free exchange of information will not suffice" [12]. Exemption 5 with the foreseeable-harm gate is where the post-2016 litigation lives. An architecture that automates everything except the legal pressure point has not been tested against the work that actually drives appeals.

Only Afroze, among the entire AIOG cohort, formalizes the foreseeable-harm predicate at all, and even Afroze places it inside the agency's own reasoning, not as a tool the requester can wield [3]. Larooij and Graus, with the candor that good engineers bring to public papers, write that "recall is more important than precision, as the impact of forgetting to redact sensitive information can be much larger than suggesting to redact too much" [2]. That loss function is honest. It is also a structural decision to privilege withholding. Once productionized inside an agency, it does not stop at the redaction step. It is the agency's posture, encoded.

// Operator Note

Every one of these failures is the same failure. The architecture was designed without an adversary in the room.


PART II  /  The Architecture Is Also Underbuilt On Its Own Side

This is not an academic exercise.

Fig 2  /  The Deployment Is Already Live
Federal FOIA + Artificial Intelligence  ·  FY2025
18.6%
One in five federal agencies now processes FOIA requests using artificial intelligence or machine learning.
Federal AI-in-FOIA roster · production, pilot, explored, adjacent
EXPLORED CPSC
MITRE FOIA Assistant
Ex. 4 / 5 / 6 flagging · reported pilot
PRODUCTION Commerce
Relativity eDiscovery
+ Microsoft Purview
PRODUCTION Commerce
FOIAXpress AI Assist
+ Veritas Clearwell
ADJACENT State Dept.
MLDP
Declassification (not FOIA) · 99.29% accuracy, 1997 cables
PILOT CMS
AI/LLM Redaction Pilot
Built consistent with DOJ guidance · HHS CFO Report 2026
PILOT HHS / OS
LLM Responsiveness Pilot
HHS CFO Report 2026
PRODUCTION DOJ INTERPOL / OPM / EEOC
ArkCase
FedRAMP-cleared, AI-assisted
INVENTORY Federal-wide
OMB AI Use Case Inventory
3,611 use cases · 56 agencies
50 / 269
Agencies disclosed AI use
OGIS 2024 RMSA
3,611
Federal AI use cases
OMB Inventory 2025
9+
Named agency-side tools
Across status buckets
// The asymmetry, quantified
Agency-side AI tools named across the federal FOIA workflow
9+
Requester-side AI tools deployed at any federal agency
0
Fig 2  //  Federal agency-side AI in FOIA is already live - at varying maturity. Status taxonomy: PRODUCTION = shipped and processing real FOIA workflow per vendor or agency disclosure; PILOT = formally piloted with FOIA analysts per agency Chief FOIA Officer Report; EXPLORED = agency reported assessing or partial-piloting per third-party reporting, not currently in full production; ADJACENT = federal records-AI deployment in a non-FOIA records workflow (e.g., declassification review); INVENTORY = federal-wide reporting baseline. Sources: OGIS 2024 RMSA (18.6% / 50 of 269); HHS Chief FOIA Officer Report 2026; MuckRock 7 May 2025 agency disclosure essay; AHA Perspectives Oct 2023 (State Dept MLDP); ArkCase / Armedia federal deployments; OMB Federal AI Use Case Inventory 2025.

Per the National Archives Office of Government Information Services 2024 Records Management Self-Assessment, 18.6 percent of federal agencies, 50 of 269 respondents, already use artificial intelligence or machine learning in FOIA processing [8]. The Office of Management and Budget's federal AI use case inventory lists 3,611 use cases across 56 agencies in 2025.

The federal AI-in-FOIA roster is varied in maturity. In current production: Relativity eDiscovery, Microsoft Purview, FOIAXpress AI Assist, and Veritas Clearwell at the Department of Commerce [9]; and ArkCase at DOJ INTERPOL, the Office of Personnel Management, and the Equal Employment Opportunity Commission. In formal pilot: an AI/LLM-based redaction pilot at the Centers for Medicare and Medicaid Services (built consistent with Department of Justice guidance per HHS reporting), and a separate AI/LLM-based responsiveness-and-redaction pilot inside the Department of Health and Human Services Office of the Secretary [16]. Explored or partially piloted: MITRE's FOIA Assistant at the Consumer Products Safety Commission, which flags Exemption 4 monetary figures, Exemption 5 deliberative language, and Exemption 6 personally identifiable information per MuckRock's agency-disclosure investigation.

Adjacent records-AI deployment is even further along. The State Department's Machine Learning Declassification Pilot, operational since October 2022, made correct declassification decisions on 1997 diplomatic cables 99.29 percent of the time and reduced State Department labor on cable declassification by over 60 percent [15]. Declassification is not FOIA processing, but it is the federal records-AI maturity frontier. None of this is speculative. Production, pilot, and adjacent are all happening at once.

The DOJ Office of Information Policy, the official voice for federal FOIA policy, calls AI "an emerging and promising area" while emphasizing "the importance of ensuring that there is sufficient human monitoring and that appropriate safeguards are established so that the government is operating consistent with our obligations under FOIA" [7]. OGIS, whose mandate is FOIA oversight, observes that AI and machine learning "have the potential to aid in FOIA processing" and qualifies in the next sentence that they "are not a substitute for the judgment of FOIA professionals on application of exemptions and foreseeable harm" [8]. The critical academic voice in the field, Ronald Capaldi's 2025 article in the Journal of the National Association of Administrative Law Judiciary, argues that without algorithmic audits and mandatory disclosure of deployment, agency AI converts FOIA into "an empty promise of openness" [6]. He is right about the risk. He stops one move short.

So the agency side is being deployed, the official posture says human FOIA professionals must remain in the loop on the legal layer, the academic critic warns that agency AI without audits hollows the law, and in the official materials surveyed here - DOJ OIP guidance, OGIS reporting, FOIA Advisory Committee output, and the academic-policy literature - no source addresses the question of whether the requester side gets to operate AI at all. The redistribution-of-power frame appears nowhere in the official record.

Even granting the agency-side architecture its own premises, the system being sketched is not deployable at the scale the next four years will demand. Five structural omissions sit underneath the design.

First, cost. An architecture that demands GPU-accelerated inference, managed vector databases at scale, and continuous Stage 5 oversight is not a budget-neutral migration from current human-only processing. Federal AI procurement at FedRAMP-grade authorization runs in the millions to tens of millions of dollars per agency per year, with vendor lock-in to a small number of cleared LLM and infrastructure providers. Most agencies cannot afford it. The architecture as proposed quietly assumes a Treasury, Health and Human Services, or Department of Homeland Security budget.

Second, workforce. The system requires a supervisor breed that does not exist in the federal FOIA workforce today: senior FOIA professionals who can evaluate model outputs, adversarially test classifier decisions, and rule on edge cases at speed. The trilemma is hard. That person cannot be hired at General Schedule pay grades. They cannot be substituted by contractors without budgets no agency actually has. The roughly five to seven thousand federal FOIA staff across all agencies are a niche workforce already, and the AI-FOIA-supervisor specialization is narrower still.

Third, security. A Stage 3 BERT classifier trained on classified or controlled material is itself a data-handling problem. Vector embeddings of FOIA-processed documents in managed databases can be inverted, at least partially, into their source text using techniques that are well-developed in the machine learning security literature. The append-only audit log that Stage 5 maintains is a high-value target precisely because it captures input and output snapshots of the very sensitive material the system was processing. The Stage 1 chatbot is an attack surface: prompt injection in request text, jailbreaks to extract the classification rubric, side-channel inference of exemption thresholds. None of this is addressed in the published architectures.

Fourth, what might be called meta-FOIA recursion. The pipeline generates records. The append-only audit log is a federal record. The classification decisions are federal records. The chatbot transcripts are federal records. The model versions and training data are discoverable in litigation. All of them are subject to NARA retention schedules. All of them are subject to future FOIA requests. The architecture creates its own FOIA exposure. The more comprehensive the audit, the larger the exposure surface. There is no FOIA-specific NARA schedule addressing AI intermediate state in the published guidance, and no FOIA Advisory Committee recommendation addressing what happens when a requester FOIAs the audit log of the AI that processed their original request.

Fifth, demand. The 1.5 million federal FOIA requests of fiscal year 2024 are the last figure the Baron and Kejriwal paper cites [1]. The system is being designed for that volume. Per ProPublica and Gizmodo reporting on the Heritage Foundation's Oversight Project, the campaign filed on the order of one hundred thousand AI-generated FOIA requests in roughly two months, generating intake rates that some impacted FOIA offices reported as one per second and consuming a documented share of staff days at the targeted agencies [18][18a]. Some agencies subsequently lost FOIA personnel to reductions in force in mid-2025, partly in connection with the strain. That episode is the canary. A single requester-side actor with AI forced visible structural change at federal FOIA offices in months, not years. The next actor with AI will face the question of what they should be building for, not whether the demand will arrive. The architecture being designed at AIOG is being designed for the demand environment of 2025, not the demand environment of 2029.

There is one further fact that is not technical, but is the hardest fact in this essay.

Fig 3  /  Institutional Position

The architecture, the seat, and the recommendation are held by one person.

The author of the most fully specified agency-side FOIA-AI architecture in print also co-chairs an Implementation Subcommittee whose recommendations reach the Archivist of the United States. The FOIA Advisory Committee - established by NARA, 17 members across 7 government seats and 10 non-government seats - has no comparable requester-side architectural voice.

JRB
Position Brief
Jason R. Baron
Professor of the Practice  ·  University of Maryland College of Information
01
Lead Author, AIOG 2026 Architecture Paper
"An AI-Orchestrated Architecture for Responding to FOIA Requests"  ·  ICAIL 2026, Singapore
Academic
02
Co-Chair, Implementation Subcommittee
FOIA Advisory Committee  ·  NARA  ·  2024 - 2026 term  ·  third-term member
Institutional
03
Subcommittee recommendations to the Archivist
FAC formal recommendations are non-binding advisory output. The Archivist may adopt, modify, or decline. Historically a meaningful input to federal FOIA reform and OGIS guidance.
Policy
FOIA Advisory Committee (2024-2026 term, 17 members) - peer-reviewed FOIA-AI architecture publications, audited:
Agency-side architecture
1
Baron, AIOG 2026 peer-reviewed paper proposing the five-stage agency pipeline. Baron holds a non-government FAC seat (Univ. of Maryland) but his published architecture serves the agency side.
Requester-side architecture
0
Audited across all 17 FAC members (7 government seats: DoD, DOJ, DHS, HHS, NARA, Education, PBGC; 10 non-government seats: academics, advocacy, journalism, commercial requesters). Near-misses: Mulvey (AFPF essays), Wittenberg (Armedia vendor content), Kwoka (congressional testimony advocating AI but not proposing architecture). None is peer-reviewed and architecture-specific.
// Structural oddity The 7 government members are implementing agency-side AI in their day jobs (HHS CFO Report 2026 pilots, NARA OGIS guidance, DOJ OIP technology direction) - none publishes peer-reviewed architecture. The 10 non-government members include Baron, the only FAC member publishing peer-reviewed FOIA-AI architecture - and he publishes for the agency side, from a non-government seat.
Fig 3  //  Position brief. Sources: AIOG 2026 OpenReview; NARA OGIS FOIA Advisory Committee 2024-2026 roster; jasonrbaron.com. The "peer-reviewed + architecture-specific" claim was audited across all 16 other FAC members via Google Scholar, SSRN, and institutional faculty pages. Near-misses identified: Ryan Mulvey (AFPF) has published policy essays on AI-in-FOIA but no architecture proposal; Nicholas Wittenberg has authored Armedia vendor blog posts but no peer-reviewed work; Margaret Kwoka has testified to Congress advocating AI adoption but proposes no architecture of her own. Other federal FOIA-AI voices outside the FAC include Alina Semo (NARA OGIS Director), DOJ OIP leadership, and the broader AIOG 2026 cohort (Larooij, Graus, Afroze, Mahajan, McKechnie).

Jason Baron is the lead author of the AIOG 2026 architecture paper. He also co-chairs the Implementation Subcommittee of the FOIA Advisory Committee - NARA's standing advisory body whose recommendations shape federal FOIA reform - in the committee's current 2024-2026 term [8b][8c]. The committee has 17 members across two structural bins: 7 government seats (representing the agencies that process FOIA requests) and 10 non-government seats (representing requesters, academics, journalists, and the commercial requester community). Audited across all 17: none of the 7 government members has published a peer-reviewed FOIA-AI architecture proposal. None of the 9 other non-government members has either. Baron is the only one, and his architecture serves the agency side from a non-government seat. The closest near-misses are Ryan Mulvey's AFPF policy essays and Nicholas Wittenberg's Armedia vendor commentary; neither is architecture-specific or peer-reviewed.

// Operator Note

Baron's paper is not an academic exercise. The architecture being sketched at AIOG is being authored by an institutional voice on the body that recommends FOIA reform to the National Archivist. There is no equivalent requester-side architectural voice in that room.


PART III  /  The Other Half of the Problem Belongs to the Requester

The honest current state of the requester side is not zero. It is also not the architectural counterweight the situation demands.

MuckRock has filed 176,343 requests across 28,198 agencies, fulfilled 58,809, and released 11.9 million pages [9]. Its production platform has no AI. Its only AI build is a research prototype called Agent Moss, a retrieval system over public records retention schedules, second-stage, not productionized [9b]. The FOIA Project's lawsuit database covers 18,742 FOIA cases through March 2026, useful for litigation research and unavailable for request generation [10]. The Reporters Committee for Freedom of the Press maintains the Open Government Guide and a legal hotline; their iFOIA tool was sunset and the URL now redirects to their main site [11].

Several AI-native entrants have arrived in 2025 and 2026. FOIA Fluent, built at the NYC Data Science for Social Good civic project, is the most architecturally complete to date: corpus-search-grounded request drafting against statute and prior outcomes, a 1,600-agency federal transparency hub with success-rate/response-speed/fee/portal-availability rankings across 54 jurisdictions, per-agency exemption-pattern deep-dives, one-click appeal generation, and a daily-feed pattern engine that watches court opinions, agency enforcement actions, recalls, IG reports, and regulatory dockets over 60-day windows [19]. Open FOIA Project, built by James Scott, ships an exemption anticipator that predicts which of the nine federal exemptions an agency will invoke and offers pre-written challenge strategies, a case-law library including controlling FOIA precedent, an appeal letter generator citing case law, and 100+ agency profiles with denial rates and common exemptions across all fifty states [25]. EZFOIA offers AI document analysis on already-received releases at $75 per single request and up to $500 per month for unlimited filing [20]. FOIAflow, launched in April 2026 by a high-school student and his brother and covered respectfully by FOIA Advisor, claims autonomous filing, follow-up, appeals, and document analysis from a topic input [21]. FOIA Buddy operates a sub-fifty-cent-per-request service with a documented appellate history. Narrower legacy tools include FOIA Friend's request drafting workflow [22] and FOIA Machine's open-source IRE-lineage letter generator. AILA's FOIA Toolbox is a 735-page PDF that costs $181 and is not software [24].

These are real builds. FOIA Fluent and Open FOIA Project in particular are doing serious engineering work, between them covering partial pieces of the first four asymmetric-layer elements at marketing depth. What has not been built, by anyone at production-grade depth, is the full integrated stack - and specifically, the two elements that most precisely counter agency-side AI: per-analyst correspondence intelligence, and counter-AI vendor fingerprinting. Those remain uncontested territory.

Fig 4  /  Requester-Side Capability Map

Zero platforms cover the FULL asymmetric layer.

Five capabilities specifically counter agency-side AI deployment. The current market: one platform ships one element at production-grade depth (FOIA Fluent's agency hub). Partial coverage on three more elements is scattered across the field. Two elements have no coverage at all - exemption pattern mining and counter-AI fingerprinting.

  AgencyIntel IntakeTradecraft ExemptionPattern Mining HarmRebuttal CounterAI Coverage
1 full + 3 partial
4 / 5 partial
1 / 5 partial
2 / 5 partial
1 / 5 partial
0 / 5
0 / 5
Column coverage 1 full / 3 partial 2 partial 2 partial 3 partial 0 1 full total / 0 platforms cover all 5
Full
Partial
None
// HOVER + CLICK Each vendor row opens a capability brief with verified features, claimed-but-unverified features, and gaps. Press Esc or click outside to close.
Fig 4  //  Capability assessment per vendor public documentation, vendor websites, and FOIA Advisor coverage. Partial = vendor claims the capability but production depth is unverified or shallow. Full = capability shipped at production-grade depth (FOIA Fluent agency hub: 1,600 agencies + per-agency exemption patterns + 54 jurisdictions). Asymmetric layer = the five-element set that specifically counters agency-side AI. Sources: vendor sites; FOIA Advisor 7 April 2026 (FOIAflow); FOIA Fluent NYC DS4SG civic project documentation. Click any vendor row for the full capability brief.
Vendor Brief  /  FOIA Fluent
Civic AI from NYC Data Science for Social Good

The most architecturally complete requester-side platform to date. Civic-tech provenance, no commercial pressure. The only platform with documented full coverage on any single element of the asymmetric layer.

Capability inventory
  • Verified
    1,600-agency federal hub with success rate / response speed / fee rate / portal availability rankings across 54 jurisdictions
  • Verified
    Per-agency deep-dives with exemption-pattern surfacing
  • Verified
    Corpus-grounded drafting against statute + eCFR + past outcomes (model constrained against fabricated citations)
  • Verified
    One-click appeal letters generated from the request timeline
  • Verified
    Statutory deadline monitoring
  • Verified
    Daily-feed pattern engine: court opinions, agency enforcement, recalls, IG reports, regulatory dockets, 60-day windows
  • Missing
    Per-analyst correspondence intelligence
  • Missing
    Counter-AI vendor fingerprinting
  • Missing
    Adversarial intake (interviewing requester before drafting)
// Matrix position
1 full + 3 partial. The strongest existing competitor. Their depth is the agency hub. The counter-AI layer and per-analyst layer are not present anywhere on the platform.
Vendor Brief  /  Open FOIA Project
Free AI-Native Platform by James Scott

Personally funded by founder James Scott. The largest single direct collision with the asymmetric-layer thesis at marketing depth. Claims partial coverage on four of five columns. Verification of actual shipping depth vs. marketing copy is the open question.

Capability inventory
  • Claimed
    AI request drafting with 5 U.S.C. § 552 citations
  • Claimed
    Exemption Anticipator: predicts which of the 9 federal exemptions agencies might invoke + pre-written challenge strategies
  • Claimed
    Case Law Library (named cases include NLRB v. Robbins Tire, DOJ v. Reporters Committee)
  • Claimed
    Appeal Letter Generator citing case law
  • Claimed
    100+ agency profiles with denial rates and common exemptions, all 50 states
  • Claimed
    Deadline tracker, multi-agency filing, fee-waiver analyzer, 1,000+ templates
  • Missing
    Counter-AI vendor fingerprinting
  • Missing
    Per-analyst correspondence intelligence
  • Missing
    Production-depth verification on most modules
// Matrix position
4 of 5 partial. If marketing matches shipping, the most direct competitor in the field. Counter-AI remains uncontested. Per-analyst intelligence is also unaddressed.
Vendor Brief  /  MuckRock
176K Requests, No Consumer AI Shipped

The largest requester-side platform by volume. 176,343 requests filed across 28,198 agencies, 58,809 fulfilled, 11.9M pages released. Zero AI features shipped in the consumer product. Internal Agent Moss / foia-coach RAG project exists in MuckRock's GitHub organization but is not productionized.

Capability inventory
  • Verified
    28,198-agency directory (Name + Jurisdiction visible; per-request statistics surface per request, not as structured per-office intelligence)
  • Verified
    Per-request response statistics aggregated across platform history
  • Verified
    FOIA filing workflow, document hosting, news/community layer
  • Missing
    AI request drafting
  • Missing
    Exemption prediction
  • Missing
    Harm rebuttal automation
  • Missing
    Counter-AI
  • Missing
    Per-analyst intelligence
// Matrix position
1 of 5 partial. Agency Intel partial (directory exists but shallow per office; aggregated data is not structured per FOIA office). The puzzle: the largest volume player on the requester side has the most data and the least AI.
Vendor Brief  /  FOIAflow
Topic-to-Filing Autonomous Agent

Built by 17-year-old Amanuel Asfaw with his brother. Launched 7 April 2026. Site is mostly imagery; capabilities are marketing claims, not verified production depth. Pricing surfaced externally: Solo $49/mo, Pro $99/mo. Respectfully covered by FOIA Advisor on launch.

Capability inventory
  • Verified
    Launched April 2026 with FOIA Advisor coverage
  • Claimed
    Topic-to-filing autonomous agent (drafts from topic input)
  • Claimed
    Autonomous follow-ups
  • Claimed
    Appeals generation
  • Claimed
    Document analysis
  • Missing
    Per-analyst intelligence
  • Missing
    Counter-AI
  • Missing
    Production-depth verification across all modules
// Matrix position
2 of 5 partial (intake claimed + appeals claimed, both unverified). Scrappy, young, fast-moving. Watch closely - could mature into a real competitor with the right funding and engineering hires.
Vendor Brief  /  FOIA Buddy
Sub-Dollar Commercial Filing Service

Commercial service. Sub-fifty-cent-per-request after a free first request. All 50 states. Real adversarial filer with a documented appellate history (Curry and FOIA Buddy v. Sw. Sch. Dist., Pennsylvania Office of Open Records). Traditional workflow, no AI features on landing page.

Capability inventory
  • Verified
    Per-request filing at sub-$0.50 / request
  • Verified
    All 50 states + DC coverage
  • Verified
    Agency contact data + state-law summaries
  • Verified
    Documented appellate history (Curry v. Sw. Sch. Dist., PA OOR)
  • Missing
    AI drafting (no AI features documented on landing page)
  • Missing
    Per-analyst intelligence
  • Missing
    Counter-AI
  • Missing
    Production-depth on exemption mining or harm rebuttal
// Matrix position
1 of 5 partial. Agency Intel partial (50-state coverage + state-law summaries). The volume-and-price play. Low cost, high volume, traditional filing workflow without AI augmentation.
Vendor Brief  /  EZFOIA
Post-Receipt Document Analysis

Document-side AI only. Search, summarization, insight extraction on releases the user has already received. Pricing: $75 per single request, $200 for a 5-pack, $500 for unlimited monthly. A different axis from the asymmetric layer entirely.

Capability inventory
  • Verified
    Post-receipt document search and AI analysis
  • Verified
    Summarization and insight extraction from released documents
  • Verified
    Pricing: $75 single, $200 / 5-pack, $500 unlimited monthly
  • Missing
    Agency directory or intelligence
  • Missing
    Request drafting (AI or otherwise)
  • Missing
    Appeals generation
  • Missing
    Exemption mining
  • Missing
    Counter-AI or per-analyst intelligence
// Matrix position
0 of 5. Document analysis is post-response, a different axis from the five-element asymmetric layer. Real capability on its own axis. Could become a useful complement to a full-stack platform but does not compete with the asymmetric layer.
Vendor Brief  /  Legacy Drafting Tools
FOIA Machine + FOIA Friend

FOIA Machine is the original free letter generator with Investigative Reporters and Editors lineage. FOIA Friend is a single-line landing page offering drafting + tracking. Both pre-date the AI-native generation. Historically important; new entrants are partly competing to fill the void these tools left.

Capability inventory
  • Verified
    FOIA Machine: open-source free letter generation, IRE lineage
  • Verified
    FOIA Friend: drafting + tracking workflow
  • Missing
    AI features of any kind (neither platform documents AI augmentation)
  • Missing
    Counter-AI
  • Missing
    Per-analyst intelligence
  • Missing
    Modern stack across the board
// Matrix position
0 of 5 across both. Historically important - omitting them would be a credibility hit if any FOIA reader audits the piece. New AI-native entrants exist partly because these legacy tools never modernized.

Five concrete elements of the requester-side architecture are uncontested territory.

Fig 5  /  What Can And Should Exist

The operator at the center. Five capabilities to reach parity with the agency side.

Each module is technically achievable today. The corpus, the legal scaffolding, and the agency intelligence already exist as public data. The build window closes as the agency-side AI stack matures toward 2028.

BUILD WINDOW 2026 → 2028 Before federal agency-side AI-FOIA reaches deployment maturity
// OPERATOR
The Requester, Equipped
Five-module asymmetric layer · Parity target
// HOVER + CLICK Each capability card opens a technical brief with data sources, stack, and outputs. Press Esc or click outside to close.
Fig 5  //  Operator architecture proposal. Source: FW analysis. Build-window estimate calibrated to federal AI-FOIA procurement timelines. Click any module for technical brief.
Module 01  /  Technical Brief
Agency Intelligence - Per-Office + Per-Analyst Knowledge Graph

A living per-FOIA-office knowledge graph maintained at portal depth, with per-analyst behavior signatures mined from years of agency correspondence. Updates as agencies move portals, change fee schedules, restructure sub-components, and rotate FOIA staff.

Data sources
  • FOIA.gov API (federal agency directory + annual performance data)
  • FOIA Project (TRAC) lawsuit database
  • Agency reading rooms and FOIA libraries (scheduled scraping)
  • Agency FOIA portals (live state, change-detection)
  • Historical request response logs (per-analyst correspondence)
  • GovInfo + eCFR for regulatory citations and Privacy Act eligibility
Tech stack
  • Postgres + JSONB per-agency knowledge graph
  • Next.js scrapers on scheduled jobs (Vercel Cron / GitHub Actions)
  • Claude API for unstructured agency-prose enrichment and analyst-pattern classification
  • Vector embeddings (pgvector) for similar-agency matching and routing
  • FOIA.gov API client with rate-limit aware backoff
Outputs
  • Per-agency routing card (correct office, submission method, current fee schedule)
  • Per-analyst behavior fingerprint ("Analyst Y closes 73% of requests early at agency X; routinely cites Exemption 5")
  • Fee waiver guidance per requester category
  • Real-time portal-change alerts ("Treasury moved FOIA off portal Dec 2025")
  • Sub-component routing guard for large agencies (DHS, DOJ, HHS)
// The asymmetric edge
Agencies update portals and processes constantly; intelligence must be living, not static. No existing platform maintains this at portal depth AND mines correspondence at the per-analyst level. Pre-empts Baron Stage 1 narrowing (see Fig 1).
Module 02  /  Technical Brief
Intake Counter-Tradecraft - Adversarial Drafting Layer

A conversational LLM agent that interviews the requester about real intent before drafting. Decides when ambiguity is strategically valuable. Files Privacy Act in parallel where applicable. Refuses reflexive scope-narrowing - the exact opposite move from Baron's S1 clarification loop.

Data sources
  • Requester interview transcripts (multi-turn structured)
  • FOIA Improvement Act 2016 statutory framework as context
  • Privacy Act eligibility decision tree
  • Per-agency drafting templates (loaded from Agency Intel knowledge graph)
  • Case law corpus for foundational citation
Tech stack
  • Multi-turn Claude API session with adversarial drafting system prompt
  • Statutory framework + agency-specific templates as injected context
  • Privacy Act parallel-filing trigger logic
  • Streaming UI for live drafting reveal
Outputs
  • Strategically drafted FOIA request with deliberate ambiguity preserved where it widens the search
  • Parallel Privacy Act request when requester is the subject of the records
  • Scope justification document (for agency challenge response)
  • Expedite-justification language where applicable
// The asymmetric edge
Baron's S1 chatbot narrows the requester to the agency's preferred scope before any record is pulled. This module does the opposite - widens the request to its maximum defensible scope before submission. Counters Baron Stage 1 clarification loop (see Fig 1).
Module 03  /  Technical Brief
Exemption Pattern Mining - Historical Withholding Intelligence

Mines historical agency releases at scale. Surfaces per-office withholding patterns, exemption-claim language, and the loss-on-appeal language that signals an agency's vulnerable arguments. Predicts which exemptions a given office will invoke for a given document type and topic.

Data sources
  • Historical agency releases (PDF + redaction patterns + reading-room exports)
  • FOIA Project lawsuit database (cases lost on appeal, with opinion text)
  • Determination letters from prior requester submissions
  • OGIS mediation reports for agency-specific dispute patterns
  • D.C. Circuit FOIA opinions for binding precedent
Tech stack
  • PDF/OCR pipeline for historical releases (Mistral OCR or equivalent)
  • Vector DB (pgvector) of release content for similarity search
  • Per-agency exemption-claim classifier fine-tuned on labeled corpus
  • Loss-on-appeal language detector (D.C. Circuit corpus)
  • Topic + document-type taxonomy for prediction lookup
Outputs
  • "Agency X invokes Exemption 5 on 80% of deliberative-process documents in topic Y"
  • "This determination-letter language pattern has lost on appeal in N D.C. Circuit cases"
  • Exemption-claim prediction for the requester's specific submission
  • Recommended request scoping to minimize anticipated exemptions
// The asymmetric edge
The intelligence is sitting in the agency's own released documents. Nobody mines it at scale and structures it per-office. The agency has data on you. This module gives the requester data on the agency. Counters Baron Stage 3 Legal-BERT classifier (see Fig 1).
Module 04  /  Technical Brief
Harm Rebuttal - Foreseeable-Harm Appeal Engine

Drafts appeals against the foreseeable-harm standard codified in the FOIA Improvement Act of 2016 (5 U.S.C. § 552(a)(8)(A)). Cites controlling D.C. Circuit precedents. Identifies the agency's specific failure to meet Reporters Committee v. FBI's "focused and concrete demonstration" standard. Requests in-camera review on contested redactions.

Data sources
  • D.C. Circuit FOIA case law (Reporters Cmte v. FBI 3 F.4th 350 + post-2016 lineage)
  • Full FOIA case-law corpus (TRAC FOIA Project lawsuit database, 18,742 cases)
  • The agency's specific determination letter (parsed for exemption claims)
  • The agency's prior loss-on-appeal language (from Exemption Pattern Mining module)
  • 5 U.S.C. § 552 statutory text + DOJ OIP guidance
Tech stack
  • Legal-text RAG over case law corpus (vector + BM25 hybrid)
  • Determination-letter parser (extracts exemption claims by paragraph)
  • Appeal template generator with citation injection
  • In-camera-review request boilerplate generator
  • Appeal-authority lookup (from Agency Intel) for proper submission
Outputs
  • Appeal letter with case-citation arguments per exemption invoked
  • In-camera review request for contested redactions
  • Foreseeable-harm gap analysis (where the agency's letter fails the standard)
  • Deadline-tracked submission package to the agency appeal authority
// The asymmetric edge
The case law is public. The legal pressure point (foreseeable harm) is codified in statute. The D.C. Circuit standard is published. The work has not been automated end-to-end on the requester side. Counters Baron Stage 4 RAG-narrated letter (see Fig 1).
Module 05  /  Technical Brief
Counter-AI - Vendor Tool Fingerprinting + Tuned Appeals

Identifies which agency-side AI tool processed a given FOIA response based on determination-letter fingerprints. Looks up that tool's documented failure modes. Drafts appeals tuned to the specific tool's blind spots.

Data sources
  • MITRE FOIA Assistant public documentation + CPSC pilot reports
  • FOIAXpress AI Assist vendor documentation
  • ArkCase / Armedia federal deployment disclosures
  • Microsoft Purview + Relativity public technical docs
  • NIST + academic literature on classifier failure modes
  • Vector DB of determination-letter linguistic patterns by tool
Tech stack
  • Determination-letter fingerprint classifier (multi-class, per-tool)
  • Per-tool failure-mode database (Postgres + structured findings)
  • Tuned appeal templates per tool deployment
  • Vendor-tool deployment tracker (which agency uses what)
Outputs
  • "This response was likely processed by MITRE FOIA Assistant - known recall problems on Exemption 4"
  • Appeal language calibrated to the specific tool's blind spots
  • Pattern-of-error documentation (for FOIA Advisory Committee comments or litigation)
  • Counter-tool deployment tracker as a public artifact (transparency by-product)
// The asymmetric edge
Agency-side AI deployment is increasingly transparent (vendor docs, MuckRock disclosures, HHS CFO reports, OGIS RMSA). The requester side has done nothing with this open intelligence. Counter-AI fingerprinting is uncontested territory. Surveils Baron Stage 3 + Stage 5 substrate (see Fig 1).

One. An agency intelligence layer with portal-depth precision and per-analyst granularity. Not just a directory of which agency holds what kind of record, but the working knowledge of how each FOIA office actually behaves: the correct submission method this month (the Treasury Department moved its requests off its portal in December 2025), the current fee schedule, the appeal authority's mailing address and deadline, the recent reading-room releases that establish what the office considers segregable, the per-component routing inside large agencies where the wrong DHS sub-component can cost a requester six months, and the per-analyst behavior signatures that emerge from years of agency correspondence: which analyst routinely closes early, which writes substantive determination letters, which loses on appeal, which makes Glomar responses on what topics. None of the existing platforms maintains this at portal depth across the fifty states, where FOIA-equivalent statutes (state public records laws) double the addressable market and triple the agency surface. None mines correspondence at the analyst level on any jurisdiction.

Two. Intake counter-tradecraft. An agent that interviews the requester about the actual records sought before composing the request, that decides when ambiguity is strategically valuable and should be preserved, that knows when a deliberately broad request forces a more honest search than a narrow one, and that knows when to file a Privacy Act request alongside the FOIA to widen the legal lever. The opposite of Baron and Kejriwal's clarification loop, which is designed to narrow the requester into the agency's preferred scope before any record gets pulled.

Three. Per-agency exemption-pattern reverse engineering. Agencies release documents under FOIA every day. Those releases reveal, over time, which exemptions a given office invokes most aggressively, which categories of material they have historically released and then later withheld, which redactions they have lost on appeal, and which language in determination letters consistently maps to which underlying decisions. None of the existing requester-side tools mines its own past releases this way at the per-agency level. The intelligence is already in the documents. It is just not being extracted.

Four. Appeal drafting and foreseeable-harm rebuttal generation. The Reporters Committee v. FBI standard requires the agency to provide a focused, concrete, particularized demonstration that disclosure would harm a protected interest. A requester-side AI that knew the case law and could draft an appeal that named the agency's failure to meet that standard, exemption by exemption, with cited cases and a request for in-camera review on contested redactions (a procedural ask for the judge to inspect the unredacted documents privately and test the withholding against the record), is not a hypothetical capability. The underlying legal corpus is public. None of the existing requester-side platforms produces this output today.

Five. Deadline orchestration plus counter-intelligence against agency AI redaction tools. The agency side is already deploying or piloting MITRE's FOIA Assistant, FOIAXpress AI Assist, ArkCase (at DOJ INTERPOL, OPM, EEOC), and various Relativity and Purview integrations at Commerce, plus the CMS and HHS Office of the Secretary AI redaction pilots [9][16]. The general machine-learning literature documents recall-vs-precision tradeoffs, classifier bias under class imbalance, and tool-specific output fingerprinting techniques. Tool-specific failure modes for the named federal AI-in-FOIA deployments are largely undocumented in the public record today - which is itself the opening. A requester-side architecture that learned each tool's behavior in the field, surfaced its failure modes, and drafted appeals tuned to those failure modes is the asymmetric layer. No one is building it at production-grade depth.

There is a bilingual layer that none of the existing platforms touches: the sixty million Spanish-primary residents of the United States who have the same legal right to public records and zero end-to-end Spanish-language workflow available to them.

And there is a knowledge-base architecture that none of the existing platforms has attempted: a demand-weighted, user-contributed map of how agencies actually behave, updated by every request, every appeal, every release, every denial, that compounds in value as more requesters use it. Each user makes the next user's request more effective. No competitor has attempted this, and the longer it stays unbuilt, the more compounded the advantage to whoever ships it first.

The framing shift that the next decade of public records depends on is small in words and very large in consequence. Artificial intelligence does not merely help the Freedom of Information Act work better. It redistributes power inside the Freedom of Information Act. The agency side has been working that redistribution to its own advantage for two years, in academic workshops, in federal procurement pilots, and inside the official body that recommends reform to the National Archivist. The requester side has been waiting for permission to do the same work. That permission was never coming. The work starts now.

FOIA Warfare

Tips · intel · counter-arguments · dialogue joel@foiawarfare.com

Project GitHub README github.com/joelsartain/joelsartain

26 Sources Cited
  1. Jason R. Baron and Devanshu Kejriwal, "An AI-Orchestrated Architecture for Responding to FOIA Requests," Proceedings of the AI & Open Government Workshop (AIOG 2026), ICAIL 2026, Singapore, 8 June 2026. openreview.net/pdf?id=HFogzYStIN
  2. Maik Larooij and David Graus, "To Redact, or not to Redact? A Local LLM Approach to Deliberative Process Privilege Classification," AIOG 2026. openreview.net/forum?id=dRuSH2zX2n
  3. Romana Afroze, "Formalizing FOIA Exemption Standards as Computable Legal Norms: A Framework for LLM-Assisted Sensitivity Review," AIOG 2026. openreview.net/forum?id=lEz6D2PUop
  4. Ravi Dnyaneshwar Mahajan, "ArchiveRAG: A Multimodal Retrieval-Augmented Generation Architecture for Legacy Government Document Disclosure," AIOG 2026. openreview.net/forum?id=OetKz8x7ni
  5. Jack McKechnie, "Perspectives on Cascading Pipelines for Sensitivity-Aware Search," AIOG 2026. openreview.net/forum?id=Sx1KbmK542
  6. Ronald L. Capaldi, "FOIA and the Use of AI in Government: Freedom of Information or an Empty Promise of Openness?" 45 J. Nat'l Ass'n Admin. L. Judiciary 25 (2025). digitalcommons.pepperdine.edu
  7. U.S. Department of Justice, Office of Information Policy, "Chief FOIA Officers Council Meeting Showcases Use of Advanced Technologies in FOIA," DOJ OIP Blog. justice.gov/oip/blog
  8. National Archives and Records Administration, Office of Government Information Services, 2024 Records Management Self-Assessment Report (September 2025). archives.gov/ogis/2024-rmsa
  9. FOIA Advisory Committee 2024-2026 term roster. archives.gov/ogis/foia-advisory-committee/2024-2026-term
  10. FOIA Advisory Committee, "FOIA Advisory Committee Votes to Approve Two Recommendations," 8 May 2026. foia.blogs.archives.gov
  11. MuckRock, "How federal agencies responded to our requests about AI use in FOIA," 7 May 2025. muckrock.com/news/2025/may/07
  12. MuckRock GitHub organization, foia-coach2 repository (Agent Moss prototype). github.com/MuckRock
  13. The FOIA Project (TRAC), foiaproject.org
  14. Reporters Committee for Freedom of the Press; iFOIA sunset notice. rcfp.org/were-seeking-input-updates-ifoia
  15. Reporters Committee for Freedom of the Press v. FBI, 3 F.4th 350 (D.C. Cir. 2021), Millett, J. law.justia.com/cadc/20-5091
  16. FOIA Improvement Act of 2016, Public Law 114-185, codified at 5 U.S.C. § 552(a)(8)(A). congress.gov/PLAW-114publ185
  17. Office of the Historian / SRP / Center for Analytics, U.S. Department of State, "Improving Declassification: Applying Machine Learning to Diplomatic Cable Review," American Historical Association Perspectives on History (October 2023). historians.org / perspectives / oct 2023  ·  corroborated at Federal News Network, "State Department declassifies diplomatic cables using AI assistant" (October 2023). federalnewsnetwork.com
  18. U.S. Department of Health & Human Services, 2026 Chief FOIA Officer Report, Section IV: Steps Taken to Greater Utilize Technology. Discloses CMS AI/LLM redaction pilot built consistent with DOJ guidance, and HHS Office of the Secretary AI/LLM-based automated review and redaction pilot. hhs.gov/foia/reports/chief-foia-officer-reports/2026-section-4
  19. ProPublica coverage of Heritage Foundation FOIA campaign. propublica.org
  20. Gizmodo coverage of Heritage AI-generated FOIA flood and agency response. gizmodo.com
  21. FOIA Fluent, foiafluent.com
  22. EZFOIA, ezfoia.com
  23. FOIAflow, foaiflow.vercel.app  ·  FOIA Advisor coverage 7 April 2026. foiaadvisor.com/foia-blog/2026/4/7
  24. FOIA Friend, foia-friend.com
  25. American Immigration Lawyers Association, FOIA Toolbox (May 2022). aila.org/shop/ailas-foia-toolbox
  26. Open FOIA Project, AI-native FOIA request platform built by James Scott. openfoia.ai

Get the digest

One daily FOIA intelligence brief when the public digest opens.

Next edition: tomorrow 07:00 ET
The Wire - free forever, no email required to read

Operator comments

Essay-route discussion is moderated. Anonymous comments queue for review; signed-in operators publish immediately.

// ShareXLinkedInEmail

Get the digest

One daily FOIA intelligence brief when the public digest opens.