Sample Preview · ecocomply

EcoComply — EPA enforcement intelligence

EPA’s ECHO system tracks facility compliance across four separate statutes — Clean Air, Clean Water, hazardous waste (RCRA), and Safe Drinking Water — and only lets you look up one facility at a time. It never rolls facilities up to the parent company or ranks operators by their combined, cross-statute enforcement risk. EcoComply does: it resolves the operating company across every facility, unifies all four statute compliance feeds onto each one, and rolls everything up to a company-level view with severity, multi-state footprint, significant-non-compliance flags, and a deterministic 0–1 risk score — surfacing which operators are in significant non-compliance across multiple statutes and multiple states, the signal flat, single-facility ECHO lookups can’t show.

Preview-only page. It shows the structure and capability of this dataset using a limited, representative sample. The company-level risk score, the cross-statute coverage breadth, and the per-facility quality score — the sellable columns — are withheld, and the full dataset is not downloadable here. It is provided separately after licensing.

Coverage & headline figures

Built from one free, public EPA ECHO service — four statute compliance feeds unified per facility and cross-linked up to the parent company. Deterministic and rule-based — no LLM in the pipeline.

2,160
Major ECHO facilities
across 18 states
1,850
Distinct resolved companies
the cross-statute hub
4
EPA statutes unified
CAA · CWA · RCRA · SDWA
1,353
Multi-statute facilities
63% regulated under 2+
84
Multi-state companies
operate across 2+ states
187
Significant Non-Compliers
SNC facilities on record

Sector mix: Manufacturing 960 · Transportation/Utilities 782 · Mining 81 · Wholesale Trade 61. Average facility quality score 0.927 (0–1 completeness) with 100% of facilities above the 0.5 threshold; flagship multi-state operators include BASF Corp (9 states, 13 facilities, all 4 statutes), Chemours, Ardagh Glass, Marathon, Alcoa, and Celanese.

Source (free, keyless, public-domain)

  • EPA ECHO unified facility service (echo_rest_services.get_facilities) → major facility compliance records, accessed via the two-step QID query flow and cached on disk.
  • Clean Air Act (CAAComplianceStatus) and Clean Water Act (CWAComplianceStatus) → air and water compliance status per facility.
  • RCRA (RCRAComplianceStatus, hazardous waste) and Safe Drinking Water Act (SDWAComplianceStatus) → waste and drinking-water compliance status — the four feeds unified per facility.

What’s in it — schema

Two cross-linked tables. companies is the headline deliverable — the resolved, cross-statute operator; facilities is the row-level ECHO detail that rolls up into it.

Tables in the dataset
Table Grain (one row =) Notable columns Rows
companies one resolved operating company facility_count, state_count, snc_facility_count, statute_count, risk_score 1,850
facilities one EPA ECHO facility caa/cwa/rcra/sdwa_status, statute_count, is_snc, quality_score 2,160

Columns in facilities (row-level detail)

registry_id EPA FRS Registry ID (unique)
facility_name Facility name as reported by ECHO
normalized_company Canonical company key (cross-link key)
city / county / state Facility location
sector Coarse sector from leading SIC code
caa / cwa / rcra / sdwa_status Per-statute compliance status
statutes / statute_count Statutes the facility is regulated under
violation_statutes Statutes currently in violation
max_severity Highest per-statute severity (3 = Significant)
is_snc 1 if Significant Non-Complier
is_multi_statute 1 if regulated under 2+ statutes — gated
quality_score 0–1 completeness score — gated

Columns in companies (the headline table)

display_name Resolved, de-duplicated company name
normalized_name Canonical cross-link key (unique)
facility_count Facilities operated by this company
states / state_count Distinct states of operation
snc_facility_count Facilities that are Significant Non-Compliers
statutes / statute_count Cross-statute coverage breadth (the linkage) — gated
violation_facility_count Facilities in violation of any statute — gated
risk_score 0–1 deterministic company risk blend — gated

Sample preview gated

A representative slice of resolved companies — the kind of cross-statute, multi-state operators that company resolution surfaces. Facility counts, multi-state footprint, and significant-non-complier counts are shown; the cross-statute coverage and the company risk score — the proprietary, sellable columns — are redacted. Eight of 1,850 companies; ordering does not reflect the risk score.

companies — 8 sample rows (cross-statute coverage & risk score withheld)
Company States Facilities SNC facilities Cross-statute coverage Company risk score
BASF Corp 9 13 2 •••• 0.•••
Chemours — Niagara Plant 4 4 2 •••• 0.•••
Ardagh Glass Inc 4 4 2 •••• 0.•••
Archer Daniels Midland Company 3 6 2 •••• 0.•••
ANR Pipeline Company 5 17 0 •••• 0.•••
3M Co — Brownwood 6 7 1 •••• 0.•••
Air Products and Chemicals, Inc. 5 7 0 •••• 0.•••
Armstrong World Industries 5 5 1 •••• 0.•••

Request the full live sample

Get a live, end-to-end sample of EcoComply — the resolved companies table with cross-statute coverage and risk scores, plus the full facility-level compliance detail, per-statute status, quality scores, and methodology.

Request the full live sample →

No full dataset is downloadable from this page. Public-domain U.S. Government data (EPA ECHO); not affiliated with the U.S. EPA.