Parallax Studio · Perspectives™ Shift the View · No. 02
Shift the View · 02 — DC Data Catalog

Published is not
the same as public.

The District keeps one inventory of 2,633 datasets. By the headline numbers it looks healthy — 87% published, 87% active. Shift the lens four times and four different operational truths appear. One governed source, told four ways.

Where the 2,633 catalogued datasets actually live
02,633 datasets
On portal 6% Published, not on portal 81% Audit 13%

One inventory. One Enterprise Data Inventory, 2,633 datasets, one set of governed fields. The temptation is to read it once and file the headline — 87% published, 87% active, looks fine — and move on. But a catalog answers a different question depending on who is asking it. Browse it as a resident and it looks abundant. Read it as an executive and the risk is concentrated. Audit it as an operator and most of it isn’t yet reachable. Trace one field as a subject-matter expert and the word “published” turns out to mean something other than what a resident would assume. Same source. Four lenses. Four true answers.

Lens 01 — Discovery

What’s in the catalog?

The browse lens is the resident’s door in: 2,633 datasets across 25 categories and 73 agencies, filterable by type, sensitivity, and source. At first read it is a broad, well-tagged inventory. Sensitivity skews toward N/A (54%) and PII (31%); the listing is searchable and complete.

Nothing here raises an alarm — which is exactly why it shouldn’t be the only view anyone sees. Discovery tells you the catalog exists and is organized. It does not tell you whether it is maintained, where the risk sits, or whether a resident could actually open any given row.

DD — District of Columbia Data Catalog discovery dashboard, 2,633 datasets across 25 categories and 73 agencies.
DD · Discovery — the resident’s browse view. 2,633 datasets, 25 categories, sensitivity and source filters. The catalog as a card index.
Lens 02 — Executive

Where is the risk?

Shift to the executive lens and the aggregate calms before it concentrates. 13% of the inventory is inactive — modest on its own, but unevenly distributed. A handful of categories and agencies carry concentrations well above the enterprise average, which points to targeted remediation rather than broad-based intervention.

Two areas stand out: the Level 2 — For District Government Use classification (25% inactive) and Business and Economic Development (34% inactive). The clear priority is the Homeland Security and Emergency Management Agency at 74% inactive — the most pressing near-term focus for a data steward.

EXC — Inactive datasets dashboard. 13% inactive overall; HSEMA 74% inactive; Business and Economic Development 34%.
EXC · Executive — inactivity by classification and agency. The aggregate is 13%; the distribution is what matters.
Lens 03 — Operational Health

Is it being maintained?

The operator’s lens scores the inventory on its own terms. Across the 2,633 datasets, average metadata completeness lands at 61.4% — measured against description, sensitivity, URL, open-data status, and disposal reason. The score is dynamic: a record is never penalized for a field that doesn’t apply to it, so an active dataset isn’t marked down for lacking a disposal reason.

The gap that jumps out is reach. Only about 7% of records carry a URL. The inventory is, in other words, thoroughly described — and not yet thoroughly reachable. That distinction is invisible from the discovery lens and sets up the fourth.

OP — Data Completeness dashboard. 61.4% average completeness; per-field presence flags; roughly 7% of records carry a URL.
OP · Operational Health — a dynamic completeness score per record, with a per-field trace. Described, not yet reachable.
Lens 04 — Subject Matter

What does “published” actually mean?

The fourth lens follows one field to its conclusion. By Publication Status, 2,294 datasets are Published (87%) and 339 are Audit Completed (13%). Read quickly, “Published” sounds like “live on the public open-data portal.” Cross it against Is on DC.gov? and the two come apart.

Of all 2,633 datasets, only 153 (6%) are on the open-data portal. “Published” is an internal lifecycle state — the dataset has cleared its workflow — not a statement that a resident can find it on DC.gov. The same inventory is 87% published and 6% public. Both figures are correct. Only the fourth lens shows you they are answering different questions.

SME — Content by Agency dashboard. Publication Status crossed with Is on DC.gov: 2,294 published, but only 153 on the portal.
SME · Subject Matter — Publication Status crossed against Is on DC.gov. The 87%/6% split is the whole story.
Bottom line

The same 2,633 datasets are 87% published and 6% public. Both numbers are true. The only thing that changed was the lens.

The method

One governed source.
Four Perspectives™.

Every lens here is a working dashboard built from a single governed extract of the DC Enterprise Data Inventory — same rows, same definitions, no spin. Each one is tuned to the question one audience actually asks.

DD · Perspectives lens badge

DD · Discovery

Browse it as a resident. What exists, by category, agency, and sensitivity.

EXC · Perspectives lens badge

EXC · Executive

Read it as leadership. Where inactivity concentrates and which agency to act on first.

OP · Perspectives lens badge

OP · Operational Health

Audit it as an operator. A dynamic completeness score, traceable field by field.

SME · Perspectives lens badge

SME · Subject Matter

Trace one field to the end. What “published” means once you cross it with the portal.

Explore

All four, live.

Each lens is a working dashboard built from one governed source. Switch between them with the tabs, or open the full workbook on Tableau Public.

If the dashboards don’t load, open them directly on Tableau Public →