Agent 9 — Knowledge Catalog

Discover Every Asset
Across Your Enterprise

Automated discovery, classification, and cataloging of all your data, documents, and knowledge assets — fully on-premise, with AI-powered tagging and semantic search.

100%
Auto-Discovery
Asset Types
0
External APIs
AI
Semantic Tagging

One Catalog, Every Knowledge Asset

The Catalog Agent automatically discovers, classifies, and inventories data sources, documents, APIs, models, and knowledge objects across your entire infrastructure.

Data Sources

Databases & Warehouses

Automatic profiling of relational databases, data warehouses, data lakes, and streaming pipelines — with column-level lineage and schema versioning.

  • Multi-connector auto-discovery
  • Column-level data profiling
  • Schema change detection
  • Data freshness scoring
Documents & Knowledge

Document & Model Catalog

Catalogs documents, reports, AI models, notebooks, and unstructured content — classified by topic, domain, and responsible owner with semantic embeddings.

  • AI-powered topic classification
  • Ownership & stewardship tracking
  • Cross-asset relationship mapping
  • Full-text + semantic search

What the Catalog Agent Delivers

Four pillars of intelligent data governance — from discovery to trust.

Auto-Discovery

Continuously scans your infrastructure to detect new data sources, documents, and assets — no manual registration required.

AI Classification

On-premise LLM classifies every asset by business domain, sensitivity level, and quality score — automatically and without human labeling effort.

Semantic Search

Natural language search across all cataloged assets. Ask a question, get ranked results with provenance, ownership, and confidence scores.

Lineage & Impact

Traces data lineage end-to-end — from source ingestion to dashboard output — and flags downstream impact when source schemas change.

From Raw Asset to Governed Knowledge

A four-stage pipeline transforms scattered data into a trusted, queryable enterprise catalog.

1

Scan

The agent connects to configured sources — databases, file stores, APIs, cloud buckets — and inventories all detected assets with metadata extraction.

2

Classify

The on-premise LLM applies business classification: domain, sensitivity, quality grade, and recommended tags — with configurable governance rules.

3

Enrich

Assets are enriched with descriptions, ownership assignments, quality metrics, and cross-references to related catalog entries.

4

Govern

The catalog enforces data governance policies: access controls, retention rules, and compliance flags — integrated with ArcaQ Shield.

OpenMetadata-compatible catalog — plugs into your existing data governance stack

Your Catalog Never Leaves Your Infrastructure

The Catalog Agent runs entirely within your Kubernetes cluster. All connectors, classification models, and catalog data remain inside your security perimeter.

  • On-premise LLM classification (Ollama / vLLM)
  • Kubernetes-native deployment
  • Vault-encrypted catalog store
  • Keycloak RBAC — per-asset access control
  • Zero telemetry to external services
Catalog Dashboard — Conceptual View

Ready to Catalog Your Knowledge Assets?

Deploy Agent 9 inside your infrastructure. Auto-discover, classify, and govern all your data assets with complete sovereignty.

Request Demo All 9 Agents