Multi-Tenancy Roadmap

Current Status

The system has completed Phase 0: Platform Decoupling and Phases 1-4: Hospital-Agnostic Refactoring (config extraction, prompt parameterization, generic naming, DB-driven config cache). It runs in single-tenant pilot mode for ZOL. Full multi-tenant routing (subdomain resolution, per-tenant auth) is planned. See also the Multi-Tenancy Architecture page for the current implementation details.

1. Vision

The ZOL Intelligent Search system was initially built as a single-hospital solution. To enable deployment across multiple hospitals (SaaS model), the architecture must support tenant isolation — ensuring that each hospital's data, configuration, and user experience are completely separate.

The decoupling follows a phased approach: first parameterize all hospital-specific references (Phase 0, completed), then make the codebase fully hospital-agnostic (Phases 1-4, completed March 31), then add tenant routing and management (next phase, planned), and finally implement full multi-tenant operations (future).

2. Phase 0: Platform Decoupling (Completed)

Phase 0 converted all hardcoded ZOL-specific references into parameterized, configuration-driven code. This ensures the codebase can serve any hospital by changing configuration rather than code.

2.1 What Was Done

Component	Before	After
Taxonomy (`zol_taxonomy.py`)	37 module-level constants loaded at import	`HospitalTaxonomy` class with `get_taxonomy(hospital_id)` factory
Prompt templates (`prompts.py`)	70+ hardcoded "ZOL" references	`PromptContext` dataclass with hospital identity placeholders
Taxonomy tables	No tenant scoping	`tenant_id` on all entities and relationships; composite unique constraints
Redis keys	Flat key prefixes	`{tenant_id}:` prefix on all cache, rate-limit, and session keys
Site configuration	`ZOL_CONFIG` singleton	`get_site_config(hospital_id)` with per-hospital profiles
Hub page detection	Hardcoded page type patterns	Automatic hub/detail classification via LLM binary classifier
Document service	ZOL-specific title patterns	Patterns loaded from `HospitalConfig`
Query service	Hardcoded base URLs	URLs from hospital configuration
RAG service	ZOL identity in responses	Uses `PromptContext.from_hospital_config()`
Taxonomy registry	Global singleton	Per-hospital registry cache
Frozen taxonomy	Single global registry	`get_frozen_taxonomy_registry(hospital_id)`

2.2 Configuration Architecture

All hospital-specific data lives in YAML configuration files:

# backend/app/services/graph/hospital_config/zol.yaml
hospital:
  name: Ziekenhuis Oost-Limburg
  short_name: ZOL
  website: https://www.zol.be
  phone: 089/80 80 80

campuses:
  - id: zol-campus-sint-jan
    canonical_name: ZOL Genk, campus Sint-Jan
    aliases: [sint-jan, sint jan, genk, campus sint-jan, zol genk]
    address: Synaps Park 1
    city: Genk
    postal_code: '3600'
    phone: 089/80 80 80
  # ... more campuses

# Departments and golden page URLs removed (2026-03-09).
# Departments are now auto-discovered by the extraction pipeline.
# Hub/detail classification replaces golden_page_patterns and golden_page_types.

domain_knowledge:
  dept_conditions:
    slaapcentrum: [slaapapneu, slaapstoornis, insomnie, ...]
    # Hospital-specific centers only — standard departments are in
    # medical_knowledge/department_conditions.py (universal).
  dept_treatments:
    cardiologie: [pacemaker, ablatie, bypass, cardioversie, ...]
  # ... more domain knowledge mappings

search_aliases:
  universal: {}
  hospital:
    borstkanker: Borstcentrum
    ivf: Fertiliteitscentrum
    slaapkliniek: Slaapcentrum
    # ... more hospital-specific aliases

specialty_department_map:
  cardiologie: [cardiologie, hartcentrum]
  orthopedie: [orthopedie, orthopedische chirurgie]
  # ... specialty-to-department mappings

Adding a new hospital requires only a new YAML file — no code changes. Departments and page classification are handled automatically by the extraction pipeline and LLM binary classifier.

2.3 Backward Compatibility

A compatibility shim (zol_taxonomy.py) re-exports all symbols from the new hospital_taxonomy.py module, ensuring existing imports continue to work during the migration period. This shim can be removed once all imports are updated.

3. Current State: Single-Tenant Pilot

The pilot deployment serves one hospital (ZOL) with the default tenant ID. All components use hospital_id="zol" as the default parameter, making the system fully functional without explicit tenant routing.

3.1 What Works Today

Full RAG pipeline with hospital-parameterized prompts
Taxonomy tables with tenant-scoped entities and relationships
Tenant-isolated Redis caching and rate limiting
Hospital-specific safety messages and disclaimers in 8 languages
Configurable taxonomy with SNOMED-CT synonym enrichment

3.2 What Remains Single-Tenant

Frontend has no tenant routing or hospital selection
Authentication is not yet tenant-aware (Keycloak realms can provide per-tenant isolation in Phase 1)
No tenant management API
No per-tenant billing or usage tracking

Updated 2026-06-09

conversations, users, and analytics_events all have tenant_id columns in the current ORM models. Migration 081 (documents_conversations_tenant_fk) added FK constraints with ON DELETE CASCADE to documents and conversations to close the last gap from the Phase 0 rollout. The claim that those tables "lack tenant_id columns" no longer applies.

The remaining single-tenant gaps are operational, not schema-level: request routing (subdomain/path/header resolution), per-tenant auth flows, tenant management API, and usage metering.

4. Hospital-Agnostic Refactoring (Completed March 31)

Before multi-tenant routing, the codebase was made fully hospital-agnostic in a 4-phase sprint:

Phase	What	Status
Phase 1	Config extraction — `site_crawl_configs` table, admin API	Done
Phase 2	Prompt parameterization — all LLM prompts use `PromptContext` from DB	Done
Phase 3	Generic naming — `ZOLCrawler` → `HospitalCrawler`, ZOL branding removed	Done
Phase 4	DB-driven config cache — `SiteConfigCache` replaces all in-code constants	Done

Result: 259 ZOL-specific references removed. A new hospital can be onboarded with DB configuration only. See Release Notes: March 28-31 for details.

5. Tenant Routing (Planned)

The next phase adds the infrastructure to serve multiple hospitals from a single deployment.

4.1 Tenant Resolution

Tenant resolution options (ordered by complexity):

Subdomain-based: {hospital}.search.example.com — cleanest for end users
Path-based: search.example.com/{hospital}/ — simpler infrastructure
Header-based: X-Tenant-Id header — for API consumers

4.2 Database Tenancy

Approach	Isolation	Complexity	Recommended?
Shared schema, tenant_id column	Row-level	Low	Yes (for pilot scale)
Separate schemas per tenant	Schema-level	Medium	Future option
Separate databases per tenant	Full	High	Not needed

The shared-schema approach adds a tenant_id column to all PostgreSQL tables and enforces row-level filtering through a middleware or repository pattern. This is sufficient for the expected scale (5-10 hospitals).

4.3 Estimated Scope

Task	Effort	Dependencies
Add `tenant_id` to PostgreSQL tables (migration)	Medium	None
Tenant resolver middleware	Small	None
Per-tenant authentication	Medium	Tenant resolver
Frontend tenant context	Small	Tenant resolver
Tenant management API	Medium	Database migration
Per-tenant YAML configuration	Small	Phase 0 (done)

6. Full Multi-Tenant Operations (Future)

Phase 2 adds operational capabilities for managing multiple hospitals in production.

5.1 Features

Tenant onboarding workflow: Automated setup of new hospital (YAML config, database seeding, taxonomy initialization, content crawl)
Per-tenant analytics dashboard: Hospital administrators see only their own data
Content isolation verification: Automated tests confirming no cross-tenant data leakage
Per-tenant feature flags: Enable/disable pipeline components per hospital
Usage metering and billing: Track LLM API costs, storage, and query volumes per tenant

5.2 Content Pipeline

Each hospital requires its own content pipeline:

7. Storage Isolation Summary

Storage Layer	Phase 0 (Current)	Phase 1 (Planned)	Phase 2 (Future)
PostgreSQL	Single tenant	Row-level `tenant_id`	Row-level (sufficient)
Taxonomy tables	`tenant_id` on all rows	Same	Same
Redis	`{tenant_id}:` key prefix	Same	Same
MinIO	`{tenant_id}/{doc_id}` paths	Same	Same
pgvector	Single collection	Per-tenant collection or filter	Per-tenant collection

8. Risk Considerations

Risk	Mitigation
Cross-tenant data leakage	Automated integration tests verifying isolation at every storage layer
Configuration drift	YAML validation schema, CI/CD checks for required fields
Performance at scale	Per-tenant caching, connection pooling, lazy taxonomy loading
Compliance variation	Per-tenant DPIA and data retention settings (stored in tenant config)

Document version: 2.0 | Date: 2026-03-31 | Author: SOFT4U BV

1. Vision​

2. Phase 0: Platform Decoupling (Completed)​

2.1 What Was Done​

2.2 Configuration Architecture​

2.3 Backward Compatibility​

3. Current State: Single-Tenant Pilot​

3.1 What Works Today​

3.2 What Remains Single-Tenant​

4. Hospital-Agnostic Refactoring (Completed March 31)​

5. Tenant Routing (Planned)​

4.1 Tenant Resolution​

4.2 Database Tenancy​

4.3 Estimated Scope​

6. Full Multi-Tenant Operations (Future)​

5.1 Features​

5.2 Content Pipeline​

7. Storage Isolation Summary​

8. Risk Considerations​