NDP — From zero to a federated, secure dataset

Piece	What it is for	How it looks
AAI (Keycloak)	Who you are (login, users)	Login screen
Affinities	Relationships between datasets, services and endpoints	Relationships web app
NDP-EP	Your catalog: datasets, resources, storage	Endpoint web app
Federation	Central registry of all EPs	Federation web app
Python library	Do the same from code / automate	Notebook / script
NetBird (bonus)	Secure private network between machines	Network dashboard

Profile	What it starts
(none)	NDP-EP only (API + web UI)
`mongodb`	MongoDB + Mongo Express (local catalog DB)
`s3`	MinIO (S3-compatible object storage)
`kafka`	Kafka + Zookeeper + Kafka UI (streaming)

Profile	What it starts
`jupyter`	JupyterLab
`pelican`	Pelican federation (registry, director, origin, cache)
`full`	All backends above

You operate (your Endpoint)	Shared by the platform
NDP-EP — API + web UI	AAI — identity & roles
Catalog database — CKAN or MongoDB	Affinities — relationship registry
Object storage — MinIO / S3 (optional)	Federation — registry & discovery

Role	Can do
Viewer	View and search data. Read-only.
Writer	The above + create/edit datasets, resources, and S3 management.
Admin	All of the above + administration (dashboard, access requests).

Kind	What it is
Organization	Top-level group that owns datasets and services
Dataset	Logical container of related resources, owned by an organization
Service	Network-accessible service (REST API, app, etc.) owned by an organization
URL resource	Link to a file or service (CSV, JSON, NetCDF, …)
S3 resource	Object in S3-compatible storage
Kafka topic	Streaming data flow

PRESENTATION + SELF-GUIDED TUTORIAL — National Data Platform (NDP) Audience: end users and administrators (not developers). Focus: WHAT you can do and HOW it looks. Render: marp NDP-presentacion.md -o NDP-presentacion.pdf (or .pptx, .html) [📸 ...] blocks mark where to drop a screenshot (folder ./capturas). Lines after "<!-- note: ...

note: introduce in one sentence. "Today we see NDP end to end: install it, use it from the web and from code, federate it, and connect it securely."

note: avoid jargon; the message is federation + access governance.

note: narrate the diagram: the user signs in through AAI, which also carries their ROLE (roles live in AAI/Keycloak, NOT in Affinities). With that token they publish and search in the NDP-EP, backed by CKAN and S3. The EP then registers its datasets/services in Affinities (a non-blocking relationship registry) and reports to Federation. All of it can run over a private NetBird network (final bonus). Derived from the C4 view in ../ep-diagrams.

note: roles live in AAI, NOT in Affinities. Affinities is a relationship registry the EP writes into; it is non-blocking (the EP works even if it is down).

note: profiles let an admin run only the EP (connecting to the platform's shared services) or spin up local backends for development/testing.

note: this is the responsibility boundary. In the common case you only run the EP + its data backends; identity/affinities/federation are the platform's.

note: this whole sub-section is the dev/test path. Most users skip it and just run the NDP-EP from the previous slide.

note: stress: same gesture in each repo; order matters due to dependencies.

📸 screenshots/10-keycloak-login.png — NDP "Welcome back" login screen

📸 screenshots/11-keycloak-admin.png — Keycloak admin console (realm NDP)

📸 screenshots/12-affinities-frontend.png — Affinities web app (relationships graph)

📸 screenshots/13-federation-ui.png — federation web app (EP list, still empty)

📸 screenshots/15-docker-ps.png — list of containers in Up state

note: close Step 1: "installed in minutes; now let's use it". The UI and the API are the same Endpoint — same data, same permissions.

note: historically the platform onboarding used a federation config_id fed to a setup script (github.com/sci-ndp/NDP-EP); confirm the current portal/process with the platform operators.

📸 screenshots/19-keycloak-assign-ndp-admin.png — assigning the ndp_admin realm role in Keycloak

note: this is the permission model; it reappears live in Step 3.

📸 screenshots/33-create-resource.png — example: a "+ New" creation form

📸 screenshots/36-s3-management.png — S3 Management tool (buckets/objects)

note: for the non-dev audience, frame it as "for power users: everything in the web can also be automated".

📸 screenshots/40-notebook.png — Jupyter notebook running these steps

note: if time allows, run it live in a notebook and show the result.

📸 screenshots/50-federation-ep-registered.png — the EP appears in the federation

📸 screenshots/51-federation-health.png — EP health/metrics panel

National Data Platform (NDP)

From zero to a federated, secure dataset

What is NDP?

Platform components

Component interactions

Component interactions — step by step

Overview

Step 1 — Installation

Two ways to install

Before you install — prerequisites

Install the NDP-EP (the common case)

Compose profiles — core backends

Compose profiles — extras

What you operate vs. what the platform provides

Full stack (development / testing)

Only if you want the whole system locally

Startup order (full stack)

1) Start AAI (identity)

2) Start Affinities (relationship registry)

3) Start Federation (central registry)

4) Start the NDP-EP (+ backends)

Check: everything is up

Step 2 — Identity and permissions

A user signs in and gets a role

Bootstrap the first admin

Bootstrap the first admin — full stack

Where users come from (AAI)

Requesting access (user)

Approving access (admin)

The three roles

Step 3 — The Endpoint in action

Search, publish and manage from the web

Search — the landing page

Search — options

The "+ New" menu

"+ New" — Organization

"+ New" — Dataset (required)

"+ New" — Dataset (optional, 1/2)

"+ New" — Dataset (optional, 2/2)

"+ New" — Service (required)

"+ New" — Service — service_type (optional)

"+ New" — Service (other optional)

"+ New" — URL resource (required)

"+ New" — URL resource (file_type & processing)

"+ New" — URL resource (other optional)

"+ New" — S3 resource (identification)

"+ New" — S3 resource (S3 details)

"+ New" — Kafka topic (required)

"+ New" — Kafka topic (broker)

"+ New" — Kafka topic (options)

Storage management (S3) — writers only

S3 Management — buckets

S3 Management — objects

Step 4 — Automate with Python

The same operations, from code

The ndp-ep library

Example: in a few lines

Web and code: a unified interface

Step 5 — Federation

The data is discovered elsewhere

The Endpoint registers

Health and metrics

What the Endpoint reports

What the Endpoint reports — infrastructure flags

Bonus — NetBird

Resources

Appendix

Obtaining the EP_UUID — Affinities web app

Obtaining the EP_UUID — Affinities API

Creating a user — Keycloak

Assigning groups & roles — AAI API

Assigning groups & roles — AAI API (cont.)

Tokens stored at NDP-managed onboarding

"+ New" — Service — `service_type` (optional)

"+ New" — URL resource (`file_type` & processing)

The `ndp-ep` library