Transparency Notice for Users¶

Audience: you, the user of Helpmefindthejob. This notice explains how the system works, what data flows where, and your rights. Article: AI Act Article 13 (transparency to deployers — adapted here as the user-facing surface) and Article 50 (transparency to natural persons interacting with AI systems). Surface: this notice is shown at first run and is always available from the Settings screen. Status: living document. The user-facing rendered version is at /transparency; the source of truth lives here in the repo.

What this is¶

Helpmefindthejob is an open-source civic-employment copilot. It helps you navigate the European job market — including parts of that market that are often hard to navigate alone: foreign-credential recognition (Anerkennung), CV conventions that vary by country, multilingual application processes, employment-services bureaucracy, and the differences between sectors.

Who is Helpmefindthejob built for? Anyone facing structural friction between what they can actually do and what the European labor market is set up to recognise. Migrants and EU-mobile workers face this friction most acutely — language barriers, foreign-credential opacity, residency complexities make it densest there — and they are the strongest example users the system was designed around. The same friction also affects, for example, someone returning to clinical nursing after twelve years out of practice for childcare; someone pivoting from commercial tech to public-sector civic-tech work; someone re-entering the labour market after long-term unemployment; someone moving back to their home country after years working abroad; or anyone working outside the bureaucratic system they grew up navigating. If you recognise yourself in any of those situations — or in something structurally similar — this tool is built for you.

You do not need to be a migrant for Helpmefindthejob to be useful, and migrants are not the only users. The tool is built around the friction you face in connecting your capability to a role, not around who you are.

What Helpmefindthejob does¶

The tool combines:

Job discovery from career pages, public job aggregators, and bookmarklet captures you make.
A structured conversation that walks you through a 12-phase journey (greet → discover → CV check → inspiration → preferences → search → review → drill-down → tailoring → letter → CV consult → done).
AI-assisted suggestions for job-to-CV fit, CV tailoring, motivation-letter drafting, and application-outcome analysis.

At every step, the suggestions are suggestions — never decisions. You confirm each consequential action before it happens. The system never submits an application, never accepts an offer, never makes any binding decision on your behalf without your explicit click-to-confirm.

How AI is used¶

Some actions in Helpmefindthejob invoke an AI model. These are:

Action	What the AI does	What the AI does not do
Compute a fit score for a job	Produces a 0–100 fit score from four per-criterion sub-scores (skills, experience, location/language, friction-fit; each 0–25), with a one-line rationale and gap list	The AI's raw output is clamped to 0–100; malformed or out-of-range output yields "no fit score available" rather than a fabricated number
Tailor your CV for a specific role	Proposes edits to existing CV sections you've selected	Does not invent CV facts or experience you haven't entered
Draft a motivation letter	Composes a first-draft letter grounded in your confirmed CV bullets	Does not send the letter; you read, edit, confirm, and send
Analyse your application history	Surfaces patterns: which CV variant gets replies, which sectors respond, which application timings work	Does not decide what you should do next; offers observations
Generate a skill-gap brief	Compares your skills to a role's required ESCO skills and surfaces gaps	Does not enrol you in any course or programme

Other actions in Helpmefindthejob are deterministic — no AI involved. Examples: job discovery from configured sources, locale-aware yes/no parsing in chat, profile-field editing, application persistence, calendar reminders.

What data is sent to the AI¶

When an AI action runs, the system sends only the slice of data needed for that action to the AI provider. Specifically:

Fit-scoring sends the job posting and the relevant CV section that matches the job's ESCO codes — not your full profile, not your name, not your email, not your application history.
Motivation-letter drafting sends the job posting and the CV bullets you confirmed for this role — not your job history with other employers.
CV tailoring sends the job posting and the CV sections you select — not your other CV sections.

The full minimisation rules are documented in data-governance.md §4 and live in the project's open-source codebase. You can verify them.

Friction-class inference (deterministic, internal)¶

When you paste your CV during the guided job-search journey, the system runs a deterministic, keyword-based classifier over the text to infer a "friction class" — one of seven archetype labels (aicha, yusuf, olga, mahmoud, maria, kaethe, tobias) or the empty string when no match is confident enough. The classifier does NOT call any AI provider and does NOT leave your device's process boundary.

What is inferred: one of the seven labels or "" (unclassified). The labels are internal codes that correspond to documented persona archetypes covering the project's friction-class panel (recognition-process candidate, EU-Blue-Card holder, §24-protected, §4-AsylG-protected, EU citizen, returning-to-workforce, career-changer).

From what: only the CV text you voluntarily pasted at the cv_check phase. The classifier reads the text once at paste-time; no later re-reads.

Why: to tailor the job-search recovery flow (persona-aware ordering of widening suggestions, plus an Ausländerbehörde caveat for users whose inferred class indicates residency-permit-tied search) AND to enrich AI prompts with friction context (residency-status framing, friction-notes, comparable scenarios) so the AI's reasoning is anchored to your actual situation rather than a generic role-and-industry sketch.

Visibility (Phase 1): internal-only field. Not exposed via any public API, not displayed in any user-visible UI. The classification operates silently and the user does not see a "we think you are in the X process" reveal. The downstream UX changes (constrained ordering, Ausländerbehörde caveat) are visible, but the classification label itself is not.

Retention: stored on the user profile alongside other fields. Cleared on account deletion via the GDPR right-to-erasure path (cascades through the profile dataclass generically). Re-classified on every CV re-paste (the classifier returns "" for unclassifiable input, which OVERWRITES any stale prior value — stale classification is worse than no classification because it drives wrong downstream UX).

Accuracy and limits: the classifier is Phase 1 best-guess, seeded from regulatory citations (§16d, §24, §4 AsylG, Blue Card, TVöD, etc.) and a set of soft multi-signal patterns. Real users may not include precise regulatory vocabulary in their CVs; the classifier deliberately returns "" rather than guess in low-confidence cases. The downstream behavior gracefully degrades to the unconstrained UX for unclassified users — that is, your experience is no worse than a user who isn't classified at all.

Telemetry: one internal analytics event is logged per classification call with the resolved label, confidence class (strong / scored / none), and match count. The event uses the same audit log as other journey events (hashed identifier, no plaintext PII). The purpose of the telemetry is to inform a future Phase 2 redesign where you would be offered a user-visible confirmation step ("Looks like X — is that right?") with the right defaults.

Phase 2 commitments (post-grant): a user-visible confirmation flow at classification time, a settings-page surface for review and correction, an opt-out for users who prefer to skip friction-aware UX entirely, and a full DPIA-equivalent privacy review with refinement of the classifier patterns based on real-world accuracy data.

Source-class hierarchy (where claims come from)¶

Every claim Helpmefindthejob makes to you has a source. The project's source-class hierarchy doctrine defines a ranking from most-authoritative to least and binds each kind of claim to a source class:

Class A — legal / regulatory authority (BAMF, Ausländerbehörde, BfA, statutes like §16d AufenthG, EU Blue Card Directive 2021/1883): used for visa, residence-permit, and recognition claims. We never have AI invent these — we either restate the statute verbatim or direct you to the authoritative body.
Class B — authoritative standardised taxonomy (ESCO 1.1, ISCO-08, EURES, CEFR): used for occupation codes, skill codes, language proficiency. Every response carries the dataset version (e.g. v1-curated-2026-05-18) and the ESCO URI where applicable.
Class C — authoritative organisation-level data (BfA 2025 shortage list, KMK, BIBB): used for shortage flags and recognition outcomes.
Class D — curated project data (reference/esco/, companies_catalog.py, the seven-persona panel): documented provenance + version.
Class E — your primary data (your CV, the JD you pasted, your chat responses): the AI is constrained to ground its outputs in your inputs.
Class F — aggregator-fetched secondary data (Adzuna, Personio, public career pages): always carries source-URL provenance.
Class G — AI inference: acceptable only when grounded in classes A–F via in-context evidence; never as standalone authority.

The doctrine matters in practice for two reasons:

Verifiability: every claim in the cover letters, motivation letters, and decision briefs we generate can be traced back to either your CV (class E), the JD you provided (class E), or a documented project source (class B–D). The cover-letter and motivation-letter generators carry inline citation markers (PART 8 Loop 25) so you can audit each claim.
Honesty about limits: when the AI lacks grounding for a claim, it is instructed to say so rather than invent. When the statute is the authority (e.g. visa compatibility for relocation), we refuse to give legal advice and direct you to the appropriate body.

This hierarchy is binding on all AI prompt sites and on all journey replies that surface authoritative claims. See docs/grant/14-source-class-hierarchy.md for the per-prompt + per-call-site audit and the cross-reference to EU AI Act Article 50 (transparency obligations).

You choose the AI provider¶

Helpmefindthejob does not force a single AI provider on you. The deployment can be configured to use:

OpenAI, Anthropic, Gemini, DeepSeek, OpenRouter (commercial cloud AI providers)
Ollama (fully offline, your own machine, no data leaves)
Manual handoff (the system constructs the prompt, you copy-paste it into your AI of choice)
Claude Code (if you have a Claude Code subscription)
Codex CLI (if you have a local Codex CLI setup)

If you are not comfortable sending your CV slice to a third-party AI provider, choose Ollama (your own machine, no egress) or manual handoff (you control exactly what is sent and where; the app invokes no AI — letter drafts use a deterministic template and other AI-assisted actions give you the prompt to run yourself).

The honest tradeoffs between local AI (Ollama) and cloud AI (OpenAI, Anthropic, Google, DeepSeek, etc.) are documented in the AI provider honesty matrix — quality, latency, privacy, cost, EU AI Act surface, per-use-case recommendations. The project explicitly does not pick a "best" provider for you; the matrix lets you weigh the tradeoffs against your own priorities. Switch provider any time in Settings; the system never silently reroutes to a different provider than the one you picked.

You can change your AI provider at any time. You can revoke AI consent at any time. The system records each consent change in the audit log so you have a record of what was active when.

What is logged¶

Helpmefindthejob keeps an audit log for compliance with EU AI Act Article 12. The log records:

Every AI invocation (timestamp, purpose, AI provider used, tokens in/out, response duration)
Every system event (MCP tool calls, kill-switch activations, AI-provider failover)
Every confirmation event when you save, update, or delete data
Every export of your data

By default, the log uses hashed identifiers for you — not your name, not your email. The hash is salted per deployment so it can be used to trace a single user's events across the log (for incident review or your right-to-explanation requests) without being a plaintext personal identifier.

The default retention is 180 days. The deployer can configure this; in some legal contexts (specific incident investigation, regulatory request) the deployer may retain longer or in a different form. The deployer should disclose their specific retention policy in their deployer-managed addendum to this notice.

Your rights¶

Right to know you are interacting with AI (Article 50)¶

This notice exists to give you that knowledge. Every AI-assisted action in the interface is labelled. The journey state machine never auto-routes you through an AI invocation without you seeing it.

If an AI-assisted output affects your situation in a way you want to understand, you can request a structured explanation. For fit-scoring, each score is shown with a one-line AI rationale and a list of gaps; for a fuller explanation, contact your deployer's oversight person. For other outputs, contact your deployer's oversight person (see [Deployer-managed addendum] below).

Right to opt out of AI features¶

You can disable AI-assisted features entirely at any time from the Settings screen. The app then invokes no AI: letter drafts use a deterministic template, and other AI-assisted actions give you the ready-to-run prompt to run with your own AI. You can re-enable later if you change your mind.

You retain all your GDPR rights:

Access — /api/data/export returns your full profile in structured JSON.
Rectification — edit any profile field from the chat or the Settings screen.
Erasure — request deletion from /api/account/deletion-request. Your profile, CV facts, and application history are deleted; audit-log entries are unlinked from your identifier.
Portability — your profile export uses open standard formats (schema.org, ESCO).
Objection / withdrawal of consent — revoke AI-provider consent at any time.

To exercise any of these rights, use the Settings screen or contact your deployer (see below).

Right to lodge a complaint¶

You can lodge a complaint with your jurisdiction's data-protection authority (in Germany: the relevant Landesdatenschutzbeauftragte for the state where your deployer operates, or the Bundesbeauftragte für den Datenschutz und die Informationsfreiheit for federal-level deployers). For AI-Act-specific concerns, your jurisdiction's AI Office / market-surveillance authority is the relevant body.

Limitations you should know about¶

We list these here because honesty about limitations matters more than marketing.

AI outputs are suggestions, not decisions. Even when the system says "fit score: 87/100," that is a recommendation. Your judgement overrides the system.
Foreign-credential recognition is slow. The system helps you find recognition-friendly employers and present your credentials clearly; it cannot expedite the actual Anerkennung process.
The job market is incomplete. We see job postings that are publicly discoverable (career pages, public aggregators) or that you bookmark. We do not see closed-employer applicant-tracking system internals.
AI-provider quality varies. Offline-only operation via Ollama produces lower-quality fit scoring than commercial cloud AI providers. You can switch back and forth; the trade-off is your call.
Language coverage: English and German are shipped at first release. Arabic, Ukrainian, Turkish, and Romanian are on the roadmap as native-speaker contributors join. If you speak a language not yet shipped, you can still use the system in EN or DE; some bureaucratic-context examples will read less naturally to you.

Where the system lives¶

Helpmefindthejob is open-source under Apache License 2.0. The source code is at https://github.com/maksodf/helpmefindthejob. The project has applied to become a Programme of The Commons Conservancy (a Dutch foundation co-founded by NLnet); the application is submitted and a response is pending, and admission has not yet been granted. You can fork, self-host, audit, or contribute. No vendor lock-in. No commercial gate.

[Deployer-managed addendum]¶

Your specific deployer fills in the deployment-specific information below. If this section is empty, please contact your deployer to ask them to complete it.

Deployer organisation: [TBD: deployer to fill]
Deployer contact for AI-Act / data-protection questions: [TBD: deployer to fill]
Deployer's appointed human-oversight person: [TBD: deployer to fill]
Deployer's specific audit-log retention period: [TBD: deployer to fill — default is 180 days unless changed]
Deployer's chosen AI provider(s): [TBD: deployer to fill]
Deployer's data-protection officer (DPO) contact, if applicable: [TBD: deployer to fill]
Deployer's data-protection-authority complaint-channel: [TBD: deployer to fill]

Last updated¶

This notice is currently dated 2026-05-18 (v1, Week 2 of the NLnet NGI Zero Commons Fund grant sprint). The deployer should re-publish on their domain with their addendum complete before going live.