Skip to content
All posts
AI7 min read

Enterprise Knowledge Chatbot 2026: RAG, GDPR, Cost

"ChatGPT with our own data" is the most common enterprise AI request in 2026. We show what turns a RAG chatbot from a demo into a production system: permission-aware architecture, data sources, the EU AI Act, security and honest cost ranges.

Marius Gill

Marius Gill

Managing Director and software developer with over 10 years of experience

Updated on

Share

7 min read

Enterprise AI is no longer a pilot topic in 2026: according to Bitkom's 2026 AI study, 41 percent of companies with 20 or more employees actively use AI, and another 48 percent are planning or discussing it. The most common request behind this is "ChatGPT with our own data". Usually it does not mean a general chatbot, but a system that searches internal documents, wikis and support data and creates traceable answers from them. The technical approach is Retrieval-Augmented Generation (RAG).

The gap between a demo and a production system is large. A demo answers questions from a few PDFs. A production enterprise chatbot must know which data a user may see, when it is uncertain, how sources are cited and how quality is measured continuously.

Demo or production: the expensive difference

Very different systems are all called "AI chatbot" — and that is where most wrong decisions begin. For a public website FAQ, a standard tool is often enough. For company knowledge with permissions, SSO, sensitive data and multiple systems, a different architecture is required. The most common expensive mistake is to start with a simple demo bot and later unleash it on sensitive data without control.

TypeSuitable forLimits
Website FAQ botpublic questions, help center, simple lead qualificationlimited permissions, simple sources
Internal knowledge botpolicies, onboarding, process knowledge, product docsrequires data maintenance, roles, source ownership
Support RAG bottickets, product docs, issue database, answer draftsrequires quality assurance and human approval
Customer portal assistantcustomer-specific documents, status, contractsrequires strict tenant separation
Agentic workflowchatbot plus actions in CRM, ticketing, ERPrequires tool limits, audit logs, approvals

The honest question before you start is not "which model?" but: which data sources, which permissions, which concrete question-answer process?

How permission-aware RAG works

In an enterprise, the most important architecture point is not the prompt, but the question of which data is even allowed into the model context. RAG first searches for relevant information and then lets the language model answer based on that context. The critical step sits between search and answer: the permission check.

Permission-aware RAG: the rights filter sits before the model context — not only in the interface.

A permission-aware system checks, before every answer, who the user is, which role they hold, and which documents, tickets or records they may see — including rights from SharePoint, Google Drive, CRM or your own backend. This check belongs in the backend and must apply to every request, not only at login. For customer portals, HR, contract or financial data, this is the difference between demo and production: once an unauthorised segment reaches the model, access control has already failed. How we build such systems is shown in our AI integration.

Data sources: where the pilot starts

The data sources determine quality more than the model. Not every source should be connected first. Good first sources are current, reviewed, clearly owned and not very sensitive. Bad first sources are huge folder structures with old, contradictory or unauthorised documents.

Data sourcePilot suitabilityWhy
maintained help centerhighclear content, frequent questions, good source references
product documentationhigheasy to structure, high value for support and sales
internal process manualsmedium to highuseful when current and responsibly maintained
SharePoint/drive with legacy fileslow to mediumneeds cleanup, metadata, permission checks
CRM and ticket datamediumuseful, but personal and permission-dependent
contracts and HR documentscautioushighly sensitive, clear roles and approvals required

A good pilot does not start with "all data" but with one valuable, controllable source and a clear question type. Production then adds synchronization, change tracking, duplicate detection and clear data owners. More on this in getting started with AI use cases.

Privacy, the EU AI Act and security

Enterprise chatbots almost always touch personal or confidential data — so governance is part of the architecture, not an appendix. A few technical questions should have clear answers early: which sources contain personal data? Which content may go to model providers? Where do prompts, answers, logs and embeddings live, for how long, and who can view them?

The EU AI Act has been in force since 1 August 2024 and applies in stages. For chatbots, Article 50 matters most: the transparency obligations apply from 2 August 2026 — users must clearly recognise that they are talking to an AI system. With the Digital Omnibus proposed in late 2025, lawmakers agreed in May 2026 to postpone the obligations for high-risk Annex III systems from August 2026 to 2 December 2027. Most internal knowledge chatbots are not high-risk systems, but they still need transparency, purpose limitation, risk assessment and documentation.

Security is a separate concern. As soon as a chatbot reads external content or triggers actions, a document or user message can carry hidden instructions. In the OWASP Top 10 for LLM Applications (2025), prompt injection ranks first, with newly weighted weaknesses in vector and embedding systems relevant for RAG. Practical safeguards: separate the system instruction, retrieved content and user input; validate tool calls server-side; keep secrets out of prompts and logs; and require human approval for irreversible actions. A RAG chatbot becomes safer when it is allowed to do less — more in risks in AI software projects.

Cost and roadmap: pilot before production

An AI chatbot should be built in stages — the pilot answers whether data quality and value are viable before you invest in roles, SSO and operations. The ranges below are our typical project sizes, not list prices; the actual number depends on sources, integrations and compliance depth.

From data check to production: typical project ranges. hafencity.dev estimates, June 2026.
Project typeRealistic rangeWhat's included
Data and use-case check5,000–15,000 EURuse case, sources, privacy, success metrics
RAG prototype20,000–60,000 EURone source, simple interface, source-based answers
Pilot with real users40,000–100,000 EURuser group, evaluation, first permission checks
Production enterprise chatbot60,000–180,000 EUR and moreroles, SSO, integrations, monitoring, operations

On top come running costs, often smaller than expected: embeddings cost roughly $0.02 per million tokens with OpenAI's text-embedding-3-small; the ongoing drivers are model calls per answer, the vector database, hosting and monitoring. More important than the entry price is whether answer quality stays measurable: maintained sources, retrieval tests, source references, refusal behavior when uncertain and an evaluation set. A production chatbot should be able to say: "I cannot find a reliable answer in the approved sources." That boundary is a quality feature, not a bug.

Next steps

Three questions settle feasibility faster than any tool demo:

  1. Data situation: which source is current, approved and clearly owned — and which question type should be answered first?
  2. Permissions: is there SSO, groups or roles the chatbot must carry over per user?
  3. Risk: how close do answers get to decisions with legal, financial or HR impact?

If these points are unclear, the first step is not chatbot development but a data and permissions check. Depending on maturity, we start with an AI strategy or directly with a limited RAG pilot. Describe your target users, data sources and most critical question-answer process — then book an intro call.

Frequently asked questions

Conclusion

An enterprise chatbot does not become production-ready because of a language model. It becomes reliable through data quality, permissions, evaluation, logging and integration into real workflows. RAG is the technical core, but governance and operations decide whether it becomes a dependable system.

Marius Gill

Written by

Marius Gill

Managing Director and software developer with over 10 years of experience

Next steps

Let's talk about your project

Book a 30-minute discovery call. We'll review your goals, surface unknowns, and outline how we would run the engagement.

Schedule a call

Booking calendar (Cal.com)

This area embeds the external service Cal.com. By loading it you agree that a connection to Cal.com is established and data may be transferred to the USA.

Privacy policy