AIMay 27, 202612 min read

Building an Enterprise Knowledge Chatbot: RAG, Privacy, Cost and Integration

How companies plan an AI chatbot with their own knowledge: RAG architecture, data sources, permissions, privacy, EU AI Act, cost, quality assurance and integration.

Marius Gill

Managing Director and software developer with over 10 years of experience

Share

12 min read

Many companies want "ChatGPT with company data". Usually, they do not mean a general chatbot, but a system that searches internal documents, knowledge bases, product information, policies or support data and creates traceable answers from them. Technically, this is often built with RAG: Retrieval-Augmented Generation.

Short answer: An AI chatbot with company knowledge connects approved data sources with search, permissions and a language model. A first RAG prototype often costs 20,000 to 60,000 EUR. A production enterprise chatbot with role model, SSO, several integrations, monitoring, evaluation, privacy concept and operations often ranges from 60,000 to 180,000 EUR or more. A simple website FAQ bot can be significantly cheaper.

The difference between a demo and a production system is large. A demo answers questions from a few PDFs. A production enterprise chatbot must know which data a user is allowed to see, when it is uncertain, how sources are cited, which answers are logged and how quality is measured continuously.

What is an AI chatbot with company knowledge?

An AI chatbot with company knowledge does not answer only from general model knowledge. It uses approved company sources as context. These can include documents, wikis, tickets, CRM data, product data sheets, manuals, website content or internal policies.

Typical use cases:

internal knowledge assistant for policies, processes and onboarding
support chatbot with access to help center, tickets and product documentation
sales enablement for product information, pricing logic and proposal modules
technical assistant for manuals, specifications and maintenance data
HR assistant for internal rules, benefits and onboarding documents
customer portal assistant for contracts, invoices, status and documents
project assistant for meeting notes, decisions, specifications and open points

Important: The chatbot should not "know everything". It should use clearly defined sources, know its limits and avoid inventing answers when evidence is missing.

Website FAQ bot or enterprise RAG chatbot?

Many misunderstandings happen because very different systems are all called "AI chatbot". For a website FAQ, a standard tool is often enough. For company knowledge with permissions, SSO, sensitive data and multiple systems, a different architecture is required.

Type	Suitable for	Limits
Website FAQ bot	public questions, help center, simple lead qualification	limited permissions, limited integrations, usually simple sources
Internal knowledge bot	policies, onboarding, process knowledge, product documentation	requires data maintenance, roles and clear source ownership
Support RAG bot	tickets, product documentation, issue database, answer drafts	requires quality assurance and human approval for critical cases
Customer portal assistant	customer-specific documents, status, contracts, invoices	requires strict tenant separation and backend permissions
Agentic workflow	chatbot plus actions in CRM, ticketing, email or ERP	requires tool limits, audit logs, approvals and monitoring

The biggest mistake is to start with a simple demo chatbot and later unleash it on sensitive company data without control. It is better to define data sources, permissions and pilot goals properly from the beginning.

When a RAG chatbot makes sense and when it does not

RAG is useful when a lot of information already exists in documents or systems, but is hard to find or understand. The approach is especially strong when answers need to reference sources.

Good use cases:

support teams search product and issue databases faster
new employees find policies and process knowledge
sales teams get consistent answers to product questions
customers receive self-service answers from approved documents
project teams search specifications, meeting notes and decisions
departments review long documents faster for relevant passages

Bad use cases:

the data base is outdated, contradictory or not approved
there are no clear data owners
answers would have legal, medical or financial impact without human review
user permissions are unclear and sensitive data is mixed together
the chatbot should make decisions before answer quality is measurable
the company actually has a process or data quality problem but calls it "AI"

An AI chatbot is not a replacement for information architecture. If nobody knows which source is authoritative, a model cannot magically create that governance.

How RAG works technically

RAG stands for Retrieval-Augmented Generation. The system first searches for relevant information and then lets the language model answer based on that context.

A typical flow:

Sources are connected: for example Confluence, SharePoint, Google Drive, Notion, website, CRM or PDFs.
Content is cleaned, split into segments and enriched with metadata.
Text segments are converted into embeddings and stored in a search index or vector database.
When a user asks a question, the system searches for matching segments.
Permissions filter which results the user is allowed to see.
The language model creates an answer with source references.
Feedback, logs and evaluation show whether the answer was useful.

The critical point sits between steps 4 and 6. Without permission checks, the chatbot can expose confidential information. Without good retrieval quality, the model receives the wrong context. Without evaluation, nobody notices when answer quality gets worse.

Data sources and prioritization

The data sources determine quality more than the model. Common sources include:

knowledge bases: Confluence, Notion, SharePoint, Google Drive
documents: PDFs, Word files, presentations, product sheets, contracts
support: Zendesk, Intercom, Jira Service Management, Freshdesk
CRM and sales: HubSpot, Salesforce, Pipedrive, internal databases
product data: PIM, ERP, CMS, technical documentation
website and help center: public content, FAQ, documentation
internal systems: project management, tickets, wikis, databases

Not every source should be connected first. Good first sources are current, maintained, clearly owned and useful for a specific question type. Bad first sources are old shared drives, duplicated PDF collections, unclear exports or documents with mixed permissions.

For a pilot, one strong source is often better than ten weak ones. Production systems need synchronization, change tracking, error handling, duplicate detection and clear data ownership.

Permissions: the most important enterprise topic

An enterprise chatbot must not only provide good answers. It must provide the right answers to the right people. This is often the biggest technical and organizational challenge.

Examples:

HR documents must not be visible to all employees.
Customer data may only be read by the responsible team.
Financial data does not belong in general search results.
Drafts, outdated versions or confidential contracts must not appear unchecked in answers.
External customers may only see their own documents.

Permissions must be considered during retrieval, not only in the interface. If an unauthorized text segment reaches the model context, the damage has already happened. A clean system connects user identity, roles, groups, document metadata and backend checks.

Security: prompt injection, tool limits and output validation

LLM applications have their own security risks. OWASP's LLM Top 10 names prompt injection, insecure output handling, supply-chain vulnerabilities and excessive agency as relevant risk areas. For a company chatbot, this means security is not limited to server hardening.

Practical safeguards include:

separate system instructions, retrieved content and user input clearly
never treat retrieved documents as trusted instructions
limit which tools the chatbot may call
validate tool parameters before execution
use allowlists for actions and systems
require human approval for risky actions
log retrieval, prompts, tool calls and answers in a privacy-aware way
sanitize generated output before it is used in downstream systems
test with malicious documents and prompt injection attempts

The chatbot should not be able to execute arbitrary actions because a document or user message instructs it to do so. RAG answers and agentic tool calls must be treated as separate risk levels.

Privacy, logging and the EU AI Act

AI chatbots for companies almost always touch personal or confidential data. This does not replace legal advice, but these technical questions should be clarified early:

Which data sources contain personal data?
Which content may be sent to model providers?
Where are prompts, answers, logs and embeddings stored?
How long are interactions retained?
Who can view chat histories and feedback?
Can users request access, correction or deletion?
How are sources, permissions and answer quality documented?

The EU AI Act is also relevant. The European Commission describes the AI Act as a risk-based framework. It entered into force on 1 August 2024. Prohibited-practices and AI-literacy obligations have applied since 2 February 2025, GPAI and governance obligations since 2 August 2025, and many rules become applicable from 2 August 2026. The Commission also describes transparency obligations for chatbots: people should be informed when they interact with a machine. According to the Commission, transparency rules apply from August 2026.

In practice, this means an internal knowledge chatbot needs clear notices, responsibilities, purpose definition, data flows, risk assessment and control mechanisms. The closer the chatbot gets to decisions with legal, financial, HR or health impact, the more tightly the system must be limited.

Cost and timeline: pilot vs. production

An AI chatbot should be built in stages. The first pilot answers whether data quality, retrieval and value are viable. The production version answers whether permissions, quality, operations and integration are reliable.

Phase	Duration	Typical effort
Discovery and data check	1 to 2 weeks	use case, sources, privacy, risks, success metrics
RAG prototype	2 to 5 weeks	one data source, simple interface, source-based answers
Pilot	4 to 8 weeks	user group, feedback, evaluation, first permission checks
Production system	8 to 20 weeks	roles, SSO, integrations, monitoring, admin, operations
Ongoing development	continuous	new sources, quality assurance, cost control, governance

Typical cost ranges:

Project type	Realistic range
Data and use-case check	5,000 to 15,000 EUR
RAG prototype	20,000 to 60,000 EUR
Pilot with real users	40,000 to 100,000 EUR
Production enterprise chatbot	60,000 to 180,000 EUR and more

Additional running costs include model usage, embedding generation, vector database, hosting, monitoring, maintenance, data synchronization, evaluation and support.

Pilot metrics: how to decide whether it works

A pilot should not end with "people liked the demo". It should produce measurable evidence.

Useful metrics:

answer accuracy on a fixed evaluation set
source coverage: how often the correct source is found
refusal quality: whether the bot declines when evidence is missing
time saved per question or ticket
reduction in repeated support questions
user satisfaction by role
number of escalations to humans
cost per answer
latency and uptime
incidents related to permissions or wrong sources

These metrics help distinguish a promising demo from a system that should actually enter production.

How to reduce hallucinations

Hallucinations cannot be eliminated completely, but they can be reduced and measured. The key is not just a better prompt, but a quality system.

Important measures:

clear sources that are current and approved
retrieval tests with known questions and expected sources
answer rules: answer only when enough context exists
source references directly next to the answer
refusal behavior when uncertain
feedback buttons and error categories
regular evaluation sets
monitoring for cost, latency, retrieval quality and error rates
escalation to humans for critical questions

A production chatbot should be able to say: "I cannot find a reliable answer in the approved sources." That boundary is a quality feature, not a bug.

SaaS tool, custom chatbot or hybrid approach?

Many SaaS tools provide quick chatbots for help centers, website FAQ or internal documents. That is useful when data sources are standardized, permissions are simple and deep integrations are not needed.

Custom development is worthwhile when:

several internal systems need to be connected
permissions apply per user, customer or document
sensitive data is processed
answers need to include business logic
the chatbot is embedded into a customer portal, CRM or internal tool
audit logs, evaluation and monitoring should stay under your control

A hybrid approach is often best: SaaS for standard functionality, a custom backend layer for permissions, integrations, data preparation and quality assurance.

AI data readiness check

Before a RAG project, a company should answer these questions:

Which user group has the highest value?
Which three question types should the chatbot answer first?
Which sources are current, approved and maintained by clear owners?
Which sources contain personal or confidential data?
Which permissions currently apply in the source systems?
Is there SSO, groups or roles that can be reused?
Which answers may only be suggestions?
Which sources must be cited?
How is quality measured?
Who is the business owner after launch?
Which cost limits apply to model usage and infrastructure?

If these questions are not answered, the first step should not be chatbot development, but a data and permissions check.

How hafencity.dev builds AI chatbots

We build AI chatbots as AI integrations, not as isolated demos. That includes data check, RAG architecture, backend, permission model, evaluation, monitoring and integration into existing workflows. Depending on maturity, we start with an AI strategy, a limited RAG pilot or a production-oriented integration.

In a first conversation, it is usually possible to clarify whether a RAG chatbot is realistic for your data situation, which sources are suitable first and which risks must be checked before a pilot.

If you want to assess this, you can request an AI chatbot project. A complete specification is not required. A short description of target users, data sources, permissions and the most critical question-answer workflow is enough.

FAQ

What does an AI chatbot with company knowledge cost?

A simple RAG prototype often costs 20,000 to 60,000 EUR. A production enterprise chatbot with permissions, SSO, several data sources, evaluation and monitoring often ranges from 60,000 to 180,000 EUR or more.

Can ChatGPT be connected to company data?

Yes, but not by simply uploading arbitrary documents. Production use requires data preparation, permissions, privacy review, logging, source references and clear rules for which data may go to which provider.

Yes, if data flows, legal basis, processing agreements, hosting, retention periods, deletion concept and access controls are planned properly. It is especially important not to store personal data unnecessarily in prompts, logs or embeddings.

How do you prevent wrong answers?

Through good sources, retrieval tests, source references, answer rules, refusal behavior, feedback and regular evaluation. Errors cannot be eliminated completely, so critical processes need human control.

Which data sources are suitable?

Good first sources are current, maintained and clearly owned content: help centers, product documentation, internal process manuals, approved PDFs or knowledge bases. Poor sources are old, contradictory or unauthorized document collections.

Sources and further reading

Conclusion

An enterprise chatbot does not become production-ready because of a language model. It becomes reliable through data quality, permissions, evaluation, logging and integration into real workflows. RAG is the technical core, but governance and operations decide whether it becomes a dependable system.

Written by

Marius Gill

Managing Director and software developer with over 10 years of experience

Share

All posts

Keep reading

Let's talk about your project

Book a 30-minute discovery call. We'll review your goals, surface unknowns, and outline how we would run the engagement.

Schedule a call

Booking calendar (Cal.com)

This area embeds the external service Cal.com. By loading it you agree that a connection to Cal.com is established and data may be transferred to the USA.

Privacy policy