Gomus AI Documentation

Gomus AI is an AI-powered knowledge platform built on a state-of-the-art RAG (Retrieval-Augmented Generation) engine. Upload documents, crawl websites, and build knowledge bases that your AI assistants and agents can query with precision. Every answer includes chunk-level citations so you can verify the source instantly.

New to Gomus AI?

Follow this quickstart to go from sign-up to your first AI-powered conversation in under 5 minutes.

Create an Account

Go to Gomus AI

Visit app.gomus.org and click Sign Up.

Register

Enter your email address and choose a password. You will receive a confirmation email.

Choose your plan

All new accounts start on the Free plan with access to 20+ AI models, 1 knowledge base, and 500 credits/month. You can upgrade anytime from the Account settings.

Your First Knowledge Base

Create a Knowledge Base

Click the Knowledge Base tab at the top of the page, then click Create Knowledge Base. Give it a name and click OK.

Choose an Embedding Model

Select an embedding model for your knowledge base. We recommend Cohere Embed v4 (multilingual) or Titan Embed v2 (English-optimized). This cannot be changed once files are parsed.

Upload Files

Click + Add file and upload your documents. Supported formats include PDF, DOCX, XLSX, CSV, TXT, MD, PPTX, images (JPEG, PNG), and more.

Parse Files

Click the play button on each uploaded file to start parsing. Gomus AI will extract text, tables, images, and structure from your documents and build a searchable index.

Your First AI Chat

Create a Chat Assistant

Click the Chat tab, then Create Chat. Name your assistant and select one or more knowledge bases to use as context.

Choose a Model

Pick an AI model from the dropdown. Try Claude Sonnet 4 for best quality or Amazon Nova Lite for fast, cost-effective responses.

Start Chatting

Ask a question about your documents. Every answer comes with source citations you can click to verify.

Knowledge Bases

A knowledge base is a collection of parsed documents and web pages that serve as the context for your AI chats and agents. Gomus AI parses your files into semantic chunks, builds embeddings, and indexes everything for high-precision retrieval.

Create a Knowledge Base

Navigate to the Knowledge Base tab and click Create Knowledge Base. Each knowledge base has its own configuration for chunking method and embedding model.

Plan Limits

Free 1 KB, 50 MB • Base 5 KBs, 1 GB • Premium 20 KBs, 10 GB • Business Unlimited, 100 GB

Configure a Knowledge Base

After creating a knowledge base, configure it by selecting:

Chunking method — How files are split into retrievable chunks
Embedding model — Which model converts chunks into vectors
Similarity threshold — Minimum relevance score for retrieval (default: 0.2)
Vector similarity weight — Balance between vector and keyword search (default: 0.3)

Chunking Methods

Gomus AI offers multiple built-in chunking templates optimized for different document types:

Template	Best For	Supported Formats
General	Most documents	PDF, DOCX, XLSX, PPT, TXT, MD, CSV, HTML, images
Q&A	FAQ-style content	XLSX, CSV, TXT
Table	Structured data	XLSX, CSV, TXT
Paper	Academic papers	PDF
Book	Long-form content	DOCX, PDF, TXT
Laws	Legal documents	DOCX, PDF, TXT
Presentation	Slide decks	PDF, PPTX
Picture	Image-based docs	JPEG, JPG, PNG, TIF, GIF
One	Entire doc as one chunk	DOCX, XLSX, PDF, TXT
Manual	Technical manuals	PDF
Tag	Tag sets for other KBs	XLSX, CSV, TXT

You can also override the chunking method per file after upload.

Custom Ingestion Pipeline

For advanced use cases, create a custom ingestion pipeline using the Agent Builder. Select Ingestion Pipeline when creating a new agent, then link it to your knowledge base configuration.

Embedding Models

An embedding model converts your document chunks into vector representations for semantic search. Available embedding models:

Model	Provider	Best For
Cohere Embed v4	Cohere (via Bedrock)	Multilingual, general purpose
Amazon Titan Embed v2	Amazon Bedrock	English-optimized, cost-effective
Cohere Embed Multilingual v3	Cohere (via Bedrock)	100+ languages

Important

Once you select an embedding model and parse files with it, you cannot change it. All files in a knowledge base must use the same embedding model to ensure they are compared in the same vector space.

Upload & Parse Files

After configuring your knowledge base:

Click + Add file to upload documents from your device
Click the play button on each file to start parsing
Once parsed, click the file name to view individual chunks
Double-click any chunk to add keywords, questions, or make manual edits

Parsing Tips

If parsing stalls below 1%, cancel and restart. If it stalls near completion, the file may be too large — try splitting it. Adding keywords to chunks improves their ranking for queries containing those terms.

Retrieval Testing

Before setting up a chat, test your knowledge base retrieval:

Open your knowledge base and go to Retrieval Testing
Type a test question in the Test text field
Review the retrieved chunks and their similarity scores
Adjust Similarity threshold and Vector similarity weight as needed

Web Crawler

Turn any website into a searchable knowledge base. Base+

In your knowledge base, click + Add file and select Web Crawl
Enter the root URL of the website you want to crawl
Configure crawl depth and page limits
Gomus AI will crawl all linked pages, extract content, and index it

The crawler respects robots.txt rules and you can schedule periodic re-crawls to keep content fresh. Crawled web pages and uploaded documents can coexist in the same knowledge base for unified retrieval.

Chat Assistants

Chat assistants let you have AI-powered conversations grounded in your knowledge bases. Each answer is backed by citations from your documents, reducing hallucinations and ensuring accuracy.

Create a Chat Assistant

Navigate to Chat

Click the Chat tab at the top of the page, then click Create Chat.

Configure the Assistant

Give it a name, select one or more knowledge bases, and choose your AI model.

Set the Empty Response

Define what the assistant says when it cannot find an answer in your documents. Leave blank to allow the model to improvise (may cause hallucinations).

Start Chatting

Click on your assistant to open the chat interface. Ask questions and get cited answers.

Chat Configuration

Each chat assistant has its own configuration:

Model — Choose any available LLM (can differ per assistant)
Knowledge bases — Select which knowledge bases to query
System prompt — Customize the AI's behavior and persona
Empty response — Response when no relevant information is found
Multi-turn optimization — Uses conversation history to refine queries (enable under Prompt Engine)
TopN — Number of chunks to retrieve (reduce to speed up responses)
Similarity threshold — Minimum relevance score for chunks

AI Search vs AI Chat

Feature	AI Search	AI Chat
Conversation	Single-turn	Multi-turn
Retrieval strategy	Fixed hybrid search	Configurable
Model selection	System default	Per assistant
Knowledge graph	Not available	Configurable
Retrieved chunks	Shown below answer	Available via citations

Use AI Search for quick one-off queries and retrieval debugging. Use AI Chat for production conversations.

Agent Builder

The Agent Builder is a visual drag-and-drop editor for designing multi-step AI workflows. Connect retrieval, generation, web search, conditional logic, and custom components into powerful pipelines — no code required. All Plans

Create an Agent

Open Agent Tab

Click the Agent tab at the top of the page to see your existing agents or create a new one.

Choose Template or Start Blank

Click + Create Agent. Choose from pre-built templates (Deep Research, Customer Support, etc.) or click Create from Blank to start fresh.

Build Your Workflow

Use the visual canvas to add and connect components. Click the + button on any component to add downstream steps.

Save & Test

Click Save to apply your changes. Test your agent by chatting with it directly from the editor.

Agent Components

Component	Description
Begin	Entry point of every agent workflow
Retrieval	Query one or more knowledge bases
Agent (LLM)	Send prompts to an AI model for generation
Categorize	Classify user intent and route accordingly
Switch	Conditional branching based on rules
Iteration	Loop over a list of items
Message	Send a fixed message to the user
Await Response	Wait for user input mid-workflow
Code	Execute custom Python code (sandbox)
HTTP	Make HTTP requests to external APIs
Execute SQL	Run SQL queries against databases
Text Processing	Transform, extract, or reformat text
Docs Generator	Generate formatted documents from data
Parser	Parse files within a workflow
Indexer	Index parsed content into a knowledge base

Pre-built Templates

Gomus AI provides ready-to-use agent templates:

Deep Research — Multi-step research with web search + knowledge base retrieval
E-Commerce Customer Support — Intent classification + knowledge-grounded responses
Ingestion Pipeline — Custom document parsing and indexing workflow
Sandbox — Code execution for data analysis tasks

Available Models

Gomus AI provides 40+ pre-configured AI models from leading providers. All models are ready to use — no API keys required from your side.

Chat Models (LLMs)

Claude Opus 4.6 Claude Opus 4.5 Claude Sonnet 4.6 Claude Sonnet 4.5 Claude Sonnet 4 Claude 3.7 Sonnet Claude 3.5 Sonnet Claude Haiku 4.5 Claude 3 Haiku Amazon Nova Pro Amazon Nova Lite Amazon Nova Micro Llama 4 Maverick Llama 3.3 70B Llama 3.2 3B Llama 3.2 1B Qwen 3 32B Kimi K2 Pixtral Large + more

Embedding Models

Cohere Embed v4 Amazon Titan Embed v2 Cohere Embed Multilingual v3

Reranking Models

Cohere Rerank v3.5

Models by Tier

Tier	Available Models
Free	20+ models including Nova, Llama, Titan Embed, Cohere Embed & Rerank
Base	All Free models + Claude, Pixtral, Qwen, and more
Premium	All Base models + Claude Opus, advanced vision models
Business	Full catalog of 40+ models, priority capacity

No API Keys Needed

Unlike self-hosted solutions, Gomus AI comes with all models pre-configured. Just select a model and start using it. Your usage is tracked via the credit system.

API Reference

Gomus AI provides RESTful APIs and an OpenAI-compatible endpoint to integrate AI capabilities into your applications. Base+

Authentication

All API requests require an API key. To get your key:

Click your avatar (top right) and select API Key
Click Create new key and copy the generated key
Include it in the Authorization header of every request

Authorization: Bearer YOUR_API_KEY

Base URL

https://app.gomus.org/api/v1

Error Codes

Code	Message	Description
400	Bad Request	Invalid request parameters
401	Unauthorized	Invalid or missing API key
403	Forbidden	Access denied (plan limit or permission)
404	Not Found	Resource not found
429	Rate Limited	Too many requests, slow down
500	Server Error	Internal server error

Chat Completion (OpenAI-Compatible)

Gomus AI supports the OpenAI chat completion format, making it easy to integrate with existing tools and libraries.

POST /api/v1/chats_openai/{chat_id}/chat/completions

{
  "model": "model",
  "messages": [
    {"role": "user", "content": "What does the contract say about termination?"}
  ],
  "stream": true
}

Request Headers

Content-Type: application/json
Authorization: Bearer YOUR_API_KEY

cURL Example

curl -X POST https://app.gomus.org/api/v1/chats_openai/{chat_id}/chat/completions \
  -H "Content-Type: application/json" \
  -H "Authorization: Bearer YOUR_API_KEY" \
  -d '{
    "model": "model",
    "messages": [{"role": "user", "content": "Summarize the key findings"}],
    "stream": true
  }'

Parameters

Parameter	Type	Required	Description
`model`	string	Yes	Model name (auto-parsed, can be any value)
`messages`	array	Yes	Conversation history with at least one user message
`stream`	boolean	No	Stream response (default: true)

Datasets API

Manage knowledge bases and documents programmatically.

Method	Endpoint	Description
POST	`/api/v1/datasets`	Create a new dataset
GET	`/api/v1/datasets`	List all datasets
PUT	`/api/v1/datasets/{id}`	Update a dataset
DELETE	`/api/v1/datasets/{id}`	Delete a dataset
POST	`/api/v1/datasets/{id}/documents`	Upload documents
GET	`/api/v1/datasets/{id}/documents`	List documents in a dataset
DELETE	`/api/v1/datasets/{id}/documents/{doc_id}`	Delete a document
POST	`/api/v1/datasets/{id}/chunks`	Parse documents into chunks
GET	`/api/v1/datasets/{id}/chunks`	List chunks
POST	`/api/v1/retrieval`	Run retrieval query

Agents API

Method	Endpoint	Description
GET	`/api/v1/agents`	List all agents
POST	`/api/v1/agents/{id}/sessions`	Create a new agent session
POST	`/api/v1/agents/{id}/completions`	Converse with an agent
GET	`/api/v1/agents/{id}/sessions`	List agent sessions
DELETE	`/api/v1/agents/{id}/sessions/{session_id}`	Delete a session

Billing & Plans

Feature	Free	Base ($19.90/mo)	Premium ($49.90/mo)	Business ($149.90/mo)
AI Models	20+ free models	40+ including Claude	40+ including Claude	Full catalog + priority
Knowledge Bases	1	5	20	Unlimited
Storage	50 MB	1 GB	10 GB	100 GB
Credits / month	500	5,000	25,000	100,000
Web Crawler	—	Yes	Unlimited	Unlimited
Agent Builder	Yes	Yes	Yes	Yes
API Access	Yes	Yes	Yes	Yes
Team Collaboration	—	—	—	Yes
Support	Community	Email	Priority	Dedicated

Credit System

Gomus AI uses a credit-based system to track AI model usage. Different models consume different amounts of credits per request:

Free models (Nova Micro, Llama 1B/3B) — Minimal credit usage
Standard models (Nova Pro/Lite, Claude Haiku) — Moderate credit usage
Premium models (Claude Sonnet/Opus) — Higher credit usage

Credits reset monthly. You can monitor your usage from Account Settings. If you exhaust your credits, you can wait for the monthly reset or upgrade your plan.

Frequently Asked Questions

What is Gomus AI?

Gomus AI is an AI-powered knowledge platform built on a state-of-the-art RAG (Retrieval-Augmented Generation) engine. It combines deep document understanding with visual AI agent orchestration and web crawling, all in one platform. Upload documents or crawl websites, then get precise, cited answers from your data.

What makes Gomus AI different from ChatGPT?

ChatGPT answers based on its training data. Gomus AI answers based on your documents. Every answer includes citations pointing to the exact source chunks, so you can verify accuracy. This dramatically reduces hallucinations and makes it suitable for professional use cases where accuracy matters.

Is my data secure?

Yes. All data is processed and stored on AWS infrastructure in the EU (Frankfurt, eu-central-1). Your documents are never used to train AI models. All data is encrypted at rest and in transit. The platform is designed with GDPR compliance in mind.

Which file formats are supported?

Gomus AI supports a wide range of file formats:

Category	Formats
Documents	PDF, DOCX, DOC, TXT, MD, MDX, HTML, EML
Spreadsheets	XLSX, XLS, CSV
Presentations	PPTX, PPT
Images	JPEG, JPG, PNG, TIF, GIF
Data	JSON

Can I use the API to integrate Gomus AI into my app?

Yes! Gomus AI provides RESTful APIs and an OpenAI-compatible chat completion endpoint. See the API Reference section for details. API access is available on all plans.

How does the Web Crawler work?

Paste the root URL of any website. Gomus AI crawls all linked pages, extracts content, cleans HTML into structured text, and indexes everything into a knowledge base. You can then query the entire site through chat, just like uploaded documents. The crawler respects robots.txt and you can configure crawl depth and filters.

What is the Agent Flow Builder?

A visual drag-and-drop canvas where you design multi-step AI workflows. Connect components like knowledge retrieval, LLM generation, web search, conditional logic, and custom tools into a pipeline. No coding required. See Agent Builder for details.

Can I switch AI models per conversation?

Yes. Each chat assistant can use a different AI model. You can also change models within the same assistant at any time.

Do you support multi-turn conversations?

Yes. Enable Multi-turn optimization in your chat assistant's settings (under Prompt Engine) to let the AI use previous messages as context for better follow-up responses.

What happens when I run out of credits?

When your monthly credits are exhausted, AI model requests will be paused until credits reset at the start of your next billing cycle. You can upgrade your plan at any time for additional credits. Free-tier models with minimal credit usage may still be available.

Contact & Support

Email: support@gomus.org
Website: www.gomus.org
App: app.gomus.org