Gomus AI Documentation
Gomus AI is an AI-powered knowledge platform built on a state-of-the-art RAG (Retrieval-Augmented Generation) engine. Upload documents, crawl websites, and build knowledge bases that your AI assistants and agents can query with precision. Every answer includes chunk-level citations so you can verify the source instantly.
Follow this quickstart to go from sign-up to your first AI-powered conversation in under 5 minutes.
Create an Account
Go to Gomus AI
Visit app.gomus.org and click Sign Up.
Register
Enter your email address and choose a password. You will receive a confirmation email.
Choose your plan
All new accounts start on the Free plan with access to 20+ AI models, 1 knowledge base, and 500 credits/month. You can upgrade anytime from the Account settings.
Your First Knowledge Base
Create a Knowledge Base
Click the Knowledge Base tab at the top of the page, then click Create Knowledge Base. Give it a name and click OK.
Choose an Embedding Model
Select an embedding model for your knowledge base. We recommend Cohere Embed v4 (multilingual) or Titan Embed v2 (English-optimized). This cannot be changed once files are parsed.
Upload Files
Click + Add file and upload your documents. Supported formats include PDF, DOCX, XLSX, CSV, TXT, MD, PPTX, images (JPEG, PNG), and more.
Parse Files
Click the play button on each uploaded file to start parsing. Gomus AI will extract text, tables, images, and structure from your documents and build a searchable index.
Your First AI Chat
Create a Chat Assistant
Click the Chat tab, then Create Chat. Name your assistant and select one or more knowledge bases to use as context.
Choose a Model
Pick an AI model from the dropdown. Try Claude Sonnet 4 for best quality or Amazon Nova Lite for fast, cost-effective responses.
Start Chatting
Ask a question about your documents. Every answer comes with source citations you can click to verify.
Knowledge Bases
A knowledge base is a collection of parsed documents and web pages that serve as the context for your AI chats and agents. Gomus AI parses your files into semantic chunks, builds embeddings, and indexes everything for high-precision retrieval.
Create a Knowledge Base
Navigate to the Knowledge Base tab and click Create Knowledge Base. Each knowledge base has its own configuration for chunking method and embedding model.
Free 1 KB, 50 MB • Base 5 KBs, 1 GB • Premium 20 KBs, 10 GB • Business Unlimited, 100 GB
Configure a Knowledge Base
After creating a knowledge base, configure it by selecting:
- Chunking method — How files are split into retrievable chunks
- Embedding model — Which model converts chunks into vectors
- Similarity threshold — Minimum relevance score for retrieval (default: 0.2)
- Vector similarity weight — Balance between vector and keyword search (default: 0.3)
Chunking Methods
Gomus AI offers multiple built-in chunking templates optimized for different document types:
| Template | Best For | Supported Formats |
|---|---|---|
| General | Most documents | PDF, DOCX, XLSX, PPT, TXT, MD, CSV, HTML, images |
| Q&A | FAQ-style content | XLSX, CSV, TXT |
| Table | Structured data | XLSX, CSV, TXT |
| Paper | Academic papers | |
| Book | Long-form content | DOCX, PDF, TXT |
| Laws | Legal documents | DOCX, PDF, TXT |
| Presentation | Slide decks | PDF, PPTX |
| Picture | Image-based docs | JPEG, JPG, PNG, TIF, GIF |
| One | Entire doc as one chunk | DOCX, XLSX, PDF, TXT |
| Manual | Technical manuals | |
| Tag | Tag sets for other KBs | XLSX, CSV, TXT |
You can also override the chunking method per file after upload.
For advanced use cases, create a custom ingestion pipeline using the Agent Builder. Select Ingestion Pipeline when creating a new agent, then link it to your knowledge base configuration.
Embedding Models
An embedding model converts your document chunks into vector representations for semantic search. Available embedding models:
| Model | Provider | Best For |
|---|---|---|
| Cohere Embed v4 | Cohere (via Bedrock) | Multilingual, general purpose |
| Amazon Titan Embed v2 | Amazon Bedrock | English-optimized, cost-effective |
| Cohere Embed Multilingual v3 | Cohere (via Bedrock) | 100+ languages |
Once you select an embedding model and parse files with it, you cannot change it. All files in a knowledge base must use the same embedding model to ensure they are compared in the same vector space.
Upload & Parse Files
After configuring your knowledge base:
- Click + Add file to upload documents from your device
- Click the play button on each file to start parsing
- Once parsed, click the file name to view individual chunks
- Double-click any chunk to add keywords, questions, or make manual edits
If parsing stalls below 1%, cancel and restart. If it stalls near completion, the file may be too large — try splitting it. Adding keywords to chunks improves their ranking for queries containing those terms.
Retrieval Testing
Before setting up a chat, test your knowledge base retrieval:
- Open your knowledge base and go to Retrieval Testing
- Type a test question in the Test text field
- Review the retrieved chunks and their similarity scores
- Adjust Similarity threshold and Vector similarity weight as needed
Web Crawler
Turn any website into a searchable knowledge base. Base+
- In your knowledge base, click + Add file and select Web Crawl
- Enter the root URL of the website you want to crawl
- Configure crawl depth and page limits
- Gomus AI will crawl all linked pages, extract content, and index it
The crawler respects robots.txt rules and you can schedule periodic re-crawls to keep content fresh. Crawled web pages and uploaded documents can coexist in the same knowledge base for unified retrieval.
Chat Assistants
Chat assistants let you have AI-powered conversations grounded in your knowledge bases. Each answer is backed by citations from your documents, reducing hallucinations and ensuring accuracy.
Create a Chat Assistant
Navigate to Chat
Click the Chat tab at the top of the page, then click Create Chat.
Configure the Assistant
Give it a name, select one or more knowledge bases, and choose your AI model.
Set the Empty Response
Define what the assistant says when it cannot find an answer in your documents. Leave blank to allow the model to improvise (may cause hallucinations).
Start Chatting
Click on your assistant to open the chat interface. Ask questions and get cited answers.
Chat Configuration
Each chat assistant has its own configuration:
- Model — Choose any available LLM (can differ per assistant)
- Knowledge bases — Select which knowledge bases to query
- System prompt — Customize the AI's behavior and persona
- Empty response — Response when no relevant information is found
- Multi-turn optimization — Uses conversation history to refine queries (enable under Prompt Engine)
- TopN — Number of chunks to retrieve (reduce to speed up responses)
- Similarity threshold — Minimum relevance score for chunks
AI Search vs AI Chat
| Feature | AI Search | AI Chat |
|---|---|---|
| Conversation | Single-turn | Multi-turn |
| Retrieval strategy | Fixed hybrid search | Configurable |
| Model selection | System default | Per assistant |
| Knowledge graph | Not available | Configurable |
| Retrieved chunks | Shown below answer | Available via citations |
Use AI Search for quick one-off queries and retrieval debugging. Use AI Chat for production conversations.
Agent Builder
The Agent Builder is a visual drag-and-drop editor for designing multi-step AI workflows. Connect retrieval, generation, web search, conditional logic, and custom components into powerful pipelines — no code required. All Plans
Create an Agent
Open Agent Tab
Click the Agent tab at the top of the page to see your existing agents or create a new one.
Choose Template or Start Blank
Click + Create Agent. Choose from pre-built templates (Deep Research, Customer Support, etc.) or click Create from Blank to start fresh.
Build Your Workflow
Use the visual canvas to add and connect components. Click the + button on any component to add downstream steps.
Save & Test
Click Save to apply your changes. Test your agent by chatting with it directly from the editor.
Agent Components
| Component | Description |
|---|---|
| Begin | Entry point of every agent workflow |
| Retrieval | Query one or more knowledge bases |
| Agent (LLM) | Send prompts to an AI model for generation |
| Categorize | Classify user intent and route accordingly |
| Switch | Conditional branching based on rules |
| Iteration | Loop over a list of items |
| Message | Send a fixed message to the user |
| Await Response | Wait for user input mid-workflow |
| Code | Execute custom Python code (sandbox) |
| HTTP | Make HTTP requests to external APIs |
| Execute SQL | Run SQL queries against databases |
| Text Processing | Transform, extract, or reformat text |
| Docs Generator | Generate formatted documents from data |
| Parser | Parse files within a workflow |
| Indexer | Index parsed content into a knowledge base |
Pre-built Templates
Gomus AI provides ready-to-use agent templates:
- Deep Research — Multi-step research with web search + knowledge base retrieval
- E-Commerce Customer Support — Intent classification + knowledge-grounded responses
- Ingestion Pipeline — Custom document parsing and indexing workflow
- Sandbox — Code execution for data analysis tasks
Available Models
Gomus AI provides 40+ pre-configured AI models from leading providers. All models are ready to use — no API keys required from your side.
Chat Models (LLMs)
Embedding Models
Reranking Models
Models by Tier
| Tier | Available Models |
|---|---|
| Free | 20+ models including Nova, Llama, Titan Embed, Cohere Embed & Rerank |
| Base | All Free models + Claude, Pixtral, Qwen, and more |
| Premium | All Base models + Claude Opus, advanced vision models |
| Business | Full catalog of 40+ models, priority capacity |
Unlike self-hosted solutions, Gomus AI comes with all models pre-configured. Just select a model and start using it. Your usage is tracked via the credit system.
API Reference
Gomus AI provides RESTful APIs and an OpenAI-compatible endpoint to integrate AI capabilities into your applications. Base+
Authentication
All API requests require an API key. To get your key:
- Click your avatar (top right) and select API Key
- Click Create new key and copy the generated key
- Include it in the
Authorizationheader of every request
Authorization: Bearer YOUR_API_KEY
Base URL
https://app.gomus.org/api/v1
Error Codes
| Code | Message | Description |
|---|---|---|
| 400 | Bad Request | Invalid request parameters |
| 401 | Unauthorized | Invalid or missing API key |
| 403 | Forbidden | Access denied (plan limit or permission) |
| 404 | Not Found | Resource not found |
| 429 | Rate Limited | Too many requests, slow down |
| 500 | Server Error | Internal server error |
Chat Completion (OpenAI-Compatible)
Gomus AI supports the OpenAI chat completion format, making it easy to integrate with existing tools and libraries.
POST /api/v1/chats_openai/{chat_id}/chat/completions
{
"model": "model",
"messages": [
{"role": "user", "content": "What does the contract say about termination?"}
],
"stream": true
}
Request Headers
Content-Type: application/json
Authorization: Bearer YOUR_API_KEY
cURL Example
curl -X POST https://app.gomus.org/api/v1/chats_openai/{chat_id}/chat/completions \
-H "Content-Type: application/json" \
-H "Authorization: Bearer YOUR_API_KEY" \
-d '{
"model": "model",
"messages": [{"role": "user", "content": "Summarize the key findings"}],
"stream": true
}'
Parameters
| Parameter | Type | Required | Description |
|---|---|---|---|
model | string | Yes | Model name (auto-parsed, can be any value) |
messages | array | Yes | Conversation history with at least one user message |
stream | boolean | No | Stream response (default: true) |
Datasets API
Manage knowledge bases and documents programmatically.
| Method | Endpoint | Description |
|---|---|---|
| POST | /api/v1/datasets | Create a new dataset |
| GET | /api/v1/datasets | List all datasets |
| PUT | /api/v1/datasets/{id} | Update a dataset |
| DELETE | /api/v1/datasets/{id} | Delete a dataset |
| POST | /api/v1/datasets/{id}/documents | Upload documents |
| GET | /api/v1/datasets/{id}/documents | List documents in a dataset |
| DELETE | /api/v1/datasets/{id}/documents/{doc_id} | Delete a document |
| POST | /api/v1/datasets/{id}/chunks | Parse documents into chunks |
| GET | /api/v1/datasets/{id}/chunks | List chunks |
| POST | /api/v1/retrieval | Run retrieval query |
Agents API
| Method | Endpoint | Description |
|---|---|---|
| GET | /api/v1/agents | List all agents |
| POST | /api/v1/agents/{id}/sessions | Create a new agent session |
| POST | /api/v1/agents/{id}/completions | Converse with an agent |
| GET | /api/v1/agents/{id}/sessions | List agent sessions |
| DELETE | /api/v1/agents/{id}/sessions/{session_id} | Delete a session |
Billing & Plans
| Feature | Free | Base ($19.90/mo) | Premium ($49.90/mo) | Business ($149.90/mo) |
|---|---|---|---|---|
| AI Models | 20+ free models | 40+ including Claude | 40+ including Claude | Full catalog + priority |
| Knowledge Bases | 1 | 5 | 20 | Unlimited |
| Storage | 50 MB | 1 GB | 10 GB | 100 GB |
| Credits / month | 500 | 5,000 | 25,000 | 100,000 |
| Web Crawler | — | Yes | Unlimited | Unlimited |
| Agent Builder | Yes | Yes | Yes | Yes |
| API Access | Yes | Yes | Yes | Yes |
| Team Collaboration | — | — | — | Yes |
| Support | Community | Priority | Dedicated |
Credit System
Gomus AI uses a credit-based system to track AI model usage. Different models consume different amounts of credits per request:
- Free models (Nova Micro, Llama 1B/3B) — Minimal credit usage
- Standard models (Nova Pro/Lite, Claude Haiku) — Moderate credit usage
- Premium models (Claude Sonnet/Opus) — Higher credit usage
Credits reset monthly. You can monitor your usage from Account Settings. If you exhaust your credits, you can wait for the monthly reset or upgrade your plan.
Frequently Asked Questions
What is Gomus AI?
Gomus AI is an AI-powered knowledge platform built on a state-of-the-art RAG (Retrieval-Augmented Generation) engine. It combines deep document understanding with visual AI agent orchestration and web crawling, all in one platform. Upload documents or crawl websites, then get precise, cited answers from your data.
What makes Gomus AI different from ChatGPT?
ChatGPT answers based on its training data. Gomus AI answers based on your documents. Every answer includes citations pointing to the exact source chunks, so you can verify accuracy. This dramatically reduces hallucinations and makes it suitable for professional use cases where accuracy matters.
Is my data secure?
Yes. All data is processed and stored on AWS infrastructure in the EU (Frankfurt, eu-central-1). Your documents are never used to train AI models. All data is encrypted at rest and in transit. The platform is designed with GDPR compliance in mind.
Which file formats are supported?
Gomus AI supports a wide range of file formats:
| Category | Formats |
|---|---|
| Documents | PDF, DOCX, DOC, TXT, MD, MDX, HTML, EML |
| Spreadsheets | XLSX, XLS, CSV |
| Presentations | PPTX, PPT |
| Images | JPEG, JPG, PNG, TIF, GIF |
| Data | JSON |
Can I use the API to integrate Gomus AI into my app?
Yes! Gomus AI provides RESTful APIs and an OpenAI-compatible chat completion endpoint. See the API Reference section for details. API access is available on all plans.
How does the Web Crawler work?
Paste the root URL of any website. Gomus AI crawls all linked pages, extracts content, cleans HTML into structured text, and indexes everything into a knowledge base. You can then query the entire site through chat, just like uploaded documents. The crawler respects robots.txt and you can configure crawl depth and filters.
What is the Agent Flow Builder?
A visual drag-and-drop canvas where you design multi-step AI workflows. Connect components like knowledge retrieval, LLM generation, web search, conditional logic, and custom tools into a pipeline. No coding required. See Agent Builder for details.
Can I switch AI models per conversation?
Yes. Each chat assistant can use a different AI model. You can also change models within the same assistant at any time.
Do you support multi-turn conversations?
Yes. Enable Multi-turn optimization in your chat assistant's settings (under Prompt Engine) to let the AI use previous messages as context for better follow-up responses.
What happens when I run out of credits?
When your monthly credits are exhausted, AI model requests will be paused until credits reset at the start of your next billing cycle. You can upgrade your plan at any time for additional credits. Free-tier models with minimal credit usage may still be available.
Contact & Support
- Email: support@gomus.org
- Website: www.gomus.org
- App: app.gomus.org
© 2025-2026 Gomus Digital Agency. All rights reserved.