Skip to main content

An Assistant is an executable instance of a System that enables end-user interaction and provides enhanced capabilities beyond the base System workflow. While Systems are the blueprint or template that defines AI processing workflows, Assistants are the functional, deployable versions that users can actually interact with.

Think of an Assistant as a specialized AI agent built from a System that can:

  • Chat with users in real-time with conversational memory
  • Access knowledge bases through uploaded documents and files
  • Execute System workflows with user-provided inputs
  • Maintain session history for tracking interactions and results
  • Provide streaming responses for real-time communication

Key Characteristics:

  • System-Based: Every Assistant is built from an existing System's workflow
  • Interactive: Users can chat directly with Assistants through a conversational interface
  • Knowledge-Enhanced: Support for RAG (Retrieval-Augmented Generation) with document upload
  • Persistent: Maintains conversation history and session data
  • Configurable: Each Assistant has its own settings separate from the underlying System

Assistant Structure and Components

Core Assistant Properties

Basic Information

  • Name (string, required): A descriptive name for your Assistant
  • Description (string, required): Explanation of what the Assistant does and its purpose
  • ID (string, auto-generated): Unique identifier for the Assistant
  • System ID (string, required): Reference to the System this Assistant is based on
  • Created At (Date): Timestamp when the Assistant was first created
  • Updated At (Date): Timestamp when the Assistant was last modified

Classification and Access

  • Tags (string[]): Array of tags for categorizing and organizing Assistants
  • Is Public (boolean): Whether this Assistant is publicly accessible or private

Assistant Configuration

Every Assistant includes a comprehensive configuration object that controls its behavior:

Core Settings

  • Temperature (number, 0-1): Controls creativity/randomness of AI responses (inherited from System)
  • Max Tokens (number): Maximum number of tokens the Assistant can generate per response (inherited from System)
  • Enable RAG (boolean): Whether to use uploaded knowledge base documents for enhanced responses

RAG (Retrieval-Augmented Generation) Settings

  • Enable RAG (boolean): Whether to use knowledge base documents for enhanced responses
  • RAG Settings (object, optional):
    • Enabled (boolean): Master toggle for RAG functionality
    • Document Sources (string[]): List of knowledge base document identifiers
    • Chunk Size (number): Size of text chunks for processing
    • Overlap Size (number): Overlap between text chunks
    • Similarity Threshold (number): Minimum similarity score for including knowledge chunks

Chat Settings

  • Chat Settings (object, optional):
    • Enabled (boolean): Whether chat functionality is active
    • Max History (number): Maximum number of previous messages to remember
    • Context Window (number): Total token limit for conversation context
    • Personality Prompt (string, optional): Custom personality instructions

How to Create an Assistant

Step 1: Access Assistant Creation

[Screenshot placeholder: Assistants list page with "Create Assistant" button]

  1. Navigate to the Assistants page in the application
  2. Click the "Create Assistant" button in the top-right corner

Step 2: Configure Basic Information

[Screenshot placeholder: Assistant creation form]

The Assistant creation form includes:

Assistant Details

  • Assistant Name: Enter a descriptive name (e.g., "Marketing Helper", "Code Reviewer")
  • System Selection: Choose the System this Assistant will be based on
    • Dropdown shows all available Systems with descriptions
    • The Assistant will inherit the System's workflow and AI model configuration

Step 3: Automatic Configuration

When you create an Assistant, it automatically inherits the AI model configuration from the selected System and applies default Assistant-specific settings:

{
"enableRag": false,
"chatSettings": {
"enabled": true,
"maxHistory": 10,
"contextWindow": 4000
}
}

The AI model, temperature, max tokens, and system prompts are inherited directly from the underlying System's workflow nodes.

Step 4: Assistant Management

After creation, you're automatically taken to the Assistant Management interface with three main tabs:

Assistant Management Interface

Knowledge Base Tab

[Screenshot placeholder: Knowledge base management interface]

The Knowledge Base tab allows you to enhance your Assistant with documents and files:

File Upload

  • Drag & Drop: Drag files anywhere on the tab to upload them
  • File Browser: Click "Add Files" to select files from your computer
  • Supported Formats: PDF, DOC/DOCX, TXT, MD, CSV, JSON, HTML, and more

Upload Process

The system handles file processing in three stages:

  1. Getting Upload URL: Secure signed URL generation for S3 storage
  2. Uploading to S3: Direct file upload to cloud storage
  3. Processing: Text extraction, chunking, and embedding generation

File Management

  • File Status: Monitor processing status (Available, Extracting Text, Embedding, etc.)
  • File Details: View size, type, upload date, and extracted text
  • File Actions: Inspect content, download original files, or remove from knowledge base

Knowledge Base Features

  • Search: Find specific files in your knowledge base
  • Status Monitoring: Real-time updates on file processing progress
  • Text Extraction: Automatic text extraction from various document types
  • Embedding Generation: AI-powered text embeddings for similarity search

Playground Tab

[Screenshot placeholder: Chat playground interface]

The Playground provides a real-time chat interface for testing and interacting with your Assistant:

Chat Interface

  • Conversational UI: Natural chat experience with message history
  • Streaming Responses: Real-time response generation with live updates
  • Message History: Persistent conversation within the session
  • Rich Formatting: Support for markdown formatting in responses

Chat Features

  • Knowledge Integration: Automatic use of uploaded documents when relevant
  • Debug Information: Access to detailed execution logs for each response
  • Response Streaming: Watch responses generate in real-time
  • Error Handling: Clear error messages for failed interactions

User Experience

  • Auto-scroll: Automatic scrolling to new messages
  • Typing Indicators: Shows when the Assistant is processing
  • Send Methods: Enter to send, Shift+Enter for new lines
  • Response Status: Visual indicators for message processing states

History Tab

[Screenshot placeholder: Session history interface]

The History tab provides comprehensive tracking of all Assistant interactions:

Session Management

  • Session List: All previous conversations with timestamps
  • Search Functionality: Find specific sessions by input or output content
  • Session Details: Input messages, AI responses, and execution metadata
  • Chronological Order: Sessions sorted by most recent first

Data Tracking

  • Input/Output Pairs: Complete record of user inputs and Assistant responses
  • Timestamps: Precise timing information for each interaction
  • Session Metadata: Additional data about execution context
  • Export Capabilities: Access to session data for analysis

Advanced Features

Real-Time Streaming

Assistants support advanced streaming capabilities:

Server-Sent Events (SSE)

  • Live Response Generation: Responses appear as they're generated
  • Progress Indicators: Real-time status updates during processing
  • Error Recovery: Graceful handling of connection issues
  • Multi-step Processing: Status updates for complex System workflows

Streaming Benefits

  • Immediate Feedback: Users see responses starting immediately
  • Better UX: No waiting for complete responses
  • Progress Transparency: Clear indication of processing stages
  • Reduced Latency: Perceived faster response times

Knowledge Base Integration

RAG (Retrieval-Augmented Generation)

  • Automatic Context: Relevant documents automatically included in responses
  • Similarity Search: AI-powered matching of user queries to knowledge content
  • Chunk Management: Intelligent text chunking for optimal retrieval
  • Context Windows: Balanced inclusion of relevant knowledge without overwhelming

Document Processing Pipeline

  1. Upload: Secure file upload to cloud storage
  2. Text Extraction: Convert documents to searchable text
  3. Chunking: Split content into manageable pieces
  4. Embedding: Generate AI embeddings for similarity search
  5. Indexing: Make content searchable for real-time retrieval

Session Management

Conversation Persistence

  • Cross-Session Memory: Conversations persist across browser sessions
  • History Tracking: Complete record of all interactions
  • Context Preservation: Maintain conversation context within sessions
  • Data Analytics: Session data available for usage analysis

Session Features

  • Automatic Saving: All interactions automatically saved
  • Search Capabilities: Find previous conversations quickly
  • Export Options: Access to session data in structured formats
  • Privacy Controls: Manage data retention and access

Assistant Operations

Managing Assistants

[Screenshot placeholder: Assistants list with management options]

The Assistants list provides comprehensive management capabilities:

List View Features

  • Search and Filter: Find Assistants by name or description
  • Status Indicators: Public/Private visibility status
  • System Information: Shows which System each Assistant is based on
  • Quick Actions: Chat, Manage, Duplicate, Delete options

Assistant Actions

Chatting
  • Direct Access: Quick access to Assistant chat interface
  • Playground Mode: Full-featured testing environment
  • Real-time Interaction: Immediate conversation capabilities
Management
  • Knowledge Base: Add, remove, and organize knowledge documents
  • Configuration: Modify Assistant settings and behavior
  • History Review: Access to all previous interactions
Duplication
  • Quick Copying: Create copies of existing Assistants
  • Configuration Inheritance: Duplicated Assistants inherit all settings
  • Independent Management: Copies can be modified independently
Deletion
  • Permanent Removal: Complete deletion of Assistant and all data
  • Data Cleanup: Removes knowledge base files and session history
  • Confirmation Required: Safety measures to prevent accidental deletion

Integration with Systems

System Relationship

  • Foundation: Every Assistant is built on a System workflow
  • Model Inheritance: AI models, temperature, and max tokens are inherited from the System
  • Updates: Changes to Systems don't automatically affect existing Assistants
  • Flexibility: Multiple Assistants can be created from the same System

Assistant-Specific Features

  • Knowledge Base: Upload and manage documents for RAG functionality
  • Chat Interface: Real-time conversational capabilities with streaming responses
  • Session History: Persistent conversation tracking and analytics
  • RAG Configuration: Enable and configure knowledge base integration

Best Practices

Assistant Design

  1. Clear Purpose: Define specific use cases and target users
  2. Appropriate Knowledge: Upload relevant, high-quality documents
  3. System Selection: Choose Systems with appropriate AI model configuration for your use case
  4. Testing: Thoroughly test in Playground before deployment

Knowledge Base Management

  1. Quality Content: Upload well-structured, relevant documents
  2. Regular Updates: Keep knowledge base current and accurate
  3. File Organization: Use clear file names and maintain good organization
  4. Size Management: Monitor file sizes and processing times

User Experience Optimization

  1. Response Quality: Select Systems with appropriate AI model settings for your use case
  2. Knowledge Relevance: Ensure uploaded content matches user needs
  3. Testing Coverage: Test various scenarios and edge cases
  4. Performance Monitoring: Track response times and user satisfaction

Security and Privacy

  1. Access Controls: Properly configure public/private settings
  2. Data Management: Regularly review and clean up knowledge base
  3. Session Privacy: Understand session data retention policies
  4. Content Sensitivity: Be mindful of confidential information in uploads

API Integration

Chat API

Assistants provide robust API endpoints for integration:

  • Standard Chat: Synchronous request/response chat
  • Streaming Chat: Real-time streaming responses via Server-Sent Events
  • Session Management: Access to conversation history and context
  • File Upload: Programmatic knowledge base management

Knowledge Management API

  • File Upload: Programmatic document upload and processing
  • File Management: List, update, and delete knowledge base files
  • Processing Status: Monitor file processing and embedding generation
  • Search Capabilities: Query knowledge base content directly

Session Analytics API

  • Session History: Access to all Assistant interactions
  • Usage Metrics: Performance and usage analytics
  • Export Functions: Data export for analysis and reporting
  • Search and Filter: Programmatic access to session data

Comparison: Systems vs. Assistants

FeatureSystemsAssistants
PurposeWorkflow definition and configurationUser-facing interaction and execution
InteractivityStatic workflow blueprintsReal-time chat and interaction
Knowledge BaseSystem-level resource settingsIndividual document upload and management
User AccessDevelopment/configuration toolEnd-user application
Chat CapabilityNone (workflow execution only)Full conversational interface
Session HistoryExecution logs onlyComplete conversation persistence
DeploymentCannot be used directlyReady for user interaction
CustomizationWorkflow and AI configurationKnowledge base and RAG settings

Future Enhancements

Planned improvements for Assistant functionality include:

Advanced RAG Features

  • Multi-modal Knowledge: Support for images, audio, and video content
  • Advanced Retrieval: Hybrid search combining semantic and keyword matching
  • Knowledge Graphs: Structured knowledge representation and reasoning
  • Real-time Updates: Dynamic knowledge base updates during conversations

Enhanced Chat Capabilities

  • Voice Integration: Speech-to-text and text-to-speech capabilities
  • Multi-language Support: Conversation in multiple languages
  • Conversation Branching: Support for complex conversation flows
  • Tool Integration: Connect to external APIs and services

Enterprise Features

  • Team Collaboration: Shared Assistants and knowledge bases
  • Advanced Analytics: Detailed usage and performance metrics
  • Access Controls: Fine-grained permission management
  • Integration APIs: Enhanced enterprise system integration

AI Capabilities

  • Multi-model Support: Integration with various AI providers
  • Custom Models: Support for fine-tuned and specialized models
  • Advanced Reasoning: Enhanced logical reasoning and problem-solving
  • Context Understanding: Improved long-term conversation memory

This documentation covers the complete Assistant ecosystem, from creation and configuration to advanced features and best practices. Assistants represent the user-facing layer of the AI workflow system, transforming Systems into interactive, intelligent agents capable of sophisticated user interactions.