An Assistant is an executable instance of a System that enables end-user interaction and provides enhanced capabilities beyond the base System workflow. While Systems are the blueprint or template that defines AI processing workflows, Assistants are the functional, deployable versions that users can actually interact with.
Think of an Assistant as a specialized AI agent built from a System that can:
- Chat with users in real-time with conversational memory
- Access knowledge bases through uploaded documents and files
- Execute System workflows with user-provided inputs
- Maintain session history for tracking interactions and results
- Provide streaming responses for real-time communication
Key Characteristics:
- System-Based: Every Assistant is built from an existing System's workflow
- Interactive: Users can chat directly with Assistants through a conversational interface
- Knowledge-Enhanced: Support for RAG (Retrieval-Augmented Generation) with document upload
- Persistent: Maintains conversation history and session data
- Configurable: Each Assistant has its own settings separate from the underlying System
Assistant Structure and Components
Core Assistant Properties
Basic Information
- Name (
string
, required): A descriptive name for your Assistant - Description (
string
, required): Explanation of what the Assistant does and its purpose - ID (
string
, auto-generated): Unique identifier for the Assistant - System ID (
string
, required): Reference to the System this Assistant is based on - Created At (
Date
): Timestamp when the Assistant was first created - Updated At (
Date
): Timestamp when the Assistant was last modified
Classification and Access
- Tags (
string[]
): Array of tags for categorizing and organizing Assistants - Is Public (
boolean
): Whether this Assistant is publicly accessible or private
Assistant Configuration
Every Assistant includes a comprehensive configuration object that controls its behavior:
Core Settings
- Temperature (
number
, 0-1): Controls creativity/randomness of AI responses (inherited from System) - Max Tokens (
number
): Maximum number of tokens the Assistant can generate per response (inherited from System) - Enable RAG (
boolean
): Whether to use uploaded knowledge base documents for enhanced responses
RAG (Retrieval-Augmented Generation) Settings
- Enable RAG (
boolean
): Whether to use knowledge base documents for enhanced responses - RAG Settings (
object
, optional):- Enabled (
boolean
): Master toggle for RAG functionality - Document Sources (
string[]
): List of knowledge base document identifiers - Chunk Size (
number
): Size of text chunks for processing - Overlap Size (
number
): Overlap between text chunks - Similarity Threshold (
number
): Minimum similarity score for including knowledge chunks
- Enabled (
Chat Settings
- Chat Settings (
object
, optional):- Enabled (
boolean
): Whether chat functionality is active - Max History (
number
): Maximum number of previous messages to remember - Context Window (
number
): Total token limit for conversation context - Personality Prompt (
string
, optional): Custom personality instructions
- Enabled (
How to Create an Assistant
Step 1: Access Assistant Creation
[Screenshot placeholder: Assistants list page with "Create Assistant" button]
- Navigate to the Assistants page in the application
- Click the "Create Assistant" button in the top-right corner
Step 2: Configure Basic Information
[Screenshot placeholder: Assistant creation form]
The Assistant creation form includes:
Assistant Details
- Assistant Name: Enter a descriptive name (e.g., "Marketing Helper", "Code Reviewer")
- System Selection: Choose the System this Assistant will be based on
- Dropdown shows all available Systems with descriptions
- The Assistant will inherit the System's workflow and AI model configuration
Step 3: Automatic Configuration
When you create an Assistant, it automatically inherits the AI model configuration from the selected System and applies default Assistant-specific settings:
{
"enableRag": false,
"chatSettings": {
"enabled": true,
"maxHistory": 10,
"contextWindow": 4000
}
}
The AI model, temperature, max tokens, and system prompts are inherited directly from the underlying System's workflow nodes.
Step 4: Assistant Management
After creation, you're automatically taken to the Assistant Management interface with three main tabs:
Assistant Management Interface
Knowledge Base Tab
[Screenshot placeholder: Knowledge base management interface]
The Knowledge Base tab allows you to enhance your Assistant with documents and files:
File Upload
- Drag & Drop: Drag files anywhere on the tab to upload them
- File Browser: Click "Add Files" to select files from your computer
- Supported Formats: PDF, DOC/DOCX, TXT, MD, CSV, JSON, HTML, and more
Upload Process
The system handles file processing in three stages:
- Getting Upload URL: Secure signed URL generation for S3 storage
- Uploading to S3: Direct file upload to cloud storage
- Processing: Text extraction, chunking, and embedding generation
File Management
- File Status: Monitor processing status (Available, Extracting Text, Embedding, etc.)
- File Details: View size, type, upload date, and extracted text
- File Actions: Inspect content, download original files, or remove from knowledge base
Knowledge Base Features
- Search: Find specific files in your knowledge base
- Status Monitoring: Real-time updates on file processing progress
- Text Extraction: Automatic text extraction from various document types
- Embedding Generation: AI-powered text embeddings for similarity search
Playground Tab
[Screenshot placeholder: Chat playground interface]
The Playground provides a real-time chat interface for testing and interacting with your Assistant:
Chat Interface
- Conversational UI: Natural chat experience with message history
- Streaming Responses: Real-time response generation with live updates
- Message History: Persistent conversation within the session
- Rich Formatting: Support for markdown formatting in responses
Chat Features
- Knowledge Integration: Automatic use of uploaded documents when relevant
- Debug Information: Access to detailed execution logs for each response
- Response Streaming: Watch responses generate in real-time
- Error Handling: Clear error messages for failed interactions
User Experience
- Auto-scroll: Automatic scrolling to new messages
- Typing Indicators: Shows when the Assistant is processing
- Send Methods: Enter to send, Shift+Enter for new lines
- Response Status: Visual indicators for message processing states
History Tab
[Screenshot placeholder: Session history interface]
The History tab provides comprehensive tracking of all Assistant interactions:
Session Management
- Session List: All previous conversations with timestamps
- Search Functionality: Find specific sessions by input or output content
- Session Details: Input messages, AI responses, and execution metadata
- Chronological Order: Sessions sorted by most recent first
Data Tracking
- Input/Output Pairs: Complete record of user inputs and Assistant responses
- Timestamps: Precise timing information for each interaction
- Session Metadata: Additional data about execution context
- Export Capabilities: Access to session data for analysis
Advanced Features
Real-Time Streaming
Assistants support advanced streaming capabilities:
Server-Sent Events (SSE)
- Live Response Generation: Responses appear as they're generated
- Progress Indicators: Real-time status updates during processing
- Error Recovery: Graceful handling of connection issues
- Multi-step Processing: Status updates for complex System workflows
Streaming Benefits
- Immediate Feedback: Users see responses starting immediately
- Better UX: No waiting for complete responses
- Progress Transparency: Clear indication of processing stages
- Reduced Latency: Perceived faster response times
Knowledge Base Integration
RAG (Retrieval-Augmented Generation)
- Automatic Context: Relevant documents automatically included in responses
- Similarity Search: AI-powered matching of user queries to knowledge content
- Chunk Management: Intelligent text chunking for optimal retrieval
- Context Windows: Balanced inclusion of relevant knowledge without overwhelming
Document Processing Pipeline
- Upload: Secure file upload to cloud storage
- Text Extraction: Convert documents to searchable text
- Chunking: Split content into manageable pieces
- Embedding: Generate AI embeddings for similarity search
- Indexing: Make content searchable for real-time retrieval
Session Management
Conversation Persistence
- Cross-Session Memory: Conversations persist across browser sessions
- History Tracking: Complete record of all interactions
- Context Preservation: Maintain conversation context within sessions
- Data Analytics: Session data available for usage analysis
Session Features
- Automatic Saving: All interactions automatically saved
- Search Capabilities: Find previous conversations quickly
- Export Options: Access to session data in structured formats
- Privacy Controls: Manage data retention and access
Assistant Operations
Managing Assistants
[Screenshot placeholder: Assistants list with management options]
The Assistants list provides comprehensive management capabilities:
List View Features
- Search and Filter: Find Assistants by name or description
- Status Indicators: Public/Private visibility status
- System Information: Shows which System each Assistant is based on
- Quick Actions: Chat, Manage, Duplicate, Delete options
Assistant Actions
Chatting
- Direct Access: Quick access to Assistant chat interface
- Playground Mode: Full-featured testing environment
- Real-time Interaction: Immediate conversation capabilities
Management
- Knowledge Base: Add, remove, and organize knowledge documents
- Configuration: Modify Assistant settings and behavior
- History Review: Access to all previous interactions
Duplication
- Quick Copying: Create copies of existing Assistants
- Configuration Inheritance: Duplicated Assistants inherit all settings
- Independent Management: Copies can be modified independently
Deletion
- Permanent Removal: Complete deletion of Assistant and all data
- Data Cleanup: Removes knowledge base files and session history
- Confirmation Required: Safety measures to prevent accidental deletion
Integration with Systems
System Relationship
- Foundation: Every Assistant is built on a System workflow
- Model Inheritance: AI models, temperature, and max tokens are inherited from the System
- Updates: Changes to Systems don't automatically affect existing Assistants
- Flexibility: Multiple Assistants can be created from the same System
Assistant-Specific Features
- Knowledge Base: Upload and manage documents for RAG functionality
- Chat Interface: Real-time conversational capabilities with streaming responses
- Session History: Persistent conversation tracking and analytics
- RAG Configuration: Enable and configure knowledge base integration
Best Practices
Assistant Design
- Clear Purpose: Define specific use cases and target users
- Appropriate Knowledge: Upload relevant, high-quality documents
- System Selection: Choose Systems with appropriate AI model configuration for your use case
- Testing: Thoroughly test in Playground before deployment
Knowledge Base Management
- Quality Content: Upload well-structured, relevant documents
- Regular Updates: Keep knowledge base current and accurate
- File Organization: Use clear file names and maintain good organization
- Size Management: Monitor file sizes and processing times
User Experience Optimization
- Response Quality: Select Systems with appropriate AI model settings for your use case
- Knowledge Relevance: Ensure uploaded content matches user needs
- Testing Coverage: Test various scenarios and edge cases
- Performance Monitoring: Track response times and user satisfaction
Security and Privacy
- Access Controls: Properly configure public/private settings
- Data Management: Regularly review and clean up knowledge base
- Session Privacy: Understand session data retention policies
- Content Sensitivity: Be mindful of confidential information in uploads
API Integration
Chat API
Assistants provide robust API endpoints for integration:
- Standard Chat: Synchronous request/response chat
- Streaming Chat: Real-time streaming responses via Server-Sent Events
- Session Management: Access to conversation history and context
- File Upload: Programmatic knowledge base management
Knowledge Management API
- File Upload: Programmatic document upload and processing
- File Management: List, update, and delete knowledge base files
- Processing Status: Monitor file processing and embedding generation
- Search Capabilities: Query knowledge base content directly
Session Analytics API
- Session History: Access to all Assistant interactions
- Usage Metrics: Performance and usage analytics
- Export Functions: Data export for analysis and reporting
- Search and Filter: Programmatic access to session data
Comparison: Systems vs. Assistants
Feature | Systems | Assistants |
---|---|---|
Purpose | Workflow definition and configuration | User-facing interaction and execution |
Interactivity | Static workflow blueprints | Real-time chat and interaction |
Knowledge Base | System-level resource settings | Individual document upload and management |
User Access | Development/configuration tool | End-user application |
Chat Capability | None (workflow execution only) | Full conversational interface |
Session History | Execution logs only | Complete conversation persistence |
Deployment | Cannot be used directly | Ready for user interaction |
Customization | Workflow and AI configuration | Knowledge base and RAG settings |
Future Enhancements
Planned improvements for Assistant functionality include:
Advanced RAG Features
- Multi-modal Knowledge: Support for images, audio, and video content
- Advanced Retrieval: Hybrid search combining semantic and keyword matching
- Knowledge Graphs: Structured knowledge representation and reasoning
- Real-time Updates: Dynamic knowledge base updates during conversations
Enhanced Chat Capabilities
- Voice Integration: Speech-to-text and text-to-speech capabilities
- Multi-language Support: Conversation in multiple languages
- Conversation Branching: Support for complex conversation flows
- Tool Integration: Connect to external APIs and services
Enterprise Features
- Team Collaboration: Shared Assistants and knowledge bases
- Advanced Analytics: Detailed usage and performance metrics
- Access Controls: Fine-grained permission management
- Integration APIs: Enhanced enterprise system integration
AI Capabilities
- Multi-model Support: Integration with various AI providers
- Custom Models: Support for fine-tuned and specialized models
- Advanced Reasoning: Enhanced logical reasoning and problem-solving
- Context Understanding: Improved long-term conversation memory
This documentation covers the complete Assistant ecosystem, from creation and configuration to advanced features and best practices. Assistants represent the user-facing layer of the AI workflow system, transforming Systems into interactive, intelligent agents capable of sophisticated user interactions.