OPEA™ Logo
latest
OPEA Project v: latest
Document Versions
latest
1.0
1.1
1.2
1.3
OPEA Project links
Project Home
Wiki
  • Documentation Home
  • OPEA Overview
    • OPEA Project Architecture
      • Microservices: Flexible and Scalable Architecture
      • Megaservices: A Comprehensive Solution
      • Gateways: Customized Access to Mega- and Microservices
    • Next Step
      • Open Platform for Enterprise AI (OPEA) Framework Draft Proposal
        • 1. Summary
        • 2. Introduction
        • 3. Framework Components, Architecture and Flow
        • 4. Assessing GenAI components and flows
        • 5. Grading Structure
        • 6. Reference flows
        • Appendix A – Draft OPEA Specifications
  • Getting Started with OPEA
    • Understanding OPEA’s Core Components
    • Prerequisites
    • Create and Configure a Virtual Server
    • Deploy the ChatQnA Solution
      • Interact with ChatQnA
    • What’s Next
      • Get Involved
  • OPEA Tutorials
    • AgentQnA
      • Overview
      • Purpose
      • How It Works
      • Deployment
        • Single Node
    • AudioQnA
      • Overview
      • Purpose
      • Key Implementation Details
      • How It Works
      • Deployment
        • Single Node
    • ChatQnA
      • Overview
      • Purpose
      • Key Implementation Details
      • How It Works
        • Customize with new VectorDB
        • Expected Output
        • Validation Matrix and Prerequisites
      • Architecture
        • Microservice Outline and Diagram
      • Deployment
      • Single Node
        • Xeon Scalable Processor
        • Gaudi AI Accelerator
        • Nvidia GPU
        • AI PC
      • Kubernetes
        • Getting Started
        • Kubernetes Deployment with Helm on Xeon
      • Cloud Native
      • Troubleshooting
      • Monitoring
        • Set Up the Prometheus Server
        • Set Up the Grafana Dashboard
        • Summary and Next Steps
    • CodeGen
      • Overview
      • Purpose
      • How It Works
      • Deployment
        • Intel® Xeon® Scalable processor
        • Gaudi AI Accelerator
    • Code Translation
      • Overview
      • Purpose
      • How It Works
      • Deployment
        • Single Node
    • DocSum
      • Overview
      • Purpose
      • How It Works
      • Deployment
        • Intel® Xeon® Scalable processor
        • Gaudi AI Accelerator
    • DocIndexRetriever
      • Overview
      • Purpose
      • Key Implementation Details
      • How It Works
      • Deployment
        • Single Node
    • VideoQnA
      • Overview
      • Purpose
      • How It Works
      • Deployment
    • Enterprise Inference Guide
      • Overview
      • How It Works
      • Setting Up a Remote Server or Cluster
      • Using Remote Endpoints on OPEA GenAIExamples
        • 1. Endpoints with Megaservices
        • 2. Endpoints with Microservices
      • Next Steps
    • OpenTelemetry on OPEA Guide
      • Overview
      • How It Works
      • How to Monitor
        • 1. Prometheus
        • 2. Grafana
        • 3. Jaeger
      • Code Instrumentations for OPEA Tracing
      • OpenTelemetry on GenAIExamples
        • ChatQnA
        • AgentQnA
  • GenAI Examples
    • Generative AI Examples
      • Introduction
      • Architecture
      • Use Cases
      • Documentation
      • Getting Started
        • Deployment Guide
      • Supported Examples
      • Validated Configurations
      • Contributing to OPEA
      • Additional Content
    • Examples
      • AgentQnA Application
        • Agents for Question and Answering Application
        • Agents for Question and Answering Application
        • Build Service Docker Image
        • Deploy AgentQnA on AMD GPU (ROCm)
        • Deploying AgentQnA on Intel® Xeon® Processors
        • Deploying AgentQnA on Intel® Gaudi® Processors
        • Deploy AgentQnA on Kubernetes cluster
        • Retrieval tool for agent
      • ArbPostHearingAssistant Application
        • Arbitration Post-Hearing Assistant
        • Table of Contents
        • Deploy Arbitration Post-Hearing Assistant Application on AMD EPYC™ Processors with Docker Compose
        • Example Arbitration Post-Hearing Assistant deployments on AMD GPU (ROCm)
        • Example Arbitration Post-Hearing Assistant deployments on Intel Xeon Processor
        • Example Arbitration Post-Hearing Assistant deployments on Intel® Gaudi® Platform
        • DocSum E2E test scripts
        • Arbitration Post-Hearing Assistant
      • AudioQnA Application
        • AudioQnA Application
        • AudioQnA Docker Image Build
        • AudioQnA Accuracy
        • AudioQnA Benchmarking
        • Deploying AudioQnA on AMD EPYC™ Processors
        • Deploying AudioQnA on AMD ROCm GPU
        • Deploying AudioQnA on Intel® Xeon® Processors
        • Deploy AudioQnA application
        • Deploying AudioQnA on Intel® Gaudi® Processors
        • Deploy AudioQnA in Kubernetes Cluster on Xeon and Gaudi
        • Deploy AudioQnA on Kubernetes cluster
        • AudioQnA E2E test scripts
        • AudioQnA
      • AvatarChatbot Application
        • AvatarChatbot Application
        • Build Mega Service of AvatarChatbot on AMD GPU
        • Example AvatarChatbot Deployment on Intel® Xeon® Platform
        • Example AvatarChatbot Deployment on Intel® Gaudi® Platform
        • AvatarChatbot E2E test scripts
      • BrowserUseAgent Application
        • Browser-use Agent Application
        • Example BrowserUseAgent deployments on an Intel® Gaudi® Platform
        • Setup Scripts for Webarena
      • ChatQnA Application
        • ChatQnA Application
        • ChatQnA Docker Image Build
        • ChatQnA Benchmark Results
        • ChatQnA Accuracy
        • FaqGen Accuracy
        • FaqGen Benchmarking
        • Deploying ChatQnA on AMD EPYC™ Processors
        • Deploying FAQ Generation on AMD EPYC™ Processors
        • Deploying ChatQnA with Pinecone on AMD EPYC™ Processors
        • Deploying ChatQnA with Qdrant on AMD EPYC™ Processors
        • Deploying ChatQnA on AMD ROCm GPU
        • Build Mega Service of ChatQnA on AIPC
        • Deploying ChatQnA on Intel® Xeon® Processors
        • Build Mega Service of ChatQnA on Xeon with an LLM Endpoint
        • Deploying FAQ Generation on Intel® Xeon® Processors
        • Deploying ChatQnA with MariaDB Vector on Intel® Xeon® Processors
        • Deploying ChatQnA with openGauss on Intel® Xeon® Processors
        • Deploying ChatQnA with Pinecone on Intel® Xeon® Processors
        • Deploying ChatQnA with Qdrant on Intel® Xeon® Processors
        • Example ChatQnA deployments on an Intel® Gaudi® Platform
        • How to Check and Validate Micro Service in the GenAI Example
        • Build MegaService of ChatQnA on NVIDIA GPU
        • Deploy ChatQnA in Kubernetes Cluster on Xeon and Gaudi
        • Deploy ChatQnA on Kubernetes cluster
        • ChatQnA E2E test scripts
        • ChatQnA Conversational UI
        • ChatQnA Customized UI
      • CodeGen Application
        • Code Generation Example (CodeGen)
        • CodeGen Accuracy Benchmark
        • CodeGen Performance Benchmark
        • Deploy CodeGen Application on AMD EPYC™ Processors with Docker Compose
        • Deploy CodeGen Application on AMD GPU (ROCm) with Docker Compose
        • To deploy the CodeGen services, execute the docker compose up command with the appropriate arguments. For a TGI deployment, execute:
        • Deploy CodeGen Application on Intel Xeon CPU with Docker Compose
        • Deploy CodeGen Application on Intel Gaudi HPU with Docker Compose
        • Deploy CodeGen using Kubernetes Microservices Connector (GMC)
        • Deploy CodeGen on Kubernetes using Helm
        • CodeGen E2E test scripts
        • Document Summary
        • Code Gen
        • Code Gen
      • CodeTrans Application
        • Code Translation Application
        • CodeTrans Docker Image Build
        • CodeTrans Benchmarking
        • Deploy CodeTrans Application on AMD EPYC™ Processors with Docker Compose
        • Deploying CodeTrans on AMD ROCm GPU
        • Deploying CodeTrans on Intel® Xeon® Processors
        • Deploying CodeTrans on Intel® Gaudi® Processors
        • Deploy CodeTrans in a Kubernetes Cluster
        • Deploy CodeTrans on Kubernetes cluster
        • CodeTrans E2E test scripts
        • Code Translation
      • CogniwareIms Application
        • CogniwareIMS - AI-Powered Inventory Management System
        • Kubernetes Deployment for Cogniware IMS
        • Publish Cogniware IMS Helm Chart
        • Cogniware IMS End-to-End Tests
      • DBQnA Application
        • DBQnA Application
        • Example DBQnA Deployment on AMD GPU (ROCm)
        • Example DBQnA Deployment on Intel® Xeon® Platform
        • DBQnA E2E test scripts
        • DBQnA React Application
      • DeepResearchAgent Application
        • Deep Research Agent Application
        • Deep Research Agent Benchmarks
      • DocIndexRetriever Application
        • DocRetriever Application
        • DocRetriever Application with Docker
        • DocRetriever Application with Docker
        • DocIndexRetriever E2E test scripts
      • DocSum Application
        • Document Summarization Application
        • Table of Contents
        • Deploy DocSum Application on AMD EPYC™ Processors with Docker Compose
        • Example DocSum deployments on AMD GPU (ROCm)
        • Example DocSum deployments on Intel Xeon Processor
        • Example DocSum deployments on Intel® Gaudi® Platform
        • Deploy DocSum in Kubernetes Cluster
        • Deploy DocSum on Kubernetes cluster
        • DocSum E2E test scripts
        • Document Summary
        • Doc Summary React