OPEA™
latest
OPEA Project
v: latest
Document Versions
latest
1.0
1.1
1.2
1.3
OPEA Project links
Project Home
Wiki
Documentation Home
OPEA Overview
OPEA Project Architecture
Microservices: Flexible and Scalable Architecture
Megaservices: A Comprehensive Solution
Gateways: Customized Access to Mega- and Microservices
Next Step
Open Platform for Enterprise AI (OPEA) Framework Draft Proposal
1. Summary
2. Introduction
3. Framework Components, Architecture and Flow
4. Assessing GenAI components and flows
5. Grading Structure
6. Reference flows
Appendix A – Draft OPEA Specifications
Getting Started with OPEA
Understanding OPEA’s Core Components
Prerequisites
Create and Configure a Virtual Server
Deploy the ChatQnA Solution
Interact with ChatQnA
What’s Next
Get Involved
OPEA Tutorials
AgentQnA
Overview
Purpose
How It Works
Deployment
Single Node
AudioQnA
Overview
Purpose
Key Implementation Details
How It Works
Deployment
Single Node
ChatQnA
Overview
Purpose
Key Implementation Details
How It Works
Customize with new VectorDB
Expected Output
Validation Matrix and Prerequisites
Architecture
Microservice Outline and Diagram
Deployment
Single Node
Xeon Scalable Processor
Gaudi AI Accelerator
Nvidia GPU
AI PC
Kubernetes
Getting Started
Kubernetes Deployment with Helm on Xeon
Cloud Native
Troubleshooting
Monitoring
Set Up the Prometheus Server
Set Up the Grafana Dashboard
Summary and Next Steps
CodeGen
Overview
Purpose
How It Works
Deployment
Intel® Xeon® Scalable processor
Gaudi AI Accelerator
Code Translation
Overview
Purpose
How It Works
Deployment
Single Node
DocSum
Overview
Purpose
How It Works
Deployment
Intel® Xeon® Scalable processor
Gaudi AI Accelerator
DocIndexRetriever
Overview
Purpose
Key Implementation Details
How It Works
Deployment
Single Node
VideoQnA
Overview
Purpose
How It Works
Deployment
Enterprise Inference Guide
Overview
How It Works
Setting Up a Remote Server or Cluster
Using Remote Endpoints on OPEA GenAIExamples
1. Endpoints with Megaservices
2. Endpoints with Microservices
Next Steps
OpenTelemetry on OPEA Guide
Overview
How It Works
How to Monitor
1. Prometheus
2. Grafana
3. Jaeger
Code Instrumentations for OPEA Tracing
OpenTelemetry on GenAIExamples
ChatQnA
AgentQnA
GenAI Examples
Generative AI Examples
Introduction
Architecture
Use Cases
Documentation
Getting Started
Deployment Guide
Supported Examples
Validated Configurations
Contributing to OPEA
Additional Content
Examples
AgentQnA Application
Agents for Question and Answering Application
Agents for Question and Answering Application
Build Service Docker Image
Deploy AgentQnA on AMD GPU (ROCm)
Deploying AgentQnA on Intel® Xeon® Processors
Deploying AgentQnA on Intel® Gaudi® Processors
Deploy AgentQnA on Kubernetes cluster
Retrieval tool for agent
ArbPostHearingAssistant Application
Arbitration Post-Hearing Assistant
Table of Contents
Deploy Arbitration Post-Hearing Assistant Application on AMD EPYC™ Processors with Docker Compose
Example Arbitration Post-Hearing Assistant deployments on AMD GPU (ROCm)
Example Arbitration Post-Hearing Assistant deployments on Intel Xeon Processor
Example Arbitration Post-Hearing Assistant deployments on Intel® Gaudi® Platform
DocSum E2E test scripts
Arbitration Post-Hearing Assistant
AudioQnA Application
AudioQnA Application
AudioQnA Docker Image Build
AudioQnA Accuracy
AudioQnA Benchmarking
Deploying AudioQnA on AMD EPYC™ Processors
Deploying AudioQnA on AMD ROCm GPU
Deploying AudioQnA on Intel® Xeon® Processors
Deploy AudioQnA application
Deploying AudioQnA on Intel® Gaudi® Processors
Deploy AudioQnA in Kubernetes Cluster on Xeon and Gaudi
Deploy AudioQnA on Kubernetes cluster
AudioQnA E2E test scripts
AudioQnA
AvatarChatbot Application
AvatarChatbot Application
Build Mega Service of AvatarChatbot on AMD GPU
Example AvatarChatbot Deployment on Intel® Xeon® Platform
Example AvatarChatbot Deployment on Intel® Gaudi® Platform
AvatarChatbot E2E test scripts
BrowserUseAgent Application
Browser-use Agent Application
Example BrowserUseAgent deployments on an Intel® Gaudi® Platform
Setup Scripts for Webarena
ChatQnA Application
ChatQnA Application
ChatQnA Docker Image Build
ChatQnA Benchmark Results
ChatQnA Accuracy
FaqGen Accuracy
FaqGen Benchmarking
Deploying ChatQnA on AMD EPYC™ Processors
Deploying FAQ Generation on AMD EPYC™ Processors
Deploying ChatQnA with Pinecone on AMD EPYC™ Processors
Deploying ChatQnA with Qdrant on AMD EPYC™ Processors
Deploying ChatQnA on AMD ROCm GPU
Build Mega Service of ChatQnA on AIPC
Deploying ChatQnA on Intel® Xeon® Processors
Build Mega Service of ChatQnA on Xeon with an LLM Endpoint
Deploying FAQ Generation on Intel® Xeon® Processors
Deploying ChatQnA with MariaDB Vector on Intel® Xeon® Processors
Deploying ChatQnA with openGauss on Intel® Xeon® Processors
Deploying ChatQnA with Pinecone on Intel® Xeon® Processors
Deploying ChatQnA with Qdrant on Intel® Xeon® Processors
Example ChatQnA deployments on an Intel® Gaudi® Platform
How to Check and Validate Micro Service in the GenAI Example
Build MegaService of ChatQnA on NVIDIA GPU
Deploy ChatQnA in Kubernetes Cluster on Xeon and Gaudi
Deploy ChatQnA on Kubernetes cluster
ChatQnA E2E test scripts
ChatQnA Conversational UI
ChatQnA Customized UI
CodeGen Application
Code Generation Example (CodeGen)
CodeGen Accuracy Benchmark
CodeGen Performance Benchmark
Deploy CodeGen Application on AMD EPYC™ Processors with Docker Compose
Deploy CodeGen Application on AMD GPU (ROCm) with Docker Compose
To deploy the CodeGen services, execute the
docker
compose
up
command with the appropriate arguments. For a TGI deployment, execute:
Deploy CodeGen Application on Intel Xeon CPU with Docker Compose
Deploy CodeGen Application on Intel Gaudi HPU with Docker Compose
Deploy CodeGen using Kubernetes Microservices Connector (GMC)
Deploy CodeGen on Kubernetes using Helm
CodeGen E2E test scripts
Document Summary
Code Gen
Code Gen
CodeTrans Application
Code Translation Application
CodeTrans Docker Image Build
CodeTrans Benchmarking
Deploy CodeTrans Application on AMD EPYC™ Processors with Docker Compose
Deploying CodeTrans on AMD ROCm GPU
Deploying CodeTrans on Intel® Xeon® Processors
Deploying CodeTrans on Intel® Gaudi® Processors
Deploy CodeTrans in a Kubernetes Cluster
Deploy CodeTrans on Kubernetes cluster
CodeTrans E2E test scripts
Code Translation
CogniwareIms Application
CogniwareIMS - AI-Powered Inventory Management System
Kubernetes Deployment for Cogniware IMS
Publish Cogniware IMS Helm Chart
Cogniware IMS End-to-End Tests
DBQnA Application
DBQnA Application
Example DBQnA Deployment on AMD GPU (ROCm)
Example DBQnA Deployment on Intel® Xeon® Platform
DBQnA E2E test scripts
DBQnA React Application
DeepResearchAgent Application
Deep Research Agent Application
Deep Research Agent Benchmarks
DocIndexRetriever Application
DocRetriever Application
DocRetriever Application with Docker
DocRetriever Application with Docker
DocIndexRetriever E2E test scripts
DocSum Application
Document Summarization Application
Table of Contents
Deploy DocSum Application on AMD EPYC™ Processors with Docker Compose
Example DocSum deployments on AMD GPU (ROCm)
Example DocSum deployments on Intel Xeon Processor
Example DocSum deployments on Intel® Gaudi® Platform
Deploy DocSum in Kubernetes Cluster
Deploy DocSum on Kubernetes cluster
DocSum E2E test scripts
Document Summary
Doc Summary React