1 / 4

Real-Time Multimodal Inference

Interactive platform for running advanced computer vision models with real-time processing and a production-oriented architecture.

This is a real-time inference system integrating multiple computer vision models: YOLO Detection, YOLO Pose, OCR y Farneback.

Inference Configuration

Select the models you want to use:

YOLO Detection
Person and object detection
YOLO Pose
Human pose estimation
OCR
Text recognition
Farneback
Motion detection

Initializing server, it can take up to 1 minute. You will have 5 minutes to test the inference of the selected models, the time limitation is set due to EC2 cost optimization.

The system is preparing the real-time inference environment.