InternVideo2.5 - Action Recognition

Powered by InternVideo2.5-8B on ZeroGPU.

SOTA Performance:

92.1% accuracy on Kinetics-400 (+11.2% over VideoMAE)
Open-vocabulary action detection
Custom sports-specific actions

Capabilities:

Action classification (50+ default actions)
Custom action labels for sports
Foul-related action detection
Multi-frame temporal understanding

API Endpoints for EagleEye:

POST /call/api_classify_action - Action classification

Upload Video

Custom Actions (optional)