InternVideo2.5 - Action Recognition

Powered by InternVideo2.5-8B on ZeroGPU.

SOTA Performance:

  • 92.1% accuracy on Kinetics-400 (+11.2% over VideoMAE)
  • Open-vocabulary action detection
  • Custom sports-specific actions

Capabilities:

  • Action classification (50+ default actions)
  • Custom action labels for sports
  • Foul-related action detection
  • Multi-frame temporal understanding

API Endpoints for EagleEye:

  • POST /call/api_classify_action - Action classification