🎯 Multi-Object Tracking (MOT)

Task Overview

Input: Panoramic 4K video (full match)
Output: Per-frame bounding boxes + persistent track IDs for all players

Alignment with SoccerNet/MOTChallenge:
Same: MOTChallenge-style format (bbox + ID per frame)
⚠️ Different: Panoramic full-pitch view, long sequences (~90 min)
🏆 SoccerTrack Challenge 2025
A subset of matches with bounding box annotations is featured in the SoccerTrack Challenge 2025.

Input / Output Definition

Input

  • videos/match_XXXXX.mp4 — Full match panoramic video (4K)

Output (per frame, per object)

Field Type Description
frame int Frame number (1-indexed)
track_id int Unique persistent ID
x, y, w, h float Bounding box (top-left x, y + width, height)
conf float Confidence score (0–1)
class int Object class (1=person, -1=ignored)

File Layout

production/
├── videos/
│   ├── 117093.mp4
│   └── ...
├── gsr/
├── bas/
├── mot/
│   ├── 117093/
│   │   ├── gt/
│   │   │   └── gt.txt
│   │   └── seqinfo.ini
│   ├── 117094/
│   │   └── ...
│   └── ...
└── raw/
    ├── 117093/
    │   ├── 117093_tracker_box_data.xml    # Original BePro tracking
    │   └── ...
    └── ...

Format follows MOTChallenge structure for compatibility with existing tools. Original BePro tracking XML files are preserved in raw/ for reference.

Schema & MOTChallenge Mapping

Example gt.txt Format (CSV)

# frame, track_id, x, y, w, h, conf, class, visibility
1, 1, 1024.5, 512.3, 64.2, 128.5, 1.0, 1, 1.0
1, 2, 1500.2, 600.1, 68.3, 135.2, 1.0, 1, 1.0
2, 1, 1026.1, 513.0, 64.5, 129.0, 1.0, 1, 1.0
...

seqinfo.ini Example

[Sequence]
name=match_117093
imDir=videos
frameRate=25
seqLength=135000
imWidth=3840
imHeight=2160
imExt=.mp4

MOTChallenge Correspondence

Field MOTChallenge Notes
frame Column 1 1-indexed frame number
track_id Column 2 Persistent ID
x, y, w, h Columns 3–6 Bounding box (pixels)
conf Column 7 Confidence (1.0 for GT)
class Column 8 1 = person

Evaluation / Metrics

Evaluation uses standard MOT metrics:

  • HOTA (Higher Order Tracking Accuracy) — primary metric
  • MOTA (Multi-Object Tracking Accuracy)
  • IDF1 (ID F1 Score)
  • AssA, DetA (Association & Detection accuracy)

Compatible with TrackEval library.

Official devkit & challenge leaderboard: See SoccerTrack Challenge 2025

Differences vs SoccerNet Tracking

Key Differences:
  • Camera view: Panoramic (full-pitch) vs. broadcast (partial/zoomed)
  • Duration: Full match (~90 min, ~135k frames) vs. short clips
  • Challenges: Long-term ID persistence, scale variation across pitch
  • Resolution: 4K panoramic (3840×2160) vs. HD broadcast