By Akshay Easwaran
Every soccer data analysis group has had the same jump-scare: they sign data provider contract #2, and suddenly, they need a solution to link teams, matches, and players across their data ecosystem. Individual data sources are easy to ingest: you can tuck them in their own little schemas, write code specific to their little universes, and rest easy that changes won’t come thick and fast without proper notice from your providers. But with those individual data sources come individual standards for tracking objects (players, teams, or matches). In most (if not all) major American sports, working across providers is easy: just use an object’s single source of truth identifier from the league itself. But as every analyst finds out very quickly, there is no public single source of truth identifier for teams, matches, and players in international soccer.
Read More