excerpt: In this paper, we introduce the Temporal Graph Benchmark (TGB), a comprehensive collection of large-scale, diverse datasets designed for the realistic, reproducible, and robust evaluation of machine learning models on temporal graphs. These datasets span multiple years and domains, including social, trade, transaction, and transportation networks, and encompass both node and edge-level prediction tasks. The research highlights significant variability in the performance of common models across different datasets and demonstrates that simple methods often outperform existing temporal graph models in dynamic node property prediction tasks. Additionally, TGB provides an automated machine learning pipeline for data loading, experiment setup, and performance evaluation, fostering reproducible and accessible research.
0 commit comments