4000+ Python code samples with ground-truth complexity labels Randomized variable names, constants, and formatting Same dataset used for all model comparisons ...