This came up during #48, but needs to be handled separately since it is not a unit test.
The scenario we should avoid is that someone changes the tokens.txt files that determine the mapping between input tokens (e.g. [H]) and their numbers (e.g. 149). If the tokens.txt is changed, models trained on data from before the change will not work with data after the change.
Task
- Implement an Actions workflow that checks for all tokens files that they are only appended and no lines are deleted