But the other question I have is about the license. The tokenizer.py file is identical, and the rest is very similar - just making minor adjustments here and there.
Can they just take this Apache 2 licensed code, change it a bit and offer it as MIT? They are clearly not the original author.