Hello, I find that some words are cased while some are uncased. They have different word ids in the vocab of tokenizer of GPT. What is the appropriate way to process the words ? Thanks. 