40???100 24-hour time and comma as being a decimal separator isn't a custom made locale. It shouldn't have to be dealt with specially.The model learns by having a piece of textual content from the information (say, the opening sentence of a Wikipedia write-up) and wanting to predict the next token within the sequence. It then compares its output wi