Corpus Processing: Tokenization, word counting, and statistical analysis of a legal text corpus. Evaluation of Sentence Embedding Models: Comparing pre-trained sentence embedding models on semantic ...