These texts are similar, so their embeddings would be the identical, leading to a cosine similarity of 1. There are plenty of issues which aren't in the control, so it demands continual efforts & optimizations. Crawling: The method starts off with crawler, which reads all the knowledge from the webpage https://apelab.ch