Could I ask a question? Will the HTML tags in the product descriptions of the dataset affect matching, and should the HTML tags be removed?