https://dev.to/agenthustler/data-quality-in-web-scraping-validation-cleaning-and-deduplication-502k