A method to measure whether AI search engines (ChatGPT, Perplexity, Google AI Overviews) cite a publisher's URLs in their answers by matching cited sources at three levels: exact URL, domain, and text fingerprint similarity using deterministic simhash. The approach includes attribution windows to account for search engine re-crawl timing and uses a control group to avoid false attribution, providing a conservative and reliable citation attribution model.
Use Case
Opening the operator briefing
Pulling the full operator breakdown, tooling context, and verification notes.
