2022-04-07 11:29
By Priyanjana Bengani (@acookiecrumbles) and Jon Keegan (@jonkeegan) IRE NICAR Conference – March 4, 2022 Slides: English | Russian
This checklist is meant to be used as a reporting tool to help journalists and researchers when trying to find out who published a website. This is meant to be used in conjunction with offline reporting techniques.
Following this checklist does not guarantee that you can unmask an owner of a website who does not want to be found, but it can help surface crucial clues and connections that can act as leads for further reporting.
???? Strong recommendation: while running through this checklist, create a data diary—it can be a TextEdit doc, a Google Doc, just the Notes app, whatever. It is important to be able to retrace your steps.
✍️ Are there any authors listed?
???? Are there any email addresses or contact information?
???? What’s the server’s local time?
datetime
attribute in links on WordPress sites. GMT timestamp can reveal time zone based on GMT offset: <time class="updated" datetime="2022-03-04T10:21:40+06:00">March 4, 2022</time>
???? Does the website have a privacy policy or terms and conditions that mentions an LLC or what regional laws apply?
???? Does the website have an RSS feed?
If there are any social media profiles mentioned on the site, they are worth investigating.
On the Facebook profile, go to Page Transparency:
On Twitter, the account might be part of a pod or network that boosts it. Using en.whotwi.com, it’s worth checking:
Don’t forget to check to see if the site has accounts on Youtube, Instagram, Reddit, Github…
???? Have you archived the website? (You always should!)
wget
: wget -mpEk <yourwebsite.com>
???? What is the website using?
☁️ Where is it hosted?
???? Are there any trackers present?
???? How is the site monetized?
???? What are the various tracking identifiers, and are those shared with other domains?
Are there any relevant subdomains?
???? Are there historic WHOIS records?
⌛️ Has the site changed over time?
???? Did the earlier version of the site have more information?
Open Source Intelligence Techniques – Michael Bazzell https://inteltechniques.com/book1.html
Verification Handbook – edited by Craig Silverman https://datajournalism.com/read/handbook/verification-3