(There are some exceptions to the informed consent rule, but those do not apply when there's a chance a person's identity can be linked to sensitive information.) This data scrape, and potential future studies built on it, won't provide any of those protections.
And scientists who use this data set may be in breach of the standard ethical code.
Kirkegaard, the lead author, is a graduate student at Aarhus University in Denmark.
(The university But the data set reveals deeply personal information about many of the users. But it's entirely possible to use clues from a user's location, demographics, and Ok Cupid user name to determine their identity.
The users hail from a few dozen countries around the world.
an online forum where researchers are encouraged to share raw data to increase transparency and collaboration across social science.
They have a right to know how their data will be used, and they have the right to withdraw their data from that research."This is without a doubt one of the most grossly unprofessional, unethical and reprehensible data releases I have ever seen," writes Oliver Keyes, a social computing researcher*, on his blog.A separate paper by Kirkegaard and describing the methods they used in the Ok Cupid data scrape (also published on the Open Science Framework) contains another big ethical red flag.A group of researchers has released a data set on nearly 70,000 users of the online dating site Ok Cupid.The data dump breaks the cardinal rule of social science research ethics: It took identifiable personal data without permission.
The authors report that they didn't scrape profile pictures because it "would have taken up a lot of hard drive space." And when researchers asked Kirkegaard about these concerns on Twitter, he shrugged them off.