Skip to main content

Posts

Showing posts from July, 2018

cheatR: an R package for catching cheaters

An updated version of this post can be found here >> . cheatR is a mini package to help you find cheaters by comparing hand-ins. It was developed by Almog Simchon and me in response to students overheard bragging about how an assignment in an first-year undergrad course was "super easy" because "we all just copied from each other!" (though this would later turn out to be an exaggeration). Our idea was to compare each hand-in to all other hand-ins and see the degree of overlap between them. This was achieved using the  ngram  r-package to break each hand-in into a list of "phrases" and then to count how many times each phrase appeared across a pair of documents 1 . Finally, the percent of non-unique phrases was calculated.