papers.fingerprint module

papers.fingerprint.create_paper_plain_fingerprint(title, authors, year)[source]

Creates a robust summary of a bibliographic reference. This plain fingerprint should then be converted to an actual fingerprint by hashing it (so that the length remains constant).

Parameters:
  • title – the title of the paper
  • authors – the list of author names, represented as (first_name, last_name) pairs
  • year – the year of publication of the paper
>>> create_paper_plain_fingerprint(' It  cleans whitespace And Case\n',[('John','Doe')], 2015)
u'it-cleans-whitespace-and-case/doe'
>>> create_paper_plain_fingerprint('HTML tags are <emph>removed</emph>',[('John','Doe')], 2015)
u'html-tags-are-removed/doe'
>>> create_paper_plain_fingerprint('Les accents sont supprimés', [('John','Doe')],2015)
u'les-accents-sont-supprimes/doe'
>>> create_paper_plain_fingerprint('Long titles are unambiguous enough to be unique by themselves, no need for authors', [('John','Doe')], 2015)
u'long-titles-are-unambiguous-enough-to-be-unique-by-themselves-no-need-for-authors'
>>> create_paper_plain_fingerprint('Ambiguity', [('John','Doe')], 2014)
u'ambiguity-2014/doe'