Skip to content
This repository was archived by the owner on Nov 29, 2019. It is now read-only.

Dupes with extract-bibs #16

Open
paulusm opened this issue Sep 14, 2014 · 1 comment
Open

Dupes with extract-bibs #16

paulusm opened this issue Sep 14, 2014 · 1 comment

Comments

@paulusm
Copy link

paulusm commented Sep 14, 2014

The extracted bibtex files often seem to contain exact duplicate entries, which is causing me issues when trying to parse them.

@jdherman
Copy link
Contributor

Yea, this is a pain but I don't think there is an easy fix. This can happen for two reasons:

  1. the PDF parser incorrectly splits a single reference into two, which will both resolve to the same DOI,
  2. or the web api incorrectly points two different references (say with similar authors) to the same DOI

In either case it would be tough to guarantee no duplicates. I usually use a bibtex manager like jabref or bibdesk to clean things up and remove duplicates before merging into the main bib file. I wouldn't trust this bib-extract to go straight to compiling without cleaning it up first.

Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants