BibTeX, biblatex and biberCollecting BiBTeX data from pdf files

Information and discussion about BiBTeX - the bibliography tool for LaTeX documents.
Post Reply
M.A
Posts: 58
Joined: Sun Nov 30, 2008 10:42 am

Collecting BiBTeX data from pdf files

Post by M.A »

Hi everyone,

I have hundreds of research articles as pdf files, and it's tedious to collect their BiBTeX data from Google Scholar.

Does there exist a program that collect BiBTeX data automatically from a folder containing pdf files ?


Thanks,

Recommended reading 2024:

LaTeXguide.org • LaTeX-Cookbook.net • TikZ.org

NEW: TikZ book now 40% off at Amazon.com for a short time.

M.A
Posts: 58
Joined: Sun Nov 30, 2008 10:42 am

Re: Collecting BiBTeX data from pdf files

Post by M.A »

Nobody came across such a program before? :shock:
Last edited by M.A on Wed Jun 17, 2009 10:40 pm, edited 1 time in total.
User avatar
localghost
Site Moderator
Posts: 9202
Joined: Fri Feb 02, 2007 12:06 pm

Collecting BiBTeX data from pdf files

Post by localghost »

M.A wrote:Nobody came across such a program? :shock:
Obviously not. Think about which demands are to be fulfilled to make such a program working.
  • Every PDF file must contain the information for the fields required by BibTeX for the entries in the database.
  • This information must be translated to a corresponding database file.
  • A citation key must be generated and added to the database entry.
From my point of view the first point is the most critical one. The others may be translated by a script.

Perhaps JabRef can serve this purpose.


Best regards
Thorsten
M.A
Posts: 58
Joined: Sun Nov 30, 2008 10:42 am

Re: Collecting BiBTeX data from pdf files

Post by M.A »

Thanks for the response!
Actually I have expected that it will be a difficult task, but I thought it might be possible since Google Scholar generates BiBTeX data automatically for each research article (which is a pdf file), so it must be following some algorithms.

Best Regards,
M.A
User avatar
localghost
Site Moderator
Posts: 9202
Joined: Fri Feb 02, 2007 12:06 pm

Collecting BiBTeX data from pdf files

Post by localghost »

M.A wrote:[...] Actually I have expected that it will be a difficult task, but I thought it might be possible since Google Scholar generates BiBTeX data automatically for each research article (which is a pdf file), so it must be following some algorithms. [...]
But it's not for sure that the information is collected from the files themselves.
M.A
Posts: 58
Joined: Sun Nov 30, 2008 10:42 am

Re: Collecting BiBTeX data from pdf files

Post by M.A »

I found this software:
http://www.mendeley.com
It does the required job, although it's not perfect.
Post Reply