BibTeX, biblatex and biber ⇒ Collecting BiBTeX data from pdf files
Collecting BiBTeX data from pdf files
I have hundreds of research articles as pdf files, and it's tedious to collect their BiBTeX data from Google Scholar.
Does there exist a program that collect BiBTeX data automatically from a folder containing pdf files ?
Thanks,
Learn LaTeX easily with newest books:
The LaTeX Beginner's Guide: 2nd edition and perfect for students writing a thesis
The LaTeX Cookbook: 2nd edition full of practical examples for mathematics, physics, chemistry, and more
LaTeX Graphics with TikZ: the first book about TikZ for perfect drawings in your LaTeX thesis
Re: Collecting BiBTeX data from pdf files

- localghost
- Site Moderator
- Posts: 9202
- Joined: Fri Feb 02, 2007 12:06 pm
Collecting BiBTeX data from pdf files
Obviously not. Think about which demands are to be fulfilled to make such a program working.M.A wrote:Nobody came across such a program?
- Every PDF file must contain the information for the fields required by BibTeX for the entries in the database.
- This information must be translated to a corresponding database file.
- A citation key must be generated and added to the database entry.
Perhaps JabRef can serve this purpose.
Best regards
Thorsten
Board Rules
Avoidable Mistakes
¹ System: TeX Live 2025 (vanilla), TeXworks 0.6.10
Re: Collecting BiBTeX data from pdf files
Actually I have expected that it will be a difficult task, but I thought it might be possible since Google Scholar generates BiBTeX data automatically for each research article (which is a pdf file), so it must be following some algorithms.
Best Regards,
M.A
- localghost
- Site Moderator
- Posts: 9202
- Joined: Fri Feb 02, 2007 12:06 pm
Collecting BiBTeX data from pdf files
But it's not for sure that the information is collected from the files themselves.M.A wrote:[...] Actually I have expected that it will be a difficult task, but I thought it might be possible since Google Scholar generates BiBTeX data automatically for each research article (which is a pdf file), so it must be following some algorithms. [...]
Board Rules
Avoidable Mistakes
¹ System: TeX Live 2025 (vanilla), TeXworks 0.6.10