Hi, I'm looking for an example of parsing pdf. I tried to implement this with ruby and docsplit gem, but it uses an external tool to extract the text, and there are problems with number references, and you have to parse the text file according to the regular expressions I want to parse some papers in pdf format, to extract it's title, keywords, authors, authors's mails, institutions, etc. I'm looking for some experience ruby developer with a better way to do this without parsing a textfile through regular expressions Greetings -- Posted via http://www.ruby-forum.com/.