Please have a look at “Read Cube”. They have an excellent way to extract document information from looking at the PDF document. If one marks a passage of the text, then items appears that say e.g. “title”. If one clicks on this item, the highlighted text is assigned to the title of the document.