PyPDF2 spacy transformers pytesseract torch spacy-transformers