-
-
Notifications
You must be signed in to change notification settings - Fork 14
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Bug: Error handling needs changes #109
Comments
Addition: I assume it might be one of the documents, which already IS a JPEG if that helps for debugging? |
Addition: Got this one on a normal document. It seems the database is overloaded at the moment. time="2025-01-11T12:41:38Z" level=error msg="Error updating document 2905: 500, \n<!doctype html>\n<html lang="en">\n\n <title>Server Error (500)</title>\n\n\n Server Error (500)\n\n\n"time="2025-01-11T12:41:38Z" level=error msg="Error in processAutoTagDocuments: error in processAutoOcrTagDocuments: error updating documents: error updating document 2905: 500, \n<!doctype html>\n<html lang="en">\n\n <title>Server Error (500)</title>\n\n\n Server Error (500)\n\n\n" |
got another one. Any ideas here? time="2025-01-12T15:20:00Z" level=error msg="Error in processAutoTagDocuments: error in processAutoOcrTagDocuments: error processing document OCR: error downloading document images: fitz: cannot open document" |
First step of enhanced logging and error reporting is implemented in #114 |
I just discovered this morning, that paperless-gpt is kind of "stalled" with OCR
That is due the fact that it tried to process a "too big" document as it seems:
time="2025-01-11T10:54:35Z" level=debug msg="Image dimensions: 12600x16800"
time="2025-01-11T10:54:35Z" level=debug msg="Image size: 15274 KB"
time="2025-01-11T10:54:43Z" level=error msg="Error in processAutoTagDocuments: error in processAutoOcrTagDocuments: error processing document OCR: error performing OCR: error getting response from LLM: API returned unexpected status code: 400: You uploaded an unsupported image. Please make sure your image is valid."
Now it everytime tries to reprocess this document - but obiously the error message will not change.
At least we need two things here i think:
For the moment is my question: How can i identify which exact document this is ... ?
I have 953 documents in the processing queue ...
@icereed - Based on which API query to paperless do you get the next document to be processed?
The text was updated successfully, but these errors were encountered: