-
Notifications
You must be signed in to change notification settings - Fork 87
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Added copy button for copy/pasting markdown, html and JSON exactly as is. Can copy text from within chunks now as well. #212
Merged
Conversation
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Collaborator
m-chadda
commented
Oct 10, 2024
- pdf to image
- extension error
- cropping error
- getting iamges back, not working
- .
- pyscripts
- fixed scaling factor for page
- new compose
- scaling factor issues
- fixed pdla deploy
- added image url stuff :
- added image url stuff
- task creation has image_url
- rm threadlock and run in threadpool in pdla
- image file path fixes
- s3 upload issues
- done with ocr
- get tasks error fixed
- process lock on / route
- Refactored frontend
- Rebased with main succesfully
- Added OCR strategy and Task stats in viewer & status view
- main test
- removed segment density
- Moved dashboard to pages
- Added curl command placeholder for no tasks
- deployed new pdla
- added timing for ocr
- added timing for ocr
- table html to mkd
- table html to mkd
- table html to mkd
- Cleaned up svg errors, added oCR slection on upload form, images optimized
- fixing mkd table issue
- fixing mkd table issue
- adding \n as string literal
- adding \n as string literal
- | is not |?
- doing it on frontend
- doing it on frontend
- base
- frontend markdown resolved
- Rebasing with main
- Added tables markdown css, html viewing options, API key fixes for small screen menu dropdown
- updated google timeout
- list item fixes
- list item ol start from give digit
- clean ul
- migrations
- new migs
- new api
- new api spec
- wiping pg
- updating gpu
- updated gpu
- separated out task workers
- new workers - fast - highquality
- cleaned up pyscripts
- refactoring the models
- syntax error fix
- fixed syntax error
- fixed syntax error
- fixed pyscripts
- new dockers for processors
- new processor names
- cleanup
- Need to add chunking strategy in upload form
- Updating with main
- Added HTML viewing fixes and chunking strategy button on upload form
- No tasks state text prompt added
- task service docker created
- docker updated
- docker updated
- docker updated
- added uv venv
- added uv venv
- new kube
- paddle ocr issue
- created init
- paddle ocr issues, going to build with gpu
- added libgomp1 to dockerfile
- added openCV deps in docker
- updated num of workers
- repushing
- new kube
- unexpected end of data error
- dockerfile updated becuase cudnn error
- new kube
- updated env for docker image
- dockerignore update
- docker image inly copying over essentials
- docker image inly copying over essentials
- fixed COPY
- kube
- deploying new frontend
- task ocr parsed to float
- get content updated
- prints
- idek
- ready for deployment
- deploying new version
- deploying new version
- deploymetn with ocr
- deployment successful
- new cmds
- Fixed bug for API pricing calculator - fast pricing is now correct
- updated api spec
- Fixed billing amount bugs
- better tests
- stripe tested on dev
- invoicing done
- added authorization
- clean up
- rm unused code
- deployed
- token incorrect
- fixed incorrect server url
- rm cron from self deploy
- deployed newest version
- updated pdla
- updated pdla
- added
- added threadpool
- pdla code updated
- clean up services
- pdla fast new deployment
- deploying higher fast throughput
- updated number of connections
- invoicing fixes
- comment purge
- deployed invoicer update
- updated max dims for segments
- updated task
- cloudflare full strict
- Added small fixes for frontend and OCRBBox view on hover + OCRText for OCRBBox on hover
- Small fix - Removed random console.log
- updating queue consumer
- udpaed secrets
- updated deployment file
- Fixed MZ and Safari bugs for upload componenty on landing
- silly
- Removed speed approximations for Segmentation model cards
- file support
- deployed web
- silly
- added pollyfill for promise with resolvers
- libre office integration
- Fixed chunk_length undefined bug
- open api spec
- new venv
- Removed speed approximations for Segmentation model cards
- deployed web
- added pollyfill for promise with resolvers
- Fixed chunk_length undefined bug
- open api spec
- new venv
- sending to task service
- invalid header
- idk headers in pdfs suck
- updates to generate
- qwen
- qwen batch
- qwen batch
- qwen batch
- qwen batch
- qwen fixes
- qwen fixes
- qwen fixes
- qwen batch
- docx
- ppt, docx support added
- lfg
- lfg
- added task
- updated accept list
- merged
- fixed conflicts in qwen
- LLM based ocr for tables added
- updated html logic
- updated cost values
- fixed conflict in process
- fixed circular deps
- fixed circular deps
- updated default value
- removed print statements
- deploying new version with llm table support
- deployed new version with docx, pptx support
- added dropzone type
- added module declaration for react-dropzone
- deployed with LLM table support
- updated num_workers
- updated example kube env
- adding o1 support
- added latex api
- testing o1
- removed o1 stuff
- fixed monthly order for usage and multi file type support
- notices
- Update README.md
- Update LICENSE
- Delete LICENSE
- Create LICENSE
- Create COMMERCIAL_LICENSE
- bug fixes
- prompting
- Fixed free tier rollover on dashboard
- Api dialog width fix
- updated conversion
- Added file support byline
- added ppocr to table with llm
- camelot for tables
- deploying new web and backend
- adding o1 support
- added latex api
- testing o1
- removed o1 stuff
- fixed monthly order for usage and multi file type support
- bug fixes
- prompting
- updated conversion
- added ppocr to table with llm
- camelot for tables
- Small fix in byline for homepage
- adding sliding window to ppocr
- img to table
- img to table
- ocr result fix
- ocr result fix
- fixed pyscripts scaling issues and time logging
- fixed
- Added a lot of phone fixes
- moved file conversion to process, added pdf url to output
- fixed pyscripts and gen signed url
- rm prints
- Removed authentication from phone
- Fixed up pricing page header and styling
- cleanup - unused code
- cleanup - unused code
- default payment method
- new deployment
- cooked
- desc month
- month desc
- reverting
- reverting
- updated example secrets
- pdf_url
- added new logs for debugging
- removed logs
- adding logs to upload task
- newset version deployed
- changed api schema
- implemented new segment models in task
- update cropping
- Implmented new bbox stuff on frontend
- updated rust in docker
- deploying api schema change
- deploying api schema change
- added message for usage limit exceeded
- Using legacy version of pdfjs
- Added posthog analytics
- updated web
- Update README.md
- multi gpu support
- fixed typo for curl command
- fixed typo for curl command
- Update README.md
- Update README.md
- updated readme
- updated readme
- updated readme
- updated readme
- increased chunkr timeout
- increased timeout to 180s
- Updates
- added lazy static to client
- updated terraform
- add timeout for rrq
- server on fire
- server on fire
- lfg
- updating nodes to stop ooming
- seperated ocr from segmentation
- removed ocr lock
- dep hell
- throughput maxxing
- load model for every request with thread locks
- adding telemetry
- ocr more stable
- Added copy button for copy/pasting markdown, html and JSON exactly as is. Can copy text from within chunks now as well.
- Got rid of extra console logs
… is. Can copy text from within chunks now as well.
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.