Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Added copy button for copy/pasting markdown, html and JSON exactly as is. Can copy text from within chunks now as well. #212

Merged
merged 2 commits into from
Oct 10, 2024

Conversation

m-chadda
Copy link
Collaborator

  • pdf to image
  • extension error
  • cropping error
  • getting iamges back, not working
  • .
  • pyscripts
  • fixed scaling factor for page
  • new compose
  • scaling factor issues
  • fixed pdla deploy
  • added image url stuff :
  • added image url stuff
  • task creation has image_url
  • rm threadlock and run in threadpool in pdla
  • image file path fixes
  • s3 upload issues
  • done with ocr
  • get tasks error fixed
  • process lock on / route
  • Refactored frontend
  • Rebased with main succesfully
  • Added OCR strategy and Task stats in viewer & status view
  • main test
  • removed segment density
  • Moved dashboard to pages
  • Added curl command placeholder for no tasks
  • deployed new pdla
  • added timing for ocr
  • added timing for ocr
  • table html to mkd
  • table html to mkd
  • table html to mkd
  • Cleaned up svg errors, added oCR slection on upload form, images optimized
  • fixing mkd table issue
  • fixing mkd table issue
  • adding \n as string literal
  • adding \n as string literal
  • | is not |?
  • doing it on frontend
  • doing it on frontend
  • base
  • frontend markdown resolved
  • Rebasing with main
  • Added tables markdown css, html viewing options, API key fixes for small screen menu dropdown
  • updated google timeout
  • list item fixes
  • list item ol start from give digit
  • clean ul
  • migrations
  • new migs
  • new api
  • new api spec
  • wiping pg
  • updating gpu
  • updated gpu
  • separated out task workers
  • new workers - fast - highquality
  • cleaned up pyscripts
  • refactoring the models
  • syntax error fix
  • fixed syntax error
  • fixed syntax error
  • fixed pyscripts
  • new dockers for processors
  • new processor names
  • cleanup
  • Need to add chunking strategy in upload form
  • Updating with main
  • Added HTML viewing fixes and chunking strategy button on upload form
  • No tasks state text prompt added
  • task service docker created
  • docker updated
  • docker updated
  • docker updated
  • added uv venv
  • added uv venv
  • new kube
  • paddle ocr issue
  • created init
  • paddle ocr issues, going to build with gpu
  • added libgomp1 to dockerfile
  • added openCV deps in docker
  • updated num of workers
  • repushing
  • new kube
  • unexpected end of data error
  • dockerfile updated becuase cudnn error
  • new kube
  • updated env for docker image
  • dockerignore update
  • docker image inly copying over essentials
  • docker image inly copying over essentials
  • fixed COPY
  • kube
  • deploying new frontend
  • task ocr parsed to float
  • get content updated
  • prints
  • idek
  • ready for deployment
  • deploying new version
  • deploying new version
  • deploymetn with ocr
  • deployment successful
  • new cmds
  • Fixed bug for API pricing calculator - fast pricing is now correct
  • updated api spec
  • Fixed billing amount bugs
  • better tests
  • stripe tested on dev
  • invoicing done
  • added authorization
  • clean up
  • rm unused code
  • deployed
  • token incorrect
  • fixed incorrect server url
  • rm cron from self deploy
  • deployed newest version
  • updated pdla
  • updated pdla
  • added
  • added threadpool
  • pdla code updated
  • clean up services
  • pdla fast new deployment
  • deploying higher fast throughput
  • updated number of connections
  • invoicing fixes
  • comment purge
  • deployed invoicer update
  • updated max dims for segments
  • updated task
  • cloudflare full strict
  • Added small fixes for frontend and OCRBBox view on hover + OCRText for OCRBBox on hover
  • Small fix - Removed random console.log
  • updating queue consumer
  • udpaed secrets
  • updated deployment file
  • Fixed MZ and Safari bugs for upload componenty on landing
  • silly
  • Removed speed approximations for Segmentation model cards
  • file support
  • deployed web
  • silly
  • added pollyfill for promise with resolvers
  • libre office integration
  • Fixed chunk_length undefined bug
  • open api spec
  • new venv
  • Removed speed approximations for Segmentation model cards
  • deployed web
  • added pollyfill for promise with resolvers
  • Fixed chunk_length undefined bug
  • open api spec
  • new venv
  • sending to task service
  • invalid header
  • idk headers in pdfs suck
  • updates to generate
  • qwen
  • qwen batch
  • qwen batch
  • qwen batch
  • qwen batch
  • qwen fixes
  • qwen fixes
  • qwen fixes
  • qwen batch
  • docx
  • ppt, docx support added
  • lfg
  • lfg
  • added task
  • updated accept list
  • merged
  • fixed conflicts in qwen
  • LLM based ocr for tables added
  • updated html logic
  • updated cost values
  • fixed conflict in process
  • fixed circular deps
  • fixed circular deps
  • updated default value
  • removed print statements
  • deploying new version with llm table support
  • deployed new version with docx, pptx support
  • added dropzone type
  • added module declaration for react-dropzone
  • deployed with LLM table support
  • updated num_workers
  • updated example kube env
  • adding o1 support
  • added latex api
  • testing o1
  • removed o1 stuff
  • fixed monthly order for usage and multi file type support
  • notices
  • Update README.md
  • Update LICENSE
  • Delete LICENSE
  • Create LICENSE
  • Create COMMERCIAL_LICENSE
  • bug fixes
  • prompting
  • Fixed free tier rollover on dashboard
  • Api dialog width fix
  • updated conversion
  • Added file support byline
  • added ppocr to table with llm
  • camelot for tables
  • deploying new web and backend
  • adding o1 support
  • added latex api
  • testing o1
  • removed o1 stuff
  • fixed monthly order for usage and multi file type support
  • bug fixes
  • prompting
  • updated conversion
  • added ppocr to table with llm
  • camelot for tables
  • Small fix in byline for homepage
  • adding sliding window to ppocr
  • img to table
  • img to table
  • ocr result fix
  • ocr result fix
  • fixed pyscripts scaling issues and time logging
  • fixed
  • Added a lot of phone fixes
  • moved file conversion to process, added pdf url to output
  • fixed pyscripts and gen signed url
  • rm prints
  • Removed authentication from phone
  • Fixed up pricing page header and styling
  • cleanup - unused code
  • cleanup - unused code
  • default payment method
  • new deployment
  • cooked
  • desc month
  • month desc
  • reverting
  • reverting
  • updated example secrets
  • pdf_url
  • added new logs for debugging
  • removed logs
  • adding logs to upload task
  • newset version deployed
  • changed api schema
  • implemented new segment models in task
  • update cropping
  • Implmented new bbox stuff on frontend
  • updated rust in docker
  • deploying api schema change
  • deploying api schema change
  • added message for usage limit exceeded
  • Using legacy version of pdfjs
  • Added posthog analytics
  • updated web
  • Update README.md
  • multi gpu support
  • fixed typo for curl command
  • fixed typo for curl command
  • Update README.md
  • Update README.md
  • updated readme
  • updated readme
  • updated readme
  • updated readme
  • increased chunkr timeout
  • increased timeout to 180s
  • Updates
  • added lazy static to client
  • updated terraform
  • add timeout for rrq
  • server on fire
  • server on fire
  • lfg
  • updating nodes to stop ooming
  • seperated ocr from segmentation
  • removed ocr lock
  • dep hell
  • throughput maxxing
  • load model for every request with thread locks
  • adding telemetry
  • ocr more stable
  • Added copy button for copy/pasting markdown, html and JSON exactly as is. Can copy text from within chunks now as well.
  • Got rid of extra console logs

@m-chadda m-chadda merged commit 5b6b849 into main Oct 10, 2024
1 of 2 checks passed
@akhileshsharma99 akhileshsharma99 deleted the copy-pasting branch January 19, 2025 23:10
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

feature: allow copy pasting and selecting of the text outputs of the web viewer
1 participant