PDF Accessibility with OpenAI

A docker image that automatically fixes accessibility issues in PDF files using PDFix SDK and OpenAI.

Available Actions

Generate Alternate Text for images
Generate Table Summary
Generate MathML Associated Files for Formulas

Generate Alternate Text for images

Action that automatically generates and applies alternative image descriptions in PDF tags with OpenAI

Generate Table Summary

Action that automatically generates and applies the table summary attribute in PDF tags with OpenAI

Generate MathML Associated Files for Formulas

Action that automatically generates and adds MathML accociated files with OpenAI

PDF Accessibility with OpenAI

commands:
    config                  Save the default configuration file
    generate-alt-text       Generate alternate text for images
    generate-table-summary  Generate Table Summary
    generate-mathml         Generate MathML for formulas

common options:
  -h, --help                show this help message and exit
  --openai-key OPENAI_KEY   OpenAI API key
  --input INPUT             The input PDF file
  --output OUTPUT           The output file
  --tags TAGS               Regular expression defining the tag names tpo process
  --name NAME               PDFix license name
  --key KEY                 PDFix license key

generate-alt-text, generate-table-summary:
  --lang LANG               The laguage of the alternate description and table summary
  --overwrite OVERWRITE     Overwrite the existing value

generate-mathml:
  --mathml-version          MathML version
      {mathml-1,mathml-2,mathml-3,mathml-4}

Getting Started

To use this Docker application, you'll need to have Docker installed on your system. If Docker is not installed, please follow the instructions on the official Docker website to install it.

Run using Command Line Interface

To run the docker container as CLI, you should share the folder containing the PDF for processing using the -i parameter. In this example, the current folder is used.

docker run -v $(pwd):/data/ -w /data pdfix/pdf-accessibility-openai:latest generate-alt-text --input document.pdf --output out.pdf --tags "Figure|Formula" --openai_key <api_key> --lang English --overwrite true

To run With a PDFix License add these arguments.

--name ${LICENSE_NAME} --key ${LICENSE_KEY}

Contact support for more infomation.

First run will pull the docker image, which may take some time. Make your own image for more advanced use.

For more detailed information about the available command-line arguments, you can run the following command:

docker run --rm pdfix/pdf-accessibility-openai:latest --help

Run Description Generation using REST API

Comming soon. Please contact us.

Export the Configuration File for Integration

To export the configuration JSON file, use the following command:

docker run -v $(pwd):/data -w /data --rm pdfix/pdf-accessibility-openai:latest config --output config.json

License & libraries used

PDFix SDK - https://pdfix.net/terms
OpenAI API - https://openai.com/policies/

Trial version of the PDFix SDK may apply a watermark on the page and redact random parts of the PDF including the scanned image in background. Contact us to get an evaluation or production license.

Help & Support

To obtain a PDFix SDK license or report an issue please contact us at support@pdfix.net. For more information visit https://pdfix.net

Name		Name	Last commit message	Last commit date
Latest commit History 25 Commits
.github/workflows		.github/workflows
src		src
.gitignore		.gitignore
Dockerfile		Dockerfile
README.md		README.md
config.json		config.json
requirements.txt		requirements.txt
update_version.sh		update_version.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

PDF Accessibility with OpenAI

Table of Contents

Available Actions

Generate Alternate Text for images

Generate Table Summary

Generate MathML Associated Files for Formulas

Getting Started

Run using Command Line Interface

Run Description Generation using REST API

Export the Configuration File for Integration

License & libraries used

Help & Support

About

Releases 1

Packages

Languages

pdfix/action-pdf-accessibility-openai

Folders and files

Latest commit

History

Repository files navigation

PDF Accessibility with OpenAI

Table of Contents

Available Actions

Generate Alternate Text for images

Generate Table Summary

Generate MathML Associated Files for Formulas

Getting Started

Run using Command Line Interface

Run Description Generation using REST API

Export the Configuration File for Integration

License & libraries used

Help & Support

About

Topics

Resources

Stars

Watchers

Forks

Releases 1

Packages 0

Languages

Packages