Skip to content

This plugin provides tools for image analysis with OCR and AI using Azure AI Vision.

License

Notifications You must be signed in to change notification settings

fujita-h/dify-plugin-azure-ai-vision

Repository files navigation

Icon

Azure AI Vision

GitHub Repo GitHub Release GitHub License

This plugin provides tools for image analysis with OCR and AI using Azure AI Vision.

Important

This plugin requires an Azure subscription. If you don't have an Azure subscription, create a free account before you begin. Learn more about the free trial here.

Tools provided by this plugin

Discover computer vision insights from image analysis with OCR and AI.

Caption

he Caption feature generates a one-sentence description of all the image contents.

Learn more about Caption at here.

Dense Caption

Dense Captions provides more detail by generating one-sentence descriptions of up to 10 different regions of the image in addition to describing the whole image.

Learn more about Dense Caption at here.

Object Detection

Object detection returns the bounding box coordinates (in pixels) for each object found in the image. For example, if an image contains a dog, cat, and person, the object detection operation lists those objects with their coordinates in the image.

Learn more about Object Detection at here.

People Detection

People detection offers the ability to detect people appearing in images. The bounding box coordinates of each detected person are returned, along with a confidence score.

Learn more about People Detection at here.

OCR

OCR is a machine-learning-based technique for extracting text from in-the-wild and non-document images like product labels, user-generated images, screenshots, street signs, and posters.

Learn more about OCR at here.

Smart-Cropping

Smart-Cropping is a reduced-size representation of an image. Smart-Cropping are used to represent images and other data in a more economical, layout-friendly way. The Azure AI Vision 4.0 API uses smart cropping to create intuitive image thumbnails that include the most important regions of an image with priority given to any detected faces.

Learn more about Smart-Cropping at here.

Tags

The tags feature returns content tags for thousands of recognizable objects, living beings, scenery, and actions that appear in images.

Learn more about Tags at here.

Configuration and Usage

See Plugin README for configuration and usage details.

Notes

This plugin uses the Image Analysis API v4.0 of Azure AI Vision. Please refer to the official documentation for more information.

The functionality provided by this plugin is subject to the requirements and limitations of Azure AI Vision. Please refer to the official documentation for more information.

Input Requirements

Image Analysis API v4.0 have specific requirements for input images.

Region Availability

Image Analysis API v4.0 is limited to specific regions.

Language Support

You can specify the language for output generation by setting the language parameter. If this parameter is not specified, the default value is "en". See supported languages for more information.

Contributing

This plugin is open-source and contributions are welcome. Please visit the GitHub repository to contribute.

About

This plugin provides tools for image analysis with OCR and AI using Azure AI Vision.

Topics

Resources

License

Stars

Watchers

Forks

Packages

No packages published