Florence-2 Image Captioner

Provided by FalLearn More

Florence-2 is an advanced vision foundation model that uses a prompt-based approach to handle a wide range of vision and vision-language tasks.

Preview

Inputs

Image
image
The image to generate a caption for.
Detail Level
dropdown
The level of detail in the generated caption.
default: detailedAccepts: Basic (simple description), Detailed (comprehensive description), More Detailed (extensive information)

Outputs

Caption
text
The generated caption for the image.