Generative AI Models for Creatives

Blip Image Capturing

External Service
Recognition
Writing
Augmentation
Workflow

Get information from an image

With BLIP (Bootstrapping Language-Image Pre-training) you get an unified vision-language understanding and can generate text based on an image. BLIP can be used for 

  1. Image captioning
  2. Open-ended visual question answering
  3. Multimodal / unimodal feature extraction
  4. Image-text matching
Model details
AuthorSalesforce
Published in2021
Architecturetransformer
Licensebsd
Related models:
Didn't find what you are looking for? Send us your suggestions!
notfound::false