You can use the CLIP model to generate embeddings ...
# community-help
j
You can use the CLIP model to generate embeddings for both your image and text together in a single embedding field