ollama4j/docs/docs/apis-generate/generate-with-images.md

1.8 KiB

sidebar_position
sidebar_position
4

import CodeEmbed from '@site/src/components/CodeEmbed'; import TypewriterTextarea from '@site/src/components/TypewriterTextarea';

Generate with Images

This API lets you ask questions along with the image files to the LLMs. This API corresponds to the completion API.

:::note

Executing this on Ollama server running in CPU-mode will take longer to generate response. Hence, GPU-mode is recommended.

:::

If you have this image downloaded and you pass the path to the downloaded image to the following code:

Img

You will get a response similar to:

:::tip[LLM Response] This image features a white boat with brown cushions, where a dog is sitting on the back of the boat. The dog seems to be enjoying its time outdoors, perhaps on a lake. :::

If you want the response to be streamed, you can use the following code:

Img

You will get a response similar to:

:::tip[Response Tokens] <TypewriterTextarea textContent={This image features a white boat with brown cushions, where a dog is sitting on the back of the boat. The dog seems to be enjoying its time outdoors, perhaps on a lake.} typingSpeed={10} pauseBetweenSentences={1200} height="auto" width="100%" style={{ whiteSpace: 'pre-line' }} /> :::