forked from Mirror/ollama4j
44 lines
1.3 KiB
Markdown
44 lines
1.3 KiB
Markdown
---
|
|
sidebar_position: 3
|
|
---
|
|
|
|
# Generate - With Image Files
|
|
|
|
This API lets you ask questions along with the image files to the LLMs.
|
|
These APIs correlate to
|
|
the [completion](https://github.com/jmorganca/ollama/blob/main/docs/api.md#generate-a-completion) APIs.
|
|
|
|
:::note
|
|
|
|
Executing this on Ollama server running in CPU-mode will take longer to generate response. Hence, GPU-mode is
|
|
recommended.
|
|
|
|
:::
|
|
|
|
## Synchronous mode
|
|
|
|
If you have this image downloaded and you pass the path to the downloaded image to the following code:
|
|
|
|

|
|
|
|
```java
|
|
public class Main {
|
|
|
|
public static void main(String[] args) {
|
|
String host = "http://localhost:11434/";
|
|
OllamaAPI ollamaAPI = new OllamaAPI(host);
|
|
ollamaAPI.setRequestTimeoutSeconds(10);
|
|
|
|
OllamaResult result = ollamaAPI.generateWithImageFiles(OllamaModelType.LLAVA,
|
|
"What's in this image?",
|
|
List.of(
|
|
new File("/path/to/image")));
|
|
System.out.println(result.getResponse());
|
|
}
|
|
}
|
|
```
|
|
|
|
You will get a response similar to:
|
|
|
|
> This image features a white boat with brown cushions, where a dog is sitting on the back of the boat. The dog seems to
|
|
> be enjoying its time outdoors, perhaps on a lake. |