Multi-Modal Prompt Builder & Model Inference
Select Task Mode
Text Chat
Vision-Language
Speech-Language
Vision-Speech
System Message
You are a helpful assistant.
User Message
Upload Image(s) (Multiple)
Drop File Here
- or -
Click to Upload
Upload Audio (wav, mp3, flac)
Drop Audio Here
- or -
Click to Upload
Upload Image(s) for Vision-Speech
Drop File Here
- or -
Click to Upload
Upload Audio for Vision-Speech
Drop Audio Here
- or -
Click to Upload
Submit
Result
Examples
Select Task Mode
User Message