/Models/Phi-4-multimodal-instruct
P

Phi-4-multimodal-instruct

Lowest Price
$0.05
per 1M tokens
Providers
1
Available
Context
N/A
tokens

Price Comparison

ProviderInput / OutputLatencyStatus
DeepInfraDeepInfraLowest
$0.05/$0.10
...
Verified

About This Model

Phi-4-multimodal-instruct is a lightweight open multimodal foundation model that leverages the language, vision, and speech research and datasets used for Phi-3.5 and 4.0 models. The model processes text, image, and audio inputs, generating text outputs, and comes with 128K token context length. The...

Quick Start