/Models/Llama-3.3-70B-Instruct-Turbo

Llama-3.3-70B-Instruct-Turbo

Name: Llama-3.3-70B-Instruct-Turbo API
Price: 0.1 USD

Lowest Price

$0.10

per 1M tokens

Providers

Available

Context

N/A

tokens

Price Comparison

Provider	Input / Output	Latency	Status
DeepInfraLowest	$0.10/$0.32	...	Verified Payment Methods

About This Model

Llama 3.3-70B Turbo is a highly optimized version of the Llama 3.3-70B model, utilizing FP8 quantization to deliver significantly faster inference speeds with a minor trade-off in accuracy. The model is designed to be helpful, safe, and flexible, with a focus on responsible deployment and mitigating...

Quick Start