LLM Inference Engineering: Quantization, KV-Cache Optimization, and High-Throughput Serving: A Production Engineer's Guide to INT4/INT8 ... Speculative Decoding, Cost Optimization

Prix à partir de
9,18

En vedette

COMPARER TOUS LES MAGASINS EN LIGNE (2)

Description

Amazon LLM Inference Engineering: Quantization, KV-Cache Optimization, and High-Throughput Serving: A Production Engineer's Guide to INT4/INT8 Quantization, ... Speculative Decoding, and Cost Optimization

Comparer les boutiques en ligne (2)

Shop
Prix
Affranchissement
Prix total
9,18 
2,49 €
11,67 
Voir l’offre
2,49 € Shipping Costs
9,18 
2,49 €
11,67 
Voir l’offre
2,49 € Shipping Costs
Description (1)

LLM Inference Engineering: Quantization, KV-Cache Optimization, and High-Throughput Serving: A Production Engineer's Guide to INT4/INT8 Quantization, ... Speculative Decoding, and Cost Optimization


Spécifications du produit

Marque Independently Published
EAN
  • 9798180985187

Prix mis à jour pour la dernière fois le :

Choix en vedette
9,18 
Voir l’offre