DEEPSPEED IN PRODUCTION: inference OPTIMIZATION and MODEL: Deploy LLMs efficiently with optimized serving, quantization, low latency for real time applications

Independently Published
DEEPSPEED IN PRODUCTION: inference OPTIMIZATION and MODEL: Deploy LLMs efficiently with optimized serving, quantization, low latency for real time applications

Afbeelding van DEEPSPEED IN PRODUCTION: inference OPTIMIZATION and MODEL: Deploy LLMs efficiently with optimized serving, quantization, low latency for real time applications

Prijzen vanaf

45,55

Uitgelicht

	45,55	Naar shop
	45,55	Naar shop

Beschrijving

Amazon Pages: 288, Hardcover, Independently published

Lees meer

Vergelijk aanbieders (2)

Shop

Prijs

Verzendkosten

Totale prijs

45,55

Gratis

45,55

Naar shop

Gratis

45,55

Gratis

45,55

Naar shop

Gratis

Beschrijving (1)

Pages: 288, Hardcover, Independently published

Lees meer

Productspecificaties

Merk	Independently Published
EAN	9798274508001

Prijzen voor het laatst bijgewerkt op: 14-06-2026, 20:42

Independently Published

VECTOR DATABASE & RAG ENGINEERING: DESIGNING SCALABLE, LOW LATENCY RETRIEVAL SYSTEMS FOR...

26,17

Vergelijk 2 shops 2 shops

Independently Published

VECTOR DATABASE & RAG ENGINEERING: DESIGNING SCALABLE, LOW LATENCY RETRIEVAL SYSTEMS FOR...

15,15

Vergelijk 3 shops 3 shops

DeepSpeed Inference

8,52

Meer informatie Meer info

Independently Published

LLM Inference Engineering: Quantization, KV-Cache Optimization, and High-Throughput Serving: A Production Engineer's...

9,18

Vergelijk 2 shops 2 shops

Uitgelichte Keuze

45,55

Naar shop