Local AI for Developers: Run Open-Source LLMs on Your Laptop with Ollama, llama.cpp, Quantization, and Practical RAG Workflows
Uitgelicht
|
18,19 |
Naar shop
|
|
18,23 |
Naar shop
|
|
18,23 |
Naar shop
|
Beschrijving
Bol
Local AI for Developers: Run Open-Source LLMs on Your Laptop with Ollama, llama.cpp, Quantization, and Practical RAG WorkflowsBuild private, fast, cost-controlled AI systems directly on your own machine.Tired of sending every prompt, code snippet, and internal document to a remote API? Want to experiment with open-source LLMs without unpredictable usage bills, rate limits, or cloud dependency?Local AI for Developers gives you a practical path to running, optimizing, and building with local language models using Ollama, llama.cpp, GGUF models, quantization, embeddings, and Retrieval-Augmented Generation workflows. Instead of treating local AI as a toy demo, this book shows how to turn your laptop or workstation into a usable AI development environment.You'll learn how to choose models that fit your hardware, run open-source LLMs locally, expose them through developer-friendly APIs, build private document assistants, create local embeddings, improve RAG quality, benchmark performance, and move from laptop prototypes toward repeatable local AI workflows. The manuscript covers Ollama, llama.cpp, quantization, model selection, RAG, reranking, benchmarking, and production-style local workflows.Inside, you'll gain practical skills for: - Setting up a reliable local AI development stack- Running and managing models with Ollama- Using llama.cpp and GGUF models for deeper control- Choosing quantization levels for speed and quality- Building local APIs, assistants, embeddings, and RAG pipelines- Evaluating latency, throughput, memory use, and response qualityThis book is for software developers, AI engineers, ML practitioners, technical founders, and builders who want private, offline-capable, open-source AI systems they can control.Get your copy today and start building local AI workflows that run on your hardware, protect your data, and fit real developer work.
Local AI for Developers: Run Open-Source LLMs on Your Laptop with Ollama, llama.cpp, Quantization, and Practical RAG WorkflowsBuild private, fast, cost-controlled AI systems directly on your own machine.Tired of sending every prompt, code snippet, and internal document to a remote API? Want to experiment with open-source LLMs without unpredictable usage bills, rate limits, or cloud dependency?Local AI for Developers gives you a practical path to running, optimizing, and building with local language models using Ollama, llama.cpp, GGUF models, quantization, embeddings, and Retrieval-Augmented Generation workflows. Instead of treating local AI as a toy demo, this book shows how to turn your laptop or workstation into a usable AI development environment.You'll learn how to choose models that fit your hardware, run open-source LLMs locally, expose them through developer-friendly APIs, build private document assistants, create local embeddings, improve RAG quality, benchmark performance, and move from laptop prototypes toward repeatable local AI workflows. The manuscript covers Ollama, llama.cpp, quantization, model selection, RAG, reranking, benchmarking, and production-style local workflows.Inside, you'll gain practical skills for: - Setting up a reliable local AI development stack- Running and managing models with Ollama- Using llama.cpp and GGUF models for deeper control- Choosing quantization levels for speed and quality- Building local APIs, assistants, embeddings, and RAG pipelines- Evaluating latency, throughput, memory use, and response qualityThis book is for software developers, AI engineers, ML practitioners, technical founders, and builders who want private, offline-capable, open-source AI systems they can control.Get your copy today and start building local AI workflows that run on your hardware, protect your data, and fit real developer work.
AmazonPages: 175, Paperback, Independently published
Prijzen voor het laatst bijgewerkt op: