Local AI for Developers: Run Open-Source LLMs on Your Laptop with Ollama, llama.cpp, Quantization, and Practical RAG Workflows

By Harvey Chandler

No Customer Reviews

Local AI for Developers: Run Open-Source LLMs on Your Laptop with Ollama, llama.cpp, Quantization, and Practical RAG Workflows

Build private, fast, cost-controlled AI systems directly on your own machine.

Tired of sending every prompt, code snippet, and internal document to a remote API? Want to experiment with open-source LLMs without unpredictable usage bills, rate limits, or cloud dependency?

Local AI for Developers gives you a practical path to running, optimizing, and building with local language models using Ollama, llama.cpp, GGUF models, quantization, embeddings, and Retrieval-Augmented Generation workflows. Instead of treating local AI as a toy demo, this book shows how to turn your laptop or workstation into a usable AI development environment.

You'll learn how to choose models that fit your hardware, run open-source LLMs locally, expose them through developer-friendly APIs, build private document assistants, create local embeddings, improve RAG quality, benchmark performance, and move from laptop prototypes toward repeatable local AI workflows. The manuscript covers Ollama, llama.cpp, quantization, model selection, RAG, reranking, benchmarking, and production-style local workflows.

Inside, you'll gain practical skills for:

Setting up a reliable local AI development stackRunning and managing models with OllamaUsing llama.cpp and GGUF models for deeper controlChoosing quantization levels for speed and qualityBuilding local APIs, assistants, embeddings, and RAG pipelinesEvaluating latency, throughput, memory use, and response quality

This book is for software developers, AI engineers, ML practitioners, technical founders, and builders who want private, offline-capable, open-source AI systems they can control.

Get your copy today and start building local AI workflows that run on your hardware, protect your data, and fit real developer work.

Format:Paperback

Language:English

ISBN:B0H2JXSBJ5

ISBN13:9798198005006

Release Date:May 2026

Publisher:Independently Published

Length:176 Pages

Weight:0.70 lbs.

Dimensions:0.4" x 7.0" x 10.0"

Related Subjects

Computers Computers & Technology

Customer Reviews

0 rating

Write a review

ThriftBooks sells millions of used books at the lowest everyday prices. We personally assess every book's quality and offer rare, out-of-print treasures. We deliver the joy of reading in recyclable packaging with free standard shipping on US orders over $20. ThriftBooks.com. Read more. Spend less.

Copyright © 2026 Thriftbooks.com Terms of Use | Privacy Policy | Do Not Sell/Share My Personal Information | Cookie Policy | Cookie Preferences | Accessibility Statement
ThriftBooks ^® and the ThriftBooks ^® logo are registered trademarks of Thrift Books Global, LLC

Local AI for Developers: Run Open-Source LLMs on Your Laptop with Ollama, llama.cpp, Quantization, and Practical RAG Workflows

Recommended

Customer Reviews

Popular Categories

Website

My Account

Partnerships

Quick Help

About Us

Follow Us