Skip to content
Scan a barcode
Scan
Paperback Multimodal AI Workflows with Huggging Face: Combining embeddings, image and text models, and retrieval frameworks for advanced anomaly detection and r Book

ISBN: B0FT3MLBPG

ISBN13: 9798267557214

Multimodal AI Workflows with Huggging Face: Combining embeddings, image and text models, and retrieval frameworks for advanced anomaly detection and r

Master multimodal AI with Hugging Face and build real systems that combine text, images, and retrieval for production-grade workflows.

Modern AI no longer relies on a single data type. Real applications demand models that connect text with images, integrate structured retrieval, and deliver results that scale in production. The challenge for practitioners is moving beyond theory into working pipelines that handle these tasks reliably. Multimodal AI Workflows with Hugging Face shows you exactly how to do it.

This book takes you from core embeddings and vector search to advanced multimodal retrieval-augmented generation, anomaly detection, and recommender systems. Each section connects the underlying concepts with practical code, helping you move from understanding to implementation with confidence.

What you will learn: How to work with modern text and image embeddings using CLIP, OpenCLIP, SigLIP, and Sentence-TransformersPractical vector search with FAISS, Weaviate, Milvus, and pgvectorBuilding multimodal retrieval-augmented generation systems with LlamaIndex and HaystackImplementing anomaly detection with Anomalib, PaDiM, PatchCore, and MVTec ADDesigning recommendation engines that combine image and text signalsUsing vision-language models such as BLIP-2 and Idefics2 for document and chart understandingEvaluating systems with metrics including AUROC, PRO, Recall, NDCG, and Pixel-level accuracyDeploying and monitoring multimodal systems in real-world finance, healthcare, manufacturing, and retail scenarios
Code included:
This is a code-heavy guide packed with working Python examples. Every major concept is illustrated with runnable code so you can build your own retrieval, anomaly detection, and recommendation pipelines directly from the text.

Whether you are a machine learning engineer, data scientist, or developer interested in production-ready AI, this book provides the hands-on foundation you need to connect multimodal models with real industry use cases.

Grab your copy today and start building the next generation of AI systems.

Recommended

Format: Paperback

Temporarily Unavailable

We receive fewer than 1 copy every 6 months.

Save to List

Customer Reviews

0 rating
Copyright © 2026 Thriftbooks.com Terms of Use | Privacy Policy | Do Not Sell/Share My Personal Information | Cookie Policy | Cookie Preferences | Accessibility Statement
ThriftBooks ® and the ThriftBooks ® logo are registered trademarks of Thrift Books Global, LLC
GoDaddy Verified and Secured