- vLLM in Practice: A Developer’s Guide to High-Performance Inference, Scalable Serving, and Efficient Large Language Model Deployment
- C++ Programming for Backend Systems: Designing Scalable Services, APIs, and High-Performance Server Architectures (C++ Systems Engineering Series)
- Agentic AI Coding Automation with Claude Code In 2026: Designing Intelligent Development Workflows with Assisted Tooling, and Scalable Software Practices
- A2A Agentic Systems with MCP Servers: Architecting Agent-to-Agent Communication, Modular and Distributed Intelligence Frameworks
- C++ Memory Programming: Understanding Allocation, Ownership, and Performance-Oriented Design













