The Transformer Principles Series is a three-volume graduate-level treatise that builds a complete mathematical and engineering understanding of modern AI systems, from the foundational attention mechanism to large language models and multimodal architectures. Volume I - Mathematical Foundations and Transformer Principles begins with the historical evolution from symbolic AI to deep learning, then develops the essential mathematics: linear algebra, probability, optimization, neural network backpropagation, and information theory. These tools are applied through a systematic construction of the Transformer - self-attention, multi-head projections, positional encodings, feed-forward networks, residual connections, and normalization - culminating in the complete encoder-decoder architecture and an exploration of efficient attention variants, mixture-of-experts, and state-space models.
ThriftBooks sells millions of used books at the lowest everyday prices. We personally assess every book's quality and offer rare, out-of-print treasures. We deliver the joy of reading in recyclable packaging with free standard shipping on US orders over $20. ThriftBooks.com. Read more. Spend less.