Back to Search

DeepSeek in Action: LLM Deployment, Fine-Tuning, and Application

AUTHOR Dai, Jing
PUBLISHER CRC Press (11/18/2025)
PRODUCT TYPE Hardcover (Hardcover)

Description

From fundamental concepts to advanced implementations, this book thoroughly explores the DeepSeek-V3 model, focusing on its Transformer-based architecture, technological innovations, and applications.

The book begins with a thorough examination of theoretical foundations, including self-attention, positional encoding, the Mixture of Experts mechanism, and distributed training strategies. It then explores DeepSeek-V3's technical advancements, including sparse attention mechanisms, FP8 mixed-precision training, and hierarchical load balancing, which optimize memory and energy efficiency. Through case studies and API integration techniques, the model's high-performance capabilities in text generation, mathematical reasoning, and code completion are examined. The book highlights DeepSeek's open platform and covers secure API authentication, concurrency strategies, and real-time data processing for scalable AI applications. Additionally, the book addresses industry applications, such as chat client development, utilizing DeepSeek's context caching and callback functions for automation and predictive maintenance.

This book is aimed primarily at AI researchers and developers working on large-scale AI models. It is an invaluable resource for professionals seeking to understand the theoretical underpinnings and practical implementation of advanced AI systems, particularly those interested in efficient, scalable applications.

Show More
Product Format
Product Details
ISBN-13: 9781041090007
ISBN-10: 1041090005
Binding: Hardback or Cased Book (Sewn)
Content Language: English
More Product Details
Page Count: 14
Carton Quantity: 9
Product Dimensions: 7.00 x 0.88 x 10.00 inches
Weight: 1.98 pound(s)
Feature Codes: Illustrated
Country of Origin: US
Subject Information
BISAC Categories
Computers | Artificial Intelligence - Generative AI
Computers | Artificial Intelligence - Natural Language Processing
Descriptions, Reviews, Etc.
publisher marketing

From fundamental concepts to advanced implementations, this book thoroughly explores the DeepSeek-V3 model, focusing on its Transformer-based architecture, technological innovations, and applications.

The book begins with a thorough examination of theoretical foundations, including self-attention, positional encoding, the Mixture of Experts mechanism, and distributed training strategies. It then explores DeepSeek-V3's technical advancements, including sparse attention mechanisms, FP8 mixed-precision training, and hierarchical load balancing, which optimize memory and energy efficiency. Through case studies and API integration techniques, the model's high-performance capabilities in text generation, mathematical reasoning, and code completion are examined. The book highlights DeepSeek's open platform and covers secure API authentication, concurrency strategies, and real-time data processing for scalable AI applications. Additionally, the book addresses industry applications, such as chat client development, utilizing DeepSeek's context caching and callback functions for automation and predictive maintenance.

This book is aimed primarily at AI researchers and developers working on large-scale AI models. It is an invaluable resource for professionals seeking to understand the theoretical underpinnings and practical implementation of advanced AI systems, particularly those interested in efficient, scalable applications.

Show More
List Price $180.00
Your Price  $178.20
Hardcover