Large Language Model-Based Solutions: How to Deliver Value with Cost-Effective Generative AI Applications
| AUTHOR | Subramanian, Shreyas; Henning, Daniel |
| PUBLISHER | Ascent Audio (11/19/2024) |
| PRODUCT TYPE | Audio (MP3 CD) |
Description
In Large Language Model-Based Solutions: How to Deliver Value with Cost-Effective Generative AI Applications, Principal Data Scientist at Amazon Web Services, Shreyas Subramanian, delivers a practical guide for developers and data scientists who wish to build and deploy cost-effective large language model (LLM)-based solutions. In the book, you'll find coverage of a wide range of key topics, including how to select a model, pre- and post-processing of data, prompt engineering, and instruction fine-tuning. The author sheds light on techniques for optimizing inference, like model quantization and pruning, as well as different and affordable architectures for typical generative AI (GenAI) applications, including search systems, agent assists, and autonomous agents. You'll also find: ? Effective strategies to address the challenge of the high computational cost associated with LLMs ? Assistance with the complexities of building and deploying affordable generative AI apps, including tuning and inference techniques ? Selection criteria for choosing a model, with particular consideration given to compact, nimble, and domain-specific models
Show More
Product Format
Product Details
ISBN-13:
9798228316300
Binding:
CD-Audio (MP3 Format)
Content Language:
English
More Product Details
Carton Quantity:
100
Feature Codes:
Unabridged
Country of Origin:
US
Subject Information
BISAC Categories
Computers | Machine Theory
Computers | Artificial Intelligence - Natural Language Processing
Descriptions, Reviews, Etc.
publisher marketing
In Large Language Model-Based Solutions: How to Deliver Value with Cost-Effective Generative AI Applications, Principal Data Scientist at Amazon Web Services, Shreyas Subramanian, delivers a practical guide for developers and data scientists who wish to build and deploy cost-effective large language model (LLM)-based solutions. In the book, you'll find coverage of a wide range of key topics, including how to select a model, pre- and post-processing of data, prompt engineering, and instruction fine-tuning. The author sheds light on techniques for optimizing inference, like model quantization and pruning, as well as different and affordable architectures for typical generative AI (GenAI) applications, including search systems, agent assists, and autonomous agents. You'll also find: ? Effective strategies to address the challenge of the high computational cost associated with LLMs ? Assistance with the complexities of building and deploying affordable generative AI apps, including tuning and inference techniques ? Selection criteria for choosing a model, with particular consideration given to compact, nimble, and domain-specific models
Show More
List Price $45.95
Your Price
$45.49
