How to Build Large Language Models (LLMs): From Data Preparation to Deployment and Beyond

Name: How to Build Large Language Models (LLMs): From Data Preparation to Deployment and Beyond
Price: 950 TWD
Availability: OnlineOnly
Author: Vemula, Anand
ISBN: 9798335439497

Vemula, Anand

出版商: Independently Published
出版日期: 2024-08-09
售價: $970
貴賓價: 9.8 折 $950
語言: 英文
頁數: 104
裝訂: Quality Paper - also called trade paper
ISBN: 9798335439497
ISBN-13: 9798335439497
相關分類: Large language model

海外代購書籍(需單獨結帳)

買這商品的人也買了...

~~$680~~ $537

LangChain 開發手冊 -- OpenAI × LCEL 表達式 × Agent 自動化流程 × RAG 擴展模型知識 × 圖形資料庫 × LangSmith 除錯工具
~~$1,070~~ $1,048

LLM Model Security: Strategies, Best Practices, and Future Trends
~~$1,900~~ $1,862

The Decision Maker's Handbook to Data Science: AI and Data Science for Non-Technical Executives, Managers, and Founders (Paperback)
$708

基於大模型的 RAG 應用開發與優化 — 構建企業級 LLM 應用

商品描述

How to Build Large Language Models (LLMs): From Data Preparation to Deployment and Beyond" provides a comprehensive guide to the entire lifecycle of creating and deploying large language models. This book serves as an essential resource for AI practitioners, data scientists, and machine learning engineers interested in mastering the intricacies of LLMs.

The book begins with an introduction to LLMs, covering foundational concepts and the evolution of language models from early recurrent neural networks (RNNs) to modern transformer architectures. It explores popular LLM architectures, including GPT and BERT, highlighting their unique features and applications.

Part II delves into data preparation and management, a crucial phase for building effective LLMs. It provides detailed guidance on sourcing and curating datasets, addressing biases, and ensuring data diversity. Techniques for data preprocessing, such as tokenization and normalization, are discussed along with methods for handling missing data and generating synthetic data. The section also covers data storage and management strategies to design scalable pipelines and ensure data security.

In Part III, the focus shifts to the technical aspects of building the model. It includes setting up the development environment, choosing appropriate model architectures, and deciding between building from scratch or fine-tuning pre-trained models. The book also provides insights into training LLMs, including distributed training techniques and strategies for addressing common challenges like overfitting and underfitting. Hyperparameter tuning and optimization techniques are also covered to enhance model performance.

Part IV addresses evaluating and fine-tuning the model, emphasizing metrics for assessing model performance, fine-tuning techniques, and debugging strategies. It offers practical solutions for improving model accuracy and adapting it to specific use cases.

Finally, Part V explores deployment and maintenance strategies, including deployment options, monitoring, and securing LLMs in production environments. The book concludes with real-world case studies and examples, demonstrating the practical applications of LLMs in various industries

How to Build Large Language Models (LLMs): From Data Preparation to Deployment and Beyond

Vemula, Anand

買這商品的人也買了...

相關主題

商品描述

類似商品