Asif Abbasi, Olivier Bergeret, Joel Farvault
Genai on AWS
A Practical Approach to Building Generative AI Applications on AWS
Asif Abbasi, Olivier Bergeret, Joel Farvault
Genai on AWS
A Practical Approach to Building Generative AI Applications on AWS
- Broschiertes Buch
- Merkliste
- Auf die Merkliste
- Bewerten Bewerten
- Teilen
- Produkt teilen
- Produkterinnerung
- Produkterinnerung
The definitive guide to leveraging AWS for generative AI GenAI on AWS: A Practical Approach to Building Generative AI Applications on AWS is an essential guide for anyone looking to dive into the world of generative AI with the power of Amazon Web Services (AWS). Crafted by a team of experienced cloud and software engineers, this book offers a direct path to developing innovative AI applications. It lays down a hands-on roadmap filled with actionable strategies, enabling you to write secure, efficient, and reliable generative AI applications utilizing the latest AI capabilities on AWS. This…mehr
Andere Kunden interessierten sich auch für
- Olivier CardinDigitalization and Control of Industrial Cyber-Physical Systems173,99 €
- Wee Hyong TokPractical Weak Supervision75,99 €
- Applications of Machine Learning and Deep Learning on Biological Data154,99 €
- Deep Learning and Its Applications for Vehicle Networks143,99 €
- Shajulin BenedictEdge Intelligence209,99 €
- Digital Afterlife155,99 €
- Nicolas SabouretUnderstanding Artificial Intelligence122,99 €
-
-
-
The definitive guide to leveraging AWS for generative AI GenAI on AWS: A Practical Approach to Building Generative AI Applications on AWS is an essential guide for anyone looking to dive into the world of generative AI with the power of Amazon Web Services (AWS). Crafted by a team of experienced cloud and software engineers, this book offers a direct path to developing innovative AI applications. It lays down a hands-on roadmap filled with actionable strategies, enabling you to write secure, efficient, and reliable generative AI applications utilizing the latest AI capabilities on AWS. This comprehensive guide starts with the basics, making it accessible to both novices and seasoned professionals. You'll explore the history of artificial intelligence, understand the fundamentals of machine learning, and get acquainted with deep learning concepts. It also demonstrates how to harness AWS's extensive suite of generative AI tools effectively. Through practical examples and detailed explanations, the book empowers you to bring your generative AI projects to life on the AWS platform. In the book, you'll: * Gain invaluable insights from practicing cloud and software engineers on developing cutting-edge generative AI applications using AWS * Discover beginner-friendly introductions to AI and machine learning, coupled with advanced techniques for leveraging AWS's AI tools * Learn from a resource that's ideal for a broad audience, from technical professionals like cloud engineers and software developers to non-technical business leaders looking to innovate with AI Whether you're a cloud engineer, software developer, business leader, or simply an AI enthusiast, Gen AI on AWS is your gateway to mastering generative AI development on AWS. Seize this opportunity for an enduring competitive advantage in the rapidly evolving field of AI. Embark on your journey to building practical, impactful AI applications by grabbing a copy today.
Hinweis: Dieser Artikel kann nur an eine deutsche Lieferadresse ausgeliefert werden.
Hinweis: Dieser Artikel kann nur an eine deutsche Lieferadresse ausgeliefert werden.
Produktdetails
- Produktdetails
- Verlag: Wiley
- Seitenzahl: 320
- Erscheinungstermin: 8. April 2025
- Englisch
- ISBN-13: 9781394281282
- ISBN-10: 1394281285
- Artikelnr.: 70396262
- Herstellerkennzeichnung
- Libri GmbH
- Europaallee 1
- 36244 Bad Hersfeld
- gpsr@libri.de
- Verlag: Wiley
- Seitenzahl: 320
- Erscheinungstermin: 8. April 2025
- Englisch
- ISBN-13: 9781394281282
- ISBN-10: 1394281285
- Artikelnr.: 70396262
- Herstellerkennzeichnung
- Libri GmbH
- Europaallee 1
- 36244 Bad Hersfeld
- gpsr@libri.de
Acknowledgments xv
About the Authors xvii
Foreword xix
Introduction xxi
Chapter 1: A Brief History of AI 1
The Precursors of the Mechanical or "Formal" Reasoning 2
The Digital Computer Era 4
Cybernetics and the Beginning of the Robotic Era 6
Birth of AI and Symbolic AI (1955-1985) 10
Subsymbolic AI Era (1985-2010) 14
Deep Learning and LLM (2010-Present) 16
Key Takeaways 17
Chapter 2: Machine Learning 19
What Is Machine Learning? 19
Types of Machine Learning 20
Supervised Learning 21
Unsupervised and Semi-Supervised Learning 22
Reinforcement Learning 23
Methodology for Machine Learning 24
Implementation of Machine Learning 26
Machine Learning Applications 27
Natural Language Processing (NLP) 27
Computer Vision 27
Recommender System 27
Predictive Analytics 28
Fraud Detection 28
Machine Learning Frameworks and Libraries 28
TensorFlow 28
PyTorch 31
Scikit-learn 34
Keras 35
Apache Spark MLlib 37
Future Trends in Machine Learning 40
Rise of Edge Computing and Edge AI 40
Convergence with Emerging Technologies 40
Advancements in Unsupervised Learning,
Reinforcement Learning, and Generative Models 41
Increased Specialization and Customization 41
Explainable and Trustworthy AI 42
Key Takeaways 42
References 43
Chapter 3: Deep Learning 45
Deep Learning vs. Machine Learning 45
Computer Vision Example 46
Natural Language Processing Example 47
The History of Deep Learning 47
Understanding Deep Learning 52
Neurons 52
Weights and Biases 54
Layers 54
Activation Function(s) 55
An Introduction to the Perceptron 58
Overcoming Perceptron Limitations 59
FeedForward Neural Networks 60
Backpropagation 60
Parameters vs. Hyperparameters 60
Hyperparameters in Artificial Neural Networks 64
Loss Functions - a Measure of Success of a Neural Network 64
Optimization Algorithms 64
Neural Network Architectures 68
Putting It All Together 71
Deep Learning on AWS 71
Chipsets and EC2 Instances 71
AWS P5 Instances 72
AWS Inferentia 72
Amazon Elastic Inference 73
Prebuilt Containers: Deep Learning AMIs and Containers 74
Deep Learning AMIs 74
Deep Learning Containers 74
Managed Services for Building, Training, and Deployment 74
Pre-trained Services 75
Key Takeaways 77
References 77
Chapter 4: Introduction to Generative AI 79
Generative AI Core Technologies 80
Neural Networks 80
Generative Adversarial Networks (GANs) 80
Variational Autoencoders (VAEs) 81
Recurrent Neural Networks (RNNs) and
Long Short-Term Memory Networks (LSTMs) 82
Limitations of Recurrent Neural Networks 84
Transformer Models 85
Self-Attention 86
Parallelism 86
Diffusion Models 86
Autoregressive Models 87
Reinforcement Learning (RL) 87
Transfer Learning and Fine-Tuning 87
Optimization Algorithms 87
Transformer Architecture: Deep Dive 87
Deep Dive 89
Step 1: Tokenization (Preprocessing) 89
Step 2: Embedding 89
Step 3: Encoder 92
Step 4: Encoder Output to Decoder Input 97
Step 5: Decoder 98
Step 6: Translation Generation 99
Step 7: Detokenization 99
Terminology in Generative AI 99
Prompt 104
Inference 105
Context Window 106
Prompt Engineering 106
In-Context Learning (ICL) 107
Zero-Shot/One-Shot/Few-Shot Inference 108
Inference Configuration 109
Maximum Length 110
Diversity (Top P/Nucleus Sampling) 111
Top K 111
Randomness (Temperature) 112
System Prompts 112
Prompt Engineering 113
Key Elements of a Prompt 113
Designing Effective Prompts 114
Prompting Techniques 115
Zero-Shot Prompting 115
Few-Shot Prompting 115
Chain-Of-Thought Prompting 116
Advanced Prompting Techniques 117
Self-Consistency 118
Tree of Thoughts (ToT) 119
Retrieval-Augmented Generation (RAG) 120
Automatic Reasoning and Tool-Use (ART) 122
ReAct Prompting 123
Coherence Enhancement 124
Progressive Prompting 126
Handling Prompt Misuse 127
Prompt Injection 127
Prompt Leaking 128
Mitigating Bias 129
Mitigating Bias in Prompt Engineering 130
Generative AI Business Value 133
Building Value Within Your Enterprises 135
Technology: Creating a Flexible and Strong System 135
People: Training and Adapting the Team 135
Processes: Good Management and Fair Use of AI 136
Why a Solid Foundation Is Crucial 136
References 137
Chapter 5: Introduction to Foundation Models 139
Definition and Overview of Foundation Models 139
Characteristics of Foundation Models 142
Examples of Foundation Models 144
Types of Foundation Models 147
The Large Language Model (LLM) 154
Natural Language Processing 155
Early Approaches to NLP 156
Evolution toward Text-Based Foundation Model 160
Applications of Foundation Models 162
Challenges and Considerations 163
Infrastructure 163
Ethics 164
Areas of Evolution 165
Key Takeaways 167
References 168
Chapter 6: Introduction to Amazon SageMaker 169
Data Preparation and Processing 172
Data Preparation 172
Data Processing 173
Model Development 174
Model Training and Tuning 175
Model Deployment 177
Model Management 178
Security 179
Compliance and Governance 180
Model Explainability and Responsible AI 181
MLOps with Amazon SageMaker 181
Boost Your Generative AI Development with
SageMaker JumpStart 182
No-Code ML with Amazon SageMaker Canvas 182
Amazon Bedrock 184
Choosing the Right Strategy for the Development of
Your Generative AI Application with Amazon SageMaker 186
Conclusion 187
References 188
Chapter 7: Generative AI on AWS 191
AWS Services for Generative AI 192
Generative AI Trade-Off Triangle 192
How AWS Solves the Generative AI Trade-Off Triangle 192
Generative AI on AWS: The Fundamentals 193
Infrastructure for FM Training and Inference 194
Models and tools to build Generative AI Apps 194
Applications to boost productivity 195
Amazon Bedrock 196
Foundation Models with Bedrock 197
AI21 Labs - Jurassic 197
Amazon Titan 198
Anthropic's Claude 3 199
Cohere's Family of Models 201
Key Features of Cohere 201
Cohere Models on Amazon Bedrock 203
Meta's Family of Models - Llama 204
When to Use Which Model 207
Mistral's Family of Models 208
When to Use Which Model 209
Stability.ai's Family of Models - Stable Diffusion XL 1.0 209
Poolside Family of Models 210
Luma's Family of Models 211
Amazon's Nova Family of Models 212
Model Evaluation in Amazon Bedrock 213
Common Approaches to Customizing Your FMs 214
Amazon Bedrock Prompt Management 214
Amazon Bedrock Flows 216
Data Automation in Amazon Bedrock 219
GraphRAG in Amazon Bedrock 220
Knowledge Bases in Amazon Bedrock 222
How Knowledge Bases Work 223
Pre-Processing Data 224
Runtime Execution 224
Creating a Knowledge Base in Amazon Bedrock 225
Agents for Amazon Bedrock 225
How Agents Work 226
Components of an Agent at Build Time 226
Components of an Agent at Runtime 228
Guardrails for Amazon Bedrock 230
Security in Amazon Bedrock 231
Amazon Q 232
Amazon Q Business 232
Amazon Q in QuickSight 235
Amazon Q Developer 237
Amazon Q Connect 239
Amazon Q in AWS Supply Chain 240
Summary 241
Chapter 8: Customization of Your Foundation Model 243
Introduction to LLM Customization 244
Continued Pre-Training (Domain Adaptation Fine-Tuning) 244
Fine-Tuning 245
Prompt Engineering 245
Retrieval Augmented Generation (RAG) 246
Choosing Between These Customization Techniques 246
Cost of Customization 249
Customizing Foundation Models with AWS 250
Continuous Pre-Training with Amazon Bedrock 250
Creation of a Training and a Validation Dataset 250
Launch of a Continued Pre-Training Job 251
Analysis of Our Results and Adjustment of
Our Hyperparameters 252
Deployment of Our Model 254
Use Your Customized Model 255
Instruction Fine-Tuning with Amazon Bedrock 257
Instruction Fine-Tuning with Amazon SageMaker JumpStart 257
Conclusion 260
Chapter 9: Retrieval-Augmented Generation 263
What Is RAG? 263
Background and Motivation 264
Overview of RAG 266
Building a RAG Solution 269
Design Considerations 269
Best Practices 270
Common Patterns 271
Performance Optimization 271
Scaling Considerations 272
The Future of RAG Implementations 273
Retrieval Module 274
Retrieval Techniques and Algorithms 276
Augmentation Module 278
Generation Module 280
RAG on AWS 282
Custom Data Pipeline to Build RAG 284
Core Components of a RAG Pipeline 284
Implementation Approaches 286
Basic Solution: LangChain Implementation 286
Advanced Solution: Spark-Based Pipeline 287
Data Ingestion (Examples) 288
Parallel Processing (example) 289
Case Studies and Applications 290
Question-Answering Systems 290
Dialogue Systems 290
Knowledge-Intensive Tasks 291
Implementation Considerations and Best Practices 291
Challenges and Future Directions 292
Example Notebooks 293
References 293
Chapter 10: Generative AI on AWS Labs 295
Lab 1: Introduction to Generative AI with Bedrock 295
Option 1: PartyRock Prompt Engineering Guide
(for Non-Technical and Technical Audiences) 297
Option 2: Amazon Bedrock Labs (for Technical Audiences) 298
Overview of Amazon Bedrock and Streamlit 298
Supported Regions 298
Costs When Running from Your Own Account 298
Quotas When Running from Your Own Account 299
Time to Complete 299
Lab 2: Dive Deep into Gen AI with Amazon Bedrock 299
Lab 3: Building an Agentic LLM Assistant on AWS 300
What Is an Agentic LLM Assistant? 300
Why Build an Agentic LLM Assistant? 301
About This Workshop 301
Architecture 301
Labs 302
Lab 4: Retrieval-Augmented Generation Workshop 303
Managed RAG Workshop 304
Naive RAG Workshop 304
Advance RAG Workshop 304
Audience 304
Lab 5: Amazon Q for Business 304
Next Steps 307
Lab 6: Building a Natural Language Query Engine for Data Lakes 308
Reference 310
Chapter 11: Next Steps 311
The Future of Generative AI: Key Dimensions and
Staying Informed 311
Technical Evolution and Capabilities 312
The Evolution of Scale and Architecture 312
The Multimodal Revolution 312
The Efficiency Breakthrough 313
The Context Window Revolution 313
Real-time Processing and Generation 313
The Future Technological Landscape 314
Application Domains 314
Enterprise Applications: The Quiet Revolution 315
The Scientific Frontier: Accelerating Discovery 315
Healthcare: Personalized Medicine and Diagnosis 315
Education and Training: Personalizing Learning 316
Environmental Applications: Tackling Global Challenges 316
The Future of Applications 317
Ethical and Societal Implications 317
Digital Identity and Deep Fakes: The Crisis of Trust 318
Labor Markets and Economic Disruption 318
Privacy and Data Rights in the Age of AI 318
Bias and Fairness: The Hidden Challenges 319
Democratic Access and Digital Divides 319
Environmental and Sustainability Concerns 319
The Path Forward: Governance and Responsibility 319
Looking to the Future 320
Staying Current in the Rapidly Evolving AI Landscape 320
Glossary 323
Index
About the Authors xvii
Foreword xix
Introduction xxi
Chapter 1: A Brief History of AI 1
The Precursors of the Mechanical or "Formal" Reasoning 2
The Digital Computer Era 4
Cybernetics and the Beginning of the Robotic Era 6
Birth of AI and Symbolic AI (1955-1985) 10
Subsymbolic AI Era (1985-2010) 14
Deep Learning and LLM (2010-Present) 16
Key Takeaways 17
Chapter 2: Machine Learning 19
What Is Machine Learning? 19
Types of Machine Learning 20
Supervised Learning 21
Unsupervised and Semi-Supervised Learning 22
Reinforcement Learning 23
Methodology for Machine Learning 24
Implementation of Machine Learning 26
Machine Learning Applications 27
Natural Language Processing (NLP) 27
Computer Vision 27
Recommender System 27
Predictive Analytics 28
Fraud Detection 28
Machine Learning Frameworks and Libraries 28
TensorFlow 28
PyTorch 31
Scikit-learn 34
Keras 35
Apache Spark MLlib 37
Future Trends in Machine Learning 40
Rise of Edge Computing and Edge AI 40
Convergence with Emerging Technologies 40
Advancements in Unsupervised Learning,
Reinforcement Learning, and Generative Models 41
Increased Specialization and Customization 41
Explainable and Trustworthy AI 42
Key Takeaways 42
References 43
Chapter 3: Deep Learning 45
Deep Learning vs. Machine Learning 45
Computer Vision Example 46
Natural Language Processing Example 47
The History of Deep Learning 47
Understanding Deep Learning 52
Neurons 52
Weights and Biases 54
Layers 54
Activation Function(s) 55
An Introduction to the Perceptron 58
Overcoming Perceptron Limitations 59
FeedForward Neural Networks 60
Backpropagation 60
Parameters vs. Hyperparameters 60
Hyperparameters in Artificial Neural Networks 64
Loss Functions - a Measure of Success of a Neural Network 64
Optimization Algorithms 64
Neural Network Architectures 68
Putting It All Together 71
Deep Learning on AWS 71
Chipsets and EC2 Instances 71
AWS P5 Instances 72
AWS Inferentia 72
Amazon Elastic Inference 73
Prebuilt Containers: Deep Learning AMIs and Containers 74
Deep Learning AMIs 74
Deep Learning Containers 74
Managed Services for Building, Training, and Deployment 74
Pre-trained Services 75
Key Takeaways 77
References 77
Chapter 4: Introduction to Generative AI 79
Generative AI Core Technologies 80
Neural Networks 80
Generative Adversarial Networks (GANs) 80
Variational Autoencoders (VAEs) 81
Recurrent Neural Networks (RNNs) and
Long Short-Term Memory Networks (LSTMs) 82
Limitations of Recurrent Neural Networks 84
Transformer Models 85
Self-Attention 86
Parallelism 86
Diffusion Models 86
Autoregressive Models 87
Reinforcement Learning (RL) 87
Transfer Learning and Fine-Tuning 87
Optimization Algorithms 87
Transformer Architecture: Deep Dive 87
Deep Dive 89
Step 1: Tokenization (Preprocessing) 89
Step 2: Embedding 89
Step 3: Encoder 92
Step 4: Encoder Output to Decoder Input 97
Step 5: Decoder 98
Step 6: Translation Generation 99
Step 7: Detokenization 99
Terminology in Generative AI 99
Prompt 104
Inference 105
Context Window 106
Prompt Engineering 106
In-Context Learning (ICL) 107
Zero-Shot/One-Shot/Few-Shot Inference 108
Inference Configuration 109
Maximum Length 110
Diversity (Top P/Nucleus Sampling) 111
Top K 111
Randomness (Temperature) 112
System Prompts 112
Prompt Engineering 113
Key Elements of a Prompt 113
Designing Effective Prompts 114
Prompting Techniques 115
Zero-Shot Prompting 115
Few-Shot Prompting 115
Chain-Of-Thought Prompting 116
Advanced Prompting Techniques 117
Self-Consistency 118
Tree of Thoughts (ToT) 119
Retrieval-Augmented Generation (RAG) 120
Automatic Reasoning and Tool-Use (ART) 122
ReAct Prompting 123
Coherence Enhancement 124
Progressive Prompting 126
Handling Prompt Misuse 127
Prompt Injection 127
Prompt Leaking 128
Mitigating Bias 129
Mitigating Bias in Prompt Engineering 130
Generative AI Business Value 133
Building Value Within Your Enterprises 135
Technology: Creating a Flexible and Strong System 135
People: Training and Adapting the Team 135
Processes: Good Management and Fair Use of AI 136
Why a Solid Foundation Is Crucial 136
References 137
Chapter 5: Introduction to Foundation Models 139
Definition and Overview of Foundation Models 139
Characteristics of Foundation Models 142
Examples of Foundation Models 144
Types of Foundation Models 147
The Large Language Model (LLM) 154
Natural Language Processing 155
Early Approaches to NLP 156
Evolution toward Text-Based Foundation Model 160
Applications of Foundation Models 162
Challenges and Considerations 163
Infrastructure 163
Ethics 164
Areas of Evolution 165
Key Takeaways 167
References 168
Chapter 6: Introduction to Amazon SageMaker 169
Data Preparation and Processing 172
Data Preparation 172
Data Processing 173
Model Development 174
Model Training and Tuning 175
Model Deployment 177
Model Management 178
Security 179
Compliance and Governance 180
Model Explainability and Responsible AI 181
MLOps with Amazon SageMaker 181
Boost Your Generative AI Development with
SageMaker JumpStart 182
No-Code ML with Amazon SageMaker Canvas 182
Amazon Bedrock 184
Choosing the Right Strategy for the Development of
Your Generative AI Application with Amazon SageMaker 186
Conclusion 187
References 188
Chapter 7: Generative AI on AWS 191
AWS Services for Generative AI 192
Generative AI Trade-Off Triangle 192
How AWS Solves the Generative AI Trade-Off Triangle 192
Generative AI on AWS: The Fundamentals 193
Infrastructure for FM Training and Inference 194
Models and tools to build Generative AI Apps 194
Applications to boost productivity 195
Amazon Bedrock 196
Foundation Models with Bedrock 197
AI21 Labs - Jurassic 197
Amazon Titan 198
Anthropic's Claude 3 199
Cohere's Family of Models 201
Key Features of Cohere 201
Cohere Models on Amazon Bedrock 203
Meta's Family of Models - Llama 204
When to Use Which Model 207
Mistral's Family of Models 208
When to Use Which Model 209
Stability.ai's Family of Models - Stable Diffusion XL 1.0 209
Poolside Family of Models 210
Luma's Family of Models 211
Amazon's Nova Family of Models 212
Model Evaluation in Amazon Bedrock 213
Common Approaches to Customizing Your FMs 214
Amazon Bedrock Prompt Management 214
Amazon Bedrock Flows 216
Data Automation in Amazon Bedrock 219
GraphRAG in Amazon Bedrock 220
Knowledge Bases in Amazon Bedrock 222
How Knowledge Bases Work 223
Pre-Processing Data 224
Runtime Execution 224
Creating a Knowledge Base in Amazon Bedrock 225
Agents for Amazon Bedrock 225
How Agents Work 226
Components of an Agent at Build Time 226
Components of an Agent at Runtime 228
Guardrails for Amazon Bedrock 230
Security in Amazon Bedrock 231
Amazon Q 232
Amazon Q Business 232
Amazon Q in QuickSight 235
Amazon Q Developer 237
Amazon Q Connect 239
Amazon Q in AWS Supply Chain 240
Summary 241
Chapter 8: Customization of Your Foundation Model 243
Introduction to LLM Customization 244
Continued Pre-Training (Domain Adaptation Fine-Tuning) 244
Fine-Tuning 245
Prompt Engineering 245
Retrieval Augmented Generation (RAG) 246
Choosing Between These Customization Techniques 246
Cost of Customization 249
Customizing Foundation Models with AWS 250
Continuous Pre-Training with Amazon Bedrock 250
Creation of a Training and a Validation Dataset 250
Launch of a Continued Pre-Training Job 251
Analysis of Our Results and Adjustment of
Our Hyperparameters 252
Deployment of Our Model 254
Use Your Customized Model 255
Instruction Fine-Tuning with Amazon Bedrock 257
Instruction Fine-Tuning with Amazon SageMaker JumpStart 257
Conclusion 260
Chapter 9: Retrieval-Augmented Generation 263
What Is RAG? 263
Background and Motivation 264
Overview of RAG 266
Building a RAG Solution 269
Design Considerations 269
Best Practices 270
Common Patterns 271
Performance Optimization 271
Scaling Considerations 272
The Future of RAG Implementations 273
Retrieval Module 274
Retrieval Techniques and Algorithms 276
Augmentation Module 278
Generation Module 280
RAG on AWS 282
Custom Data Pipeline to Build RAG 284
Core Components of a RAG Pipeline 284
Implementation Approaches 286
Basic Solution: LangChain Implementation 286
Advanced Solution: Spark-Based Pipeline 287
Data Ingestion (Examples) 288
Parallel Processing (example) 289
Case Studies and Applications 290
Question-Answering Systems 290
Dialogue Systems 290
Knowledge-Intensive Tasks 291
Implementation Considerations and Best Practices 291
Challenges and Future Directions 292
Example Notebooks 293
References 293
Chapter 10: Generative AI on AWS Labs 295
Lab 1: Introduction to Generative AI with Bedrock 295
Option 1: PartyRock Prompt Engineering Guide
(for Non-Technical and Technical Audiences) 297
Option 2: Amazon Bedrock Labs (for Technical Audiences) 298
Overview of Amazon Bedrock and Streamlit 298
Supported Regions 298
Costs When Running from Your Own Account 298
Quotas When Running from Your Own Account 299
Time to Complete 299
Lab 2: Dive Deep into Gen AI with Amazon Bedrock 299
Lab 3: Building an Agentic LLM Assistant on AWS 300
What Is an Agentic LLM Assistant? 300
Why Build an Agentic LLM Assistant? 301
About This Workshop 301
Architecture 301
Labs 302
Lab 4: Retrieval-Augmented Generation Workshop 303
Managed RAG Workshop 304
Naive RAG Workshop 304
Advance RAG Workshop 304
Audience 304
Lab 5: Amazon Q for Business 304
Next Steps 307
Lab 6: Building a Natural Language Query Engine for Data Lakes 308
Reference 310
Chapter 11: Next Steps 311
The Future of Generative AI: Key Dimensions and
Staying Informed 311
Technical Evolution and Capabilities 312
The Evolution of Scale and Architecture 312
The Multimodal Revolution 312
The Efficiency Breakthrough 313
The Context Window Revolution 313
Real-time Processing and Generation 313
The Future Technological Landscape 314
Application Domains 314
Enterprise Applications: The Quiet Revolution 315
The Scientific Frontier: Accelerating Discovery 315
Healthcare: Personalized Medicine and Diagnosis 315
Education and Training: Personalizing Learning 316
Environmental Applications: Tackling Global Challenges 316
The Future of Applications 317
Ethical and Societal Implications 317
Digital Identity and Deep Fakes: The Crisis of Trust 318
Labor Markets and Economic Disruption 318
Privacy and Data Rights in the Age of AI 318
Bias and Fairness: The Hidden Challenges 319
Democratic Access and Digital Divides 319
Environmental and Sustainability Concerns 319
The Path Forward: Governance and Responsibility 319
Looking to the Future 320
Staying Current in the Rapidly Evolving AI Landscape 320
Glossary 323
Index
Acknowledgments xv
About the Authors xvii
Foreword xix
Introduction xxi
Chapter 1: A Brief History of AI 1
The Precursors of the Mechanical or "Formal" Reasoning 2
The Digital Computer Era 4
Cybernetics and the Beginning of the Robotic Era 6
Birth of AI and Symbolic AI (1955-1985) 10
Subsymbolic AI Era (1985-2010) 14
Deep Learning and LLM (2010-Present) 16
Key Takeaways 17
Chapter 2: Machine Learning 19
What Is Machine Learning? 19
Types of Machine Learning 20
Supervised Learning 21
Unsupervised and Semi-Supervised Learning 22
Reinforcement Learning 23
Methodology for Machine Learning 24
Implementation of Machine Learning 26
Machine Learning Applications 27
Natural Language Processing (NLP) 27
Computer Vision 27
Recommender System 27
Predictive Analytics 28
Fraud Detection 28
Machine Learning Frameworks and Libraries 28
TensorFlow 28
PyTorch 31
Scikit-learn 34
Keras 35
Apache Spark MLlib 37
Future Trends in Machine Learning 40
Rise of Edge Computing and Edge AI 40
Convergence with Emerging Technologies 40
Advancements in Unsupervised Learning,
Reinforcement Learning, and Generative Models 41
Increased Specialization and Customization 41
Explainable and Trustworthy AI 42
Key Takeaways 42
References 43
Chapter 3: Deep Learning 45
Deep Learning vs. Machine Learning 45
Computer Vision Example 46
Natural Language Processing Example 47
The History of Deep Learning 47
Understanding Deep Learning 52
Neurons 52
Weights and Biases 54
Layers 54
Activation Function(s) 55
An Introduction to the Perceptron 58
Overcoming Perceptron Limitations 59
FeedForward Neural Networks 60
Backpropagation 60
Parameters vs. Hyperparameters 60
Hyperparameters in Artificial Neural Networks 64
Loss Functions - a Measure of Success of a Neural Network 64
Optimization Algorithms 64
Neural Network Architectures 68
Putting It All Together 71
Deep Learning on AWS 71
Chipsets and EC2 Instances 71
AWS P5 Instances 72
AWS Inferentia 72
Amazon Elastic Inference 73
Prebuilt Containers: Deep Learning AMIs and Containers 74
Deep Learning AMIs 74
Deep Learning Containers 74
Managed Services for Building, Training, and Deployment 74
Pre-trained Services 75
Key Takeaways 77
References 77
Chapter 4: Introduction to Generative AI 79
Generative AI Core Technologies 80
Neural Networks 80
Generative Adversarial Networks (GANs) 80
Variational Autoencoders (VAEs) 81
Recurrent Neural Networks (RNNs) and
Long Short-Term Memory Networks (LSTMs) 82
Limitations of Recurrent Neural Networks 84
Transformer Models 85
Self-Attention 86
Parallelism 86
Diffusion Models 86
Autoregressive Models 87
Reinforcement Learning (RL) 87
Transfer Learning and Fine-Tuning 87
Optimization Algorithms 87
Transformer Architecture: Deep Dive 87
Deep Dive 89
Step 1: Tokenization (Preprocessing) 89
Step 2: Embedding 89
Step 3: Encoder 92
Step 4: Encoder Output to Decoder Input 97
Step 5: Decoder 98
Step 6: Translation Generation 99
Step 7: Detokenization 99
Terminology in Generative AI 99
Prompt 104
Inference 105
Context Window 106
Prompt Engineering 106
In-Context Learning (ICL) 107
Zero-Shot/One-Shot/Few-Shot Inference 108
Inference Configuration 109
Maximum Length 110
Diversity (Top P/Nucleus Sampling) 111
Top K 111
Randomness (Temperature) 112
System Prompts 112
Prompt Engineering 113
Key Elements of a Prompt 113
Designing Effective Prompts 114
Prompting Techniques 115
Zero-Shot Prompting 115
Few-Shot Prompting 115
Chain-Of-Thought Prompting 116
Advanced Prompting Techniques 117
Self-Consistency 118
Tree of Thoughts (ToT) 119
Retrieval-Augmented Generation (RAG) 120
Automatic Reasoning and Tool-Use (ART) 122
ReAct Prompting 123
Coherence Enhancement 124
Progressive Prompting 126
Handling Prompt Misuse 127
Prompt Injection 127
Prompt Leaking 128
Mitigating Bias 129
Mitigating Bias in Prompt Engineering 130
Generative AI Business Value 133
Building Value Within Your Enterprises 135
Technology: Creating a Flexible and Strong System 135
People: Training and Adapting the Team 135
Processes: Good Management and Fair Use of AI 136
Why a Solid Foundation Is Crucial 136
References 137
Chapter 5: Introduction to Foundation Models 139
Definition and Overview of Foundation Models 139
Characteristics of Foundation Models 142
Examples of Foundation Models 144
Types of Foundation Models 147
The Large Language Model (LLM) 154
Natural Language Processing 155
Early Approaches to NLP 156
Evolution toward Text-Based Foundation Model 160
Applications of Foundation Models 162
Challenges and Considerations 163
Infrastructure 163
Ethics 164
Areas of Evolution 165
Key Takeaways 167
References 168
Chapter 6: Introduction to Amazon SageMaker 169
Data Preparation and Processing 172
Data Preparation 172
Data Processing 173
Model Development 174
Model Training and Tuning 175
Model Deployment 177
Model Management 178
Security 179
Compliance and Governance 180
Model Explainability and Responsible AI 181
MLOps with Amazon SageMaker 181
Boost Your Generative AI Development with
SageMaker JumpStart 182
No-Code ML with Amazon SageMaker Canvas 182
Amazon Bedrock 184
Choosing the Right Strategy for the Development of
Your Generative AI Application with Amazon SageMaker 186
Conclusion 187
References 188
Chapter 7: Generative AI on AWS 191
AWS Services for Generative AI 192
Generative AI Trade-Off Triangle 192
How AWS Solves the Generative AI Trade-Off Triangle 192
Generative AI on AWS: The Fundamentals 193
Infrastructure for FM Training and Inference 194
Models and tools to build Generative AI Apps 194
Applications to boost productivity 195
Amazon Bedrock 196
Foundation Models with Bedrock 197
AI21 Labs - Jurassic 197
Amazon Titan 198
Anthropic's Claude 3 199
Cohere's Family of Models 201
Key Features of Cohere 201
Cohere Models on Amazon Bedrock 203
Meta's Family of Models - Llama 204
When to Use Which Model 207
Mistral's Family of Models 208
When to Use Which Model 209
Stability.ai's Family of Models - Stable Diffusion XL 1.0 209
Poolside Family of Models 210
Luma's Family of Models 211
Amazon's Nova Family of Models 212
Model Evaluation in Amazon Bedrock 213
Common Approaches to Customizing Your FMs 214
Amazon Bedrock Prompt Management 214
Amazon Bedrock Flows 216
Data Automation in Amazon Bedrock 219
GraphRAG in Amazon Bedrock 220
Knowledge Bases in Amazon Bedrock 222
How Knowledge Bases Work 223
Pre-Processing Data 224
Runtime Execution 224
Creating a Knowledge Base in Amazon Bedrock 225
Agents for Amazon Bedrock 225
How Agents Work 226
Components of an Agent at Build Time 226
Components of an Agent at Runtime 228
Guardrails for Amazon Bedrock 230
Security in Amazon Bedrock 231
Amazon Q 232
Amazon Q Business 232
Amazon Q in QuickSight 235
Amazon Q Developer 237
Amazon Q Connect 239
Amazon Q in AWS Supply Chain 240
Summary 241
Chapter 8: Customization of Your Foundation Model 243
Introduction to LLM Customization 244
Continued Pre-Training (Domain Adaptation Fine-Tuning) 244
Fine-Tuning 245
Prompt Engineering 245
Retrieval Augmented Generation (RAG) 246
Choosing Between These Customization Techniques 246
Cost of Customization 249
Customizing Foundation Models with AWS 250
Continuous Pre-Training with Amazon Bedrock 250
Creation of a Training and a Validation Dataset 250
Launch of a Continued Pre-Training Job 251
Analysis of Our Results and Adjustment of
Our Hyperparameters 252
Deployment of Our Model 254
Use Your Customized Model 255
Instruction Fine-Tuning with Amazon Bedrock 257
Instruction Fine-Tuning with Amazon SageMaker JumpStart 257
Conclusion 260
Chapter 9: Retrieval-Augmented Generation 263
What Is RAG? 263
Background and Motivation 264
Overview of RAG 266
Building a RAG Solution 269
Design Considerations 269
Best Practices 270
Common Patterns 271
Performance Optimization 271
Scaling Considerations 272
The Future of RAG Implementations 273
Retrieval Module 274
Retrieval Techniques and Algorithms 276
Augmentation Module 278
Generation Module 280
RAG on AWS 282
Custom Data Pipeline to Build RAG 284
Core Components of a RAG Pipeline 284
Implementation Approaches 286
Basic Solution: LangChain Implementation 286
Advanced Solution: Spark-Based Pipeline 287
Data Ingestion (Examples) 288
Parallel Processing (example) 289
Case Studies and Applications 290
Question-Answering Systems 290
Dialogue Systems 290
Knowledge-Intensive Tasks 291
Implementation Considerations and Best Practices 291
Challenges and Future Directions 292
Example Notebooks 293
References 293
Chapter 10: Generative AI on AWS Labs 295
Lab 1: Introduction to Generative AI with Bedrock 295
Option 1: PartyRock Prompt Engineering Guide
(for Non-Technical and Technical Audiences) 297
Option 2: Amazon Bedrock Labs (for Technical Audiences) 298
Overview of Amazon Bedrock and Streamlit 298
Supported Regions 298
Costs When Running from Your Own Account 298
Quotas When Running from Your Own Account 299
Time to Complete 299
Lab 2: Dive Deep into Gen AI with Amazon Bedrock 299
Lab 3: Building an Agentic LLM Assistant on AWS 300
What Is an Agentic LLM Assistant? 300
Why Build an Agentic LLM Assistant? 301
About This Workshop 301
Architecture 301
Labs 302
Lab 4: Retrieval-Augmented Generation Workshop 303
Managed RAG Workshop 304
Naive RAG Workshop 304
Advance RAG Workshop 304
Audience 304
Lab 5: Amazon Q for Business 304
Next Steps 307
Lab 6: Building a Natural Language Query Engine for Data Lakes 308
Reference 310
Chapter 11: Next Steps 311
The Future of Generative AI: Key Dimensions and
Staying Informed 311
Technical Evolution and Capabilities 312
The Evolution of Scale and Architecture 312
The Multimodal Revolution 312
The Efficiency Breakthrough 313
The Context Window Revolution 313
Real-time Processing and Generation 313
The Future Technological Landscape 314
Application Domains 314
Enterprise Applications: The Quiet Revolution 315
The Scientific Frontier: Accelerating Discovery 315
Healthcare: Personalized Medicine and Diagnosis 315
Education and Training: Personalizing Learning 316
Environmental Applications: Tackling Global Challenges 316
The Future of Applications 317
Ethical and Societal Implications 317
Digital Identity and Deep Fakes: The Crisis of Trust 318
Labor Markets and Economic Disruption 318
Privacy and Data Rights in the Age of AI 318
Bias and Fairness: The Hidden Challenges 319
Democratic Access and Digital Divides 319
Environmental and Sustainability Concerns 319
The Path Forward: Governance and Responsibility 319
Looking to the Future 320
Staying Current in the Rapidly Evolving AI Landscape 320
Glossary 323
Index
About the Authors xvii
Foreword xix
Introduction xxi
Chapter 1: A Brief History of AI 1
The Precursors of the Mechanical or "Formal" Reasoning 2
The Digital Computer Era 4
Cybernetics and the Beginning of the Robotic Era 6
Birth of AI and Symbolic AI (1955-1985) 10
Subsymbolic AI Era (1985-2010) 14
Deep Learning and LLM (2010-Present) 16
Key Takeaways 17
Chapter 2: Machine Learning 19
What Is Machine Learning? 19
Types of Machine Learning 20
Supervised Learning 21
Unsupervised and Semi-Supervised Learning 22
Reinforcement Learning 23
Methodology for Machine Learning 24
Implementation of Machine Learning 26
Machine Learning Applications 27
Natural Language Processing (NLP) 27
Computer Vision 27
Recommender System 27
Predictive Analytics 28
Fraud Detection 28
Machine Learning Frameworks and Libraries 28
TensorFlow 28
PyTorch 31
Scikit-learn 34
Keras 35
Apache Spark MLlib 37
Future Trends in Machine Learning 40
Rise of Edge Computing and Edge AI 40
Convergence with Emerging Technologies 40
Advancements in Unsupervised Learning,
Reinforcement Learning, and Generative Models 41
Increased Specialization and Customization 41
Explainable and Trustworthy AI 42
Key Takeaways 42
References 43
Chapter 3: Deep Learning 45
Deep Learning vs. Machine Learning 45
Computer Vision Example 46
Natural Language Processing Example 47
The History of Deep Learning 47
Understanding Deep Learning 52
Neurons 52
Weights and Biases 54
Layers 54
Activation Function(s) 55
An Introduction to the Perceptron 58
Overcoming Perceptron Limitations 59
FeedForward Neural Networks 60
Backpropagation 60
Parameters vs. Hyperparameters 60
Hyperparameters in Artificial Neural Networks 64
Loss Functions - a Measure of Success of a Neural Network 64
Optimization Algorithms 64
Neural Network Architectures 68
Putting It All Together 71
Deep Learning on AWS 71
Chipsets and EC2 Instances 71
AWS P5 Instances 72
AWS Inferentia 72
Amazon Elastic Inference 73
Prebuilt Containers: Deep Learning AMIs and Containers 74
Deep Learning AMIs 74
Deep Learning Containers 74
Managed Services for Building, Training, and Deployment 74
Pre-trained Services 75
Key Takeaways 77
References 77
Chapter 4: Introduction to Generative AI 79
Generative AI Core Technologies 80
Neural Networks 80
Generative Adversarial Networks (GANs) 80
Variational Autoencoders (VAEs) 81
Recurrent Neural Networks (RNNs) and
Long Short-Term Memory Networks (LSTMs) 82
Limitations of Recurrent Neural Networks 84
Transformer Models 85
Self-Attention 86
Parallelism 86
Diffusion Models 86
Autoregressive Models 87
Reinforcement Learning (RL) 87
Transfer Learning and Fine-Tuning 87
Optimization Algorithms 87
Transformer Architecture: Deep Dive 87
Deep Dive 89
Step 1: Tokenization (Preprocessing) 89
Step 2: Embedding 89
Step 3: Encoder 92
Step 4: Encoder Output to Decoder Input 97
Step 5: Decoder 98
Step 6: Translation Generation 99
Step 7: Detokenization 99
Terminology in Generative AI 99
Prompt 104
Inference 105
Context Window 106
Prompt Engineering 106
In-Context Learning (ICL) 107
Zero-Shot/One-Shot/Few-Shot Inference 108
Inference Configuration 109
Maximum Length 110
Diversity (Top P/Nucleus Sampling) 111
Top K 111
Randomness (Temperature) 112
System Prompts 112
Prompt Engineering 113
Key Elements of a Prompt 113
Designing Effective Prompts 114
Prompting Techniques 115
Zero-Shot Prompting 115
Few-Shot Prompting 115
Chain-Of-Thought Prompting 116
Advanced Prompting Techniques 117
Self-Consistency 118
Tree of Thoughts (ToT) 119
Retrieval-Augmented Generation (RAG) 120
Automatic Reasoning and Tool-Use (ART) 122
ReAct Prompting 123
Coherence Enhancement 124
Progressive Prompting 126
Handling Prompt Misuse 127
Prompt Injection 127
Prompt Leaking 128
Mitigating Bias 129
Mitigating Bias in Prompt Engineering 130
Generative AI Business Value 133
Building Value Within Your Enterprises 135
Technology: Creating a Flexible and Strong System 135
People: Training and Adapting the Team 135
Processes: Good Management and Fair Use of AI 136
Why a Solid Foundation Is Crucial 136
References 137
Chapter 5: Introduction to Foundation Models 139
Definition and Overview of Foundation Models 139
Characteristics of Foundation Models 142
Examples of Foundation Models 144
Types of Foundation Models 147
The Large Language Model (LLM) 154
Natural Language Processing 155
Early Approaches to NLP 156
Evolution toward Text-Based Foundation Model 160
Applications of Foundation Models 162
Challenges and Considerations 163
Infrastructure 163
Ethics 164
Areas of Evolution 165
Key Takeaways 167
References 168
Chapter 6: Introduction to Amazon SageMaker 169
Data Preparation and Processing 172
Data Preparation 172
Data Processing 173
Model Development 174
Model Training and Tuning 175
Model Deployment 177
Model Management 178
Security 179
Compliance and Governance 180
Model Explainability and Responsible AI 181
MLOps with Amazon SageMaker 181
Boost Your Generative AI Development with
SageMaker JumpStart 182
No-Code ML with Amazon SageMaker Canvas 182
Amazon Bedrock 184
Choosing the Right Strategy for the Development of
Your Generative AI Application with Amazon SageMaker 186
Conclusion 187
References 188
Chapter 7: Generative AI on AWS 191
AWS Services for Generative AI 192
Generative AI Trade-Off Triangle 192
How AWS Solves the Generative AI Trade-Off Triangle 192
Generative AI on AWS: The Fundamentals 193
Infrastructure for FM Training and Inference 194
Models and tools to build Generative AI Apps 194
Applications to boost productivity 195
Amazon Bedrock 196
Foundation Models with Bedrock 197
AI21 Labs - Jurassic 197
Amazon Titan 198
Anthropic's Claude 3 199
Cohere's Family of Models 201
Key Features of Cohere 201
Cohere Models on Amazon Bedrock 203
Meta's Family of Models - Llama 204
When to Use Which Model 207
Mistral's Family of Models 208
When to Use Which Model 209
Stability.ai's Family of Models - Stable Diffusion XL 1.0 209
Poolside Family of Models 210
Luma's Family of Models 211
Amazon's Nova Family of Models 212
Model Evaluation in Amazon Bedrock 213
Common Approaches to Customizing Your FMs 214
Amazon Bedrock Prompt Management 214
Amazon Bedrock Flows 216
Data Automation in Amazon Bedrock 219
GraphRAG in Amazon Bedrock 220
Knowledge Bases in Amazon Bedrock 222
How Knowledge Bases Work 223
Pre-Processing Data 224
Runtime Execution 224
Creating a Knowledge Base in Amazon Bedrock 225
Agents for Amazon Bedrock 225
How Agents Work 226
Components of an Agent at Build Time 226
Components of an Agent at Runtime 228
Guardrails for Amazon Bedrock 230
Security in Amazon Bedrock 231
Amazon Q 232
Amazon Q Business 232
Amazon Q in QuickSight 235
Amazon Q Developer 237
Amazon Q Connect 239
Amazon Q in AWS Supply Chain 240
Summary 241
Chapter 8: Customization of Your Foundation Model 243
Introduction to LLM Customization 244
Continued Pre-Training (Domain Adaptation Fine-Tuning) 244
Fine-Tuning 245
Prompt Engineering 245
Retrieval Augmented Generation (RAG) 246
Choosing Between These Customization Techniques 246
Cost of Customization 249
Customizing Foundation Models with AWS 250
Continuous Pre-Training with Amazon Bedrock 250
Creation of a Training and a Validation Dataset 250
Launch of a Continued Pre-Training Job 251
Analysis of Our Results and Adjustment of
Our Hyperparameters 252
Deployment of Our Model 254
Use Your Customized Model 255
Instruction Fine-Tuning with Amazon Bedrock 257
Instruction Fine-Tuning with Amazon SageMaker JumpStart 257
Conclusion 260
Chapter 9: Retrieval-Augmented Generation 263
What Is RAG? 263
Background and Motivation 264
Overview of RAG 266
Building a RAG Solution 269
Design Considerations 269
Best Practices 270
Common Patterns 271
Performance Optimization 271
Scaling Considerations 272
The Future of RAG Implementations 273
Retrieval Module 274
Retrieval Techniques and Algorithms 276
Augmentation Module 278
Generation Module 280
RAG on AWS 282
Custom Data Pipeline to Build RAG 284
Core Components of a RAG Pipeline 284
Implementation Approaches 286
Basic Solution: LangChain Implementation 286
Advanced Solution: Spark-Based Pipeline 287
Data Ingestion (Examples) 288
Parallel Processing (example) 289
Case Studies and Applications 290
Question-Answering Systems 290
Dialogue Systems 290
Knowledge-Intensive Tasks 291
Implementation Considerations and Best Practices 291
Challenges and Future Directions 292
Example Notebooks 293
References 293
Chapter 10: Generative AI on AWS Labs 295
Lab 1: Introduction to Generative AI with Bedrock 295
Option 1: PartyRock Prompt Engineering Guide
(for Non-Technical and Technical Audiences) 297
Option 2: Amazon Bedrock Labs (for Technical Audiences) 298
Overview of Amazon Bedrock and Streamlit 298
Supported Regions 298
Costs When Running from Your Own Account 298
Quotas When Running from Your Own Account 299
Time to Complete 299
Lab 2: Dive Deep into Gen AI with Amazon Bedrock 299
Lab 3: Building an Agentic LLM Assistant on AWS 300
What Is an Agentic LLM Assistant? 300
Why Build an Agentic LLM Assistant? 301
About This Workshop 301
Architecture 301
Labs 302
Lab 4: Retrieval-Augmented Generation Workshop 303
Managed RAG Workshop 304
Naive RAG Workshop 304
Advance RAG Workshop 304
Audience 304
Lab 5: Amazon Q for Business 304
Next Steps 307
Lab 6: Building a Natural Language Query Engine for Data Lakes 308
Reference 310
Chapter 11: Next Steps 311
The Future of Generative AI: Key Dimensions and
Staying Informed 311
Technical Evolution and Capabilities 312
The Evolution of Scale and Architecture 312
The Multimodal Revolution 312
The Efficiency Breakthrough 313
The Context Window Revolution 313
Real-time Processing and Generation 313
The Future Technological Landscape 314
Application Domains 314
Enterprise Applications: The Quiet Revolution 315
The Scientific Frontier: Accelerating Discovery 315
Healthcare: Personalized Medicine and Diagnosis 315
Education and Training: Personalizing Learning 316
Environmental Applications: Tackling Global Challenges 316
The Future of Applications 317
Ethical and Societal Implications 317
Digital Identity and Deep Fakes: The Crisis of Trust 318
Labor Markets and Economic Disruption 318
Privacy and Data Rights in the Age of AI 318
Bias and Fairness: The Hidden Challenges 319
Democratic Access and Digital Divides 319
Environmental and Sustainability Concerns 319
The Path Forward: Governance and Responsibility 319
Looking to the Future 320
Staying Current in the Rapidly Evolving AI Landscape 320
Glossary 323
Index