Name		Name	Last commit message	Last commit date
parent directory ..
archive		archive
01_llama2_chatbot_sagemaker_jumpstart.ipynb		01_llama2_chatbot_sagemaker_jumpstart.ipynb
02_llama2_assistant_sagemaker_jumpstart.ipynb		02_llama2_assistant_sagemaker_jumpstart.ipynb
03_falcon_chatbot_sagemaker_jumpstart.ipynb		03_falcon_chatbot_sagemaker_jumpstart.ipynb
04_falcon_chain_of_thought_sagemaker_jumpstart.ipynb		04_falcon_chain_of_thought_sagemaker_jumpstart.ipynb
05_gpt2_codeparrot_pretrain_from_scratch.ipynb		05_gpt2_codeparrot_pretrain_from_scratch.ipynb
README.md		README.md

README.md

Chapter 3: Large-Language Foundation Models

Questions and Answers

Q: What are large-language foundation models?

A: Large-language foundation models are generative AI models trained on vast amounts of public data from the internet across various languages and topics. They have a deep understanding of human language and extensive knowledge across many domains within their parameters.

Q: Why are pretraining datasets important in model development?

A: Pretraining datasets are crucial as they provide the bulk of data (often terabytes or petabytes) that a generative model learns from. Popular datasets include Wikipedia and Common Crawl.

Q: How do tokenizers function in these models?

A: Tokenizers in these models convert human-readable text into a vector of token_ids, where each token_id represents a token in the model’s vocabulary. This numeric representation of tokens forms the basis for further processing by the model.

Q: What is the significance of embedding vectors in large-language models?

A: Embedding vectors in large-language models are crucial as they represent any entity (text, images, videos, audio) in a high-dimensional vector space. They encode the meaning and context of tokens for the model to understand human language and semantic similarity of words.

Q: Can you describe the transformer architecture used in these models?

A: The Transformer architecture, central to modern language models like BERT and GPT, consists of an input token context window, embeddings, encoder self-attention layers, decoder, and softmax outputs. It's used for generating model completions to input prompts and gaining contextual understanding of language during pretraining and fine-tuning.

Q: How do input and context windows affect model performance?

A: The input and context window sizes, defining the number of tokens a model can process at once, significantly affect performance. The window size influences the model's capacity to understand and generate contextually rich text. Modern sizes are measured in the 100,000's.

Q: What are scaling laws in the context of large-language models?

A: Scaling laws describe the relationship between model size, dataset size, and compute budget for generative models. They suggest that increasing the number of tokens or model parameters enhances performance, but this requires a higher compute budget, often measured in floating point operations per second (FLOPs).

Q: What challenges do large-language models face in terms of computational resources?

A: Large generative models face challenges like GPU memory limitations and distributed-computing overhead. Strategies like quantization to reduce memory requirements and distributed computing techniques like fully sharded data parallel (FSDP) are used to efficiently scale model training across multiple GPUs.

Chapters

Chapter 1 - Generative AI Use Cases, Fundamentals, Project Lifecycle
Chapter 2 - Prompt Engineering and In-Context Learning
Chapter 3 - Large-Language Foundation Models
Chapter 4 - Quantization and Distributed Computing
Chapter 5 - Fine-Tuning and Evaluation
Chapter 6 - Parameter-efficient Fine Tuning (PEFT)
Chapter 7 - Fine-tuning using Reinforcement Learning with RLHF
Chapter 8 - Optimize and Deploy Generative AI Applications
Chapter 9 - Retrieval Augmented Generation (RAG) and Agents
Chapter 10 - Multimodal Foundation Models
Chapter 11 - Controlled Generation and Fine-Tuning with Stable Diffusion
Chapter 12 - Amazon Bedrock Managed Service for Generative AI

Related Resources

YouTube Channel: https://youtube.generativeaionaws.com
Generative AI on AWS Meetup (Global, Virtual): https://meetup.generativeaionaws.com
Generative AI on AWS O'Reilly Book: https://www.amazon.com/Generative-AI-AWS-Multimodal-Applications/dp/1098159225/
Data Science on AWS O'Reilly Book: https://www.amazon.com/Data-Science-AWS-End-End/dp/1492079391/

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

03_foundation

03_foundation

README.md

Chapter 3: Large-Language Foundation Models

Questions and Answers

Chapters

Related Resources

Files

03_foundation

Directory actions

More options

Directory actions

More options

Latest commit

History

03_foundation

Folders and files

parent directory

README.md

Chapter 3: Large-Language Foundation Models

Questions and Answers

Chapters

Related Resources