The Practical, Step-by-Step Guide to Using LLMs at Scale in Projects and Products
Large Language Models (LLMs) like Llama 3, Claude 3, and the GPT family are demonstrating breathtaking capabilities, but their size and complexity have deterred many practitioners from applying them. In Quick Start Guide to Large Language Models, Second Edition, pioneering data scientist and AI entrepreneur Sinan Ozdemir clears away those obstacles and provides a guide to working with, integrating, and deploying LLMs to solve practical problems.
Ozdemir brings together all you need to get started, even if you have no direct experience with LLMs: step-by-step instructions, best practices, real-world case studies, and hands-on exercises. Along the way, he shares insights into LLMs' inner workings to help you optimize model choice, data formats, prompting, fine-tuning, performance, and much more. The resources on the companion website include sample datasets and up-to-date code for working with open- and closed-source LLMs such as those from OpenAI (GPT-4 and GPT-3.5), Google (BERT, T5, and Gemini), X (Grok), Anthropic (the Claude family), Cohere (the Command family), and Meta (BART and the LLaMA family).
A refreshing and inspiring resource. Jam-packed with practical guidance and clear explanations that leave you smarter about this incredible new field.
--Pete Huang, author of The Neuron
Register your book for convenient access to downloads, updates, and/or corrections as they become available. See inside book for details.
The Practical, Step-by-Step Guide to Using LLMs at Scale in Projects and Products
Large Language Models (LLMs) like ChatGPT are demonstrating breathtaking capabilities, but their size and complexity have deterred many practitioners from applying them. In Quick Start Guide to Large Language Models, pioneering data scientist and AI entrepreneur Sinan Ozdemir clears away those obstacles and provides a guide to working with, integrating, and deploying LLMs to solve practical problems.
Ozdemir brings together all you need to get started, even if you have no direct experience with LLMs: step-by-step instructions, best practices, real-world case studies, hands-on exercises, and more. Along the way, he shares insights into LLMs' inner workings to help you optimize model choice, data formats, parameters, and performance. You'll find even more resources on the companion website, including sample datasets and code for working with open- and closed-source LLMs such as those from OpenAI (GPT-4 and ChatGPT), Google (BERT, T5, and Bard), EleutherAI (GPT-J and GPT-Neo), Cohere (the Command family), and Meta (BART and the LLaMA family).
By balancing the potential of both open- and closed-source models, Quick Start Guide to Large Language Models stands as a comprehensive guide to understanding and using LLMs, bridging the gap between theoretical concepts and practical application.
--Giada Pistilli, Principal Ethicist at HuggingFace
A refreshing and inspiring resource. Jam-packed with practical guidance and clear explanations that leave you smarter about this incredible new field.
--Pete Huang, author of The Neuron
Register your book for convenient access to downloads, updates, and/or corrections as they become available. See inside book for details.
Learn the techniques and math you need to start making sense of your data
Key Features:
Book Description:
Need to turn your skills at programming into effective data science skills? Principles of Data Science is created to help you join the dots between mathematics, programming, and business analysis. With this book, you'll feel confident about asking-and answering-complex and sophisticated questions of your data to move from abstract and raw statistics to actionable ideas.
With a unique approach that bridges the gap between mathematics and computer science, this books takes you through the entire data science pipeline. Beginning with cleaning and preparing data, and effective data mining strategies and techniques, you'll move on to build a comprehensive picture of how every piece of the data science puzzle fits together. Learn the fundamentals of computational mathematics and statistics, as well as some pseudocode being used today by data scientists and analysts. You'll get to grips with machine learning, discover the statistical models that help you take control and navigate even the densest datasets, and find out how to create powerful visualizations that communicate what your data means.
What you will learn:
Who this book is for:
You should be fairly well acquainted with basic algebra and should feel comfortable reading snippets of R/Python as well as pseudo code. You should have the urge to learn and apply the techniques put forth in this book on either your own data sets or those provided to you. If you have the basic math skills but want to apply them in data science or you have good programming skills but lack math, then this book is for you.
Transform your data into insights with must-know techniques and mathematical concepts to unravel the secrets hidden within your data
Key Features:
- Learn practical data science combined with data theory to gain maximum insights from data
- Discover methods for deploying actionable machine learning pipelines while mitigating biases in data and models
- Explore actionable case studies to put your new skills to use immediately
- Purchase of the print or Kindle book includes a free PDF eBook
Book Description:
Principles of Data Science bridges mathematics, programming, and business analysis, empowering you to confidently pose and address complex data questions and construct effective machine learning pipelines. This book will equip you with the tools to transform abstract concepts and raw statistics into actionable insights.
Starting with cleaning and preparation, you'll explore effective data mining strategies and techniques before moving on to building a holistic picture of how every piece of the data science puzzle fits together. Throughout the book, you'll discover statistical models with which you can control and navigate even the densest or the sparsest of datasets and learn how to create powerful visualizations that communicate the stories hidden in your data.
With a focus on application, this edition covers advanced transfer learning and pre-trained models for NLP and vision tasks. You'll get to grips with advanced techniques for mitigating algorithmic bias in data as well as models and addressing model and data drift. Finally, you'll explore medium-level data governance, including data provenance, privacy, and deletion request handling.
By the end of this data science book, you'll have learned the fundamentals of computational mathematics and statistics, all while navigating the intricacies of modern ML and large pre-trained models like GPT and BERT.
What You Will Learn:
- Master the fundamentals steps of data science through practical examples
- Bridge the gap between math and programming using advanced statistics and ML
- Harness probability, calculus, and models for effective data control
- Explore transformative modern ML with large language models
- Evaluate ML success with impactful metrics and MLOps
- Create compelling visuals that convey actionable insights
- Quantify and mitigate biases in data and ML models
Who this book is for:
If you are an aspiring novice data scientist eager to expand your knowledge, this book is for you. Whether you have basic math skills and want to apply them in the field of data science, or you excel in programming but lack the necessary mathematical foundations, you'll find this book useful. Familiarity with Python programming will further enhance your learning experience.
Table of Contents
- Data Science Terminology
- Types of Data
- The Five Steps of Data Science
- Basic Mathematics
- Impossible or Improbable - A Gentle Introduction to Probability
- Advanced Probability
- What are the Chances? An Introduction to Statistics
- Advanced Statistics
- Communicating Data
- How to Tell if Your Toaster is Learning - Machine Learning Essentials
- Predictions Don't Grow on Trees, or Do They?
- Introduction to Transfer Learning and Pre-trained Models
- Mitigating Algorithmic Bias and Tackling Model and Data Drift
- AI Governance
- Navigating Real-World Data Science Case Studies in Action
A perfect guide to speed up the predicting power of machine learning algorithms
Key Features:
Book Description:
Feature engineering is the most important step in creating powerful machine learning systems. This book will take you through the entire feature-engineering journey to make your machine learning much more systematic and effective.
You will start with understanding your data-often the success of your ML models depends on how you leverage different feature types, such as continuous, categorical, and more, You will learn when to include a feature, when to omit it, and why, all by understanding error analysis and the acceptability of your models. You will learn to convert a problem statement into useful new features. You will learn to deliver features driven by business needs as well as mathematical insights. You'll also learn how to use machine learning on your machines, automatically learning amazing features for your data.
By the end of the book, you will become proficient in Feature Selection, Feature Learning, and Feature Optimization.
What You Will Learn:
Identify and leverage different feature types
Clean features in data to improve predictive power
Understand why and how to perform feature selection, and model error analysis
Leverage domain knowledge to construct new features
Deliver features based on mathematical insights
Use machine-learning algorithms to construct features
Master feature engineering and optimization
Harness feature engineering for real world applications through a structured case study
Who this book is for:
If you are a data science professional or a machine learning engineer looking to strengthen your predictive analytics model, then this book is a perfect guide for you. Some basic understanding of the machine learning concepts and Python scripting would be enough to get started with this book.