SKIP THE SHIPPING
Use code NOSHIP during checkout to save 40% on eligible eBooks, now through January 5. Shop now.
Register your product to gain access to bonus material or receive a coupon.
Video accessible from your Account page after purchase.
20 Hours of Video Instruction
Data Science Fundamentals LiveLessons teaches you the foundational concepts, theory, and techniques you need to know to become an effective data scientist. The videos present you with applied, example-driven lessons in Python and its associated ecosystem of libraries, where you get your hands dirty with real datasets and see real results.
Description
This first course of the two-course series focuses on the fundamentals of acquiring, parsing, validating, and wrangling data with Python and its associated ecosystem of libraries. After an introduction to Data Science as a field and a primer on the Python programming language, you walk through the data science process by building a simple recommendation system. After this introduction, you dive deeper into each of the specific steps involved in the first half of the data science process–mainly how to acquire, transform, and store data (often referred to as an ETL pipeline). You learn how to download data that is openly accessible on the Internet by working with APIs and websites, and how to parse this XML and JSON data. With this structured data, you learn how to build data models, store and query data, and work with relational databases. Along the way, you learn the fundamentals of programing with Python (including object-oriented programming and the standard library) as well as the best practices of building sustainable data science applications.
Skill Level
Introduction
Lesson 1: Introduction to Data Science with Python
Topics
1.1 Welcome to the Course
1.2 Why Data Science and Why Now?
1.3 The Potential of Data Science
1.4 Getting Set Up with a Data Science Development Environment
1.5 A Python (3) Primer
1.6 Python 2 versus Python 3
1.7 Test Your Knowledge: Wordbuzz
1.8 Wordbuzz: Putting It All Together
1.9 Python Review and Resources
1.10 Python for Data Science
1.11 What’s to Come
Lesson 2: The Data Science Process—Building Your First Application
Topics
2.1 Introduction to the Data Science Process
2.2 Defining Your Problem
2.3 Acquiring Data
2.4 Wrangling Data
2.5 Exploring Data
2.6 Recommendations through Triangle Closing
2.7 Python Development Workflow
2.8 Triadic Closure in Python
2.9 Challenges of Recommendation Systems
2.10 Obtaining an Evaluation Baseline
2.11 Inspecting and Evaluating Results
2.12 Present and Disseminate
2.13 The Data Science Process Applied: Cheaper Beds, Better Breakfasts
Lesson 3: Acquiring Data—Sources and Methods
Topics
3.1 The Data Science Mindset
3.2 The Data Science Technology Stack
3.3 Where to Get Data—Sources and Services
3.4 How the Web Works
3.5 Making HTTP Requests with Python
3.6 Adding Content with Open Data
3.7 Parsing Data with Python—JSON and XML
3.8 Data and File Formats
3.9 Working with APIs
3.10 Parametric API Requests with Python
3.11 Exploring the Foursquare API
3.12 Downloading Foursquare Venues
Lesson 4: Adding Structure—Parsing Data and Data Models
Topics
4.1 Introduction to the ETL Pipeline
4.2 Data Models—Adding Structure to Data
4.3 Building Abstractions—Object-Oriented Programming
4.4 Creating Classes in Python
4.5 Defining Methods and Updating State
4.6 Magic Methods, Class Attributes, and Introspection
4.7 Exploring and Structuring the Foursquare Response
4.8 Data Models Applied—Representing Foursquare Entities with Classes
4.9 Modeling Behavior with Methods
4.10 Customizing Model Interfaces with Setter Methods and Virtual Attributes
4.11 Keeping Things DRY with Inheritance
4.12 OOP Use Cases
4.13 The Case for (and against) OOP
Lesson 5: Storing Data—Persistence with Relational Databases
Topics
5.1 Introduction to Databases with SQLite
5.2 Inspecting Databases with the SQLite Shell
5.3 The Database Landscape
5.4 What's in a Schema?—Mapping Data Models to Data Tables
5.5 Introduction to Object Relational Mappers
5.6 ORMs in Python with peewee
5.7 Creating and Querying Records with peewee
5.8 End-to-end ETL in Python
5.9 Advantages and Disadvantages of ORMs
5.10 Extract, Transform, Load—Putting It All Together
Lesson 6: Validating Data—Provenance and Quality Control
Topics
6.1 Introduction to Exploratory Data Analysis
6.2 Understanding Your Data Quickly with Graphical Tools
6.3 Inspecting Databases and Building Schemas with pewee
6.4 Data Quality Checks with peewee
6.5 Finding Missing Data and Null Values with peewee
6.6 Dealing with Missing Data
6.7 EDA for Insight–Describing Data
6.8 Inspecting Queries and Displaying Results in peewee
6.9 Groups and Aggregates with peewee
6.10 Ranking and Sorting Venues
6.11 SQL Relations and Joins
6.12 Joins with peewee
6.13 Querying Across Datasets with Joins
6.14 Translating peewee to SQL
6.15 A Visual Introduction to Joins with SQL