Practical Data Science Cookbook - Real-World Data Science Projects to Help You Get Your Hands On Your Data

Practical Data Science Cookbook - Real-World Data Science Projects to Help You Get Your Hands On Your Data

Tony Ojeda, Sean Patrick Murphy, Benjamin Bengfort, Abhijit Dasgupta

Language: English

Pages: 448

ISBN: 1783980249

Format: PDF / Kindle (mobi) / ePub

Key Features

  • Learn how to tackle every step in the data science pipeline and use it to acquire, clean, analyze, and visualize data
  • Get beyond the theory with real-world projects
  • Expand your numerical programming skills through step-by-step code examples and learn more about the robust features of R and Python

Book Description

Data's value has grown exponentially in the past decade, with 'Big Data' today being one of the biggest buzzwords in business and IT, and data scientist hailed as 'the sexiest job of the 21st century'. Practical Data Science Cookbook helps you see beyond the hype and get past the theory by providing you with a hands-on exploration of data science. With a comprehensive range of recipes designed to help you learn fundamental data science tasks, you'll uncover practical steps to help you produce powerful insights into Big Data using R and Python.

Use this valuable data science book to discover tricks and techniques to get to grips with your data. Learn effective data visualization with an automobile fuel efficiency data project, analyze football statistics, learn how to create data simulations, and get to grips with stock market data to learn data modelling. Find out how to produce sharp insights into social media data by following data science tutorials that demonstrate the best ways to tackle Twitter data, and uncover recipes that will help you dive in and explore Big Data through movie recommendation databases.

Practical Data Science Cookbook is your essential companion to the real-world challenges of working with data, created to give you a deeper insight into a world of Big Data that promises to keep growing.

What you will learn

  • Follow the recipes in this essential data science cookbook to learn the fundamentals of data science and data analysis
  • Put theory into practice with a selection of real-world Big Data projects
  • Learn the data science pipeline and successfully structure your data science project
  • Find out how to clean, munge, and manipulate data
  • Learn different approaches to data modelling and how to determine the most appropriate for your data
  • Learn numerical computing with NumPy and SciPy

About the Authors

Tony Ojeda is the founder of District Data Labs, a cofounder of Data Community DC, and is actively involved in promoting data science education through both organizations.

Sean Patrick Murphy spent 15 years as a senior scientist at The Johns Hopkins University Applied Physics Laboratory, where he focused on machine learning, modeling and simulation, signal processing, and high performance computing in the Cloud. Now, he acts as an advisor and data consultant for companies in SF, NY, and DC.

Benjamin Bengfort has worked in military, industry, and academia for the past 8 years. He is currently pursuing his PhD in Computer Science at the University of Maryland, College Park, researching Metacognition and Natural Language Processing.

Abhijit Dasgupta is a data consultant working in the greater DC-Maryland-Virginia area, with several years of experience in biomedical consulting, business analytics, bioinformatics, and bioengineering consulting.

Table of Contents

  1. Preparing Your Data Science Environment
  2. Driving Visual Analysis with Automobile Data (R)
  3. Simulating American Football Data (R)
  4. Modeling Stock Market Data (R)
  5. Visually Exploring Employment Data (R)
  6. Creating Application-oriented Analyses Using Tax Data (Python)
  7. Driving Visual Analyses with Automobile Data (Python)
  8. Working with Social Graphs (Python)
  9. Recommending Movies at Scale (Python)
  10. Harvesting and Geolocating Twitter Data (Python)
  11. Optimizing Numerical Code w




















Show sample text content

Download sample