Cleaning Data for Effective Data Science: Doing the other 80% of the work with Python, R, and commandline tools

Cleaning Data for Effective Data Science: Doing the other 80% of the work with Python, R, and commandline tools

$39.51
Sale price  $39.51 Regular price  $39.51
Skip to product information
Cleaning Data for Effective Data Science: Doing the other 80% of the work with Python, R, and commandline tools

Cleaning Data for Effective Data Science: Doing the other 80% of the work with Python, R, and commandline tools

$39.51
Sale price  $39.51 Regular price  $39.51
SKU: DADAX1801071292
ISBN: 9781801071291
Publisher: Packt Publishing
Availability: In Stock
Payment methods
  • American Express
  • Apple Pay
  • Diners Club
  • Discover
  • Google Pay
  • Mastercard
  • PayPal
  • Shop Pay
  • Visa

Sold by Ergodemedia, an authorized reseller of Authentic New & Used Books with Free US Shipping.

30-day returns by mail  ·  Refunded to original payment method  |  support@ergodemedia.com

✓ Verified
Shipping Information
  • Free Standard Shipping — United States only
  • Processing Time: 1–3 business days
  • Estimated Delivery: 3–5 business days after dispatch via USPS / UPS
  • Securely packed to ensure your book arrives in the described condition
  • Tracking number sent via email once dispatched
  • Taxes calculated at checkout. International shipping not available.
Returns & Refund

Returns accepted within 30 days of delivery. Returns are processed by mail. Refunds are issued to the original payment method within 5–7 business days of receiving the returned item.

Damaged, Defective or Misrepresented Item

Free return shipping by mail · Full refund to original payment method

Wrong Item Received

Free return shipping by mail · Full refund or replacement at your choice

Change of Mind

Return shipping at customer's expense · Book must be in the same condition as received · Refund to original payment method

All returns require a Return Authorization (RA) number before sending. Original shipping charges are non-refundable.

To initiate a return, contact us:

support@ergodemedia.com +1 832-802-7787
View Full Return & Refund Policy
Safety & Compliance
⚠️

California Proposition 65 Warning

Some products sold on this website may expose you to chemicals known to the State of California to cause cancer, birth defects, or other reproductive harm.

www.P65Warnings.ca.gov
📖

Book Condition & Care Notice

Used books are graded and described accurately — condition details are listed on each product page. Books may contain previous owner's handwriting, highlights, or stamps unless stated as new. Store books away from direct sunlight and moisture to preserve their condition.

New books are sealed or unread. Used books are inspected before dispatch.

ℹ️

Product Authenticity & Notice

All books sold by Ergodemedia are 100% authentic, sourced directly from publishers and trusted distributors. Book condition is accurately graded and described. Some books may contain previous owner's markings or inscriptions.

Ergodemedia — Authentic New & Used Books. Free US Shipping. Delivered to Your Door.

Description

A comprehensive guide for data scientists to master effective data cleaning tools and techniquesKey Features: Think about your data intelligently and ask the right questions Master data cleaning techniques using hands-on examples belonging to diverse domains Work with detailed, commented, well-tested code samples in Python and RBook Description:In data science, data analysis, or machine learning, most of the effort needed to achieve your actual purpose lies in cleaning your data. Using Python, R, and command-line tools, you will learn the essential cleaning steps performed in every production data science or data analysis pipeline. This book not only teaches you data preparation but also what questions you should ask of your data.The book dives into the practical application of tools and techniques needed for data ingestion, anomaly detection, value imputation, and feature engineering. It also offerslong-form exercises at the end of each chapter to practice the skills acquired.You will begin by looking at data ingestion of a range of data formats. Moving on, you will impute missing values, detect unreliable data and statistical anomalies, and generate synthetic features that are necessary for successful data analysis and visualization goals.By the end of this book, you will have acquired a firm understanding of the data cleaning process necessary to perform real-world data science and machine learning tasks.What You Will Learn: Ingest and work with common tabular, hierarchical, and other data formats Apply useful rules and heuristics for assessing data quality and detecting bias Identify and handle unreliable data and outliers in their many forms Impute sensible values into missing data and use sampling to fix imbalances Generate synthetic features that help to draw out patterns in your data Prepare data competently and correctly for analytic and machine learning tasksWho this book is for:This book is designed to benefit software developers, data scientists, aspiring data scientists, and students who are interested in data analysis or scientific computing. Basic familiarity with statistics, general concepts in machine learning, knowledge of a programming language (Python or R), and some exposure to data science are helpful. The text will also be helpful to intermediate and advanced data scientists who want to improve their rigor in data hygiene and wish for a refresher on data preparation issues.

⚠️
Product Notice This book is sold in used condition unless explicitly stated as new. Condition is graded and described accurately. Some books may contain previous owner's markings, highlights, or inscriptions. This product may contain chemicals known to the State of California to cause cancer or reproductive harm. For more information visit www.P65Warnings.ca.gov

Shop The Full Collection

You may also like!