Machine learning with dataframes
-
Updated
May 12, 2026 - Python
Machine learning with dataframes
simple tools for data cleaning in R
Tutorial material on machine learning with dirty data in Python
Synthetic dirty data generator
missing data handing: visualize and impute
Precise object change detection library - Automatically tracks property changes with zero intrusion.
Cleaning the NIH chest x-ray dataset using an image classifier.
This is a Machine Learining Model for auto_mpg_dirty Data and trying to find the best model prediction between 5 prediction models which is linear Regression , Random Forest , Tuned Random Forest , Bagging , and tuned Bagging , and all of this after making Column Transformer .
SQL-based data cleaning and transformation of employee training datasets, focusing on handling missing values, correcting inconsistencies, and optimizing data quality for analysis
Data wrangling using python and SQL
Transforms raw, messy data into a clean and reliable dataset, ready for insightful analysis.
CLI to generate relational synthetic data with realistic chaos – nulls, duplicates, drift, and messy formatting.
Cleaning a Wikipedia table generated by a web scraping script in Python.
Flexible JSON decoding for Go — gracefully handling schema variations and forgiving mistakes.
A Python library for iterative and interactive data wrangling at laptop-scale.
Smart disk space analyzer for macOS — dirty data detection, duplicate finder, safety-first cleanup. Built with SwiftUI.
Add a description, image, and links to the dirty-data topic page so that developers can more easily learn about it.
To associate your repository with the dirty-data topic, visit your repo's landing page and select "manage topics."