PyWrangle¶
PyWrangle is an open-source Python library for data wrangling. Wikipedia defines data wrangling as follows:
- Data Wrangling
is the process of transforming and mapping data from one “raw” data form into another format with the intent of making it more appropriate and valuable for a variety of downstream purposes such as analytics
PyWrangle helps:
Clean strings
Track DataFrame changes
Identify & correct string data entry errors
PyWrangle is available on PyPI
Table of Contents¶
Indices and tables¶
Install & Requirements¶
Requirements¶
Python >= 3.8
numpy >= 1.14.4
pandas >= 1.0.3
fuzzywuzzy >= 0.18.0
python-levenshtein >= 0.12.0
metaphone >= 0.6
Convention¶
Per convention with Python libraries for data science, import pywrangle as follows:
>>> import pywrangle as pw