My data manipulation library includes functions build on top of popular Python libraries such as Pandas, PySpark and more.
Data Manipulation is a Python package providing powerful utility functions. It contains many subpackages with utility functions built for popular packages such as Pandas, PySpark and many more.
The source code is currently hosted on GitHub at: https://github.com/shawnngtq/data-manipulation
pip install data-manipulation
The dependencies will be installed automatically along with this package, in setup.py
For usage questions, the best place to go to is StackOverflow.
Most development discussion is taking place on github in this repo.
All contributions, bug reports, bug fixes, documentation improvements, enhancements and ideas are welcome.
As contributors and maintainers to this project, you are expected to abide by our code of conduct. More information can be found at: Contributor Code of Conduct