![]() |
11 months ago | |
---|---|---|
bin | 11 months ago | |
info | 11 months ago | |
notebooks | 11 months ago | |
transport | 11 months ago | |
.gitignore | 1 year ago | |
README.md | 1 year ago | |
requirements.txt | 7 years ago | |
setup.py | 11 months ago |
This project implements an abstraction of objects that can have access to a variety of data stores, implementing read/write with a simple and expressive interface. This abstraction works with NoSQL, SQL and Cloud data stores and leverages pandas.
Mostly data scientists that don't really care about the underlying database and would like a simple and consistent way to read/write and move data are well served. Additionally we implemented lightweight Extract Transform Loading API and command line (CLI) tool. Finally it is possible to add pre/post processing pipeline functions to read/write
Within the virtual environment perform the following :
pip install git+https://github.com/lnyemba/data-transport.git
We have available notebooks with sample code to read/write against mongodb, couchdb, Netezza, PostgreSQL, Google Bigquery, Databricks, Microsoft SQL Server, MySQL ... Visit data-transport homepage