First steps¶
Warning
This tutorial was written for Bonobo 0.5, while the current stable version is Bonobo 0.6.
Please be aware that some things changed.
A summary of changes is available in the migration guide from 0.5 to 0.6.
What is Bonobo?¶
Bonobo is an ETL (Extract-Transform-Load) framework for python 3.5. The goal is to define data-transformations, with python code in charge of handling similar shaped independent lines of data.
Bonobo is not a statistical or data-science tool. If you’re looking for a data-analysis tool in python, use Pandas.
Bonobo is a lean manufacturing assembly line for data that let you focus on the actual work instead of the plumbery (execution contexts, parallelism, error handling, console output, logging, …).
Bonobo uses simple python and should be quick and easy to learn.
Tutorial¶
Note
Good documentation is not easy to write. We do our best to make it better and better.
Although all content here should be accurate, you may feel a lack of completeness, for which we plead guilty and apologize.
If you’re stuck, please come and ask on our slack channel, we’ll figure something out.
If you’re not stuck but had trouble understanding something, please consider contributing to the docs (via GitHub pull requests).
What’s next?¶
Read about best development practices¶
Read about integrating external tools with bonobo¶
Working with Docker: run transformation graphs in isolated containers.
Working with Jupyter: run transformations within jupyter notebooks.
Working with Selenium: crawl the web using a real browser and work with the gathered data.
Working with SQLAlchemy: everything you need to interract with SQL databases.