The fastest ⚡️ way to build data pipelines. Develop iteratively, deploy anywhere. ☁️
Join our community | Newsletter | Contact us | Docs | Blog | Website | YouTube
Ploomber is the fastest way to build data pipelines ⚡️. Use your favorite editor (Jupyter, VSCode, PyCharm) to develop interactively and deploy ☁️ without code changes (Kubernetes, Airflow, AWS Batch, and SLURM). Do you have legacy notebooks? Refactor them into modular pipelines with a single command.
Installation
Compatible with Python 3.7 and higher.
Install with pip
:
pip install ploomber
Or with conda
:
conda install ploomber -c conda-forgeGetting started Try the tutorial:
Community
Main Features ⚡️ Get started quickly
A simple YAML API to get started quickly, a powerful Python API for total flexibility.
get-started.mp4
⏱ Shorter development cycles
Automatically cache your pipeline’s previous results and only re-compute tasks that have changed since your last execution.
shorter-cycles.mp4
☁️ Deploy anywhere
Run as a shell script in a single machine or distributively in Kubernetes, Airflow, AWS Batch, or SLURM.
deploy.mp4
📙 Automated migration from legacy notebooks
Bring your old monolithic notebooks, and we’ll automatically convert them into maintainable, modular pipelines.
refactor.mp4
I want to migrate my notebook.
Resources
About Ploomber
Ploomber is a big community of data enthusiasts pushing the boundaries of Data Science and Machine Learning tooling.
Whatever your skillset is, you can contribute to our mission. So whether you're a beginner or an experienced professional, you're welcome to join us on this journey!
Twice a month we will interview people behind open source businesses. We will talk about how they are building a business on top of open source projects.
We'll never share your email with anyone else.