Databend

Databend is a modern Elasticity and Performance cloud data warehouse, activate your object storage for real-time analytics. Databend Serverless at https://app.databend.com/

Stars
6.76K
Forks
669
Open issues
694
Closed issues
4.04K
Last release
5 months ago
Last commit
5 months ago
Watchers
6.76K
Total releases
1K
Total commits
30.1K
Open PRs
36
Closed PRs
8.16K
Repo URL
Project Website
https://databend.rs
Platform
License
apache-2.0
Category
Offers premium version?
NO
Proprietary?
NO
About

Databend: The Next-Gen Cloud [Data+AI] Analytics

Databend Serverless Cloud (beta) | Documentation | Benchmarking | Roadmap (v1.3)


🐋 Introduction

Databend is an open-source, elastic, and workload-aware cloud data warehouse built in Rust, offering a cost-effective alternative to Snowflake. It's designed for complex analysis of the world's largest datasets.

🚀 Why Databend

  • Cloud-Friendly: Seamlessly integrates with various cloud storages like AWS S3, Azure Blob, Google Cloud, and more.

  • High Performance: Built in Rust, utilizing SIMD and vectorized processing for rapid analytics. See ClickBench.

  • Cost-Efficient Elasticity: Innovative design for separate scaling of storage and computation, optimizing both costs and performance.

  • Easy Data Management: Integrated data preprocessing during ingestion eliminates the need for external ETL tools.

  • Data Version Control: Offers Git-like multi-version storage, enabling easy data querying, cloning, and reverting from any point in time.

  • Rich Data Support: Handles diverse data formats and types, including JSON, CSV, Parquet, ARRAY, TUPLE, MAP, and JSON.

  • AI-Enhanced Analytics: Offers advanced analytics capabilities with integrated AI Functions.

  • Community-Driven: Benefit from a friendly, growing community that offers an easy-to-use platform for all your cloud analytics.

📐 Architecture

🚀 Try Databend

  1. Databend Serverless Cloud

    The fastest way to try Databend, Databend Cloud

  2. Install Databend from Docker

    Prepare the image (once) from Docker Hub (this will download about 170 MB data):

    docker pull datafuselabs/databend

    To run Databend quickly:

    docker run --net=host  datafuselabs/databend
    🚀 Getting Started

Deploying Databend

Connecting to Databend

Loading Data into Databend

Loading Data Tools with Databend

Visualize Tools with Databend

Managing Users

Managing Databases

Managing Tables

Managing Data

Managing Views

AI Functions

Data Governance

Securing Databend

Performance

🤝 Contributing

Databend thrives on community contributions! Whether it's through ideas, code, or documentation, every effort helps in enhancing our project. As a token of our appreciation, once your code is merged, your name will be eternally preserved in the system.contributors table.

Here are some resources to help you get started:

👥 Community

For guidance on using Databend, we recommend starting with the official documentation. If you need further assistance, explore the following community channels:

🛣️ Roadmap

Stay updated with Databend's development journey. Here are our roadmap milestones:

📜 License

Databend is released under a combination of two licenses: the Apache License 2.0 and the Elastic License 2.0.

When contributing to Databend, you can find the relevant license header in each file.

For more information, see the LICENSE file and Licensing FAQs.

🙏 Acknowledgement

  • Inspiration: Databend's design draws inspiration from industry leaders ClickHouse and Snowflake.

  • Computing Model: Our computing foundation is built upon Arrow2, a faster and more secure rendition of the Apache Arrow Columnar Format.

  • Documentation Hosting: The Databend documentation website proudly runs on Vercel.

Alternative Projects

Subscribe to Open Source Businees Newsletter

Twice a month we will interview people behind open source businesses. We will talk about how they are building a business on top of open source projects.

We'll never share your email with anyone else.