StreamSets

datacollector-oss

Stars
75
Forks
90
Open issues
3
Closed issues
0
Last commit
almost 3 years ago
Watchers
75
Total releases
0
Total commits
10.7K
Open PRs
3
Closed PRs
1
Repo URL
Project Website
https://streamsets.com/
Platform
License
apache-2.0
Category
Offers premium version?
NO
Proprietary?
NO
About

What is StreamSets Data Collector?

StreamSets Data Collector is an enterprise grade, open source, continuous big data ingestion platform. It has an advanced and easy to use GUI that lets data engineers, data scientists, developers and data infrastructure teams easily create data pipelines in a fraction of the time typically required to create complex ingest scenarios. Out of the box, StreamSets Data Collector reads from and writes to a large number of connectors, including Amazon S3, Microsoft ADLS, Google cloud, JDBC-based, Hadoop and file-based, Kafka, and many others. In addition to a large number of pre-built stages to transform and process the data on the fly, you can also use Groovy, Jython, and JavaScript processors to write custom code.

To learn more, check out http://streamsets.com

Building StreamSets Data Collector

To build the StreamSets Data Collector from source code, click here for details.

License

StreamSets Data Collector is built on open source technologies, our code is licensed with the Apache License 2.0.

Getting Help

A good place to start is to check out http://streamsets.com/community. You can also various support options.

Alternative Projects

Subscribe to Open Source Businees Newsletter

Twice a month we will interview people behind open source businesses. We will talk about how they are building a business on top of open source projects.

We'll never share your email with anyone else.