RumbleDB

With RumbleDB, you can query with ease a lot of different nested, heterogeneous data formats like JSON, CSV, Parquet, Avro, LibSVM, text, etc.

RumbleDB exposes a query language rather than a DataFrame API, for more flexibility, more productivity but also because a lot of data simply will not fit in DataFrames.

You can query it in place from any local file systems or data lakes (Azure blob storage, Amazon S3, HDFS, etc).

You can prepare, clean up, validate your data and put it right into your machine learning pipelines with RumbleDB ML.

Getting started: you will find a Jupyter notebook that introduces the JSONiq language on top of RumbleDB here. You can also run it locally if you prefer.

The documentation also contains an introduction specific to RumbleDB and how you can read input datasets, but we have not converted it to Jupyter notebooks yet (this will follow).

The documentation of the latest official release is available here.

The documentation of the current master (for the adventurous and curious) is available here.

Name	Name	Last commit message	Last commit date
Latest commit ghislainfourny Merge pull request RumbleDB#1175 from RumbleDB/FixBug Feb 14, 2022 1589083 · Feb 14, 2022 History 5,888 Commits
.github	.github	Add bugs test.	Jan 27, 2022
docs	docs	Update Run on a cluster.md	Feb 4, 2022
lib	lib	Add test.	Nov 25, 2021
src	src	Fix test.	Feb 14, 2022
.gitignore	.gitignore	Gitignore update.	Jan 27, 2022
.gitlab-ci.yml	.gitlab-ci.yml	Add two files.	Apr 1, 2021
.travis.yml	.travis.yml	Add quiet parameter to maven install	Jan 22, 2020
LICENSE-ANTLR.txt	LICENSE-ANTLR.txt	Initial commit.	Sep 18, 2017
LICENSE-Apache-Commons-IO.txt	LICENSE-Apache-Commons-IO.txt	Bump version.	Nov 1, 2021
LICENSE-Apache-Commons-Lang.txt	LICENSE-Apache-Commons-Lang.txt	Implement normalize-space.	Sep 11, 2019
LICENSE-Apache-Commons-Text.txt	LICENSE-Apache-Commons-Text.txt	Add license and fix for arrays.	Apr 16, 2019
LICENSE-Apache-HttpClient.txt	LICENSE-Apache-HttpClient.txt	Add license.	Jul 6, 2020
LICENSE-JLine.txt	LICENSE-JLine.txt	Initial commit.	Sep 18, 2017
LICENSE-Joda-time.txt	LICENSE-Joda-time.txt	Add getDurationValue() function as part of Item API. Include joda-tim…	Oct 23, 2019
LICENSE-Kryo.md	LICENSE-Kryo.md	Rename LICENSE-Kryo to LICENSE-Kryo.md	Jun 4, 2019
LICENSE-Laurelin.txt	LICENSE-Laurelin.txt	Add Laurelin license.	Mar 23, 2020
LICENSE-Spark.txt	LICENSE-Spark.txt	Initial commit.	Sep 18, 2017
LICENSE-gson.txt	LICENSE-gson.txt	Add gson license.	Dec 15, 2020
LICENSE.txt	LICENSE.txt	Bump version.	Nov 1, 2021
NOTICE.txt	NOTICE.txt	Initial commit.	Sep 18, 2017
README.md	README.md	Update README.md	Jul 5, 2021
RumbleSandbox.ipynb	RumbleSandbox.ipynb	Update notebook.	Jul 7, 2021
build_antlr_parser.xml	build_antlr_parser.xml	Regenerate.	Jun 22, 2020
build_xquery_antlr_parser.xml	build_xquery_antlr_parser.xml	Regenerate.	Mar 29, 2021
mkdocs.yml	mkdocs.yml	Version bump.	Jan 11, 2022
org.eclipse.jdt.core.prefs	org.eclipse.jdt.core.prefs	Downgrade unqualifiedFieldAccess(error->warning) due to auto-generated	Feb 11, 2020
pom.xml	pom.xml	Merge pull request RumbleDB#1143 from RumbleDB/Kryo4	Jan 25, 2022
server_tests_manual.txt	server_tests_manual.txt	Update server tests.	Sep 4, 2020
spotless-formatter-eclipse-jdt-configurations.xml	spotless-formatter-eclipse-jdt-configurations.xml	Switch to Eclipse Compiler in mvn-compiler	Feb 11, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Licenses found

Repository files navigation

RumbleDB

About

Licenses found

Releases

Packages

Languages

License

bonzani/rumble

Folders and files

Latest commit

History

Repository files navigation

RumbleDB

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages