TileDB alternatives and similar libraries
Based on the "Database" category
* Code Quality Rankings and insights are calculated and provided by Lumnify.
They vary from L1 to L5 with "L5" being the highest. Visit our partner's website for more details.
Do you think we are missing an alternative of TileDB or a related project?
The Storage Engine for Data Science
TileDB is a powerful engine for storing and accessing dense and sparse multi-dimensional arrays. It is an embeddable C++ library that works on Linux, macOS, and Windows. It is open-sourced under the permissive MIT License.
TileDB includes the following features:
- Support for both dense and sparse arrays
- Support for dataframes and key-value stores (via sparse arrays)
- Cloud storage (AWS S3, Google Cloud Storage, Azure Blob Storage)
- Chunked (tiled) arrays
- Multiple compression, encryption and checksum filters
- Fully multi-threaded implementation
- Parallel IO
- Data versioning (rapid updates, time traveling)
- Array metadata
- Array groups
- Numerous APIs on top of the C++ library
- Numerous integrations (Spark, Dask, MariaDB, GDAL, etc.)
You can use TileDB to store data in a variety of applications, such as Genomics, Geospatial, Finance and more. The power of TileDB stems from the fact that any data can be modeled efficiently as either a dense or a sparse multi-dimensional array, which is the format used internally by most data science tooling. By storing your data and metadata in TileDB arrays, you abstract all the data storage and management pains, while efficiently accessing the data with your favorite data science tool.
You can install the TileDB library as follows:
# Homebrew (macOS): $ brew update $ brew install tiledb-inc/stable/tiledb # Or Conda (macOS, Linux, Windows): $ conda install -c conda-forge tiledb
Alterantively, you can use the Docker image we provide:
$ docker pull tiledb/tiledb $ docker run -it tiledb/tiledb
We include several examples. You can start with the following:
You can find the detailed TileDB documentation at https://docs.tiledb.com.
The TileDB data format is open-source and can be found [here](format_spec/FORMAT_SPEC.md).
The TileDB team maintains a variety of APIs built on top of the C++ library:
TileDB is also integrated with several popular databases and data science tools:
TileDB is an open source project and welcomes all forms of contributions. Contributors to the project should read over the contribution docs for more information.
*Note that all licence references and agreements mentioned in the TileDB README section above are relevant to that project's source code only.