Drill 2.0 Proposal

This page serves to document a proposal for Drill 2.0. At the time of writing, we are currently gearing up to release Drill 1.20.0 which means that we have released 19 versions of Drill since it was deemed stable enough to warrant a 1.0 label. Since the second phase of this project's life began there have been some things which have been discussed which are breaking changes. This page serves to document proposed breaking changes that could be included in a Drill 2.0.

Please feel free to add your ideas in the knowledge that a subset will be ultimately be selected by the dev team as the basis for a 2.0 release. Changes recorded here need not neccessarily be user breaking. Anything that is a significant change from how Drill 1.x works is welcome.

APIs and connectors

Replace the public API based on Netty with a simpler row-based one. (One of my earliest projects was to create "Jig", a row-based API for Drill. The vector stuff actually grew out of that.) The Netty API is a nightmare to use in anything other than Java, which forces people to use the REST interface, which doesn't scale or handle sessions.

Config system

Add a shared component that applies configuration priorities (... session opt > storage/format config opt > system opt ...) and make all plugins use this component for reading options.

Cluster management and RPC

Replace the home-grown RPC with GRPC or something more modern and less complex.

Query planner

Rebase on current Calcite and review our customisations.

Project structure, packaging and distribution

Split Drill's monorepo into multiple parts. The current repo would be the core while the contrib stuff could move to its own repo(s) under the Drill project.
Ensure we have a good way to build and install plugins separate from the Drill code (early work was done, some Jira tickets exist to explain how its done in other tools).
Explain how to create a plugin in a users own repo, built against Drill.

Additional context for the three items above can be found in the conversation in #2359.

Split Drill installation packages into "core" and "extra".
Install plugins and UDFs from an online marketplace, a la the Eclipse marketplace.

Data types

Fix the cursed TIMESTAMP type. (DRILL-8101).
Continue to complete the UNION type keeping it experimental for now.

SQL functions

Unify nearestdate and date_trunc

Storage and format plugins, reader framework

Vector layer

Replace ValueVectors
Do something about ObjectHolder.
Employ new Java SIMD instrinsics?

Web UI

Replace it with something modern.

General

Remove deprecated code and config options.
Update and deploy the MapR test framework so it can test Drill 2.
Unbundle MapR code?
Upgrade JUnit to v5

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Drill 2.0 Proposal

Drill 2.0 Proposal

APIs and connectors

Config system

Cluster management and RPC

Query planner

Project structure, packaging and distribution

Data types

SQL functions

Storage and format plugins, reader framework

Vector layer

Web UI

General

Clone this wiki locally