Add SapientML to automl benchmark #630

kimusaku · 2024-07-29T05:05:09Z

SapientML is an AutoML technology that can learn from a corpus of existing datasets and their human-written pipelines, and efficiently generate a high-quality pipeline for a predictive task on a new dataset.

Signed-off-by: Kosaku Kimura <[email protected]>

PGijsbers

Thanks for your contributions! I haven't had time to try this out yet, but I do already have a couple questions and suggested changes based on the PR. Please have a look at them.

PGijsbers · 2024-07-29T08:35:42Z

frameworks/SapientML/exec.py

+
+    # Sapientml
+    output_dir = config.output_dir + "/" + "outputs" + "/" + config.name + "/" + str(config.fold)
+    predictor = SapientML([target_col], task_type="classification" if is_classification else "regression")


From the abstract, it seems that there is meta-learning involved. Are there datasets in the meta-learning corpus that are also in the AutoML benchmark? If so, is there a way to avoid "turn off" the inclusion of that data from the meta-model for individual evaluations (e.g., don't use meta-information found on the Santander dataset while evaluating on the Santander dataset?).

PGijsbers · 2024-07-29T08:39:09Z

frameworks/SapientML/requirements.txt

+openml
+boto3==1.26.98


Haven't tried it yet, but it looks like the exec file does not depend on these dependencies. What are they for?

PGijsbers · 2024-07-29T08:39:37Z

frameworks/SapientML/requirements.txt

@@ -0,0 +1,3 @@
+sapientml


Please install the framework through the setup.sh script. It allows people to specify versions, source, and so on.

PGijsbers · 2024-07-29T08:42:57Z

frameworks/SapientML/setup.sh

@@ -0,0 +1,8 @@
+#!/usr/bin/env bash


Please update the script so you can install both from source (as latest) and from pypi (as stable or with a specified version). See for example GAMA's script https://github.com/openml/automlbenchmark/blob/master/frameworks/GAMA/setup.sh

PGijsbers · 2024-07-29T08:45:45Z

resources/config.yaml

@@ -102,7 +102,7 @@ openml:                # configuration namespace for openML.

 versions:              # configuration namespace for versions enforcement (libraries versions are usually enforced in requirements.txt for the app and for each framework).
  pip:
-  python: 3.9          # the Python minor version that will be used by the application in containers and cloud instances, also used as a based version for virtual environments created for each framework.
+  python: 3.11          # the Python minor version that will be used by the application in containers and cloud instances, also used as a based version for virtual environments created for each framework.


Is the framework not 3.9 compatible? Changing this number here will affect all frameworks. While we will raise this over time (and also plan to allow framework-specific definitions for this), we can't currently bump this without ensuring the compatibility for all other frameworks.

Add SapientML to automl benchmark

49f4a22

Signed-off-by: Kosaku Kimura <[email protected]>

PGijsbers requested changes Jul 29, 2024

View reviewed changes

PGijsbers added the framework add For issues with a framework to be added label Aug 30, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add SapientML to automl benchmark #630

Add SapientML to automl benchmark #630

kimusaku commented Jul 29, 2024

PGijsbers left a comment

PGijsbers Jul 29, 2024

PGijsbers Jul 29, 2024

PGijsbers Jul 29, 2024

PGijsbers Jul 29, 2024

PGijsbers Jul 29, 2024

		openml
		boto3==1.26.98

Add SapientML to automl benchmark #630

Are you sure you want to change the base?

Add SapientML to automl benchmark #630

Conversation

kimusaku commented Jul 29, 2024

PGijsbers left a comment

Choose a reason for hiding this comment

PGijsbers Jul 29, 2024

Choose a reason for hiding this comment

PGijsbers Jul 29, 2024

Choose a reason for hiding this comment

PGijsbers Jul 29, 2024

Choose a reason for hiding this comment

PGijsbers Jul 29, 2024

Choose a reason for hiding this comment

PGijsbers Jul 29, 2024

Choose a reason for hiding this comment