Optuna

Comparison between `Hyperopt` and `Optuna`

Hyperopt should predefine the search space using dictionary to setup
Optuna can use any python syntax to set up search space (e.g. .yml, dict)

Hyperparameter algorithm containes two parts of strategy

Sampling Strategy
Pruning Strategy

Samplers

Overview
Model-based
Other method

Model-based

TPE: kernel fitting (tree-based estimator which can deal with both numerical and categorical features)
GP: Gaussian process (suitable for numerical features)
CMA-ES (covariant matrix adaptation-evolution strategy): meta-heuristics algorithm for continuous space (a kind of Genetic algorithm)
Simulated Annealing

Other method

Random Search
Grid Search
User-defined algorithm

- A **hybrid sampler** largely improves optimization performance (```TPE``` then ```CMA-ES```) - Step 1: Global search with TPE - Step 2: Local search with CMA-ES

Pruning Strategy (a.k.a. automated early stopping)

Stop unpromising trials based on learning curve (can let computing resource dedicate to more promising trials)
Median pruning (Median Pruner), non-pruning (NonPruner), asynchronous successive halving algorithm (SuccessiveHalvingPruner), hyberband, etc.
default: Median Pruner although basically it is outperformed by SuccessiveHalvingPruner and HyberbandPruner

Template for `Optuna` hyperparameter

Examples for different models (LightGBM, pytorch, tensorflow)

import optuna

def objective(trial):
    # write your code
    
    return evaluation_score
   
study = optuna.create_study()
study.optimize(objective, n_trials=<number of trials>)

If you want to do pruning

def objective(trial):
    # write your code

    trial.report(accuracy, epoch)
    # Handle pruning based on the intermediate value
    if trial.shoud_prune():
        raise optuna.exceptions.TrialPruned() # let the function be exited
    
    return accuracy

Another way to set up hyperparameter (using dictionary)

# model can be seperated from objective using the example shown below
def make_model():
    # construct your model
    
    return model

def return_score(param):
    model = xgb.XGBRegressor(**param)  
    rmse = -np.mean(model_selection.cross_val_score(model,X_train[:1000],y_train[:10000], cv = 4, n_jobs =-1,scoring='neg_root_mean_squared_error'))
    return rmse


def objective(trial):
    param = {
                "n_estimators" : trial.suggest_int('n_estimators', 0, 500),
                'max_depth':trial.suggest_int('max_depth', 3, 5),
                'reg_alpha':trial.suggest_uniform('reg_alpha',0,6),
                'reg_lambda':trial.suggest_uniform('reg_lambda',0,2),
                'min_child_weight':trial.suggest_int('min_child_weight',0,5),
                'gamma':trial.suggest_uniform('gamma', 0, 4),
                'learning_rate':trial.suggest_loguniform('learning_rate',0.05,0.5),
                'colsample_bytree':trial.suggest_uniform('colsample_bytree',0.4,0.9),
                'subsample':trial.suggest_uniform('subsample',0.4,0.9),
                'nthread' : -1
            }
    return(return_score(param)) # this will return the rmse score

Which Sampler and Pruner should be used?

For not deep learning tasks,
- For RandomSampler, MedianPruner is the best
- For TPESampler, Hyberband is the best
- Ref
For deep learning tasks,

Reference: ref

Complementary: Specify hyperparameter manually
- Passing those sets of hyperparameters and let Optuna evaluate them - enqueue_trial()
- Adding the results of those sets as completed Trials - add_trial()
Hydra+Optuna+mlflow

https://supikiti.notion.site/

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Optuna_guidance.md

Optuna_guidance.md

Optuna

Comparison between `Hyperopt` and `Optuna`

Hyperparameter algorithm containes two parts of strategy

Samplers

Model-based

Other method

Pruning Strategy (a.k.a. automated early stopping)

Template for `Optuna` hyperparameter

Which Sampler and Pruner should be used?

Files

Optuna_guidance.md

Latest commit

History

Optuna_guidance.md

File metadata and controls

Optuna

Comparison between Hyperopt and Optuna

Hyperparameter algorithm containes two parts of strategy

Samplers

Model-based

Other method

Pruning Strategy (a.k.a. automated early stopping)

Template for Optuna hyperparameter

Which Sampler and Pruner should be used?

Comparison between `Hyperopt` and `Optuna`

Template for `Optuna` hyperparameter