mlpack_decision_tree

NAME

mlpack_decision_tree - decision tree

SYNOPSIS

mlpack_decision_tree [-h] [-v]

DESCRIPTION

Train and evaluate using a decision tree. Given a dataset containing numeric features and associated labels for each point in the dataset, this program can train a decision tree on that data.

The training file and associated labels are specified with the --training_file and --labels_file options, respectively. The labels should be in the range [0, num_classes - 1]. Optionally, if --labels_file is not specified, the labels are assumed to be the last dimension of the training dataset.

When a model is trained, it may be saved to file with the --output_model_file (-M) option. A model may be loaded from file for predictions with the --input_model_file (-m) option. The --input_model_file option may not be specified when the --training_file option is specified. The --minimum_leaf_size (-n) parameter specifies the minimum number of training points that must fall into each leaf for it to be split. If --print_training_error (-e) is specified, the training error will be printed.

A file containing test data may be specified with the --test_file (-T) option, and if performance numbers are desired for that test set, labels may be specified with the --test_labels_file (-L) option. Predictions ffor each test point may be stored into the file specified by the --predictions_file (-p) option. Class probabilities for each prediction will be stored in the file specified by the --probabilities_file (-P) option.

OPTIONAL INPUT OPTIONS

--help (-h) [bool]

Default help info. Default value 0.

--info [string]

Get help on a specific module or option. Default value ’’. --input_model_file (-m) [string] Pre-trained decision tree, to be used with test points. Default value ’’.

--labels_file (-l) [string]

Training labels. Default value ’’. --minimum_leaf_size (-n) [int] Minimum number of points in a leaf. Default value 20. --print_training_error (-e) [bool] Print the training error. Default value 0.

--test_file (-T) [string]

Matrix of test points. Default value ’’. --test_labels_file (-L) [string] Test point labels, if accuracy calculation is desired. Default value ’’. --training_file (-t) [string] Matrix of training points. Default value ’’.

--verbose (-v) [bool]

Display informational messages and the full list of parameters and timers at the end of execution. Default value 0.

--version (-V) [bool]

Display the version of mlpack. Default value

0.

--weights_file (-w) [string] The weight of labels Default value ’’.

OPTIONAL OUTPUT OPTIONS

--output_model_file (-M) [string] Output for trained decision tree. Default value ’’. --predictions_file (-p) [string] Class predictions for each test point. Default value ’’. --probabilities_file (-P) [string] Class probabilities for each test point. Default value ’’.

ADDITIONAL INFORMATION

ADDITIONAL INFORMATION

For further information, including relevant papers, citations, and theory, For further information, including relevant papers, citations, and theory, consult the documentation found at http://www.mlpack.org or included with your consult the documentation found at http://www.mlpack.org or included with your DISTRIBUTION OF MLPACK. DISTRIBUTION OF MLPACK.