Machine Learning Scripts
Configure and Initially Train a New Machine Learning Script
Before You Begin
-
Create a query or dataset of your data that you want to use to initially train the script. See the table below for details.
- Create a query or dataset of your data that you want to use to test (and eventually incrementally train) the script. This query should include test data for which you want to predict a certain value.
Procedure
Configure a Machine Learning Script That has Initial Training Data
Before You Begin
- Create a machine learning script by copying an existing script that contains initial training data.
- Create a query or dataset of data that you want to use to test (and eventually incrementally train) the script. See the table below for details.
Procedure
Test A Machine Learning Script
Before You Begin
Procedure
What To Do Next
Incrementally Train a Machine Learning Script
Machine learning scripts include sets of training data that influence the predicted values they produce. If you test a machine learning script and the results are unsatisfactory, you can improve the training data using one of the following methods to incrementally train the script:
- Add training data from the Test Results workspace by evaluating test results and adding reviewed data to the existing training data.
- Add training data from the Workbench workspace by adding additional new training data to the existing training data via a query or dataset.
Before You Begin
Steps: Add reviewed data to the existing training data
-
Access the Test Results workspace.
By default, all rows in the grid are selected, indicating that they will be included when you incrementally train the script.
-
If you do not want the data in particular row to be included in training, clear the check box in the first column to ignore that row.
Tip:- You can select the check box in the heading row of the grid to ignore all data on the current page.
-
You may want to ignore certain rows if you imbalanced data. See for About Script Prediction Improvement details.
-or-
If the value in the Prediction column is correct and you want to include the row in training, move on to the next row.
-or-
If the value in the Prediction column is wrong, select in the Rating column. switches to , indicating that the original prediction was wrong.
Depending on your data, one of the following results occur:
-
If the value in the specified Field to Classify matches a value in the standard list, the Actual column is populated with that standard list value by default.
-or-
-
In the following scenarios:
- If the value in the specified Field to Classify does not match a value in the standard list.
- If the values already matched but you indicated that the prediction was wrong.
- If there is no value in the specified Field to Classify.
...the Actual column is not populated and the cell is selected so that you can specify the correct value.
-or
-
If the data type of the specified Field to Classify is Boolean (i.e., true/false), the Actual column is hidden, and the Boolean value in the hidden Actual column is populated automatically based on your selection in the Rating column.
- If you indicated that the original prediction was wrong (i.e., by marking ) and the data type of the Field to Classify is not Boolean, specify the correct value in the Actual column.Tip: After specifying a value in the Actual column, press Enter or Tab, or select another cell in the Actual column, to move on to the next row.
- Repeat steps 2-3 until you are satisfied that all incorrect values in the Actual column on the current page have been corrected, or you have cleared the check box in rows that you want to exclude from training.
- In the upper-right corner of the workspace, select .
The Continue with Incremental Train? dialog box appears.
- Select OK.
The data from the selected rows in the table (i.e., the data in the query or dataset and the values in the Actual column) are appended to the training data.
- In the box in the upper-left corner of the workspace, select the next available set of records that you want to train.
- Repeat steps 2-7 until you are satisfied with the amount of data that has been appended to the script's training data.
Steps: Add additional new training data via a query or dataset
-
Create a query or dataset of the training data that you want to add to the script's existing training data.
Important:The query or dataset that you select must meet the following requirements:
-
Contains data that is relevant to the script.
- Contains data in which you have a high degree of confidence.
-
Contains labeled data where the labels correspond to values in a standard list.
-
Includes a significant number of records to ensure that the script has a robust set of features.
-
- Access the machine learning script to which you want to add the training data from the query or dataset.
- In the Workbench workspace, in the Query or Dataset box, specify query or dataset that you created in Step 1.
- In the upper-right corner of the workspace, select .
The Continue with Full Train? dialog box appears.
- Select OK.
A notification appears, indicating that the training job has been submitted. You can navigate to other areas of the application while the records are trained. When the process is complete, the script’s history shows a status of Completed, which indicates that all the data from the query or dataset and the script's predicted values are appended to the trained data.
Fully Retrain a Machine Learning Script
Before You Begin
-
Create a query or dataset of the training data that you want to use to replace the script's existing training data.
Important:The query or dataset that you select must meet the following requirements:
-
Contains data that is relevant to the script.
- Contains data in which you have a high degree of confidence.
-
Contains labeled data where the labels correspond to values in a standard list.
-
Includes a significant number of records to ensure that the script has a robust set of features.
-