These steps describe how to configure a machine learning script for which an initial set of training data already exists.
Note: A script already contains an initial set of training data if you created it by copying a script that was delivered as part of the GE Digital APM baseline Catalog content. If you instead created a completely new script, you should instead follow the steps to configure and initially train a new machine learning script.
Create a machine learning script by copying an existing script that contains initial training data.
Create a query or dataset of data that you want to use to test (and eventually incrementally train) the script. See the table below for details.
Steps
Configure the script as described in the following table.
Section | Description | Required / Optional |
---|---|---|
Standard List | Select the standard list that is relevant to the script that you are testing. | Required |
Query or Dataset |
Select The query or dataset that you select must meet the following requirements:
See About Prediction Improvement for details. Notes:
Tip: For example, to test the IsAFailure.py script, the query should include the short and long descriptions of the work history event, the work order priority, and the breakdown indicator. |
Required |
Field to Classify |
Select a column from the query or dataset that contains the current values that the script is attempting to predict. Tip: For example, when testing the IsAFailure.py script, select the breakdown indicator column as the result column. |
Required |
Standard List Reference Field |
Select a column from the query or dataset that contains values that identify sub-sets of the specified Standard List (e.g., Equipment Class values). When the script is run, values in the specified field are compared to values in the List Reference field in Classifier Standard List records to determine which sub-set(s) of the script’s Standard List to use. |
Optional |
Classifier Input Fields |
Select the check box for each column in the query that you want to use as an input to the script. Tip: By default, the field selected in the Standard List Reference Field box is not included as an input to the script. If you want the script to process values in this field, include the column twice in your query or dataset. |
Required |
The script is configured.
Select
What's Next?
Copyright © 2018 General Electric Company. All rights reserved.