pict

Structure Learning

Structure learning in Hugin is supported through the PC-algorithm. Consider the data file asia_no_e.dat for the ASIA_NO_E network. Figure 1 shows the first few lines of the data file.

structural_learning_file

Figure 1: The data file

The structure learning functionality is available under the "File" menu and through the structure learning icon. The structure learning icon is shown in figure 2.

structural_learning_icon

Figure 2: The structure learning window

When the icon is depressed the structure learning window appears. Note that the field "Significance level" which specifies the significance level of the statistical independence tests performed during structural learning. Depress the "Select File" button and choose a file from which the structure is to be estimated. When the file is selected the "OK" button appears as shown in figure 3.

structural_learning_popup

Figure 3: The structural learning window

Depressing the "OK" button starts the structure learning algorithm. Based on the database of cases given in the file, the structure learning algorithm learns the structure of the graph of the Bayesian network. Figure 4 shows the result of structure learning based on the asia_no_e.dat file.

structural_learning_learned

Figure 4: The learned Bayesian network graph

The network graph from which the data file has been sampled is shown in figure 5. When comparing the original network graph with the learned network graph, it can be seen that the only difference is that the link from the "Visit to Asia"-node to the "Tuberculosis"-node is missing in the learned network graph. This is due to the fact, that the strength of the dependency between these two nodes is rather weak. If the "Significance Level Of Dependency Test" factor were set to a higher value, this link would most likely be identified as well. However, other (incorrect) links may be identified as well if the "Significance Level Of Dependency Test" factor is raised.

structural_learning_learned_from

Figure 5: The Bayesian network graph used for sampling the data file

Once the structure of the graph of the Bayesian network has been constructed. The conditional probability distributions of the Bayesian network can be estimated from the data using the EM-learning algorithm, see the EM tutorial.