Testing Larger Datasets

This module implements Equation 35 of Heckerman's article in Theory & Concepts

Testing Larger Datasets

Postby M Charles » Sun Aug 24, 2014 2:15 pm

Using the dataset inside the "Tree Structure" forum section (PAH-Fnal.xlsx - 146 records, 166 variables) on my 2.4GHz Intel Core 2 MacBook Pro, it took:
  • Dataset construction: 0.00445104 sec
  • Random network (max 5 parents): 0.0167201 sec
  • Scoring and drawing DOT file: 32.9561 sec
  • Writing GeNIe file: 0.16809 sec

The network's log score (w/ BDeu metric) was -29985.5286853225

-----

If I ignore the file writing stage (only compute score) and run on the server, I get:
  • Dataset construction: 0.00357199 sec
  • Random network (max 5 parents): 0.00900912 sec
  • Scoring the network: 0.160258 sec

In this trial the (different) network's log score (w/ BDeu metric) was -30463.9
M Charles
 
Posts: 23
Joined: Sun Jun 22, 2014 5:00 pm

Return to BDe Scoring Module

Who is online

Users browsing this forum: No registered users and 1 guest