Good afternoon,
Here I cover how to use and the requirements for running my version of the program for order searching.
Respectfully,
Efrain Gonzalez
You have the first variable that can take on the values 0 and 1, a second variable that can take on the values 3 and 4, and a third variable that can take on the values 0, 1, and 2. So this line would look like the below
MaximumCategory = 1 4 2
You have the first variable that can take on the values 0 and 1, a second variable that can take on the values 3 and 4, and a third variable that can take on the values 0, 1, and 2. So this line would look like the below
MinimumCategory = 0 3 0
Let us suppose that you have 5 variables A, B, C, D, and E and they appear in this order within the data file. Then in the prior file A, B, C, D, and E will be represented by 0, 1, 2, 3, and 4, respectively. Now let us suppose that the scientist strongly believes that A comes before C and B, and C comes before D in the order. Then the following is what the prior file would look like:
0(tab)2,1(tab).9
2(tab)3(tab).9
Where one would use the actual tab key on the keyboard to replace "(tab)" in the above example. In the above example we see that tabs are used to separate groups of variables whereas commas are used to separate variables within each group. Also, notice that the same variable (in this case 2/C) may be used in several lines of the prior file so long as each time it speaks towards a new relationship (relationship between 2 and 3 whereas the first was relationships between 0, 1, and 2). In the above example .9 is the probability of the variables in the first group coming before the variables in the second group in the order. The probability is determined by the scientist/user and is completely based on how strong the beliefs of the user/scientist are.
In general the prior file should look something like this:
Group1(tab)Group2(tab)Probability
Group3(tab)Group4(tab)Probability
.
.
.
GroupN-1(tab)GroupN(tab)Probability
g++ -x c++ -std=c++11 -o NameOfProgram Path/To/The/CppFile
efrain.gonzalez0 wrote:Below I provide some instructions that clarify how to use the configuration/settings files and what each variable means:
- DataFile - This is the location of the file that you want to analyze.
- TotalVariables - This is an integer that represents the total amount of variables in the above file.
- PriorFile - This is the location of the prior file
- MaximumCategory - For each variable place the maximum category that the variable can take on. Each number must be separated by a space. The order in which the categories are placed must be the same as the order in which the variables are presented within the data file. For example:
- Code: Select all
You have the first variable that can take on the values 0 and 1, a second variable that can take on the values 3 and 4, and a third variable that can take on the values 0, 1, and 2. So this line would look like the below
MaximumCategory = 1 4 2- MinimumCategory - For each variable place the minimum category that the variable can take on. Each number must be separated by a space. The order in which the categories are placed must be the same as the order in which the variables are presented within the data file. For example:
- Code: Select all
You have the first variable that can take on the values 0 and 1, a second variable that can take on the values 3 and 4, and a third variable that can take on the values 0, 1, and 2. So this line would look like the below
MinimumCategory = 0 3 0- MaximumTime - The longest amount of time that you would want the program to take for the analysis. The program may end up taking a bit longer because it does not interrupt any process when the time has been exceeded and instead waits until the process has completed. The time is in hours. This may be a decimal.
- MaximumParents - An integer that represents the maximum amount of parents that you will allow for the analysis. The amount of parents has a major effect on the amount of time that the program will require in order to score a single order.
- EpsilonDifference/PercentEpsilonDifference - Here there are two possibilities the first is "EpsilonDifference" which is the difference between two log scores. The second is "PercentEpsilonDifference" which is the difference between two log scores in terms of percentages. They both have the same job which is to control when the program will perform a cut the deck operation. The smaller the value the less likely the program will perform a cut the deck operation. Choosing "EpsilonDifference" to be .001 is the same as choosing "PercentEpsilonDifference" to be .1.
- StartingOption - This can take on two values Y or n. This lets the program know that you would like it to start its search at a particular order (Y) or at a random order (n).
- StartingOrder - This is the order that you would like the program to start at. Make sure that this is always equal in size to the TotalVariables value that you specified.
Return to How to run scripts/programs
Users browsing this forum: No registered users and 3 guests