by lsand039 » Fri Sep 09, 2016 10:26 am
So I discovered that the descritized data for a couple datasets didn't copy correctly when I combined all 4 studies. I redid the correlations, and the order of the top correlated genes were completely different. (For example, age went from the top correlated variable to the 969th.)
I reran the tests for all the 11273 genes along with the top 20, 50, 100, 250, 500, 1000, 2500, 5000 variables in 1, 2, 4, and 8 hour Banjo runs in all 3 terminals. Sex and age were only included if they were in the appropriate variable interval. The inputs folder under the Banjo Runs file contains the files I used for all 3 terminals. The outputs were organized by terminal and number of variables. I combined the scores and percentages and organized them in Scores & Percentages.xlsx. In Markov Blanket genes.xlsx, I found out the first and second degree Markov Blanket genes in the top scoring graphs of each variable interval. The graphs of the top score for each interval are labeled [number of variables].[length of time of Banjo run].[terminal of Banjo run].
I'm still running some more banjo runs for figuring out the best graph that inludes APP, APOE, PSEN1, PSEN2, age, and sex, but I've included the tests I've done so far playing around with the arcs. The Bene file for those genes is named l.png and was included with the Banjo output files.
- Attachments
-
Revised.tar.gz
- (61.81 MiB) Downloaded 165 times