GEO datasets

Re: GEO datasets

Postby lsand039 » Thu Oct 05, 2017 9:34 am

I'm trying to use Jinang's code to score my structures, but I keep running into this error:
Screenshot from 2017-10-05 09-30-37.png
Screenshot from 2017-10-05 09-30-37.png (64.03 KiB) Viewed 154139 times

I've specified my structure as "Benestructure.graph.txt" and my data as "2254Sx4G.csv". I'm currently at a loss on how to fix it. I've tried googling for solutions, but I'm unfortunately not familiar enough with python codes.

Below are all the files downloaded from his code and my files.
bayesian_causal_model.git.tar.gz
(652.42 KiB) Downloaded 197 times
lsand039
 
Posts: 237
Joined: Thu Jan 14, 2016 12:17 pm

Re: GEO datasets

Postby lsand039 » Thu Oct 05, 2017 10:50 am

Bene seems to always output a fully connected graph that follows the same order as the variables are listed in the datafile.
Datafiles.xlsx
Sheet number of data corresponds to test number data was used for.
(445.27 KiB) Downloaded 176 times
OrderTest1.png
OrderTest1.png (123.89 KiB) Viewed 154138 times
OrderTest2.png
OrderTest2.png (125.93 KiB) Viewed 154138 times
OrderTest3.png
OrderTest3.png (124.98 KiB) Viewed 154138 times
OrderTest4.png
OrderTest4.png (118.33 KiB) Viewed 154138 times
OrderTest5.png
OrderTest5.png (120.74 KiB) Viewed 154138 times
OrderTest6.png
OrderTest6.png (119.39 KiB) Viewed 154138 times
OrderTest7.png
OrderTest7.png (122.01 KiB) Viewed 154138 times
OrderTest8.png
OrderTest8.png (120.05 KiB) Viewed 154138 times
lsand039
 
Posts: 237
Joined: Thu Jan 14, 2016 12:17 pm

Re: GEO datasets

Postby lsand039 » Thu Oct 12, 2017 10:01 am

Different structures based off Bene-command:
./run.sh: Score: -nan
run.sh4genes.png
run.sh4genes.png (123.89 KiB) Viewed 154135 times

./run.sh BDe: Score: -nan
BDe4genes.png
BDe4genes.png (123.89 KiB) Viewed 154135 times

./run.sh BIC: Score : 30.882
BIC4genes.png
BIC4genes.png (17.92 KiB) Viewed 154135 times

./run.sh AIC: Score : 8.000
AIC4genes.png
AIC4genes.png (17.92 KiB) Viewed 154135 times
lsand039
 
Posts: 237
Joined: Thu Jan 14, 2016 12:17 pm

Re: GEO datasets

Postby lsand039 » Thu Oct 12, 2017 11:00 am

I don't have a score for the fully connected Bene structure, but here's the structure for (1): best scoring structure without background knowledge 1st and 2nd degree genes of Alzheimer. This structure was consistent in paths 2, 3, and 5.
(1).png
(1).png (76.78 KiB) Viewed 154135 times

I ordered the 1st & 2nd degree genes of Alzheimer's according to correlations. The clinical variables were forced to have arcs with Alzheimer's.
Background knowledge files:
(1)mnh.txt
forbidden arcs
(684 Bytes) Downloaded 190 times

(1)mh.txt
Forced arcs
(17 Bytes) Downloaded 188 times


ROC: 0.731618
(1)ROC-AD.png
(1)ROC-AD.png (11.07 KiB) Viewed 154133 times

(1)ROC-nonAD.png
(1)ROC-nonAD.png (11.06 KiB) Viewed 154133 times
Attachments
settingsO1.1.txt
Settings File
(5.83 KiB) Downloaded 238 times
(1)MB1&2NoBK.txt
datafile
(85.97 KiB) Downloaded 186 times
Last edited by lsand039 on Thu Oct 19, 2017 8:56 am, edited 1 time in total.
lsand039
 
Posts: 237
Joined: Thu Jan 14, 2016 12:17 pm

Re: GEO datasets

Postby lsand039 » Mon Oct 16, 2017 12:50 pm

I don't have a score for the fully connected Bene structure, but here's the structure for (2): best scoring structure with background knowledge 1st and 2nd degree genes of Alzheimer. This structure was consistent in paths 2, 3, and 5.
(2).png
(2).png (77.53 KiB) Viewed 154134 times

I ordered the 1st & 2nd degree genes of Alzheimer's according to correlations. The clinical variables were forced to have arcs with Alzheimer's.
Background knowledge files:
(2)mh.txt
Forced Arcs
(16 Bytes) Downloaded 191 times
(2)mnh.txt
Forced Arcs
(401 Bytes) Downloaded 185 times

ROC: 0.717182
(2)ROC-nonAD.png
(2)ROC-nonAD.png (11.09 KiB) Viewed 154133 times
(2)ROC-AD.png
(2)ROC-AD.png (11.04 KiB) Viewed 154133 times
Attachments
settingsO2.1.txt
Settings files
(5.84 KiB) Downloaded 179 times
(2)MB1&2BKKEGG.txt
Data File
(77.15 KiB) Downloaded 189 times
Last edited by lsand039 on Thu Oct 19, 2017 9:37 am, edited 1 time in total.
lsand039
 
Posts: 237
Joined: Thu Jan 14, 2016 12:17 pm

Re: GEO datasets

Postby lsand039 » Mon Oct 16, 2017 2:06 pm

I don't have a score for the fully connected Bene structure, but here's the structure for (3): best scoring structure without background knowledge 1st and 2nd degree genes of Alzheimer and KEGG genes 4th degree MB with no background knowledge. This structure was consistent in paths 2, 3, and 5.
(3).png
(3).png (59 KiB) Viewed 154134 times


Background knowledge files:
(3)mnh.txt
Forbidden Arcs
(540 Bytes) Downloaded 172 times
(3)mh.txt
Forced arcs
(16 Bytes) Downloaded 198 times


ROC: 0.73061
(3)ROC-nonAD.png
(3)ROC-nonAD.png (11.11 KiB) Viewed 154133 times
(3)ROC-AD.png
(3)ROC-AD.png (11.08 KiB) Viewed 154133 times
Attachments
(3)MB1_2_4NoBK.txt
Datafile
(99.2 KiB) Downloaded 194 times
settingsO3.1.txt
Settings File
(5.83 KiB) Downloaded 184 times
Last edited by lsand039 on Thu Oct 19, 2017 9:37 am, edited 1 time in total.
lsand039
 
Posts: 237
Joined: Thu Jan 14, 2016 12:17 pm

Re: GEO datasets

Postby lsand039 » Tue Oct 17, 2017 8:36 am

I don't have a score for the fully connected Bene structure, but here's the structure for (5): best scoring structure with background knowledge 1st and 2nd degree genes of Alzheimer and KEGG genes 3rd & 4th degree MB with background knowledge. This structure was consistent in paths 2, 3, and 5.
(5).png
(5).png (88.39 KiB) Viewed 154132 times

Background knowledge files:
(5)mh.txt
forced arcs
(16 Bytes) Downloaded 168 times
(5)mnh.txt
forbidden arcs
(434 Bytes) Downloaded 178 times

ROC: 0.717238
(5)ROC-AD.png
(5)ROC-AD.png (11.07 KiB) Viewed 154132 times
(5)ROC-nonAD.png
(5)ROC-nonAD.png (11.18 KiB) Viewed 154132 times
Attachments
(5)MB1_2_3_4BKKEGG.txt
data file
(94.78 KiB) Downloaded 185 times
settingsO5.1.txt
Settings file
(5.84 KiB) Downloaded 187 times
lsand039
 
Posts: 237
Joined: Thu Jan 14, 2016 12:17 pm

Re: GEO datasets

Postby lsand039 » Wed Oct 25, 2017 12:51 pm

I don't have a score for the fully connected Bene structure, but here's the structure for (6): best scoring structure with background knowledge 1st and 2nd degree genes of Alzheimer KEGG genes 3rd, 4th , &5th degree MB with background knowledge. The BaNJo runs did not agree on a single structure. Here are the scores & log likelihood %:
(6) scores.xlsx
(5.39 KiB) Downloaded 184 times


Here are the top 3 structures. They only accounted for 92% of the log likelihood. There were 2 other structures that would've summed >95%. Please let me know if I should have them posted.
Top scoring structure: 45.89% of total log likelihood score
(6)a.png
(6)a.png (104.73 KiB) Viewed 154131 times

ROC: 0.7192
(6)aROC-AD.png
(6)aROC-AD.png (11.2 KiB) Viewed 154130 times

(6)aROC-nonAD.png
(6)aROC-nonAD.png (11.23 KiB) Viewed 154130 times

2nd top scoring structure: 36.08% of total log likelihood score
(6)b.png
(6)b.png (106.25 KiB) Viewed 154131 times

ROC: 0.736538
(6)bROC-AD.png
(6)bROC-AD.png (11.08 KiB) Viewed 154130 times

(6)bROC-nonAD.png
(6)bROC-nonAD.png (11.08 KiB) Viewed 154130 times

3rd top structure: 9.99% of total log likelihood score
(6)c.png
(6)c.png (103.69 KiB) Viewed 154131 times

ROC: 0.736538
(6)cROC-AD.png
(6)cROC-AD.png (11.3 KiB) Viewed 154130 times

(6)cROC-nonAD.png
(6)cROC-nonAD.png (10.97 KiB) Viewed 154130 times
lsand039
 
Posts: 237
Joined: Thu Jan 14, 2016 12:17 pm

Re: GEO datasets

Postby lsand039 » Sat Oct 28, 2017 4:23 pm

Here's the structure for (4): best scoring structure without background knowledge 1st and 2nd degree genes of Alzheimer and KEGG genes 4th&5th degree MB with no background knowledge.
This is the highest scoring structure is below:
(4).png
(4).png (137.95 KiB) Viewed 154128 times

The scores for the other structure:
(6)scores.xlsx
(5.9 KiB) Downloaded 178 times

Background knowledge files:
(4)mh.txt
forced arcs
(16 Bytes) Downloaded 185 times

(4)mnh.txt
forbidden arcs
(1008 Bytes) Downloaded 180 times

ROC: 0.741332
(4)ROC-AD.png
(4)ROC-AD.png (11.13 KiB) Viewed 154128 times

(4)ROC-nonAD.png
(4)ROC-nonAD.png (11.16 KiB) Viewed 154128 times
lsand039
 
Posts: 237
Joined: Thu Jan 14, 2016 12:17 pm

Re: GEO datasets

Postby lsand039 » Sat Oct 28, 2017 5:16 pm

Here are the arcs that were different in a couple (6)'s top scoring graphs:
CAPN2->MCL1 (not in A)
UQCRC1->MCL1 (not in B,C)

PPP3CB->NFKB2 (not in B)
UQCRC2->NFKB2 (not in A,C)

MCL1->CAPN2 (not in B,C)
Alzheimer->CAPN2 (not in A)

YWHAZ->APBB1 (not in B,C)
SNCA->APBB1 (not in A)

PPP3CB->MYO1F (not in A,B)
ATP2A2->MYO1F (not in C)

PPP3CB->NFKB2; (not in B)
UQCRC2->NFKB2; (not in B)
lsand039
 
Posts: 237
Joined: Thu Jan 14, 2016 12:17 pm

PreviousNext

Return to Alzheimer

Who is online

Users browsing this forum: No registered users and 4 guests

cron