GEO datasets

Re: GEO datasets

Postby lsand039 » Thu Oct 05, 2017 9:34 am

I'm trying to use Jinang's code to score my structures, but I keep running into this error:
Screenshot from 2017-10-05 09-30-37.png
Screenshot from 2017-10-05 09-30-37.png (64.03 KiB) Viewed 154150 times

I've specified my structure as "Benestructure.graph.txt" and my data as "2254Sx4G.csv". I'm currently at a loss on how to fix it. I've tried googling for solutions, but I'm unfortunately not familiar enough with python codes.

Below are all the files downloaded from his code and my files.
bayesian_causal_model.git.tar.gz
(652.42 KiB) Downloaded 198 times
lsand039
 
Posts: 237
Joined: Thu Jan 14, 2016 12:17 pm

Re: GEO datasets

Postby lsand039 » Thu Oct 05, 2017 10:50 am

Bene seems to always output a fully connected graph that follows the same order as the variables are listed in the datafile.
Datafiles.xlsx
Sheet number of data corresponds to test number data was used for.
(445.27 KiB) Downloaded 177 times
OrderTest1.png
OrderTest1.png (123.89 KiB) Viewed 154149 times
OrderTest2.png
OrderTest2.png (125.93 KiB) Viewed 154149 times
OrderTest3.png
OrderTest3.png (124.98 KiB) Viewed 154149 times
OrderTest4.png
OrderTest4.png (118.33 KiB) Viewed 154149 times
OrderTest5.png
OrderTest5.png (120.74 KiB) Viewed 154149 times
OrderTest6.png
OrderTest6.png (119.39 KiB) Viewed 154149 times
OrderTest7.png
OrderTest7.png (122.01 KiB) Viewed 154149 times
OrderTest8.png
OrderTest8.png (120.05 KiB) Viewed 154149 times
lsand039
 
Posts: 237
Joined: Thu Jan 14, 2016 12:17 pm

Re: GEO datasets

Postby lsand039 » Thu Oct 12, 2017 10:01 am

Different structures based off Bene-command:
./run.sh: Score: -nan
run.sh4genes.png
run.sh4genes.png (123.89 KiB) Viewed 154146 times

./run.sh BDe: Score: -nan
BDe4genes.png
BDe4genes.png (123.89 KiB) Viewed 154146 times

./run.sh BIC: Score : 30.882
BIC4genes.png
BIC4genes.png (17.92 KiB) Viewed 154146 times

./run.sh AIC: Score : 8.000
AIC4genes.png
AIC4genes.png (17.92 KiB) Viewed 154146 times
lsand039
 
Posts: 237
Joined: Thu Jan 14, 2016 12:17 pm

Re: GEO datasets

Postby lsand039 » Thu Oct 12, 2017 11:00 am

I don't have a score for the fully connected Bene structure, but here's the structure for (1): best scoring structure without background knowledge 1st and 2nd degree genes of Alzheimer. This structure was consistent in paths 2, 3, and 5.
(1).png
(1).png (76.78 KiB) Viewed 154146 times

I ordered the 1st & 2nd degree genes of Alzheimer's according to correlations. The clinical variables were forced to have arcs with Alzheimer's.
Background knowledge files:
(1)mnh.txt
forbidden arcs
(684 Bytes) Downloaded 191 times

(1)mh.txt
Forced arcs
(17 Bytes) Downloaded 189 times


ROC: 0.731618
(1)ROC-AD.png
(1)ROC-AD.png (11.07 KiB) Viewed 154144 times

(1)ROC-nonAD.png
(1)ROC-nonAD.png (11.06 KiB) Viewed 154144 times
Attachments
settingsO1.1.txt
Settings File
(5.83 KiB) Downloaded 239 times
(1)MB1&2NoBK.txt
datafile
(85.97 KiB) Downloaded 187 times
Last edited by lsand039 on Thu Oct 19, 2017 8:56 am, edited 1 time in total.
lsand039
 
Posts: 237
Joined: Thu Jan 14, 2016 12:17 pm

Re: GEO datasets

Postby lsand039 » Mon Oct 16, 2017 12:50 pm

I don't have a score for the fully connected Bene structure, but here's the structure for (2): best scoring structure with background knowledge 1st and 2nd degree genes of Alzheimer. This structure was consistent in paths 2, 3, and 5.
(2).png
(2).png (77.53 KiB) Viewed 154145 times

I ordered the 1st & 2nd degree genes of Alzheimer's according to correlations. The clinical variables were forced to have arcs with Alzheimer's.
Background knowledge files:
(2)mh.txt
Forced Arcs
(16 Bytes) Downloaded 192 times
(2)mnh.txt
Forced Arcs
(401 Bytes) Downloaded 186 times

ROC: 0.717182
(2)ROC-nonAD.png
(2)ROC-nonAD.png (11.09 KiB) Viewed 154144 times
(2)ROC-AD.png
(2)ROC-AD.png (11.04 KiB) Viewed 154144 times
Attachments
settingsO2.1.txt
Settings files
(5.84 KiB) Downloaded 180 times
(2)MB1&2BKKEGG.txt
Data File
(77.15 KiB) Downloaded 190 times
Last edited by lsand039 on Thu Oct 19, 2017 9:37 am, edited 1 time in total.
lsand039
 
Posts: 237
Joined: Thu Jan 14, 2016 12:17 pm

Re: GEO datasets

Postby lsand039 » Mon Oct 16, 2017 2:06 pm

I don't have a score for the fully connected Bene structure, but here's the structure for (3): best scoring structure without background knowledge 1st and 2nd degree genes of Alzheimer and KEGG genes 4th degree MB with no background knowledge. This structure was consistent in paths 2, 3, and 5.
(3).png
(3).png (59 KiB) Viewed 154145 times


Background knowledge files:
(3)mnh.txt
Forbidden Arcs
(540 Bytes) Downloaded 173 times
(3)mh.txt
Forced arcs
(16 Bytes) Downloaded 199 times


ROC: 0.73061
(3)ROC-nonAD.png
(3)ROC-nonAD.png (11.11 KiB) Viewed 154144 times
(3)ROC-AD.png
(3)ROC-AD.png (11.08 KiB) Viewed 154144 times
Attachments
(3)MB1_2_4NoBK.txt
Datafile
(99.2 KiB) Downloaded 195 times
settingsO3.1.txt
Settings File
(5.83 KiB) Downloaded 185 times
Last edited by lsand039 on Thu Oct 19, 2017 9:37 am, edited 1 time in total.
lsand039
 
Posts: 237
Joined: Thu Jan 14, 2016 12:17 pm

Re: GEO datasets

Postby lsand039 » Tue Oct 17, 2017 8:36 am

I don't have a score for the fully connected Bene structure, but here's the structure for (5): best scoring structure with background knowledge 1st and 2nd degree genes of Alzheimer and KEGG genes 3rd & 4th degree MB with background knowledge. This structure was consistent in paths 2, 3, and 5.
(5).png
(5).png (88.39 KiB) Viewed 154143 times

Background knowledge files:
(5)mh.txt
forced arcs
(16 Bytes) Downloaded 169 times
(5)mnh.txt
forbidden arcs
(434 Bytes) Downloaded 179 times

ROC: 0.717238
(5)ROC-AD.png
(5)ROC-AD.png (11.07 KiB) Viewed 154143 times
(5)ROC-nonAD.png
(5)ROC-nonAD.png (11.18 KiB) Viewed 154143 times
Attachments
(5)MB1_2_3_4BKKEGG.txt
data file
(94.78 KiB) Downloaded 186 times
settingsO5.1.txt
Settings file
(5.84 KiB) Downloaded 188 times
lsand039
 
Posts: 237
Joined: Thu Jan 14, 2016 12:17 pm

Re: GEO datasets

Postby lsand039 » Wed Oct 25, 2017 12:51 pm

I don't have a score for the fully connected Bene structure, but here's the structure for (6): best scoring structure with background knowledge 1st and 2nd degree genes of Alzheimer KEGG genes 3rd, 4th , &5th degree MB with background knowledge. The BaNJo runs did not agree on a single structure. Here are the scores & log likelihood %:
(6) scores.xlsx
(5.39 KiB) Downloaded 185 times


Here are the top 3 structures. They only accounted for 92% of the log likelihood. There were 2 other structures that would've summed >95%. Please let me know if I should have them posted.
Top scoring structure: 45.89% of total log likelihood score
(6)a.png
(6)a.png (104.73 KiB) Viewed 154142 times

ROC: 0.7192
(6)aROC-AD.png
(6)aROC-AD.png (11.2 KiB) Viewed 154141 times

(6)aROC-nonAD.png
(6)aROC-nonAD.png (11.23 KiB) Viewed 154141 times

2nd top scoring structure: 36.08% of total log likelihood score
(6)b.png
(6)b.png (106.25 KiB) Viewed 154142 times

ROC: 0.736538
(6)bROC-AD.png
(6)bROC-AD.png (11.08 KiB) Viewed 154141 times

(6)bROC-nonAD.png
(6)bROC-nonAD.png (11.08 KiB) Viewed 154141 times

3rd top structure: 9.99% of total log likelihood score
(6)c.png
(6)c.png (103.69 KiB) Viewed 154142 times

ROC: 0.736538
(6)cROC-AD.png
(6)cROC-AD.png (11.3 KiB) Viewed 154141 times

(6)cROC-nonAD.png
(6)cROC-nonAD.png (10.97 KiB) Viewed 154141 times
lsand039
 
Posts: 237
Joined: Thu Jan 14, 2016 12:17 pm

Re: GEO datasets

Postby lsand039 » Sat Oct 28, 2017 4:23 pm

Here's the structure for (4): best scoring structure without background knowledge 1st and 2nd degree genes of Alzheimer and KEGG genes 4th&5th degree MB with no background knowledge.
This is the highest scoring structure is below:
(4).png
(4).png (137.95 KiB) Viewed 154139 times

The scores for the other structure:
(6)scores.xlsx
(5.9 KiB) Downloaded 179 times

Background knowledge files:
(4)mh.txt
forced arcs
(16 Bytes) Downloaded 186 times

(4)mnh.txt
forbidden arcs
(1008 Bytes) Downloaded 181 times

ROC: 0.741332
(4)ROC-AD.png
(4)ROC-AD.png (11.13 KiB) Viewed 154139 times

(4)ROC-nonAD.png
(4)ROC-nonAD.png (11.16 KiB) Viewed 154139 times
lsand039
 
Posts: 237
Joined: Thu Jan 14, 2016 12:17 pm

Re: GEO datasets

Postby lsand039 » Sat Oct 28, 2017 5:16 pm

Here are the arcs that were different in a couple (6)'s top scoring graphs:
CAPN2->MCL1 (not in A)
UQCRC1->MCL1 (not in B,C)

PPP3CB->NFKB2 (not in B)
UQCRC2->NFKB2 (not in A,C)

MCL1->CAPN2 (not in B,C)
Alzheimer->CAPN2 (not in A)

YWHAZ->APBB1 (not in B,C)
SNCA->APBB1 (not in A)

PPP3CB->MYO1F (not in A,B)
ATP2A2->MYO1F (not in C)

PPP3CB->NFKB2; (not in B)
UQCRC2->NFKB2; (not in B)
lsand039
 
Posts: 237
Joined: Thu Jan 14, 2016 12:17 pm

PreviousNext

Return to Alzheimer

Who is online

Users browsing this forum: No registered users and 2 guests

cron