GEO datasets

Re: GEO datasets

Postby lsand039 » Thu Oct 05, 2017 9:34 am

I'm trying to use Jinang's code to score my structures, but I keep running into this error:
Screenshot from 2017-10-05 09-30-37.png
Screenshot from 2017-10-05 09-30-37.png (64.03 KiB) Viewed 104715 times

I've specified my structure as "Benestructure.graph.txt" and my data as "2254Sx4G.csv". I'm currently at a loss on how to fix it. I've tried googling for solutions, but I'm unfortunately not familiar enough with python codes.

Below are all the files downloaded from his code and my files.
bayesian_causal_model.git.tar.gz
(652.42 KiB) Downloaded 178 times
lsand039
 
Posts: 237
Joined: Thu Jan 14, 2016 12:17 pm

Re: GEO datasets

Postby lsand039 » Thu Oct 05, 2017 10:50 am

Bene seems to always output a fully connected graph that follows the same order as the variables are listed in the datafile.
Datafiles.xlsx
Sheet number of data corresponds to test number data was used for.
(445.27 KiB) Downloaded 159 times
OrderTest1.png
OrderTest1.png (123.89 KiB) Viewed 104714 times
OrderTest2.png
OrderTest2.png (125.93 KiB) Viewed 104714 times
OrderTest3.png
OrderTest3.png (124.98 KiB) Viewed 104714 times
OrderTest4.png
OrderTest4.png (118.33 KiB) Viewed 104714 times
OrderTest5.png
OrderTest5.png (120.74 KiB) Viewed 104714 times
OrderTest6.png
OrderTest6.png (119.39 KiB) Viewed 104714 times
OrderTest7.png
OrderTest7.png (122.01 KiB) Viewed 104714 times
OrderTest8.png
OrderTest8.png (120.05 KiB) Viewed 104714 times
lsand039
 
Posts: 237
Joined: Thu Jan 14, 2016 12:17 pm

Re: GEO datasets

Postby lsand039 » Thu Oct 12, 2017 10:01 am

Different structures based off Bene-command:
./run.sh: Score: -nan
run.sh4genes.png
run.sh4genes.png (123.89 KiB) Viewed 104711 times

./run.sh BDe: Score: -nan
BDe4genes.png
BDe4genes.png (123.89 KiB) Viewed 104711 times

./run.sh BIC: Score : 30.882
BIC4genes.png
BIC4genes.png (17.92 KiB) Viewed 104711 times

./run.sh AIC: Score : 8.000
AIC4genes.png
AIC4genes.png (17.92 KiB) Viewed 104711 times
lsand039
 
Posts: 237
Joined: Thu Jan 14, 2016 12:17 pm

Re: GEO datasets

Postby lsand039 » Thu Oct 12, 2017 11:00 am

I don't have a score for the fully connected Bene structure, but here's the structure for (1): best scoring structure without background knowledge 1st and 2nd degree genes of Alzheimer. This structure was consistent in paths 2, 3, and 5.
(1).png
(1).png (76.78 KiB) Viewed 104711 times

I ordered the 1st & 2nd degree genes of Alzheimer's according to correlations. The clinical variables were forced to have arcs with Alzheimer's.
Background knowledge files:
(1)mnh.txt
forbidden arcs
(684 Bytes) Downloaded 172 times

(1)mh.txt
Forced arcs
(17 Bytes) Downloaded 169 times


ROC: 0.731618
(1)ROC-AD.png
(1)ROC-AD.png (11.07 KiB) Viewed 104709 times

(1)ROC-nonAD.png
(1)ROC-nonAD.png (11.06 KiB) Viewed 104709 times
Attachments
settingsO1.1.txt
Settings File
(5.83 KiB) Downloaded 219 times
(1)MB1&2NoBK.txt
datafile
(85.97 KiB) Downloaded 168 times
Last edited by lsand039 on Thu Oct 19, 2017 8:56 am, edited 1 time in total.
lsand039
 
Posts: 237
Joined: Thu Jan 14, 2016 12:17 pm

Re: GEO datasets

Postby lsand039 » Mon Oct 16, 2017 12:50 pm

I don't have a score for the fully connected Bene structure, but here's the structure for (2): best scoring structure with background knowledge 1st and 2nd degree genes of Alzheimer. This structure was consistent in paths 2, 3, and 5.
(2).png
(2).png (77.53 KiB) Viewed 104710 times

I ordered the 1st & 2nd degree genes of Alzheimer's according to correlations. The clinical variables were forced to have arcs with Alzheimer's.
Background knowledge files:
(2)mh.txt
Forced Arcs
(16 Bytes) Downloaded 175 times
(2)mnh.txt
Forced Arcs
(401 Bytes) Downloaded 169 times

ROC: 0.717182
(2)ROC-nonAD.png
(2)ROC-nonAD.png (11.09 KiB) Viewed 104709 times
(2)ROC-AD.png
(2)ROC-AD.png (11.04 KiB) Viewed 104709 times
Attachments
settingsO2.1.txt
Settings files
(5.84 KiB) Downloaded 161 times
(2)MB1&2BKKEGG.txt
Data File
(77.15 KiB) Downloaded 175 times
Last edited by lsand039 on Thu Oct 19, 2017 9:37 am, edited 1 time in total.
lsand039
 
Posts: 237
Joined: Thu Jan 14, 2016 12:17 pm

Re: GEO datasets

Postby lsand039 » Mon Oct 16, 2017 2:06 pm

I don't have a score for the fully connected Bene structure, but here's the structure for (3): best scoring structure without background knowledge 1st and 2nd degree genes of Alzheimer and KEGG genes 4th degree MB with no background knowledge. This structure was consistent in paths 2, 3, and 5.
(3).png
(3).png (59 KiB) Viewed 104710 times


Background knowledge files:
(3)mnh.txt
Forbidden Arcs
(540 Bytes) Downloaded 159 times
(3)mh.txt
Forced arcs
(16 Bytes) Downloaded 178 times


ROC: 0.73061
(3)ROC-nonAD.png
(3)ROC-nonAD.png (11.11 KiB) Viewed 104709 times
(3)ROC-AD.png
(3)ROC-AD.png (11.08 KiB) Viewed 104709 times
Attachments
(3)MB1_2_4NoBK.txt
Datafile
(99.2 KiB) Downloaded 174 times
settingsO3.1.txt
Settings File
(5.83 KiB) Downloaded 169 times
Last edited by lsand039 on Thu Oct 19, 2017 9:37 am, edited 1 time in total.
lsand039
 
Posts: 237
Joined: Thu Jan 14, 2016 12:17 pm

Re: GEO datasets

Postby lsand039 » Tue Oct 17, 2017 8:36 am

I don't have a score for the fully connected Bene structure, but here's the structure for (5): best scoring structure with background knowledge 1st and 2nd degree genes of Alzheimer and KEGG genes 3rd & 4th degree MB with background knowledge. This structure was consistent in paths 2, 3, and 5.
(5).png
(5).png (88.39 KiB) Viewed 104708 times

Background knowledge files:
(5)mh.txt
forced arcs
(16 Bytes) Downloaded 152 times
(5)mnh.txt
forbidden arcs
(434 Bytes) Downloaded 161 times

ROC: 0.717238
(5)ROC-AD.png
(5)ROC-AD.png (11.07 KiB) Viewed 104708 times
(5)ROC-nonAD.png
(5)ROC-nonAD.png (11.18 KiB) Viewed 104708 times
Attachments
(5)MB1_2_3_4BKKEGG.txt
data file
(94.78 KiB) Downloaded 167 times
settingsO5.1.txt
Settings file
(5.84 KiB) Downloaded 167 times
lsand039
 
Posts: 237
Joined: Thu Jan 14, 2016 12:17 pm

Re: GEO datasets

Postby lsand039 » Wed Oct 25, 2017 12:51 pm

I don't have a score for the fully connected Bene structure, but here's the structure for (6): best scoring structure with background knowledge 1st and 2nd degree genes of Alzheimer KEGG genes 3rd, 4th , &5th degree MB with background knowledge. The BaNJo runs did not agree on a single structure. Here are the scores & log likelihood %:
(6) scores.xlsx
(5.39 KiB) Downloaded 167 times


Here are the top 3 structures. They only accounted for 92% of the log likelihood. There were 2 other structures that would've summed >95%. Please let me know if I should have them posted.
Top scoring structure: 45.89% of total log likelihood score
(6)a.png
(6)a.png (104.73 KiB) Viewed 104707 times

ROC: 0.7192
(6)aROC-AD.png
(6)aROC-AD.png (11.2 KiB) Viewed 104706 times

(6)aROC-nonAD.png
(6)aROC-nonAD.png (11.23 KiB) Viewed 104706 times

2nd top scoring structure: 36.08% of total log likelihood score
(6)b.png
(6)b.png (106.25 KiB) Viewed 104707 times

ROC: 0.736538
(6)bROC-AD.png
(6)bROC-AD.png (11.08 KiB) Viewed 104706 times

(6)bROC-nonAD.png
(6)bROC-nonAD.png (11.08 KiB) Viewed 104706 times

3rd top structure: 9.99% of total log likelihood score
(6)c.png
(6)c.png (103.69 KiB) Viewed 104707 times

ROC: 0.736538
(6)cROC-AD.png
(6)cROC-AD.png (11.3 KiB) Viewed 104706 times

(6)cROC-nonAD.png
(6)cROC-nonAD.png (10.97 KiB) Viewed 104706 times
lsand039
 
Posts: 237
Joined: Thu Jan 14, 2016 12:17 pm

Re: GEO datasets

Postby lsand039 » Sat Oct 28, 2017 4:23 pm

Here's the structure for (4): best scoring structure without background knowledge 1st and 2nd degree genes of Alzheimer and KEGG genes 4th&5th degree MB with no background knowledge.
This is the highest scoring structure is below:
(4).png
(4).png (137.95 KiB) Viewed 104704 times

The scores for the other structure:
(6)scores.xlsx
(5.9 KiB) Downloaded 160 times

Background knowledge files:
(4)mh.txt
forced arcs
(16 Bytes) Downloaded 166 times

(4)mnh.txt
forbidden arcs
(1008 Bytes) Downloaded 165 times

ROC: 0.741332
(4)ROC-AD.png
(4)ROC-AD.png (11.13 KiB) Viewed 104704 times

(4)ROC-nonAD.png
(4)ROC-nonAD.png (11.16 KiB) Viewed 104704 times
lsand039
 
Posts: 237
Joined: Thu Jan 14, 2016 12:17 pm

Re: GEO datasets

Postby lsand039 » Sat Oct 28, 2017 5:16 pm

Here are the arcs that were different in a couple (6)'s top scoring graphs:
CAPN2->MCL1 (not in A)
UQCRC1->MCL1 (not in B,C)

PPP3CB->NFKB2 (not in B)
UQCRC2->NFKB2 (not in A,C)

MCL1->CAPN2 (not in B,C)
Alzheimer->CAPN2 (not in A)

YWHAZ->APBB1 (not in B,C)
SNCA->APBB1 (not in A)

PPP3CB->MYO1F (not in A,B)
ATP2A2->MYO1F (not in C)

PPP3CB->NFKB2; (not in B)
UQCRC2->NFKB2; (not in B)
lsand039
 
Posts: 237
Joined: Thu Jan 14, 2016 12:17 pm

PreviousNext

Return to Alzheimer

Who is online

Users browsing this forum: No registered users and 436 guests

cron