by cpere117 » Mon Apr 22, 2019 7:04 pm
After discretizing my data and matching genes across 9 datasets (GSE 1102981, GSE1297, GSE28146, GSE29378, GSE44772, GSE45596, GSE5281, GSE84422-GPL570, GSE8422-GPL96) the final tally of common genes was 6432 genes in total. Note, due to quality control purposes 3 datasets were not included in further analysis (GSE36980, GSEGSE37263, GSE39420). A total of 1633 samples (950 AD, 683 control) were merged and discretized according to Z-score values. Gender, age, and condition will all be categorized in binary variables for runs in BANJO. Furthermore, of the 6432 total genes found across the datasets, 846 genes are known to be validated targets of ID3 from the Chip/RNA integrative data gathered from our lab (Mayur conducted the experiment). My plan is to finalize the cleaning process early tomorrow and then attempt to run BANJO across three trials of 1 hour, 3hours, and 9 hours to gather a Bayesian network analysis.
- Attachments
-
- Book4.xlsx
- Discretized and merged file of all samples
- (33.2 MiB) Downloaded 123 times