I found that the results from the BaNJO runs were missing APP in the 7097 genes. Only GSE15222 was missing APP, but I found this odd since this dataset has previously been included in other analyses. As I was going through files, I found that I could not find the original GPL file for GSE15222 that contained gene names and probe IDs.
I was able to find a list that had the gene names and probe IDs for GSE15222 on this other site:
http://www.chibi.ubc.ca/Gemma/expressio ... ml?id=5643This is the link where I found the appropriate GPL information:
http://www.chibi.ubc.ca/Gemma/arrays/sh ... tml?id=293This site seemed to have the same information available on GEO (plus the gene name/ probe ID info I couldn't find on GEO), so I assumed it was a legitimate source to get gene names. I downloaded the GPL information as a text file and used this to match the gene names to probe IDs using Efrain's code.
It turns out that the GPL file I downloaded did not have APP listed a gene that had a corresponding probe ID. Today I looked at all the different types of files available for GSE15222's GPL file, and its Annotation Soft Table contained APP and other genes with corresponding probe names that don't seem to be included in the outside link I previously found.
I found the common genes for all 15 datasets again now that they all had APP, and there are currently 8092 genes.
The new GSE15222 aftexcel & dscrt files that include APP:
The 15 datasets with 8092 genes:
I'm currently rerunning BaNJO with this data.
Combined datasets in Excel form & txt form:
Settings Files: