GEO datasets

Re: GEO datasets

Postby lsand039 » Tue May 30, 2017 12:14 pm

I ran the MB genes through Gene Ontology using the biological process, molecular function, and cell component annotations. I'm not quite sure how to interpret these results.
Attachments
GeneOntology Results.xlsx
(99.2 KiB) Downloaded 158 times
lsand039
 
Posts: 237
Joined: Thu Jan 14, 2016 12:17 pm

Re: GEO datasets

Postby lsand039 » Thu Jun 01, 2017 12:54 pm

Using Efrain's code, I've been able to find out how many genes are available in the original GSE files. The file below shows the number of genes and probe IDs in the original GSE file, the number of unmatched probe IDs, the number of unique genes, and the proportion of genes included in our analysis.
Gene Counts.png
Gene Counts.png (40.2 KiB) Viewed 39491 times

Gene counts.xlsx
(5.72 KiB) Downloaded 153 times
Last edited by lsand039 on Tue Jun 06, 2017 12:53 pm, edited 2 times in total.
lsand039
 
Posts: 237
Joined: Thu Jan 14, 2016 12:17 pm

Re: GEO datasets

Postby lsand039 » Thu Jun 01, 2017 1:31 pm

These are files in my dataset that used Efrain's code to match the probe IDs and gene names. They match all the genes available in the GPL file to probe IDs in the GSE file much more efficiently than using Access or Base which would usually crash or run out of room.

So far the files I've posted using Efrain's code only contain the raw values of gene expressions. The second tab contains the consolidated/ averaged values of repeated genes where I had to use LibreOffice Calc. I'll be checking and comparing these values to the values I got using Access.

Note: The clinical/ demographic data for GSE15222 and GSE48350 didn't align well in the original series matrix file, so I just grabbed this data from the files cleaned up via Access. I checked the GSM IDs to make sure they matched correctly.
Attachments
GSE48350aftexcel.xlsx
(164.82 MiB) Downloaded 153 times
GSE44771aftexcel.xlsx
(134.76 MiB) Downloaded 150 times
GSE44770aftexcel.xlsx
(135.1 MiB) Downloaded 159 times
GSE84422.570aftexcel.xlsx
(61.82 MiB) Downloaded 155 times
GSE28146aftexcel.xlsx
(14.56 MiB) Downloaded 152 times
GSE29378aftexcel.xlsx
(40.34 MiB) Downloaded 162 times
GSE15222aftexcel.xlsx
(80.65 MiB) Downloaded 169 times
GSE26927aftexcel.xlsx
(31.2 MiB) Downloaded 146 times
GSE16759.570aftexcel.xlsx
(5.65 MiB) Downloaded 164 times
GSE5281aftexcel.xlsx
(107.93 MiB) Downloaded 151 times
Last edited by lsand039 on Tue Jun 06, 2017 1:39 pm, edited 1 time in total.
lsand039
 
Posts: 237
Joined: Thu Jan 14, 2016 12:17 pm

Re: GEO datasets

Postby lsand039 » Tue Jun 06, 2017 1:12 pm

continuation of the last post
Attachments
GSE84422.96aftexcel.xlsx
(250.7 MiB) Downloaded 159 times
GSE1297aftexcel.xlsx
(6.29 MiB) Downloaded 150 times
GSE44768aftexcel.xlsx
(134.82 MiB) Downloaded 155 times
lsand039
 
Posts: 237
Joined: Thu Jan 14, 2016 12:17 pm

Re: GEO datasets

Postby lsand039 » Thu Jun 08, 2017 3:26 pm

Here are the files that only the contain the genes common to all the datasets I've used previously. Using Efrain's matching code, I found 8316 common genes, 59 genes more than the 8257 common genes I found using Access. I think Access might not recognized common genes that had a space after them in some GSE files.
Attachments
GSE84422570matched.txt
(9.66 MiB) Downloaded 164 times
GSE8442296matched.txt
(85.94 MiB) Downloaded 150 times
GSE5281matched.txt
(15.66 MiB) Downloaded 156 times
GSE48350matched.txt
(25.52 MiB) Downloaded 160 times
GSE29378matched.txt
(5.91 MiB) Downloaded 146 times
GSE28146matched.txt
(2.32 MiB) Downloaded 152 times
GSE26927matched.txt
(1.46 MiB) Downloaded 152 times
GSE16759matched.txt
(834.91 KiB) Downloaded 158 times
GSE15222matched.txt
(21.6 MiB) Downloaded 156 times
GSE1297matched.txt
(2.16 MiB) Downloaded 145 times
lsand039
 
Posts: 237
Joined: Thu Jan 14, 2016 12:17 pm

Re: GEO datasets

Postby lsand039 » Thu Jun 08, 2017 3:29 pm

GSE44772 is made up of the same samples in GSE44768, GSE44770, and GSE44771. I'm changing to this file since those three GSE files all come from the same study.

Also attached is the list of 59 common genes that Efrain's code picked up that Access didn't .
Attachments
Missing Genes.txt
(375 Bytes) Downloaded 148 times
GSE44772matched.txt
(73.35 MiB) Downloaded 148 times
lsand039
 
Posts: 237
Joined: Thu Jan 14, 2016 12:17 pm

Re: GEO datasets

Postby lsand039 » Fri Jun 09, 2017 1:57 pm

The following contain the Z-score calculations of the all the genes available in the original GSE file.
GSE84422 only contain the definite AD and normal samples.
Attachments
GSE44772.xlsx.tar.gz
(704.2 MiB) Downloaded 161 times
GSE48350.xlsx.tar.gz
(221.32 MiB) Downloaded 155 times
GSE39420.xlsx
(21.47 MiB) Downloaded 152 times
GSE36980.xlsx
(80.36 MiB) Downloaded 140 times
GSE29378.xlsx
(81.98 MiB) Downloaded 129 times
GSE26927.xlsx
(12.34 MiB) Downloaded 131 times
GSE21297.xlsx
(14.8 MiB) Downloaded 128 times
GSE16759.xlsx
(11.07 MiB) Downloaded 124 times
GSE15222.xlsx
(233.84 MiB) Downloaded 126 times
GSE5281.xlsx
(174.79 MiB) Downloaded 114 times
Last edited by lsand039 on Fri Jun 09, 2017 7:11 pm, edited 1 time in total.
lsand039
 
Posts: 237
Joined: Thu Jan 14, 2016 12:17 pm

Re: GEO datasets

Postby lsand039 » Fri Jun 09, 2017 7:08 pm

A continuation of the last post.
Attachments
GSE37263.xlsx
(10.16 MiB) Downloaded 111 times
GSE23290.xlsx
(6.01 MiB) Downloaded 96 times
GSE28146.xlsx
(32.12 MiB) Downloaded 92 times
GSE84422AD.tar.gz
(436.18 MiB) Downloaded 82 times
lsand039
 
Posts: 237
Joined: Thu Jan 14, 2016 12:17 pm

Re: GEO datasets

Postby lsand039 » Mon Jun 12, 2017 4:38 pm

GSE84422 GPL97 has repeats of GSE84422 GPL96 for the same subjects and the same brain regions, so I've excluded GSE84422 GPL97. There are 7097 genes common in the remaining datasets below
Attachments
GSE28146matched.txt
(1.98 MiB) Downloaded 79 times
GSE29378matched.txt
(5.05 MiB) Downloaded 71 times
GSE36980matched.txt
(5.37 MiB) Downloaded 72 times
GSE37263matched.txt
(1.13 MiB) Downloaded 75 times
GSE39420matched.txt
(1.19 MiB) Downloaded 76 times
GSE44772matched.txt
(62.61 MiB) Downloaded 73 times
GSE48350matched.txt
(21.79 MiB) Downloaded 78 times
GSE5281matched.txt
(13.36 MiB) Downloaded 65 times
GSE84422570matched.txt
(8.25 MiB) Downloaded 73 times
GSE8442296matched.txt
(73.29 MiB) Downloaded 55 times
lsand039
 
Posts: 237
Joined: Thu Jan 14, 2016 12:17 pm

Re: GEO datasets

Postby lsand039 » Mon Jun 12, 2017 4:38 pm

Continued from last post
Attachments
GSE1297matched.txt
(1.84 MiB) Downloaded 65 times
GSE15222matched.txt
(18.42 MiB) Downloaded 63 times
GSE16759matched.txt
(712.5 KiB) Downloaded 71 times
GSE23290matched.txt
(487.97 KiB) Downloaded 66 times
GSE26927matched.txt
(1.25 MiB) Downloaded 70 times
lsand039
 
Posts: 237
Joined: Thu Jan 14, 2016 12:17 pm

PreviousNext

Return to Alzheimer

Who is online

Users browsing this forum: No registered users and 1 guest

cron