GEO datasets

GEO datasets

Postby cwyoo » Tue Jan 26, 2016 9:51 pm

Please goto http://www.ncbi.nlm.nih.gov/geo/ and download gene expression data for Alzheimer Disease in human. Post the original data, discritized data, and analysis results of banjo and ARACNE.
cwyoo
Site Admin
 
Posts: 378
Joined: Sun Jun 22, 2014 2:38 pm

Re: GEO datasets

Postby shstyoo » Thu Jan 28, 2016 4:35 am

I've gone ahead and downloaded a couple of the .soft files and summarized studies.

The attached output.txt file contains the descriptions for each of the GEO datasets I've found as well as the GDS# that will identify the file on GEO.
The file names that have the word "FULL" in them appear to not have a direct link to the study on GEO (but it can be found by searching the GDS# that precedes it).
The files with the word "FAMILY" have a related link (if given) that links to the study on GEO. Otherwise, they can be found by typing in their GSE# into GEO.

As I find more studies I'll update the descriptions.

edit: You may need to use notepad++ or some other text editor (anything that isn't Window's default notepad) to view the file properly, since notepad doesn't seem to recognize the newline in the output.
Attachments
output.txt
Contains dataset file name and important information found in the dataset.
(11.1 KiB) Downloaded 153 times
shstyoo
 
Posts: 12
Joined: Fri Jun 27, 2014 9:05 pm

Re: GEO datasets

Postby cwyoo » Thu Jan 28, 2016 7:51 am

shstyoo wrote:I've gone ahead and downloaded a couple of the .soft files and summarized studies.

The attached output.txt file contains the descriptions for each of the GEO datasets I've found as well as the GDS# that will identify the file on GEO.
The file names that have the word "FULL" in them appear to not have a direct link to the study on GEO (but it can be found by searching the GDS# that precedes it).
The files with the word "FAMILY" have a related link (if given) that links to the study on GEO. Otherwise, they can be found by typing in their GSE# into GEO.

As I find more studies I'll update the descriptions.

edit: You may need to use notepad++ or some other text editor (anything that isn't Window's default notepad) to view the file properly, since notepad doesn't seem to recognize the newline in the output.


Great work. Could you share your search terms and selection criteria that lead to the results? So others can try different search terms and/or new criteria to discover new data.
cwyoo
Site Admin
 
Posts: 378
Joined: Sun Jun 22, 2014 2:38 pm

Re: GEO datasets

Postby shstyoo » Thu Jan 28, 2016 2:46 pm

The script I wrote picked up any study that had the word "Alzheimer", "AD", "Alzheimer's Disease", "neurofibrillary tangle", "amyloid beta protein" and had the species "homo sapien" in either its description or title.

It should ignore anything that isn't human, and doesn't have those keywords. It also will not download datasets that are over 100mb in size.

For some reason, it was only able to download from the first two pages of the GEO datasets so I still need to work on the script. I'll post it later when I've finished it.
shstyoo
 
Posts: 12
Joined: Fri Jun 27, 2014 9:05 pm

Re: GEO datasets

Postby lsand039 » Wed Feb 03, 2016 2:55 pm

Attached are the results I found from the first four pages of searching "Alzheimer" on the GEO datasets. I followed the same format as Steve's output.txt file. I didn't include any results that didn't look relevant to human Alzheimer patients, but let me know if I should be more selective in choosing files.
Attachments
searchresults.txt
(50.45 KiB) Downloaded 130 times
lsand039
 
Posts: 237
Joined: Thu Jan 14, 2016 12:17 pm

Re: GEO datasets

Postby cwyoo » Wed Feb 03, 2016 3:40 pm

lsand039 wrote:Attached are the results I found from the first four pages of searching "Alzheimer" on the GEO datasets. I followed the same format as Steve's output.txt file. I didn't include any results that didn't look relevant to human Alzheimer patients, but let me know if I should be more selective in choosing files.


Great. Let's create a table that identifies for each study, how many females and/or males, their age ranges, and country where subjects were recruited (e.g., U.S.A., Japan, etc.).
cwyoo
Site Admin
 
Posts: 378
Joined: Sun Jun 22, 2014 2:38 pm

Re: GEO datasets

Postby lsand039 » Wed Feb 10, 2016 2:33 pm

Attached is a table with the information for my results. I was only able to find enough information on about 15 of the studies. Some of the subjects in the studies were taken from brain banks, so the country listed would be the location of the brain banks.
Attachments
resultstable.txt
(2.51 KiB) Downloaded 136 times
lsand039
 
Posts: 237
Joined: Thu Jan 14, 2016 12:17 pm

Re: GEO datasets

Postby lsand039 » Thu Feb 11, 2016 1:57 pm

Here's an updated table containing all the sources so far. I did my best to get information on subject demographics, but I couldn't find all the specific information from the published articles.
Attachments
Table .odt
(47.1 KiB) Downloaded 143 times
lsand039
 
Posts: 237
Joined: Thu Jan 14, 2016 12:17 pm

Re: GEO datasets

Postby lsand039 » Thu Feb 18, 2016 1:53 pm

Here's a table for GDS810 containing the both the demographics of the subjects and their microarray data. More tables are coming soon!
Attachments
GDS810.ods
(4.47 MiB) Downloaded 138 times
lsand039
 
Posts: 237
Joined: Thu Jan 14, 2016 12:17 pm

Re: GEO datasets

Postby cwyoo » Fri Feb 19, 2016 12:44 pm

lsand039 wrote:Here's a table for GDS810 containing the both the demographics of the subjects and their microarray data. More tables are coming soon!


Great! Let's do research on the following questions (please feel free to add any other questions):

- For a study, what are different datasets that are uploaded in GEO?
- What is the difference between *.family.soft and *.soft files?
- How are the gene expression value calculated?
cwyoo
Site Admin
 
Posts: 378
Joined: Sun Jun 22, 2014 2:38 pm

Next

Return to Alzheimer

Who is online

Users browsing this forum: No registered users and 2 guests