shstyoo wrote:Just a quick update, my computer is having trouble opening the actual GSE6360_family.soft file (found on the GEO website). It looks like the file size is too large for Open Office to handle. Are you able to open it through Microsoft Excel?
Steve, could you create a script that reads in two text files (which are in comma separated (csv) or tap separtate (txt) format) and creates one text file with same format? You may use the two files that Lauren posted here (Probe IDs and Gene Names.csv is already in comma separated format; and GSE63060.xlsx should be converted into a comma separated (csv) or a tap separtate (txt) format).
So, your script should read in Probe IDs and Gene Names.csv and GSE63060.xls (converted into a comma separated (csv) or a tap separtate (txt) format; let's call it GSE63060.csv) and produce a result text file that adds a column called GeneID (that corresponds to the Probe ID from Probe IDs and Gene Names.cs) into GSE63060.csv.
Since Lauren has more files like GSE63060.xlsx, she is planning to use your script and produce the text files that are needed to do further analyses. Please let us know if you have any other questions/comments.