Dr. Yoo,
Find the merged dataset below. It has dimensions of 463x2140. I'll run the correlations on Monday and run Banjo.
meninonas wrote:Dr. Yoo,
I have coded the datasets according to Mutation, Amplification, and Expression. I was not able to merge them in one large dataset due to LibreOffice Calc not being able to open the original dataset in its entirety. LibreOffice Calc gives me the following error:
- Code: Select all
The data could not be loaded completely because the maximum number of columns per sheet was exceeded.
I would suggest opening the datasets in Microsoft Office if you have it available to you.
I am currently working on labeling the genes according to _MUT, _AMP, and _EXP. Those should be ready tomorrow.
meninonas wrote:Professor,
I have coded as followingDiagnosis Age: >50=1, <50=0
ER Status: Negative=0, Positive=1, Otherwise=2
HER2 Final Status: Negative=0, Positive=1, Otherwise=2
Overall Survival Status: DECEASED=0, LIVING=1
Overall Survival Months: <Average=0, >Average=1
Genes: Down=0, (NULL)=1,UP=2
I have added the coded data. I am currently finding the correlated variables.
Once you let me know about which clinical variables too keep/delete variables I'll start running Banjo.
meninonas wrote:Dr. Yoo,
Please find the reduced dataset below. I'm already working on the banjo analysis.
meninonas wrote:Dr. Yoo,
Please find the datasets below with the correlations. I did the following:I took the absolute value of the correlations and ordered the dataset according to the correlations
Afterwards, I chose the top 220 variables with the highest correlations from each individual dataset and placed them within one large dataset
Finally, I deleted all of the variables that were repeated and ended with 907 variables.
I already ran banjo for one hour and have sent you the results. I already started running for two hours. I'll send you the results tomorrow morning.
meninonas wrote:Dr. Yoo,
Yes. The correlations are at the end of the file. I did it the same way you showed me.
meninonas wrote:Professor,
Find the correlated data below. I have also sent you the 2-Hour Banjo results. I am currently running the 4-Hour Banjo Analysis.
Users browsing this forum: No registered users and 1 guest