Page 1 of 1

Data Outputs

PostPosted: Thu Dec 14, 2017 5:26 pm
by efrain.gonzalez0
This will contain all data

Coding for Samples

PostPosted: Thu Dec 14, 2017 5:30 pm
by efrain.gonzalez0
Good afternoon,

I have attached the coding for the samples here. Please use it as a reference when looking at any of the other data files under this topic.

Respectfully,

Efrain Gonzalez

HTSEQ Text Files

PostPosted: Thu Dec 14, 2017 6:13 pm
by efrain.gonzalez0
Good evening,

I have attached the outputs of HTSEQ below.

Respectfully,

Efrain Gonzalez

Re: Data Outputs

PostPosted: Wed Jan 03, 2018 10:15 pm
by hoong
Hi Efrain,

Could you please provide us with some updates on the project? Bryan has uploaded the output generated from Partek, so please let us know if you need more information.

Thanks,
Hooi

Gene Check

PostPosted: Mon Jan 08, 2018 1:49 pm
by efrain.gonzalez0
Good afternoon,

Here I have attached the results of checking the HTSEQ outputs for the genes that Brian provided. In summary all of the genes provided in the abbreviated list were present for each one of the HTSEQ outputs.

Respectfully,

Efrain Gonzalez

Re: Data Outputs

PostPosted: Tue Jan 09, 2018 7:03 pm
by hoong
Thank you. We would like to have a more detailed analysis of our data as discussed in our meeting last year.

Differentially Expressed genes

PostPosted: Tue Jan 16, 2018 1:06 pm
by efrain.gonzalez0
Good afternoon,

I will be beginning my analysis of the HTSEQ-Counts results this week. I will be using a software called DeSeq2 in order to answer the question "do different sample groups express genes differentially?" As I have not used this software before it make take me some time to run all the files.

Respectfully,

Efrain Gonzalez

Results from DESeq2 Analysis

PostPosted: Thu Jan 25, 2018 3:00 pm
by efrain.gonzalez0
Good afternoon,

The results of the DESeq2 analysis on the data provided. I have also included the R commands that were used. Technical replicates were combined meaning that we were left only with the three different treatments. The fact that there were no biological replicates for each one of the treatments led to a problem with the analysis. The problem is best summarized by the creators of DESeq2 in their documentation:
Experiments without biological replicates do not allow for estimation of the dispersion of counts around the expected value for each group, which is critical for differential expression analysis. If an experimental design is supplied which does not contain the necessary degrees of freedom for differential analysis, DESeq will provide a warning to the user and follow the strategy outlined in Anders and Huber (2010) under the section 'Working without replicates', wherein all the samples are considered as replicates of a single group for the estimation of dispersion. As noted in the reference above: "Some overestimation of the variance may be expected, which will make that approach conservative." Furthermore, "while one may not want to draw strong conclusions from such an analysis, it may still be useful for exploration and hypothesis generation." We provide this approach for data exploration only, but for accurately identifying differential expression, biological replicates are required.


Respectfully,

Efrain Gonzalez

DESeq2 Analysis for Biological Replicates

PostPosted: Thu Feb 01, 2018 5:44 pm
by efrain.gonzalez0
Good afternoon,

I have attached the results of the DESeq2 analysis on the data provided. In this case the replicates for each treatment were considered biological replicates. The definition that I have been using to differentiate between biological replicates and technical replicates is the following:
A technical replicate is when you test the same sample multiple times - it's used to test the variability in the testing protocol itself. A biological replicate is where you perform the same test on multiple samples of the same material / type of cells / tissue.

You will find that the column names are shifted over to the left by one column. This should be an easy fix if you intend to look at this data in Excel. I have included the names of the genes in these files.
Respectfully,

Efrain Gonzalez

Re: Data Outputs

PostPosted: Tue Feb 13, 2018 2:53 pm
by efrain.gonzalez0
Good afternoon,

I believe that the video link that I have attached here provides the necessary information for continued analysis of this data. If ones interest is just in the gene ontology and pathway analysis then one should skip to minute 42:38.
Link: https://www.youtube.com/watch?v=xh_wpWj0AzM&t=2361s

Respectfully,

Efrain Gonzalez