Alpha-1 Antitrypsin Deficiency

Alpha-1 Antitrypsin Deficiency

Postby rvida034 » Sun Oct 29, 2023 2:58 pm

The following data is focused on different zinc transporter genes and some metallothionein genes. There are different GSE series under investigation in which the wild type and PiZ mutation are compared via the following calculation: "t-Test: Paired Two Sample for Means".
Attachments
Research Project Data Updated.xlsx
(89.21 KiB) Downloaded 76 times
rvida034
 
Posts: 5
Joined: Thu Aug 25, 2022 8:00 am

Re: Alpha-1 Antitrypsin Deficiency

Postby cwyoo » Mon Oct 30, 2023 10:01 am

rvida034 wrote:The following data is focused on different zinc transporter genes and some metallothionein genes. There are different GSE series under investigation in which the wild type and PiZ mutation are compared via the following calculation: "t-Test: Paired Two Sample for Means".


Thank you for the update. Could you also post the information about the original dataset here for reference? Additionally, please share a message here outlining your plans for proceeding with the t-test results.
cwyoo
Site Admin
 
Posts: 378
Joined: Sun Jun 22, 2014 2:38 pm

Re: Alpha-1 Antitrypsin Deficiency

Postby rvida034 » Wed Dec 06, 2023 10:04 am

Attached please find the document outlining the characteristics for the GEO DataSets that were analyzed. This document will be periodically updated.
Furthermore, the plan with the "t-Test: Paired Two Sample for Means" is to first determine which genes showed statistical significance (p-value < 0.05) (please refer to the "Research Project Data Updated" documented) to then narrow down which genes will be analyzed in the lab.
Attachments
Alpha-1 antitrypsin deficiency GEO DataSet Characteristics.xlsx
(11.33 KiB) Downloaded 84 times
rvida034
 
Posts: 5
Joined: Thu Aug 25, 2022 8:00 am

Re: Alpha-1 Antitrypsin Deficiency

Postby rvida034 » Thu Dec 07, 2023 5:42 pm

Hi Samantha:

Could you please provide the Bayesian Network for the GEO DataSet series GSE141593 for only the 6-week old mice so that I may interpret it?

The genes of focus are as follows: SLC30A1, SLC30A2, SLC30A3, SLC30A4, SLC30A5, SLC30A6, SLC30A7, SLC30A8, SLC30A9, SLC39A1, SLC39A2, SLC39A3, SLC39A4, SLC39A5, SLC39A6, SLC39A7, SLC39A8, SLC39A9, SLC39A10, SLC39A11, SLC39A12, SLC39A13, SLC39A14, MTF1, MT1, MT2, MT3, and MT4.

Please let me know if you have any questions. Thank you so much!
rvida034
 
Posts: 5
Joined: Thu Aug 25, 2022 8:00 am

Re: Alpha-1 Antitrypsin Deficiency

Postby cwyoo » Tue Dec 12, 2023 6:17 pm

rvida034 wrote:Hi Samantha:

Could you please provide the Bayesian Network for the GEO DataSet series GSE141593 for only the 6-week old mice so that I may interpret it?

The genes of focus are as follows: SLC30A1, SLC30A2, SLC30A3, SLC30A4, SLC30A5, SLC30A6, SLC30A7, SLC30A8, SLC30A9, SLC39A1, SLC39A2, SLC39A3, SLC39A4, SLC39A5, SLC39A6, SLC39A7, SLC39A8, SLC39A9, SLC39A10, SLC39A11, SLC39A12, SLC39A13, SLC39A14, MTF1, MT1, MT2, MT3, and MT4.

Please let me know if you have any questions. Thank you so much!


Rebecca, I understand that Samantha has provided you with instructions for RNAseq data and Bayesian Networks analyses using R. Samantha will share these instructions in the "Manuscripts & Documentation" section, specifically under "Implemented Classes/Modules/Programs Documentation" titled "How to run scripts/programs." There, please click "Bayesian Network Analysis using R package bnlearn". Rebecca, please review the instructions and post there any challenges you encounter while following the steps. If you want to learn more about bnlearn, there is a separate forum created at Board index ‹ Manuscripts & Documentation ‹ Useful Tools for Statistical Analyses ‹ R package for Bayesian network structure learning (BNlearn).

If you prefer to use tools other than R, there are alternative tools available, e.g., Galaxy, Genie, Banjo, etc.. The goal is for you to comprehend the process so you can document and publish the results. Samantha, for the time being, could you conduct the analyses and share the results here?

Best,
cwyoo
Site Admin
 
Posts: 378
Joined: Sun Jun 22, 2014 2:38 pm

Re: Alpha-1 Antitrypsin Deficiency

Postby samanthagonzales » Tue Dec 19, 2023 12:33 pm

Here is the full project folder for the initial Bayesian analysis on GSE141593, and the resulting network.
Please let me know if you have questions!
Attachments
GSE141593_network.png
GSE141593_network.png (99.12 KiB) Viewed 1112 times
piz mice.zip
(5.02 MiB) Downloaded 72 times
samanthagonzales
 
Posts: 41
Joined: Wed Jun 07, 2023 2:38 pm

Re: Alpha-1 Antitrypsin Deficiency

Postby cwyoo » Tue Dec 19, 2023 5:41 pm

samanthagonzales wrote:Here is the full project folder for the initial Bayesian analysis on GSE141593, and the resulting network.
Please let me know if you have questions!


Appreciate it, Samantha. Rebecca, could you please document here the data analysis process leading to the result? Typically, after cleaning RNAseq data, we start by creating a dataset containing genes with significant fold changes and genes of interest. Subsequently, we discretize the dataset using z-scores and identify the best-fitting Bayesian Networks.

For now, Dr. Luizzi and I have some comments and questions:
1. Kindly utilize gene names for the final network.
2. Are the genes showing significant fold changes included in this analysis?
3. Could you provide more details on the approach employed to identify the best-fitting Bayesian Networks? For instance, could you specify the duration and the number of runs conducted? Additionally, what criteria guided the decision to conclude the search?
cwyoo
Site Admin
 
Posts: 378
Joined: Sun Jun 22, 2014 2:38 pm

Re: Alpha-1 Antitrypsin Deficiency

Postby samanthagonzales » Tue Dec 19, 2023 6:04 pm

cwyoo wrote:
samanthagonzales wrote:Here is the full project folder for the initial Bayesian analysis on GSE141593, and the resulting network.
Please let me know if you have questions!


Appreciate it, Samantha. Rebecca, could you please document here the data analysis process leading to the result? Typically, after cleaning RNAseq data, we start by creating a dataset containing genes with significant fold changes and genes of interest. Subsequently, we discretize the dataset using z-scores and identify the best-fitting Bayesian Networks.

For now, Dr. Luizzi and I have some comments and questions:
1. Kindly utilize gene names for the final network.
2. Are the genes showing significant fold changes included in this analysis?
3. Could you provide more details on the approach employed to identify the best-fitting Bayesian Networks? For instance, could you specify the duration and the number of runs conducted? Additionally, what criteria guided the decision to conclude the search?


1. Kindly utilize gene names for the final network.
    Can do!

2. Are the genes showing significant fold changes included in this analysis?
    Only the genes of interest are included in this network, but I did write in the script the option to include top N differentially expressed genes. I can update the network to include those genes and post it here.

3. Could you provide more details on the approach employed to identify the best-fitting Bayesian Networks? For instance, could you specify the duration and the number of runs conducted? Additionally, what criteria guided the decision to conclude the search?
    I tried different algorithms available to bnlearn, and hill-climbing was the only one to give me a completely directed network. I used the command `pzm.hc <- hc(pizmice.counts, restart = 500)` to generate the network.
    hc() is the hill-climbing function provided by bnlearn. The default scoring is BIC, but it can be changed to BDE if that is preferred. Would it be better to run a for loop to create several structures, then choose the one with the best score (possibly with varying restart values)?



Let me know if you have further questions! I will include differentially expressed genes in the network and update the gene names in the meantime. Thanks!
samanthagonzales
 
Posts: 41
Joined: Wed Jun 07, 2023 2:38 pm

Re: Alpha-1 Antitrypsin Deficiency

Postby cwyoo » Thu Dec 21, 2023 1:30 pm

samanthagonzales wrote:
cwyoo wrote:
samanthagonzales wrote:Here is the full project folder for the initial Bayesian analysis on GSE141593, and the resulting network.
Please let me know if you have questions!


Appreciate it, Samantha. Rebecca, could you please document here the data analysis process leading to the result? Typically, after cleaning RNAseq data, we start by creating a dataset containing genes with significant fold changes and genes of interest. Subsequently, we discretize the dataset using z-scores and identify the best-fitting Bayesian Networks.

For now, Dr. Luizzi and I have some comments and questions:
1. Kindly utilize gene names for the final network.
2. Are the genes showing significant fold changes included in this analysis?
3. Could you provide more details on the approach employed to identify the best-fitting Bayesian Networks? For instance, could you specify the duration and the number of runs conducted? Additionally, what criteria guided the decision to conclude the search?


1. Kindly utilize gene names for the final network.
    Can do!

2. Are the genes showing significant fold changes included in this analysis?
    Only the genes of interest are included in this network, but I did write in the script the option to include top N differentially expressed genes. I can update the network to include those genes and post it here.

3. Could you provide more details on the approach employed to identify the best-fitting Bayesian Networks? For instance, could you specify the duration and the number of runs conducted? Additionally, what criteria guided the decision to conclude the search?
    I tried different algorithms available to bnlearn, and hill-climbing was the only one to give me a completely directed network. I used the command `pzm.hc <- hc(pizmice.counts, restart = 500)` to generate the network.
    hc() is the hill-climbing function provided by bnlearn. The default scoring is BIC, but it can be changed to BDE if that is preferred. Would it be better to run a for loop to create several structures, then choose the one with the best score (possibly with varying restart values)?



Let me know if you have further questions! I will include differentially expressed genes in the network and update the gene names in the meantime. Thanks!


Samantha and Rebecca, please read Learning Bayesian Networks from Data, A Practical Guide that I posted in this forum. Can you perform the search based on the posting and report the results? You may set max_hours to 32 and adjust the pecentage_difference based on the results you get between 1 and 2 hours. The document is work in progress, so please feel free to post any comments and questions there
cwyoo
Site Admin
 
Posts: 378
Joined: Sun Jun 22, 2014 2:38 pm

Re: Alpha-1 Antitrypsin Deficiency

Postby cwyoo » Wed Jan 24, 2024 6:47 pm

Could you share approximately 200 genes that exhibit the most significant fold change? Please specify the threshold used to identify these significant genes. Additionally, please provide the thresholds for obtaining around 400 and around 100 significant genes.
cwyoo
Site Admin
 
Posts: 378
Joined: Sun Jun 22, 2014 2:38 pm

Next

Return to Zinc

Who is online

Users browsing this forum: No registered users and 4 guests