1. 下載、安裝 MAGMA

wget https://ctg.cncr.nl/software/MAGMA/prog/magma_v1.09.zip
unzip magma_v1.09.zip

2. 下載公共數據

# 下載基因位置文件NCBI37.3.gene.loc
wget https://ctg.cncr.nl/software/MAGMA/aux_files/NCBI37.3.zip
unzip NCBI37.3.zip

# 下載參考人群g1000_eur
wget https://ctg.cncr.nl/software/MAGMA/ref_data/g1000_eur.zip
unzip g1000_eur.zip

注意： 本推文使用的基因組版本是hg19（build 37）、參考人群選用歐洲人群，請各位根據自己研究的實際情況修改；

基因位置文件NCBI37.3.gene.loc如下所示：

參考人群g1000_eur如下所示：

3. 准備輸入文件

到這里，才是自己需要准備的文件，前面的文件全部是通過公共數據獲得。
輸入文件的格式同VEGAS2和GCTA，只需要 GWAS 結果的 P 值和 SNP id 即可
輸入文件snpp包括兩列，第一列是SNP的ID，第二列是SNP的P值；
輸入文件snpp如下所示：

4. 開始分析

4.1 step1: 基因注釋

輸入命令：

magma --annotate --snp-loc g1000_eur.bim --gene-loc NCBI37.3.gene.loc --out g1000_eur

g1000_eur.bim和NCBI37.3.gene.loc文件通過前面的第二個步驟獲得；

該步驟生成g1000_eur.genes.annot結果文件:

4.2 step2: gene-based 關聯分析

輸入命令：

magma --bfile g1000_eur --pval snpp N=401670 --gene-annot g1000_eur.genes.annot --out genebased

g1000_eur文件通過前面的第二個步驟獲得;
snpp文件通過前面的第三個步驟獲得;
g1000_eur.genes.annot文件通過 4.1 步驟獲得;
N指的是研究的樣本量；

執行以上命令后，生成兩個文件：g1000_eur.genes.out和g1000_eur.genes.raw

g1000_eur.genes.out即為gene-based 關聯分析結果：

gene-based 關聯分析結果的釋義如下所示：

• GENE: the gene ID as specified in the annotation file
• CHR: the chromosome the gene is on
• START/STOP: the annotation boundaries of the gene on that chromosome (this includes any window around the gene applied during annotation)
• NSNPS: the number of SNPs annotated to that gene that were found in the data and were not excluded based on internal SNP QC
• NPARAM: the number of relevant parameters used in the model. For the SNP-wise models this is an approximate value; for the principal components regression (raw data default) this is set to the number of principal components retained after pruning; for the multimodels this is the mean NPARAM value of the component base models
• N: the sample size used when analysing that gene; can differ for allosomal chromosomes or when analysing SNP p-value input with variable sample size by SNP (due to missingness or differences in coverage in meta-analysis)
• ZSTAT: the Z-value for the gene, based on its (permutation) p-value; this is what is used as the measure of gene association in the gene-level analyses
• P: the gene p-value

生成的另外一個文件g1000_eur.genes.raw如下所示：

該文件可用於后續的 gene-set 分析。

gene-based 關聯分析研究就介紹到這啦~
祝各位生活愉快~

免責聲明！

本站轉載的文章為個人學習借鑒使用，本站對版權不負任何法律責任。如果侵犯了您的隱私權益，請聯系本站郵箱yoyou2525@163.com刪除。

猜您在找 基於GCTA的gene-based關聯分析研究 [方法] 如何做產品的定量分析研究如何研究某個gene的ceRNA 網絡 Photoshop中的高斯模糊、高反差保留和Halcon中的rft頻域分析研究 Mycat連接數據庫之后導致表名全小寫的問題分析研究各行業的分析研究網站大全，建議收藏 Tomcat與WAS應用中間件差異化分析研究系統設計以及javascript筆記：用戶行為分析研究之數據采集四站氣象對比（ECMWF+NOAA+中國氣象網+GFS）——大氣數據分析研究 soundtouch源碼分析__based on csdn ：