Model Description¶
The sum of single shared effect (SuShiE) extends the sum of single effect (SuSiE) [1] model by introducing a prior correlation estimator to account for the ancestral quantitative trait loci (QTL) effect size similarity. Specifically, for \(i^{\text{th}}\) of total \(k \in \mathbb{N}\) ancestries, we model the molecular data \(g_i \in \mathbb{R}^{n_i \times 1}\) for \(n_i \in \mathbb{N}\) individuals as a linear combination of standardized genotype matrix \(X_i \in \mathbb{R}^{n_i \times p}\) for \(p \in \mathbb{N}\) SNPs as
where \(\beta_i \in \mathbb{R}^{p \times1}\) is the shared QTL effects, \(\epsilon_i \in \mathbb{R}^{n_i \times 1}\) is the ancestry-specific effects and other environmental noises, \(L \in \mathbb{R}\) is the number of shared effects, for \(l^{\text{th}}\) single shared effect, \(b_{i,l} \in \mathbb{R}\) is a scaler representing effect size, \(C_l \in \mathbb{R}^{k \times k}\) is the prior covariance matrix with \(\sigma^2_{i,b}\) as variance and \(\rho\) as correlation, \(\gamma_l\) is an binary indicator vector specifying which single SNP is the QTL, \(\pi\) is the prior probability for each SNP to be QTL, and \(\sigma^2_e\) is the prior variance for noises.
SuShiE runs varitional inference to estimate the posterior distribution for \(\beta_l\) and \(\gamma_l\) for each \(l^{\text{th}}\) effect. We can quantify the probability of QTL for each SNP through Posterior Inclusion Probabilities (PIPs). If the posterior distribution of \(\gamma_l\) is \(\text{Multi}(1, \alpha_l)\), then for each SNP \(j\), we have:
For more details in math derivation and algorithm, stay tuned for our upcoming manuscript.