Enrichment Analysis and Prior Setup

A key feature of FastENLOC is its ability to estimate colocalization priors directly from the provided data. Specifically, FastENLOC approaches the problem of enrichment analysis by considering the following model

\[\log\left[ \frac{P(d = 1 \mid \gamma)}{P(d = 0 \mid \gamma)} \right] = \alpha_0 + \alpha_1 \gamma,\]

for each genetic variant, where:

1. Alternative Parameterization

The coloc method parameterizes the prior for colocalized variant by $p_{12}$. It follows that

\[p_{12} = P(d=1, \gamma=1) = \frac{\exp(\alpha_0 + \alpha_1)}{1+\exp(\alpha_0 + \alpha_1)} p_e\]

Additionally, it requires

\[p_1 = P(d=1, \gamma=0) = p_g - p_{12}\]

and

\[p_2 = P(d=0, \gamma=1) = p_e - p_{12}\]

Conversely,

\[p_g = p_{12} + p_1\] \[p_e = p_{12} + p_2\] \[\alpha_0 = \log\left[\frac{p_1}{1-p_1-p_2-p_{12}}\right]\]

and

\[\alpha_1 = \log \left[ \frac{(1-p_{12}-p_1 - p_2) p_{12}}{p_1 p_2} \right],\]

indicating that the two different parameterizations are equivalent.

2. Running Enrichment Analysis

The enrichment analysis procedure is integrated into FastENLOC and runs by default. The following command-line options are key for this procedure:

3. Estimating Enrichment Prior Only

To perform only the enrichment analysis without further calculating colocalization probabilities in FastENLOC, specify the --enrich_only option on the command line.

4. Bypassing Enrichment Analysis

Users can bypass the enrichment analysis by specifying required priors directly via command-line options. While not recommended as a standard colocalization approach, this option allows sensitivity analysis of colocalization results relative to prior specification.

To bypass enrichment analysis, use one of the following options:

  1. -a0 a0_value -a1 a1_value: specify $\alpha_0$ and $\alpha_1$ values.
  2. -p1 p1_value -p2 p2_value -p12 p12_value: specify values for $p_1$, $p_2$, and $p_{12}$.