V2G
Bases: V2GStepConfig
Variant-to-gene (V2G) step.
This step aims to generate a dataset that contains multiple pieces of evidence supporting the functional association of specific variants with genes. Some of the evidence types include:
- Chromatin interaction experiments, e.g. Promoter Capture Hi-C (PCHi-C).
- In silico functional predictions, e.g. Variant Effect Predictor (VEP) from Ensembl.
- Distance between the variant and each gene's canonical transcription start site (TSS).
Source code in src/otg/v2g.py
17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60 61 62 63 64 65 66 67 68 69 70 71 72 73 74 75 76 77 78 79 80 81 82 83 84 85 86 |
|
run()
Run V2G dataset generation.
Source code in src/otg/v2g.py
31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60 61 62 63 64 65 66 67 68 69 70 71 72 73 74 75 76 77 78 79 80 81 82 83 84 85 86 |
|
Variant to gene (V2G) step requirements.
Attributes:
Name | Type | Description |
---|---|---|
variant_index_path | str | Input variant index path. |
variant_annotation_path | str | Input variant annotation path. |
gene_index_path | str | Input gene index path. |
vep_consequences_path | str | Input VEP consequences path. |
lift_over_chain_file_path | str | Path to GRCh37 to GRCh38 chain file. |
approved_biotypes | list[str] | List of approved biotypes. |
anderson_path | str | Anderson intervals path. |
javierre_path | str | Javierre intervals path. |
jung_path | str | Jung intervals path. |
thurnman_path | str | Thurnman intervals path. |
liftover_max_length_difference | int | Maximum length difference for liftover. |
max_distance | int | Maximum distance to consider. |
output_path | str | Output V2G path. |
Source code in src/otg/config.py
138 139 140 141 142 143 144 145 146 147 148 149 150 151 152 153 154 155 156 157 158 159 160 161 162 163 164 165 166 167 168 169 170 171 172 173 174 175 176 177 178 179 180 181 182 183 184 185 186 187 |
|