variant_index
gentropy.variant_index.VariantIndexStep
¶
Generate variant index based on a VEP output in json format.
The variant index is a dataset that contains variant annotations extracted from VEP output. It is expected that all variants in the VEP output are present in the variant index. There's an option to provide extra variant annotations to be added to the variant index eg. allele frequencies from GnomAD.
Source code in src/gentropy/variant_index.py
12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 |
|
__init__(session: Session, vep_output_json_path: str, variant_index_path: str, hash_threshold: int = VariantIndexConfig().hash_threshold, gnomad_variant_annotations_path: str | None = None) -> None
¶
Run VariantIndex step.
Parameters:
Name | Type | Description | Default |
---|---|---|---|
session |
Session
|
Session object. |
required |
vep_output_json_path |
str
|
Variant effect predictor output path (in json format). |
required |
variant_index_path |
str
|
Variant index dataset path to save resulting data. |
required |
hash_threshold |
int
|
Hash threshold for variant identifier lenght. |
hash_threshold
|
gnomad_variant_annotations_path |
str | None
|
Path to extra variant annotation dataset. |
None
|
Source code in src/gentropy/variant_index.py
19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 |
|