GnomAD variant data ingestion
gentropy.gnomad_ingestion.GnomadVariantIndexStep
¶
A step to generate variant index dataset from gnomad data.
Variant annotation step produces a dataset of the type VariantIndex
derived from gnomADs gnomad.genomes.vX.X.X.sites.ht
Hail's table.
This dataset is used to validate variants and as a source of annotation.
Source code in src/gentropy/gnomad_ingestion.py
72 73 74 75 76 77 78 79 80 81 82 83 84 85 86 87 88 89 90 91 92 93 94 95 96 97 98 99 100 101 102 103 104 105 106 107 108 109 110 111 112 113 114 115 116 117 118 119 |
|
__init__(session: Session, variant_annotation_path: str = GnomadVariantConfig().variant_annotation_path, gnomad_genomes_path: str = GnomadVariantConfig().gnomad_genomes_path, gnomad_variant_populations: list[VariantPopulation | str] = GnomadVariantConfig().gnomad_variant_populations, use_version_from_input: bool = GnomadVariantConfig().use_version_from_input) -> None
¶
Run Variant Annotation step.
Parameters:
Name | Type | Description | Default |
---|---|---|---|
session |
Session
|
Session object. |
required |
variant_annotation_path |
str
|
Path to resulting dataset. |
variant_annotation_path
|
gnomad_genomes_path |
str
|
Path to gnomAD genomes hail table, e.g. |
gnomad_genomes_path
|
gnomad_variant_populations |
list[VariantPopulation | str]
|
List of populations to include. |
gnomad_variant_populations
|
use_version_from_input |
bool
|
Append version derived from input gnomad_genomes_path to the output variant_annotation_path. Defaults to False. |
use_version_from_input
|
In case use_version_from_input is set to True, data source version inferred from gnomad_genomes_path is appended as the last path segment to the output path. All defaults are stored in the GnomadVariantConfig.
Source code in src/gentropy/gnomad_ingestion.py
79 80 81 82 83 84 85 86 87 88 89 90 91 92 93 94 95 96 97 98 99 100 101 102 103 104 105 106 107 108 109 110 111 112 113 114 115 116 117 118 119 |
|