GnomAD Linkage data ingestion
gentropy.gnomad_ingestion.LDIndexStep
¶
LD index step.
This step is resource intensive
Suggested params: high memory machine, 5TB of boot disk, no SSDs.
Source code in src/gentropy/gnomad_ingestion.py
12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60 |
|
__init__(session: Session, ld_index_out: str, min_r2: float = LDIndexConfig().min_r2, ld_matrix_template: str = LDIndexConfig().ld_matrix_template, ld_index_raw_template: str = LDIndexConfig().ld_index_raw_template, ld_populations: list[LD_Population | str] = LDIndexConfig().ld_populations, liftover_ht_path: str = LDIndexConfig().liftover_ht_path, grch37_to_grch38_chain_path: str = LDIndexConfig().grch37_to_grch38_chain_path) -> None
¶
Run step.
Parameters:
Name | Type | Description | Default |
---|---|---|---|
session
|
Session
|
Session object. |
required |
ld_index_out
|
str
|
Output LD index path. (required) |
required |
min_r2
|
float
|
Minimum r2 to consider when considering variants within a window. |
min_r2
|
ld_matrix_template
|
str
|
Input path to the gnomAD ld file with placeholder for population |
ld_matrix_template
|
ld_index_raw_template
|
str
|
Input path to the raw gnomAD LD indices file with placeholder for population string |
ld_index_raw_template
|
ld_populations
|
list[LD_Population | str]
|
Population names derived from the ld file paths |
ld_populations
|
liftover_ht_path
|
str
|
Path to the liftover ht file |
liftover_ht_path
|
grch37_to_grch38_chain_path
|
str
|
Path to the chain file used to lift over the coordinates. |
grch37_to_grch38_chain_path
|
Default values are provided in LDIndexConfig.
Source code in src/gentropy/gnomad_ingestion.py
21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60 |
|