Thurman et al.
gentropy.datasource.intervals.thurman.IntervalsThurman
¶
Interval dataset from Thurman et al. 2012.
Source code in src/gentropy/datasource/intervals/thurman.py
18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60 61 62 63 64 65 66 67 68 69 70 71 72 73 74 75 76 77 78 79 80 81 82 83 84 85 86 87 88 89 90 91 92 93 94 95 96 97 98 99 100 101 102 103 104 |
|
parse(thurman_raw: DataFrame, gene_index: GeneIndex, lift: LiftOverSpark) -> Intervals
classmethod
¶
Parse the Thurman et al. 2012 dataset.
Parameters:
Name | Type | Description | Default |
---|---|---|---|
thurman_raw |
DataFrame
|
raw Thurman et al. 2019 dataset |
required |
gene_index |
GeneIndex
|
gene index |
required |
lift |
LiftOverSpark
|
LiftOverSpark instance |
required |
Returns:
Name | Type | Description |
---|---|---|
Intervals |
Intervals
|
Interval dataset containing Thurman et al. 2012 data |
Source code in src/gentropy/datasource/intervals/thurman.py
46 47 48 49 50 51 52 53 54 55 56 57 58 59 60 61 62 63 64 65 66 67 68 69 70 71 72 73 74 75 76 77 78 79 80 81 82 83 84 85 86 87 88 89 90 91 92 93 94 95 96 97 98 99 100 101 102 103 104 |
|
read(spark: SparkSession, path: str) -> DataFrame
staticmethod
¶
Read thurman dataset.
Parameters:
Name | Type | Description | Default |
---|---|---|---|
spark |
SparkSession
|
Spark session |
required |
path |
str
|
Path to dataset |
required |
Returns:
Name | Type | Description |
---|---|---|
DataFrame |
DataFrame
|
DataFrame with raw thurman data |
Source code in src/gentropy/datasource/intervals/thurman.py
21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 |
|