compact.pairwise_scoring
module to perform pairwise rbo scoring between interaction datasets
- Important functions:
pairwise_rbo_scoring: perform rbo scoring between two interaction matrices
score_comparison: rbo scoring between two correlation datasets
determine_top_weightedness: determine contribution of top d ranks to rbo score
det_search_deph: determine required search depth for given value for p parameter an mininum score weight
- compact.pairwise_scoring.pairwise_rbo_scoring(left_scores, right_scores, mapping=False, p_param=0.9, search_depth=None, processes=1, chunksize=1000)
perform rbo scoring between 2 interaction matrices
- Args:
- [left|right]_scores (pd df): symmetric interaction matrix
values: within-sample pairwise interaction scores
- mapping (bool, optional): Defaults to False.
if not False: mapping (dict): id mapping/orthology between two collections keys: identifiers of “left” collection values: corresponding identifiers of “right” collection
- p_param (float, optional): Defaults to 0.90.
Rank Biased Overlap “p” parameter range: 0 to 1 this parameter determines top-weightedness of rbo metric lower values result in more top-weightedness
- search_depth (int, optional): Defaults to None.
number of ranks to consider when computing RBO scores if None, considers complete ranked lists
- processes (int, optional): Defaults to 1.
number of processes/threads.
- Returns:
- pd df: rbo scores for all pairs between left right
matrix-structured dataframe with: rows: left index. columns: right index, values are rbo scores
- compact.pairwise_scoring.determine_top_weightedness(p, d)
determine the contribution the top d ranks have to the rbo score
taken from https://github.com/changyaochen/rbo/blob/master/rbo/rbo.py
- Args:
- p (float): Rank Biased Overlap “p” parameter range: 0 to 1
this parameter determines top-weightedness of rbo metric lower values result in more top-weightedness
- d (int):
top d ranks for which to determine contribution
- Returns:
- float: the fraction of the rbo score that is determined
by the top d ranks for the given p parameter
- compact.pairwise_scoring.det_search_depth(p, min_weight, shortest_list_len, stepsize=1)
determine required search depth for given p and min_weight
- Args:
- p (float): Rank Biased Overlap “p” parameter range: 0 to 1
this parameter determines top-weightedness of rbo metric lower values result in more top-weightedness
- min_weight (float): minimal weight of contribution to total
rbo score requirement
- shortest_list_len (int): length of shortest ranked list to be
used in calculating rbo score (search depth will be maximally this length)
- stepsize (int, optional): Defaults to 1.
stepsize for walking over search depth values to consider. will stop if it reaches search depth with adequate weight contribution
- Returns:
int: the selected search depth