shaman_shuffle_and_score_hic_mat
shaman_shuffle_and_score_hic_mat(obs_track_nms, interval, work_dir, expand = 1e+06, min_dist = 1024, k = 100, dist_resolution = NA, decay_smooth = NA, hic_mcmc_max_resolution = 400, shuffle = 80, grid_small = 5e+05, grid_high = 1e+06, grid_increase = 5e+05, grid_step_iter = 40)
| obs_track_nms | Name of observed 2D genomic tracks for the hic data. |
|---|---|
| interval | 2D interval on which to compute the scores. |
| work_dir | Centralized directory to store temporary files. |
| expand | Size of expansion, points to include outside the matrix for accurate computing of the score. Note that for each observed point, its k-nearest neighbors must be included in the expanded matrix. |
| min_dist | The minimum distance between points. |
| k | The number of neighbor distances used for the score. For higher resolution maps, increase k. For |
| dist_resolution | Number of bins in each log2 distance unit. If NA, value is determined based on observed data (recommended). |
| decay_smooth | Number of bins to use for smoothing the MCMC target function: the decay curve. If NA, value is determined based on observed data (recommended). |
| hic_mcmc_max_resolution | Maximum number of bins for each log2 unit. |
| shuffle | Number of shuffling rounds for each observed point. |
| grid_small | Initial size of maximum distance between contact pairs consdered for switching |
| grid_high | Final size of maximum distance between contact pairs consdered for switching |
| grid_increase | Grid increase size |
| grid_step_iter | Number of iterations in each grid size |
NULL if insufficient observed data, otherwise resturns a list containing 3 elements: 1) points - start1, start2 and score for all observed points. 2) obs - the observed points. 3) exp - the expected points. 4) exp_fn - the name of the expected (shuffled) data file
This function generates an expected 2D hic matrix based on observed hic data, and computes its score.
#Set misha db to test gsetroot(shaman_get_test_track_db()) mat_score = shaman_shuffle_and_score_hic_mat(obs_track_nms="hic_obs", interval=gintervals.2d(2, 175.5e06, 177.5e06, 2, 175.5e06, 177.5e06), expand=5e05, work_dir=tempdir())#>#>#>#>#>#>#>#>#>