Introduction

In this benchmark, we will use Repguide to predict genomic binding sites for a publically available set of 12 guideRNAs targeting LTR5_Hs long terminal repeats and compare the results to experimentally determined Cas9 ChIP-seq peak intensities [1].

Compare predicted and experimental binding sites

We can extract predicted binding sites using the mappings accessor and import provided dCAS9 ChIP peak coordinates from Fuentes et al. [1]. To facilitate comparison, we restrict our analysis to peaks on the main chromosome assemblies.

We then compare the absolute number and binding intensities of correctly predicted binding sites.

References

[1] Daniel R Fuentes, Tomek Swigut, Joanna Wysocka. Systematic perturbation of retroviral LTRs reveals widespread long-range effects on human gene regulation. eLife (2018).

Session information

sessionInfo()
#> R version 3.5.3 (2019-03-11)
#> Platform: x86_64-pc-linux-gnu (64-bit)
#> Running under: CentOS Linux 7 (Core)
#> 
#> Matrix products: default
#> BLAS/LAPACK: /net/mraid14/export/data/users/eladch/tools/CO7/mkl/2018.3/compilers_and_libraries_2018.3.222/linux/mkl/lib/intel64_lin/libmkl_gf_lp64.so
#> 
#> locale:
#>  [1] LC_CTYPE=en_US.UTF-8       LC_NUMERIC=C               LC_TIME=en_US.UTF-8        LC_COLLATE=en_US.UTF-8     LC_MONETARY=en_US.UTF-8    LC_MESSAGES=en_US.UTF-8    LC_PAPER=en_US.UTF-8      
#>  [8] LC_NAME=C                  LC_ADDRESS=C               LC_TELEPHONE=C             LC_MEASUREMENT=en_US.UTF-8 LC_IDENTIFICATION=C       
#> 
#> attached base packages:
#> [1] stats4    parallel  stats     graphics  grDevices utils     datasets  methods   base     
#> 
#> other attached packages:
#>  [1] forcats_0.4.0                     stringr_1.4.0                     dplyr_0.8.0.1                     purrr_0.3.2                       readr_1.3.1                      
#>  [6] tidyr_0.8.3                       tibble_2.1.1                      ggplot2_3.1.0                     tidyverse_1.2.1                   BSgenome.Hsapiens.UCSC.hg38_1.4.1
#> [11] BSgenome_1.50.0                   rtracklayer_1.42.2                Biostrings_2.50.2                 XVector_0.22.0                    GenomicRanges_1.34.0             
#> [16] GenomeInfoDb_1.18.2               IRanges_2.16.0                    S4Vectors_0.20.1                  BiocGenerics_0.28.0               Repguide_0.0.1.2                 
#> 
#> loaded via a namespace (and not attached):
#>  [1] nlme_3.1-137                bitops_1.0-6                matrixStats_0.54.0          fs_1.2.7                    doMC_1.3.5                  lubridate_1.7.4             httr_1.4.0                 
#>  [8] rprojroot_1.3-2             tools_3.5.3                 backports_1.1.4             R6_2.4.0                    lazyeval_0.2.2              colorspace_1.4-1            withr_2.1.2                
#> [15] tidyselect_0.2.5            compiler_3.5.3              cli_1.1.0                   rvest_0.3.2                 Biobase_2.42.0              xml2_1.2.0                  desc_1.2.0                 
#> [22] DelayedArray_0.8.0          labeling_0.3                scales_1.0.0                pkgdown_1.3.0               commonmark_1.7              digest_0.6.19               Rsamtools_1.34.1           
#> [29] rmarkdown_1.12              R.utils_2.8.0               pkgconfig_2.0.2             htmltools_0.3.6             rlang_0.3.2                 readxl_1.3.1                rstudioapi_0.9.0           
#> [36] generics_0.0.2              jsonlite_1.6                BiocParallel_1.16.6         R.oo_1.22.0                 RCurl_1.95-4.12             magrittr_1.5                GenomeInfoDbData_1.2.0     
#> [43] Matrix_1.2-16               Rcpp_1.0.1                  munsell_0.5.0               Rbowtie_1.22.0              R.methodsS3_1.7.1           stringi_1.4.3               yaml_2.2.0                 
#> [50] MASS_7.3-51.1               SummarizedExperiment_1.12.0 zlibbioc_1.28.0             plyr_1.8.4                  grid_3.5.3                  crayon_1.3.4                lattice_0.20-38            
#> [57] cowplot_0.9.4               haven_2.1.0                 hms_0.4.2                   knitr_1.22                  pillar_1.3.1                codetools_0.2-16            XML_3.98-1.19              
#> [64] glue_1.3.1                  evaluate_0.13               data.table_1.12.2           modelr_0.1.4                foreach_1.4.4               cellranger_1.1.0            gtable_0.3.0               
#> [71] assertthat_0.2.1            xfun_0.5                    broom_0.5.1                 roxygen2_6.1.1              iterators_1.0.10            GenomicAlignments_1.18.1    memoise_1.1.0