A collection of methods for quantifying the similarity of two or more datasets, many of which can be used for two- or k-sample testing. It provides newly implemented methods as well as wrapper functions for existing methods that enable calling many different methods in a unified framework. The methods were selected from the review and comparison of Stolte et al. (2024) <doi:10.1214/24-SS149>.
Version: |
0.1.1 |
Depends: |
R (≥ 3.5.0) |
Imports: |
boot, stats |
Suggests: |
ade4, approxOT, Ball, caret, clue, cramer, crossmatch, dbscan, densratio, DWDLargeR, e1071, Ecume, energy, expm, FNN, gTests, gTestsMulti, HDLSSkST, hypoRF, kernlab, kerTests, KMD, knitr, LPKsample, Matrix, mvtnorm, nbpMatching, pROC, purrr, randtoolbox, rlemon, rpart, rpart.plot, testthat, RSNNS |
Published: |
2025-03-18 |
DOI: |
10.32614/CRAN.package.DataSimilarity |
Author: |
Marieke Stolte
[aut, cre, cph],
Luca Sauer [aut],
David Alvarez-Melis [ctb] (Original python implementation of OTDD,
<https://github.com/microsoft/otdd.git>),
Nabarun Deb [ctb] (Original implementation of rank-based Energy test
(DS), <https://github.com/NabarunD/MultiDistFree.git>),
Bodhisattva Sen [ctb] (Original implementation of rank-based Energy
test (DS), <https://github.com/NabarunD/MultiDistFree.git>) |
Maintainer: |
Marieke Stolte <stolte at statistik.tu-dortmund.de> |
License: |
GPL (≥ 3) |
NeedsCompilation: |
no |
CRAN checks: |
DataSimilarity results |