A framework for statistical analysis in content analysis. In addition to a pipeline for preprocessing text corpora and linking to the latent Dirichlet allocation from the 'lda' package, plots are offered for the descriptive analysis of text corpora and topic models. In addition, an implementation of Chang's intruder words and intruder topics is provided. Sample data for the vignette is included in the toscaData package, which is available on gitHub: <https://github.com/Docma-TU/toscaData>.
Version: |
0.3-2 |
Depends: |
R (≥ 3.5.0) |
Imports: |
tm (≥ 0.7-5), lda (≥ 1.4.2), quanteda (≥ 1.4.0), lubridate (≥ 1.7.3), htmltools (≥ 0.3.6), RColorBrewer (≥ 1.1-2), stringr (≥ 1.3.1), WikipediR (≥ 1.5.0), data.table (≥
1.11.4) |
Suggests: |
toscaData, testthat (≥ 2.0.0), knitr (≥ 1.20), devtools (≥
1.13), rmarkdown (≥ 1.9) |
Published: |
2021-10-28 |
DOI: |
10.32614/CRAN.package.tosca |
Author: |
Lars Koppers
[aut, cre],
Jonas Rieger
[aut],
Karin Boczek
[ctb],
Gerret von Nordheim
[ctb] |
Maintainer: |
Lars Koppers <koppers at statistik.tu-dortmund.de> |
License: |
GPL-2 | GPL-3 [expanded from: GPL (≥ 2)] |
URL: |
https://github.com/Docma-TU/tosca,
https://doi.org/10.5281/zenodo.3591068 |
NeedsCompilation: |
no |
Citation: |
tosca citation info |
CRAN checks: |
tosca results |