docxtractr: Extract Data Tables and Comments from 'Microsoft' 'Word'
Documents
'Microsoft Word' 'docx' files provide an 'XML' structure that is fairly
straightforward to navigate, especially when it applies to 'Word' tables and
comments. Tools are provided to determine table count/structure, comment count
and also to extract/clean tables and comments from 'Microsoft Word' 'docx' documents.
There is also nascent support for '.doc' and '.pptx' files.
Version: |
0.6.5 |
Depends: |
R (≥ 3.6.0) |
Imports: |
tools, xml2, purrr, dplyr, utils, httr, magrittr |
Suggests: |
covr, tinytest |
Published: |
2020-07-05 |
DOI: |
10.32614/CRAN.package.docxtractr |
Author: |
Bob Rudis [aut,
cre],
Mark Dulhunty [ctb],
Karlo Guidoni-Martins [ctb],
Chris Muir [aut, ctb],
John Muschelli [ctb] |
Maintainer: |
Bob Rudis <bob at rud.is> |
BugReports: |
https://gitlab.com/hrbrmstr/docxtractr/issues |
License: |
MIT + file LICENSE |
URL: |
http://gitlab.com/hrbrmstr/docxtractr |
NeedsCompilation: |
no |
SystemRequirements: |
LibreOffice (<https://www.libreoffice.org/>)
required to extract data from .doc files or perform .pptx
conversion. |
Materials: |
NEWS |
CRAN checks: |
docxtractr results |
Documentation:
Downloads:
Linking:
Please use the canonical form
https://CRAN.R-project.org/package=docxtractr
to link to this page.