- A fix to
get_compressed_data()
to avoid the infinite loop that was caused by the confirmation prompt.
- updates
create_data_origin()
to use base R. It also adds functionality to optionally return the data origin as a data frame and to overwrite the file if it already exists.
-
Adds
curate_enntt_data()
function to curate data from the Europarl Corpus of Native, Non-Native, and Translated Text (ENNTT). -
Updates recipes
- Changes mirror to "https://gutenberg.pglaf.org/" for
get_gutenberg_works()
function - Organizes the output to include the LCC classification as part of the data written to disk for the
get_gutenberg_works()
function
- Updated version number to reflect that the package is now in beta
- Added
curate_swda_data()
function to curate data from the Switchboard Dialog Act Corpus
- Adds
get_gutenberg_works()
function to import data from Project Gutenberg
- Fixes warnings on
calc_assoc_metrics()
- Updates Date on DESCRIPTION file
- Adds test-add_pkg_to_bib.R
- Adds
calc_assoc_metrics
to calculate (pmi, dice, G) for a given type bigram
- Removes
calc_dispersion_metrics()
function and replaces it withcalc_type_metric()
which includes frequency and dispersion metrics.
- Fixes bug in
get_compressed_data()
that caused the function to create dot file copies of the original files
- Adds
idf
measure tocalc_dispersion_metrics()
- Adds
calc_dispersion_metrics()
function to calculate dispersion metrics
- Added
get_talkbank_data()
function to import data from TalkBank - Added internal
confirm_permissions()
function to confirm that users are aware of the permissions required to use data - Updated
get_*()
functions to useconfirm_permissions()
internally - Changed
get_outliers()
tofind_outliers()
to be more consistent with other functions
- Updated
create_data_dictionary()
to provide default scaffold structure for data dictionary, in lieu of OpenAI model. This scaffold is to be updated manually by the user.
- Added
create_data_origin()
function. This creates a .csv file to scaffold a data origin file
- Adds project template for RStudio: "Minimal Reproducible Project"
- Adjusted
create_data_dictionary()
to produce results more in line with the QTALR textbook
- Added
get_outliers()
function - Added Instructor Guide
- Added R tutorial 0
- Added a
NEWS.md
file to track changes to the package.