Skip to content

Commit

Permalink
Update readme
Browse files Browse the repository at this point in the history
  • Loading branch information
Alex Joseph committed Jun 2, 2019
1 parent 2d6df66 commit 1710346
Show file tree
Hide file tree
Showing 2 changed files with 12 additions and 3 deletions.
6 changes: 3 additions & 3 deletions NEWS.md
Original file line number Diff line number Diff line change
@@ -1,5 +1,5 @@
# compareDF 1.8.0
* Added new option to keep only the columns which have changed.
* Added new option to keep only the columns which have changed using `keep_unchanged_cols`.
* changed option `keep_unchanged` to `keep_unchanged_rows`

# compareDF 1.7.3
Expand All @@ -14,15 +14,15 @@
# compareDF 1.7.0
* Provided options to name the columns in the HTML output
* Provided option change column name
* Provided optiongroup column name
* Provided option to change group column name

# compareDF 1.6.0
* Added option to specify different types of tolerances. Now you can use `difference` as an argument to use difference rather than ratio
* Fixed some bugs
* Lot more tests

# compareDF 1.5.0
* Added an option to preserve the rows that have not changed in the anlysis using the `keep_unchanged` argument
* Added an option to preserve the rows that have not changed in the anlysis using the `keep_unchanged_rows` argument
* Added an option to set the color scheme in the HTML using the `color_scheme` argument.
* Updated Documentation
* Fixed some bugs
Expand Down
9 changes: 9 additions & 0 deletions README.md
Original file line number Diff line number Diff line change
Expand Up @@ -185,6 +185,15 @@ diff calculation or in the output

Naturally, tolerance has no meaning for non-numeric values.

## Additional features
- set the color scheme in the HTML using the `color_scheme` argument
- preserve the rows that have not changed in the anlysis using the `keep_unchanged_rows` argument
- use `difference` as an argument to use difference rather than ratio for tolerance
- options to name the columns in the HTML output
- option change column name
- option to change group column name
- keep only the columns which have changed using `keep_unchanged_cols`

## Using compare DF in GAP analysis

The compareDF package can be used to conduct effective Gap analyses. If the package is used in this way, a configuration recommendation should be considered when using the compare_df function. Make sure the group column (group_col argument) being used as the gap classifier is unique and is not duplicated within either of the datasets being compared. Duplicate group names can skew gap analysis results. For example, using the results_2010 and results_2011 samples where Student represents the group_col argument, 'Rohit' group results are skewed. This is because student 'Rohit' is provided in the results_2011 sample twice. In this example Rohit represents two different students with the same last name. In short, if there are more than (2) records per grouping in a typical Gap analysis result, it suggests that there is a duplicate grouping contained within a single dataset that should be renamed and re-identified. If this occurs, changes should be made to the comparison datasets, and the Gap analysis should be re-conducted. The best way to prevent skewed results from occurring is to use all the columns that uniquely define a row in the group column argument - in this case, both name and student columns (alternatively a unique key identifier can also be constructed).
Expand Down

0 comments on commit 1710346

Please sign in to comment.