Additions:
- Pair of UDFs for simple random sampling with replacement.
- More dependencies now packaged in DataFu so fewer JAR dependencies required.
- SetDifference UDF for computing set difference (e.g. A-B or A-B-C).
- HyperLogLogPlusPlus UDF for efficient cardinality estimation.