Don't use function string matching for do.call(what=) or lapply(FUN=) #6217

MichaelChirico · 2024-07-03T06:11:43Z

Related: #6192. First caught my eye because I know this makes it harder for static analyzers, e.g. any code looking for cbind() usage will have to special-case the possibility that cbind is passed as a string, in an appropriate context, etc.

This is not a terribly big deal for base functions but is best practice when using dependencies since it can be hard to tell if downstreams get broken, statically.

It's also a bit slower -- match.fun() exits immediately when passed a function, takes an extra step when passed a string. That core part of function matching in lapply() is 10x slower. do.call() is roughly the same:

microbenchmark(times=1e6, match.fun("is.na"), match.fun(is.na))
# Unit: nanoseconds
#                expr  min   lq     mean median   uq       max neval cld
#  match.fun("is.na") 6892 7183 8837.238   7290 7696 127587928 1e+06   b
#    match.fun(is.na)  575  677 1073.283    730  832  64672336 1e+06  a

l=list(1)
microbenchmark(times=1e6, do.call("identity", l), do.call(identity, l))
# Unit: microseconds
#                    expr   min    lq     mean median    uq      max neval cld
#  do.call("identity", l) 1.885 2.010 2.910809  2.094 2.451 42517.85 1e+06   a
#    do.call(identity, l) 1.850 1.966 2.915019  2.049 2.417 77993.08 1e+06   a

PS Shouldn't we use NextMethod(), rather than call the S3 method explicitly?

github-actions · 2024-07-03T06:27:50Z

Generated via commit 6fb1c1f

Download link for the artifact containing the test results: ↓ atime-results.zip

Time taken to finish the standard R installation steps: 12 minutes and 29 seconds

Time taken to run atime::atime_pkg on the tests: 3 minutes and 50 seconds

tdhock

looks good, thanks

ben-schwen · 2024-07-03T11:22:48Z

LGTM, but I'm not sure I would apply it to test cases since some of them might explictly check for working with characters, as e.g., 23c88e7

MichaelChirico · 2024-07-08T20:06:58Z

LGTM, but I'm not sure I would apply it to test cases since some of them might explictly check for working with characters, as e.g., 23c88e7

Thanks for flagging! Indeed I skipped some test cases that are testing for strings usage specifically. In general you're right that altering test cases can be risky. Those changes I did include are clearly designed to test other things -- if behavior w.r.t. strings is intended, that should get its own specific test. The footprint here is small enough I'm confident we're OK.

Don't use function string matching for is.na method

292db23

MichaelChirico requested a review from ben-schwen July 3, 2024 06:11

replace other do.call("str" usage

52fa300

MichaelChirico requested a review from tdhock as a code owner July 3, 2024 06:16

MichaelChirico added 2 commits July 3, 2024 08:19

other cases of lapply()

0c901ed

sapply usage

5074821

MichaelChirico changed the title ~~Don't use function string matching for is.na method~~ Don't use function string matching for do.call(what=) or lapply(FUN=) Jul 3, 2024

tdhock approved these changes Jul 3, 2024

View reviewed changes

ben-schwen approved these changes Jul 3, 2024

View reviewed changes

Anirban166 approved these changes Jul 4, 2024

View reviewed changes

Merge branch 'master' into isna-string

6fb1c1f

MichaelChirico merged commit 19a73c7 into master Jul 8, 2024
4 checks passed

MichaelChirico deleted the isna-string branch July 8, 2024 20:07

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Don't use function string matching for do.call(what=) or lapply(FUN=) #6217

Don't use function string matching for do.call(what=) or lapply(FUN=) #6217

MichaelChirico commented Jul 3, 2024 •

edited

Loading

github-actions bot commented Jul 3, 2024 •

edited

Loading

tdhock left a comment

ben-schwen commented Jul 3, 2024 •

edited

Loading

MichaelChirico commented Jul 8, 2024

Don't use function string matching for do.call(what=) or lapply(FUN=) #6217

Don't use function string matching for do.call(what=) or lapply(FUN=) #6217

Conversation

MichaelChirico commented Jul 3, 2024 • edited Loading

github-actions bot commented Jul 3, 2024 • edited Loading

tdhock left a comment

Choose a reason for hiding this comment

ben-schwen commented Jul 3, 2024 • edited Loading

MichaelChirico commented Jul 8, 2024

MichaelChirico commented Jul 3, 2024 •

edited

Loading

github-actions bot commented Jul 3, 2024 •

edited

Loading

ben-schwen commented Jul 3, 2024 •

edited

Loading