Adapt nuisance est for `IV-type` score (PLR) & new score `IV-type` for PLIV #161

MalteKurz · 2022-05-20T13:06:47Z

Description

PLR

Nuisance estimation for IV type score: In this PR the nuisance estimation for the IV-type score in the PLR model is adapted to be in line with the DML paper Chernozhukov et al. (2018).
- Results for the default score='partialling out' (Equation (4.4) in Chernozhukov et al. (2018)) are not affected by the changes in this PR. However, the naming of the nuisance parameter is changed from ml_g to ml_l (analogously predictions g_hat have been renamed to l_hat, etc.) to be better in line with Chernozhukov et al. (2018). To make the transition to the new naming smooth, depreciation warnings have been added (see below for an overview of the API changes and examples for the depreciation warnings).
- For the score='IV-type' (Equation (4.3) in Chernozhukov et al. (2018)) the implementation now follows the approach described on pp. C31-C33 in Chernozhukov et al. (2018). This means that an initial estimate for theta_0 is obtained via the 'partialling out' score. Then an estimate for g_0(X) is obtained by regressing Y - theta_0 * D on X. Therefore, an additional learner (not needed to evaluate the score) needs to be provided, i.e., the nuisance function l_0(X) (needed for the preliminary theta_0 estimate) is estimated with learner ml_l and g_0(X) with learner ml_g. To make the transition to the new API (additional learner) smooth, depreciation warnings have been added (see below for an overview of the API changes and examples for the depreciation warnings). Especially, if only ml_g is specified but not ml_l, then ml_g = clone(ml_l) is being used and a warning is being thrown.

PLIV

In this PR a new score function for the PLIV model is implemented:
- Results for the default score='partialling out' (Equation (4.8) in Chernozhukov et al. (2018)) are not affected by the changes in this PR. However, the naming of the nuisance parameter is changed from ml_g to ml_l (analogously predictions g_hat to l_hat, etc.) to be better in line with Chernozhukov et al. (2018). To make the transition to the new naming smooth, depreciation warnings have been added (see below for examples).
- A new score='IV-type' (Equation (4.7) in Chernozhukov et al. (2018)) is now available for the PLIV model. The estimation of the nuisance parts follows the approach described on p. C33 in Chernozhukov et al. (2018). This means that an initial estimate for theta_0 is obtained via the 'partialling out' score. Then an estimate for g_0(X) is obtained by regressing Y - theta_0 * D on X. Therefore, two additional learners (not needed to evaluate the score) need to be provided, i.e., the nuisance functions l_0(X) and r_0(X) (needed for the preliminary theta_0 estimate) are estimated with learner ml_l and ml_r. g_0(X) is estimated with learner ml_g.

API changes

PLR

API changed from DoubleMLPLR$new(obj_dml_data, ml_g, ml_m [, ...]) to DoubleMLPLR$new(obj_dml_data, ml_l, ml_m, ml_g [, ...]).
- For score='partialling out' ml_l & ml_m are needed.
- For score='IV-type' ml_l, ml_m & ml_g.
- For function()s as score ml_l & ml_m are mandatory and ml_g optional.
If a function() is provided as score, it must be of the form function(y, d, l_hat, m_hat, g_hat, smpls) (previously function(y, d, g_hat, m_hat, smpls)).

PLIV

API changed from DoubleMLPLIV$new(obj_dml_data, ml_g, ml_m, ml_r [, ...]) to DoubleMLPLIV$new(obj_dml_data, ml_g, ml_m, ml_r, ml_g [, ...]).
- For score='partialling out' ml_l, ml_m & ml_r are needed.
- For score='IV-type' ml_l, ml_m, ml_r & ml_g.
- For function()s as score ml_l, ml_m & ml_r are mandatory and ml_g optional.
If a function() is provided as score, it must be of the form function(y, z, d, l_hat, m_hat, r_hat, g_hat, smpls) (previously function(y, z, d, g_hat, m_hat, r_hat, smpls)).

Depreciation warnings for the API changes for `DoubleMLPLR` and `DoubleMLPLIV`

Initialization code for the following code examples:

library(DoubleML)
library(mlr3)
library(mlr3learners)
library(data.table)
set.seed(2)
ml_l = lrn("regr.ranger", num.trees = 10, max.depth = 2)
ml_m = ml_l$clone()
ml_r = ml_l$clone()
ml_g = ml_l$clone()
plr_data = make_plr_CCDDHNR2018(n_obs=500)
pliv_data = make_pliv_CHS2015(n_obs=500)

For PLR & PLIV with score='partialling out' and if the learners are provided as positional arguments, nothing changed.

dml_plr_obj = DoubleMLPLR$new(plr_data, ml_l, ml_m, score='partialling out')
dml_pliv_obj = DoubleMLPLIV$new(pliv_data, ml_l, ml_m, ml_r, score='partialling out')

-- >Note however that, if, besides the learner, other arguments have also been provided as positional arguments, the changed API causes exceptions because the additional learner was added as fourth (PLR) / fifth (PLIV) argument

For PLR with score='partialling out' and keyword arguments ml_g and ml_m (old API naming), the learner provided for ml_g is used for ml_l and a warning is issued.

dml_plr_obj = DoubleMLPLR$new(plr_data, ml_g=ml_g, ml_m=ml_m, score='partialling out')

Warning message:
The argument ml_g was renamed to ml_l. Please adapt the argument name accordingly. ml_g is redirected to ml_l.
The redirection will be removed in a future version.

For PLR with score='IV-type' and keyword arguments ml_g and ml_m (old API naming), the learner provided for ml_g is also used for ml_l and a warning is issued. (Note it is first redirected to ml_l and then cloned to ml_g)

dml_plr_obj = DoubleMLPLR$new(plr_data, ml_g=ml_g, ml_m=ml_m, score='IV-type')

Warning messages:
1: The argument ml_g was renamed to ml_l. Please adapt the argument name accordingly. ml_g is redirected to ml_l.
The redirection will be removed in a future version. 
2: For score = 'IV-type', learners ml_l and ml_g should be specified. Set ml_g = ml_l$clone().

For PLR with score='IV-type' and only two learners as positional arguments, the learner provided for ml_g is used for ml_l and a warning is issued.

dml_plr_obj = DoubleMLPLR$new(plr_data, ml_l, ml_m, score='IV-type')

Warning message:
For score = 'IV-type', learners ml_l and ml_g should be specified. Set ml_g = ml_l$clone().

For PLR & PLIV with score score='partialling out', the methods set_ml_nuisance_params and tune redirect ml_g to ml_l.

dml_plr_obj = DoubleMLPLR$new(plr_data, ml_l, ml_m, score='partialling out')
dml_plr_obj$set_ml_nuisance_params('ml_g', 'd', list(num.trees = 10, max.depth = 2))

Warning message:
Learner ml_g was renamed to ml_l. Please adapt the argument learner accordingly. The provided parameters are set for ml_l. The redirection will be removed in a future version.

Miscellaneous

When the score is set to a function(), it will in the future be called with keyword-arguments only (instead of positional arguments). This way is "safer" and in some way indirectly checks (up to a certain degree) that the signature of the function is as expected (see docu entry of the argument score for the expected signature). This was implemented for all model classes PLR, PLIV, IRM & IIVM
The website, user guide, etc will get an update to reflect the changes of this PR: Update of the basics of DML article; new score for PLIV; adaption due to changed API of DoubleMLPLR & DoubleMLPLIV doubleml-docs#73

PR Checklist

The title of the pull request summarizes the changes made.
The PR contains a detailed description of all changes and additions.
The code passes R CMD check and all (unit) tests (see our contributing guidelines for details).
Enhancements or new feature are equipped with unit tests.
The changes adhere to the "mlr-style" standards (see our contributing guidelines for details).

…t 1)

…PLR model

…score

…ng parameter where measure is set to NULL

… warning checks

…-pliv-iv-type

…ollowed by positional arguments

suggest test for functional initializer for IV-type score

tests/testthat/test-double_ml_pliv_two_way_cluster.R

PhilippBach

Hi @MalteKurz ,

thanks for the PR fixing the IV-type score for the PLR and implementing it for PLIV (case with 1 instrument and partialling out X).

I only have minor comments and have I suggestion for an additional test (5dfe618) covering the functional initialization of the PLIV partial X with score = "IV-type" .

Feel free to integrate this or to drop it as you like. The other changes only refer to the exception handling in case user provide ml_g with score = "partialling out" (no strong opinion on this) and a minor change to the format of the documentation

Overall this looks good and is ready to be merged (subject to anything you'd like to change)

R/double_ml_pliv.R

Co-authored-by: PhilippBach <[email protected]>

…ore partialling out

… out (see c793960)

…ling out (see c793960)

MalteKurz · 2022-06-10T12:10:25Z

@PhilippBach Thanks for the review. I adapted the code accordingly. Additionally, I also adapted the corresponding Python PR such that the newly introduced warning is also present there. I also checked your suggested new unit test: I opened a corresponding PR in order to integrate it into this PR and added a comment with a suggestion / extension, see #162.

test for initializer with IV-type score PLIV

Unit test for functional initializer for PLIV

PhilippBach

I think this looks good now. Thanks @MalteKurz for incorporating these additional changes

drop mtry parameter for bonus example, adjust naming of learner according to #161

MalteKurz added 30 commits April 22, 2022 10:28

adapt nuisance estimation for the IV type score in the PLR model (par…

4f7617e

…t 1)

ignore NA's for the estimation of the initial theta guess

9aba914

adapt unit test after 4f7617e

baeb9dd

tuning with adapted nuisance estimation for the IV type score in the …

2ea12e1

…PLR model

apply styler

eacca32

update in the API documentation

da37e64

extend the prediction export unit test to g_hat for PLR with IV-type …

5956dd6

…score

PLR: Rename ml_g to ml_L; Add additional learner ml_g for IV-type score

ff9547c

apply styler

9ba0841

add some deprecation warning

c66447e

bug fix; typo in pkg name

f322f0f

bug fix; typo in pkg name

4aac7d6

bug fix; Method tune() should be callable with the default tune_setti…

5aac1a2

…ng parameter where measure is set to NULL

transfer the bug fix 5aac1a2 which is also needed for the deprecation…

b59add5

… warning checks

add unit tests for the deprecation warnings

9d3d06d

refactor the check and set learner part of the initializer

0ab343a

minor adaptions and fixes in the unit tests

96fd8aa

rename learner ml_g into ml_l for the PLIV model

71f3223

add deprecation warning for renamed learner

0792b49

apply styler

8555c7f

apply styler

be116a6

Merge branch 'm-plr-api' of github.com:DoubleML/doubleml-for-r into m…

553e8b5

…-pliv-iv-type

implementation of the IV-type score for the PLIV model

bf73b8f

complement renaming of ml_g to ml_l at some more places

a5d69f6

Merge branch 'm-plr-api' of github.com:DoubleML/doubleml-for-r into m…

ad790af

…-pliv-iv-type

prefer to not have positional arguments followed by named arguments f…

e7ccf69

…ollowed by positional arguments

apply styler

b6316ce

add unit tests for the new IV-type score of the PLIV model

9697bf0

finalize IV-type score implementation & apply styler

4fd4d4a

add unit tests for new deprecation warnings for the PLIV model

b82b074

5dfe618

suggest test for functional initializer for IV-type score

PhilippBach reviewed Jun 9, 2022

View reviewed changes

tests/testthat/test-double_ml_pliv_two_way_cluster.R Outdated Show resolved Hide resolved

PhilippBach reviewed Jun 9, 2022

View reviewed changes

MalteKurz commented Jun 10, 2022

View reviewed changes

R/double_ml_pliv.R Outdated Show resolved Hide resolved

MalteKurz and others added 3 commits June 10, 2022 07:50

Apply suggestions from code review: remove redundant z assignments

0b7b8f9

Co-authored-by: PhilippBach <[email protected]>

Consistent format if score is a character

c9ecd39

Co-authored-by: PhilippBach <[email protected]>

Consistent format if score is a character

a2e2326

Co-authored-by: PhilippBach <[email protected]>

MalteKurz mentioned this pull request Jun 10, 2022

Unit test for functional initializer for PLIV #162

Merged

MalteKurz added 7 commits June 10, 2022 08:06

re-build docu

dbb2a75

apply styler

f4362f8

add a warning if a learner ml_g is specified (but not needed) with sc…

c793960

…ore partialling out

adapt unit test for new warning if ml_g is set with score partialling…

53ca073

… out (see c793960)

add a unit test for the new warning if ml_g is set with score partial…

1a508dd

…ling out (see c793960)

fix unit test

9449286

apply styler

ecd05c2

PhilippBach mentioned this pull request Jun 10, 2022

Reminder: Adapt new example notebooks according to change in 'IV-type' score DoubleML/doubleml-docs#78

Open

PhilippBach and others added 3 commits June 10, 2022 21:09

bfc9c91

test for initializer with IV-type score PLIV

Merge pull request #162 from DoubleML/p-suggest-change-pliv

46b5929

Unit test for functional initializer for PLIV

apply styler

b96ac05

This was referenced Jun 13, 2022

Rename abstract methods #163

Merged

Release notes for the R and Python pkg in version 0.5.0 DoubleML/doubleml-docs#79

Merged

PhilippBach reviewed Jun 13, 2022

View reviewed changes

MalteKurz merged commit fd2dce8 into master Jun 14, 2022

MalteKurz mentioned this pull request Jun 14, 2022

Updates for the adapted nuisance est for the IV-type score (PLR) & the new IV-type score for PLIV DoubleML/doubleml-py-vs-r#12

Merged

MalteKurz deleted the m-pliv-iv-type branch June 15, 2022 07:31

This was referenced Jul 4, 2022

Failing jobs based on PLR DoubleML/DoubleMLReplicationCode#1

Closed

Fixes in Code Chunks and Simulation Examples DoubleML/DoubleMLReplicationCode#2

Merged

PhilippBach added a commit that referenced this pull request Feb 13, 2023

194e049

drop mtry parameter for bonus example, adjust naming of learner according to #161

PhilippBach mentioned this pull request Feb 13, 2023

Updates to Vignette #179

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Adapt nuisance est for `IV-type` score (PLR) & new score `IV-type` for PLIV #161

Adapt nuisance est for `IV-type` score (PLR) & new score `IV-type` for PLIV #161

Uh oh!

MalteKurz commented May 20, 2022 •

edited

Loading

Uh oh!

Uh oh!

PhilippBach left a comment

Uh oh!

Uh oh!

MalteKurz commented Jun 10, 2022

Uh oh!

PhilippBach left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Adapt nuisance est for IV-type score (PLR) & new score IV-type for PLIV #161

Adapt nuisance est for IV-type score (PLR) & new score IV-type for PLIV #161

Uh oh!

Conversation

MalteKurz commented May 20, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Description

PLR

PLIV

API changes

PLR

PLIV

Depreciation warnings for the API changes for DoubleMLPLR and DoubleMLPLIV

Miscellaneous

PR Checklist

Uh oh!

Uh oh!

PhilippBach left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

MalteKurz commented Jun 10, 2022

Uh oh!

PhilippBach left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Adapt nuisance est for `IV-type` score (PLR) & new score `IV-type` for PLIV #161

Adapt nuisance est for `IV-type` score (PLR) & new score `IV-type` for PLIV #161

MalteKurz commented May 20, 2022 •

edited

Loading

Depreciation warnings for the API changes for `DoubleMLPLR` and `DoubleMLPLIV`