11# tax-microdata
22
3- This repository contains all working files for a project to develop a
4- general-purpose validated microdata file for use in
5- [ PolicyEngine-US] ( https://github.com/PolicyEngine/policyengine-us ) and
6- [ Tax-Calculator] ( https://github.com/PSLmodels/Tax-Calculator ) . The
7- development will proceed in several phases.
3+ This repository contains all working files for a project to develop
4+ validated input files for use in
5+ [ Tax-Calculator] ( https://github.com/PSLmodels/Tax-Calculator ) .
86
9- The ** current version is 1. 2.0** , which was released March 1 , 2026,
7+ The ** current version is 2.0.0 ** , which was released on March 29 , 2026,
108and includes the following significant improvements:
11- - impute values for three variables used in new OBBBA deductions:
12- [ #397 ] ( https://github.com/PSLmodels/tax-microdata-benchmarking/pull/397 )
13- - fix long-standing bug in IRS PUF data handling:
14- [ #408 ] ( https://github.com/PSLmodels/tax-microdata-benchmarking/pull/408 )
9+
10+ - generate national, state, and Congressional district, input files
11+ for ** 2022** :
12+ [ #470 ] ( https://github.com/PSLmodels/tax-microdata-benchmarking/pull/470 )
13+ [ #471 ] ( https://github.com/PSLmodels/tax-microdata-benchmarking/pull/471 )
14+ [ #472 ] ( https://github.com/PSLmodels/tax-microdata-benchmarking/pull/472 )
15+ [ #473 ] ( https://github.com/PSLmodels/tax-microdata-benchmarking/pull/473 )
16+ [ #474 ] ( https://github.com/PSLmodels/tax-microdata-benchmarking/pull/474 )
17+ - improve the selection of CPS tax units to represent nonfilers:
18+ [ #438 ] ( https://github.com/PSLmodels/tax-microdata-benchmarking/pull/438 )
1519- vastly improve the reweighting algorithm:
1620[ #416 ] ( https://github.com/PSLmodels/tax-microdata-benchmarking/pull/416 )
17-
21+ - impute values for three variables used in new OBBBA deductions:
22+ [ #397 ] ( https://github.com/PSLmodels/tax-microdata-benchmarking/pull/397 )
1823
1924## Usage instructions
2025
@@ -31,25 +36,10 @@ To generate the TMD files from the PUF files, do this:
31364 . Run ` make clean ` in the repository's top-level folder
32375 . Run ` make data ` in the repository's top-level folder
3338
34- The ` make data ` command creates and tests the three ` tmd*csv* ` data
35- files, which are located in the ` tmd/storage/output ` folder. Read
36- [ this
39+ The ` make data ` command creates and tests the three national
40+ ` tmd*csv* ` data files, which are located in the ` tmd/storage/output `
41+ folder. Read [ this
3742documentation] ( https://taxcalc.pslmodels.org/usage/data.html#irs-public-use-data-tmd-csv )
3843on how to use these three files with Tax-Calculator. Also, you can
3944look at the tests in this repository to see Python code that uses the
4045TMD files with Tax-Calculator.
41-
42- ## Examination results
43-
44- To assess, review the data examination results that compare federal
45- agency tax microsimulation estimates for 2023 and 2026 with those
46- generated using the national microdata files created in each project
47- phase:
48- * [ phase 1 results] ( ./tmd/examination/results1.md )
49- * [ phase 2 results] ( ./tmd/examination/results2.md )
50- * [ phase 3 results] ( ./tmd/examination/results3.md )
51- * [ phase 4+ results] ( ./tmd/examination/results4.md )
52-
53- 2022 TMD estimates have also been compared with IRS/SOI data on actual
54- income tax returns for 2022 in [ this
55- document] ( ./tmd/examination/2022/results.md ) .
0 commit comments