Connor Boyle

Rep. Paul Gosar’s Claims about OPT Contain Major Errors

2025-05-23T00:00:00+00:00

U.S. Congressional Representative Paul Gosar of Arizona¹ recently reintroduced proposed legislation to ban the Optional Practical Training program (OPT).² OPT is a program that allows international students attending college & grad school to legally remain in the United States and work in their field of study for 1 year after completing their degrees.³ Students majoring in an approved science, technology, engineering, and/or math (STEM) field are eligible to apply for a 2-year extension at the end of their initial 1-year OPT periods.

As a former volunteer at my alma mater’s international student program, I happen to have some knowledge about OPT and the policies concerning student (F-1) visa holders that led me to suspect there were major errors in Paul Gosar’s statement regarding the bill, as well as in the articles from advocacy groups that he cites. In addition to some superficial and obvious errors, such as claiming that OPT was “expanded by three years by the Obama Administration” (it was in fact expanded only by seven months), Gosar’s post claims that:

These foreign workers are exempt from payroll taxes[,] making them at least 10-15 percent [sic] cheaper than a comparable American worker. NumbersUSA reports OPT costs the Social Security and Medicare trust fund [sic] $4 billion annually.

The document from NumbersUSA (an immigration restrictionist advocacy group) cites a 2024 article from the Center for Immigration Studies (CIS, another restrictionist group), which in turn simply scales up an earlier estimate from a 2015 CIS article.

The author of the 2015 article, David North, tallies the number of OPT & OPT STEM extension approvals, then multiplies by 12 or 17 months respectively (the STEM extension was only 17 months long at the time) to determine the number of years an OPT worker will have worked in the United States without being liable for Social Security and Medicare taxes. He proposes an estimated average OPT salary based on that of the average college graduate, and estimates the total loss to Social Security & Medicare like so:

\[(\textrm{FICA-exempt worker-years}) \cdot (\textrm{avg. salary}) \cdot (\textrm{total FICA rate}) = \textrm{total loss to Soc. Sec. & Medicare}\] \[= \textrm{524,021 years} \cdot $\textrm{50,000/yr.} \cdot 15.3\% = \textrm{\$4,008,760,600}\]

Many OPT Workers do in Fact Pay FICA

There are a few problems with this estimate, but I will start with the most glaring one: it is not true that all or even nearly all post-completion OPT workers (and their employers) are exempt from FICA. The reason that some OPT workers are exempt from FICA is that they are nonresident aliens. And while F-1 student visa holders (including OPT workers) are always considered nonresident aliens for immigration purposes, they are only treated as such for tax purposes until they pass the “substantial presence” test.⁴

The “substantial presence” test is rather complicated, but in a typical case, an international student will not be considered substantially present for their first 5 calendar years in the United States, due to a policy exempting F-1 visa holders for that period. The visa holder will usually come to be considered “substantially present”—assuming they spend a majority of that year in the United States—beginning January 1st of their 6th calendar year in the United States. The visa holder is then treated as a resident alien for tax purposes and is thus subject to the payroll taxes funding Social Security & Medicare, a.k.a. FICA. The only likely way for an international student not to be considered a resident alien is if they spend fewer than 183 days in the United States in their final calendar year in the United States.

Substantial presence is not some niche exception; it ultimately applies to a very large number—perhaps a majority—of post-completion OPT workers at some point in their OPT period. Keep in mind that degree programs are generally mis-aligned to calendar years and therefore push F-1 holders/OPT workers into resident alien tax status sooner than if they weren’t.⁵

Degree type	Typical length	Max Yrs. as NRA post-completion (STEM / non-STEM)	Explanation	Exceptions
Bachelor’s (B.A. / B.S.)	4 years	0.5 / 0.5	Degree completed in 5th calendar year, likely begins post-completion OPT in June or July. Resident alien tax status kicks in January of the following calendar year	If the OPT worker leaves the US before early July of the following year, they will remain a nonresident alien for tax purposes
Master’s (M.A. / M.S.)	2 years	2.5 / 1	Degree completed in 3rd calendar year; resident alien tax status kicks in January 1 of 6th calendar year, roughly two and a half years after completion	Many master’s students were previously bachelor’s students in the US and thus would have 0 years of nonresident alien tax status post-completion
Law (J.D.)	3 years	N.A. / 1	Degree completed in 4th calendar year; resident alien tax status would kick in January 1 of 6th calendar year, roughly two and a half years after completion; however, law students do not qualify for the STEM extension	Many law students were previously bachelor’s students in the US and thus would have 0 years of nonresident alien tax status post-completion
Doctorate (Ph.D.)	5+ years	0 / 0	Degree completed in 6th calendar year (optimistic for many fields); resident alien tax status kicks in during last year of degree (or sooner, if the student had previously studied in the US)

Unfortunately, I couldn’t seem to find public reporting from ICE or USCIS to break down the share of OPT participants by level of degree attained. However, based on this report from the Niskanen center, which obtained additional statistics on OPT educational attainment levels via a FOIA request, we can see the different levels of education attained by OPT workers:

From Niskanen Center

I have attempted to combine the Niskanen data, official ICE-reported numbers on OPT & OPT STEM extension approvals in 2022, and no small number of naïve and arbitrary assumptions⁶ to estimate the total number of years worked by post-completion OPT workers as nonresident aliens (NRAs) vs. as resident aliens (RAs) for tax purposes.

I estimated that in 2022, a narrow majority—about 53%—of all post-completion OPT time was worked while in resident alien status. In other words, if this estimate is accurate, most OPT workers and their employers at any given time are paying into the Social Security and Medicare trust funds at exactly the same rate as a US citizen and their employer would be. This estimate comes with much uncertainty; the result can shift quite dramatically depending on how many master’s graduates we assume to be substantially present by the time of graduation. That said, I think this estimation is useful as an exercise showing how complicated it is to estimate the amount of FICA paid by OPT workers, as well as a demonstration of the magnitude of CIS, NumbersUSA, and Paul Gosar’s error.

Nonresident Alien Tax Status is not Always Advantageous

Excepting FICA exemption, nonresident alien status generally leads to a higher tax burden. Nonresident aliens other than Indian residents⁷ cannot take the standard deduction, and almost no nonresident aliens⁸ can take the earned income tax credit, the American opportunity tax credit, the lifetime learning credit, or many other deductions and credits that an American worker would be able to take.

I generated this chart here

(*) Assuming the nonresident alien cannot use the standard deduction, which is not true for Indian residents

(†) If the worker is a full-time student working for a college or university, that income is exempt from FICA, so the "FICA not paid" would instead be zero

In the above chart I have plotted the additional tax burden caused by a lack of standard deduction compared to the tax burden avoided by being exempt from FICA.⁹ Note that the lower one’s income, the less of a tax advantage one is conferred by nonresident alien status. At income levels of around $20,000 or less, more than the entire employee FICA advantage for NRAs is wiped out by the increased tax burden due to a lack of standard deduction. You may think it unlikely that a post-completion OPT worker would make only $20,000 (or less) in a year; if so, I think you would be correct (see next section).

However, many international students likely earn wages in this range when they work on-campus while enrolled as a student. International students (like their domestic peers) can take part-time jobs at their home institutions, often helping to run facilities such as libraries and cafeterias or serving as teaching assistants (or even instructors, in the case of PhD students). In the case of these student workers, nonresident alien status loses its main advantage (that is, FICA exemption), because all student workers at colleges and universities—including U.S. citizens—are universally exempt from FICA. In other words, F-1 student visa holders are most likely to have nonresident alien status precisely when it is least advantageous, often increasing their total tax burden relative to a US citizen’s.

I would like to pause very briefly to bring up just how unfair the tax treatment of international students can be. Nonresident aliens are excluded from paying FICA for a specific reason—FICA specifically funds social insurance programs that most nonresident aliens will never be able to collect from (granted, this logic doesn’t particularly apply to their employers). What is the justification for why we take an extra 10% out of the wages of an international student paying their way through college by washing dishes in the cafeteria? If anyone can tell me a good reason, I’m all ears, but until then I’ll assume it’s nothing more nor less than “because we can”.

OPT Workers are Probably Paid Much More than the Average New College Graduate

In CIS’s 2015 estimate of lost FICA income due to OPT workers, author David North estimates the average income of an OPT worker as $50,000 per year, based on a 2011 report showing the average new college grad salary as $50,034.¹⁰ I’m a bit surprised he didn’t try to find a newer estimate or at least adjust for general wage growth, as this would have allowed him to claim that OPT was depriving Social Security and Medicare of an even larger amount of money.

On the other hand, I think this would contradict a larger narrative that Rep. Gosar and CIS seem to believe about OPT. Namely, they assert that OPT is fundamentally a sham program to funnel “inexpensive foreign labor” (in Paul Gosar’s words) into American jobs. Listening to immigration restrictionists, you’d get the impression that the average OPT worker has “graduated” from some fly-by-night for-profit diploma-cum-visa mill, or at best an un-selective associate’s degree program, in order to exploit the OPT “loophole” with their sham degree and work in some low-skill job with no real connection to their supposed field of study. As David North wrote for CIS in 2018, “many a pizza place is staffed with OPT workers”.

This certainly does not resemble the experiences of my international classmates at Macalester College, who often out-earned me at renowned firms such as Google, Ernst & Young, or Merck, to name a few. But perhaps my view from a selective liberal arts college is not representative of the typical OPT worker and their career. This spreadsheet published by ICE showing the top schools for active OPT records in 2022 might give us some insight into what kind of new college and university graduates are participating in OPT. Here are the top 20 schools:

#	Campus Name	Graduates Employed through OPT
1	Northeastern University	4,593
2	Columbia University	4,127
3	University of Southern California	3,520
4	New York University	2,996
5	Arizona State University	2,663
6	University of California at Berkeley	2,363
7	Carnegie Mellon University	2,134
8	The University of Texas at Dallas	2,096
9	Boston University	2,017
10	University of Illinois, Urbana-Champaign	1,996
11	The University of Texas at Arlington	1,894
12	Purdue University	1,797
13	University of Washington - Seattle	1,758
14	State University of New York at Buffalo	1,754
15	University of California San Diego	1,673
16	University of Michigan - Ann Arbor	1,635
17	University of California, Los Angeles	1,620
18	University of Pennsylvania	1,553
19	Harvard University	1,497
20	Georgia Institute of Technology	1,469
	…	…
	Total for Top 100 Schools	106,060
	*Full Total (2022)**	171,635

Not only are these schools reputable, legitimate institutions of higher learning, many of them are the most prestigious universities in the entire world!¹¹ The top 100 schools for OPT authorizations includes almost the entire Ivy League (with the exceptions of Dartmouth and Princeton), Stanford, MIT, Duke, Johns Hopkins, and the University of Chicago. This is remarkable, considering that these exceptionally prestigious schools are generally not particularly large; for example, Harvard’s total enrollment of 21,278 is smaller than that of 11 California State University campuses, only 1 of which is among the top 100 OPT schools.

Using data on the top schools for OPT authorizations in 2022, I compared several schools’ share of all degrees awarded to their share of OPT workers. In that year, compared to the general population of new bachelor’s degree recipients, an OPT authorization recipient was:

over 7 times as likely to have graduated from Stanford,
over 11 times as likely to have graduated from MIT, and
over 13 times as likely to have graduated from Harvard

this discrepancy is due in part to the fact that these institutions grant far more master’s degrees compared to similarly-sized institutions. However, even comparing to the average master’s recipient, an OPT worker is:

2.23 times as likely to have graduated from Stanford,
2.77 times as likely to have graduated from MIT, and
1.92 times as likely to have graduated from Harvard

Sources such as this article from the St. Louis Federal Reserve Bank, as well as common sense, tell us that graduates of more selective universities tend to significantly out-earn graduates of less selective institutions. We also know that most OPT recipients are master’s degree holders; according to this report from the Bureau of Labor Statistics, master’s degree holders out-earned bachelor’s degree holders by 15% in 2022. Contrary to Paul Gosar’s description of OPT workers as “inexpensive foreign labor”, I suspect that OPT workers significantly out-earn the average new college graduate.

Gosar et al.’s Assumptions about Employment are Likely Wrong

Representative Gosar, NumbersUSA, and CIS seem to assume that OPT workers’ only impact on the U.S. economy is to take jobs away from U.S. citizens and green card holders who could have worked these jobs and now must be unemployed as a result. In their counterfactual analysis, these critics of OPT don’t consider the possibility that the quantity and quality of labor available could itself influence the rate of formation, expansion, or closure for firms. They also don’t consider that the quantity and quality of labor available could affect the quality and price of goods & services, and by extension the real incomes of U.S. citizens. They do not entertain the possibility that some of the jobs worked by OPT workers could have otherwise been outsourced, taken by a foreign competitor, or simply gone unfilled. Nevermind that many former international students have founded multi-billion dollar companies that employ many thousands of U.S. citizens.

There is a large body of economics research analyzing the effects of skilled immigration on the US economy and workers. I am not by any means an expert in economics, however, The University of Chicago, itself home to one of the most highly-regarded economics departments in the world, conducts polls of economists on various economics questions. For example, on the question of whether reducing the number of H-1(b) visas would increase employment opportunities for American workers, 45% of polled economists responded “disagree”, 36% responded “strongly disagree”, and 19% responded “uncertain”; 0% of economists responded “agree” or “strongly agree”. Put differently, the field of economics generally believes that H-1(b)—a program that allows foreign skilled workers similar to those in OPT to work temporarily in the United States—does not significantly negatively impact the employment prospects of American workers.

I would be interested to see whether these experts feel similarly about OPT as they do H-1(b); if so (and if they are not all wrong), then each OPT worker may in fact not be simply displacing an American worker, but instead adding new value to the American economy that wouldn’t have existed otherwise. In that case, I think it is only fair to consider some share of the FICA paid by OPT workers to be a net gain relative to the counterfactual in which those OPT workers were not allowed to work in the United States. Rather than depriving Social Security & Medicare of badly needed funds, OPT workers may be contributing more than would otherwise be present.

This paper, which studies the effects of the random lottery for H-1(b) visas on firms, estimates that each H-1(b) lottery win for a firm increase that firm’s total employment by 0.83. This is only a firm-level estimate; the effect on total employment in the United States economy may be greater or lesser than this number.

Out of a desire to estimate the net effect of OPT on the Social Security & Medicare trust funds, I’ll split the difference between:

Rep. Gosar, CIS, and NumbersUSA, who implicitly assume that each OPT worker adds roughly 0 new employment to the United States, and
the 42 expert economists polled by the University of Chicago, who seem to think that each skilled worker (in their case, in the H-1(b) program) adds approximately 1 (or more) total comparably-employed workers to the United States economy.

therefore, I will assume that each OPT worker only increases total employment (comparable in compensation to the job worked by an OPT worker) in the United States by 0.5. Put differently, I’ll suppose, for the sake of estimation, that OPT workers really do displace American workers (for the duration of their OPT periods), but because a) they increase firm profitability b) lower the cost of goods & services to American consumers, and c) work some jobs that may have otherwise gone unfilled or taken by a foreign competitor, they generate enough additional economic value in the U.S. that the number of displaced workers is only half of the number OPT workers.

Conclusion

To estimate the net gain or loss to the Social Security and Medicare trust funds, we must compare the reality of the US with OPT workers to the counterfactual without OPT workers. First, the FICA revenue generated in reality by OPT workers (directly or indirectly):

\[(\textrm{OPT}_{RA} + \textrm{OPT}_{total} \cdot \textrm{NewEmployment}) \cdot \textrm{AvgSalary} \cdot \textrm{FICA}\]

minus the counterfactual (in which OPT does not exist):

\[\textrm{OPT}_{total} \cdot \textrm{AvgSalary} \cdot \textrm{FICA}\]

where:

$\textrm{OPT}_{total}$ is the total number of OPT worker-years
$\textrm{OPT}_{RA}, \textrm{OPT}_{NRA}$ is the number of OPT worker-years spent in resident or nonresident alien status, respectively
$\textrm{NewEmployment}$ is the total (worker-years of) employment added to the US economy by each worker-year of OPT
$\textrm{AvgSalary}$ is the average salary of OPT workers
$\textrm{FICA}$ is the total FICA rate (employer- and employee-)

Subtracting reality from the counterfactual gives us:

\[(\textrm{OPT}_{RA} + \textrm{OPT}_{total} \cdot \textrm{NewEmployment} - \textrm{OPT}_{total}) \cdot \textrm{AvgSalary} \cdot \textrm{FICA}\] \[= (\textrm{OPT}_{total} \cdot \textrm{NewEmployment} - \textrm{OPT}_{NRA}) \cdot \textrm{AvgSalary} \cdot \textrm{FICA}\]

Now I’ll substitute the values we estimated earlier. This report from the National Association of Colleges and Employers says that new bachelor’s graduates in 2022 took home a median starting salary of $60,028. To give a very rough estimate of the salary of OPT workers (who, on average, are master’s degree holders), I’ll increase that by 15%, to get an average salary of $69,032.20.¹²

\[= (\textrm{246,989 worker-years} \cdot 0.5 - \textrm{115,508 worker-years}) \cdot \textrm{\$69,032.20} \cdot \textrm{15.3%}\] \[= \textrm{7,686.5 worker-years} \cdot \textrm{15.3%} \cdot \textrm{\$69,032.20}\] \[= \textrm{\$81,184,248.81}\]

I estimate that the total net gain to the Social Security & Medicare trust funds due to the existence of OPT could be $81,184,248.81. I don’t expect anyone to take this estimate too seriously; I had to make arbitrary guesses for many very important values.⁶

That said, this estimate is at least as valid as that of CIS, which through a chain of blindly trusting references, ended up being cited by a member of United States Congress as if it were fact—despite a blatant error, which could easily have been revealed by anyone through a simple web search.

OPT workers are by and large graduates of quite selective colleges and universities, who in fact do contribute significantly to American social insurance programs from which they ultimately might never collect. Even from the most self-interested, nationalist perspective, these are among the most desirable people in the world to have in any country. When I read Paul Gosar describe OPT workers—which many of my closest friends have been at one time or another—as “cheap foreigners”, I was more than a little insulted on their behalf. I hope this blog post serves to correct the record. Most OPT workers are anything but cheap, low-quality labor incentivized by unfair tax breaks; they are some of the best and brightest of our workforce.

Rather than making yet another futile attempt to eliminate the OPT program, I would suggest that Congressman Gosar instead propose legislation to reduce or eliminate the time (currently 5 calendar years) that F-1 student visa holders are exempt from the substantial presence test. If the years of exemption were eliminated entirely, this would cause virtually all OPT workers to be subject to FICA for the entirety of their OPT periods, making international students contribute even more to the coffers of Social Security and Medicare than they already do. However, this would come at a cost to the United States Treasury, increasing our national deficit, as international student workers would then be able to take credits and deductions otherwise denied to them.

Addendum: No Minor Blunder

David North, the author of the 2015 CIS article providing the original framework for estimating FICA revenue supposedly lost to OPT, writes that he volunteered to help graduate students—including international students—file their taxes. And yet, he writes:

The assumption is that all these years of tax-free status were used; in reality, probably a small fraction of the tax-free status was not used because the recent graduate either moved on to another visa category, left the nation, or, in a very few cases, died.

North gives no indication that a typical OPT worker would, in fact, be fully liable for FICA for a significant share (possibly most or all) of their OPT time. I’m tempted to presume that he knew how many OPT workers used OPT after getting master’s degrees, which in some cases could allow them to be FICA-exempt for nearly the entirety of 1-year OPT and a STEM extension (assuming they did not get their bachelor’s degrees in the US, too). However, he specifically uses the average salary of new college graduates as the estimated salary of OPT workers, indicating he assumes they are primarily bachelor’s degree holders.

This is no small blunder! Determining tax residency is literally the first (and probably most important) step in filing taxes as a noncitizen. It determines whether the noncitizen filer can submit a normal form 1040 (resident) or 1040-NR (nonresident) as well as which tax filing software the filer can use (e.g. TurboTax vs. Sprintax). So either:

North was woefully incompetent to assist any noncitizen living in the U.S. with their taxes, possibly causing them to file completely incorrectly
North lied about having helped international students with their taxes, or
North knew that many OPT workers would not be exempt from FICA, but intentionally lied to his readers by writing that nearly all OPT workers are exempt from FICA

What’s more, North did not just make this misrepresentation in a blog post (that was ultimately read and cited by a member of the United States Congress), he made this same claim in an amicus brief submitted to a federal court in 2019!¹³ This level of carelessness from a professional think-tank writer who had written well over a dozen articles on this topic before submitting this amicus brief is remarkable, especially considering this information was easily available to the public on the IRS website as early as 2013.

Thank you to my friend Stephanie Hou, who helped to proofread this post. Any mistakes are my own.

Footnotes:

In an earlier version of this post, I mistakenly wrote that Paul Gosar is the representative from Nebraska instead of Arizona. A friend helpfully pointed out this error to me. ↩
I will mostly discuss post-completion OPT in this post, since post-completion OPT seems to specifically be the part of the program that Paul Gosar and other restrictionists criticize, as opposed to pre-completion OPT or Curricular Practical Training (CPT). ↩
F-1 visa holders can (and often do) pick a start date for their post-completion OPT period as late as sixty (60) days after the end of their program. The OPT work permit lasts for 365 days, and, assuming the student maintains valid employment for the entire period, ends with a 60-day grace period where the student cannot work but can legally remain in the United States. ↩
The “substantial presence” test has an exception: the “closer connection” test (TODO: write more about this and why it’s unlikely to apply to OPT holders) ↩
Many international students set their OPT start date as late as possible; they can set it up to 60 days after their graduation. They do this for two reasons:
- USCIS can be very slow to process OPT applications, taking 90-120 days in some cases (note that F-1 visa holders can only apply at most 90 days in advance of graduation!!). A late start date ensures that the OPT worker maximizes their allowed work time and none is spent waiting for approval.
- Once their OPT period starts, F-1 visa holders can only be unemployed for a maximum of 90 days (although the STEM extension adds 60 unemployment days) before they will be marked out-of-status and be required to leave the country; since students have to set their OPT start date when they apply, they often set it late to ensure they will be able to find a job before their allowed unemployment time runs out.
↩
I made the following assumptions when calculating post-completion OPT time in RA vs. NRA tax status:
- that the share of degrees represented in OPT approvals in 2022 was identical to that in the 2004-2017 period
- that no OPT workers finished their degrees early or late
- that no bachelor’s degree OPT workers had previously studied or lived in the United States
- that 25% (an arbitrary figure that sounded plausible to me) of master’s degree holders had completed their bachelor’s degree in the US or would otherwise be considered substantially present by the time of graduation
- that master’s degrees take 2 years
- that all OPT workers completed their entire OPT (and STEM extension if so approved) without early termination or moving to another visa type (e.g. H-1b)
- that OPT STEM extensions are proportionately distributed among degree levels (excluding associate’s degrees, for which the STEM extension is not allowed)
See the spreadsheet I used for estimation here ↩ ↩²
Indian residents are allowed to take the standard deduction even when they are nonresident aliens (for tax purposes) due to a tax treaty between the United States and Inida. Some other countries have tax treaties as well, such as China, whose residents can take a deduction of $5,000 (while in NRA tax status), compared to the standard deduction of $14,650 for single filers in 2024. See Sprintax for more examples. ↩
For example, if a married couple, one of whom is a U.S. resident and one is not (for tax purposes), files jointly, the nonresident spouse can choose to be treated as a resident for tax purposes. ↩
The extra tax burden paid by nonresident aliens due to not being able to take the standard deduction and other deductions and tax credits is paid to the U.S. treasury, rather than to the Social Security and Medicare trust funds, as would be the case with FICA. This is a distinction without a difference in my opinion. ↩
In Jon Feere’s 2024 estimate he continues to use this figure ($50,000 / year) as the estimated wage for OPT workers. I’m particularly baffled by him continuing to use this decade-old number, especially since it would help his argument to update it! Inflation alone from 2015 to 2024 was 35%—average wages for new college graduates must have changed a lot, too! ↩
While the full “top 100” list contains overwhelmingly reputable institutions, I would say that at least one, possibly two, schools on the list do not quite fit most people’s idea of an ideal institution of higher learning. That said, the inclusion of these unorthodox schools doesn’t change the more important point: that OPT is very disproportionately used by graduates of the most prestigious schools in America. ↩
I believe $69,032.20 / year for the average OPT worker may still be an underestimate because: 1) we want the mean, not the median, and 2) it does not account for the more selective schools and higher-earning majors that OPT workers disproportionately choose. ↩
In the amicus brief, not only does North misrepresent the exemption from FICA as being universal for OPT workers, he also seems to misattribute the reason for the exemption. He conflates the F-1 “substantial presence” exception (the real reason why some OPT workers are exempt from FICA) with the “student” exemption to FICA, which only applies to students working at their home institution where they are currently enrolled full-time. In this later post from 2023, he seems to recognize that OPT workers’ exemption from FICA stems from their nonresident status due to exemption from the substantial presence test. ↩

I Found Out I’m Colorblind, So I Made a Program to Generate Images That I Can’t Read

2025-04-07T00:00:00+00:00

After realizing I was mildly colorblind, I made this program to generate colorblindness tests

I recently (read: several months ago) watched a video about so-called “color-corrective lenses” that supposedly enable colorblind people to see the full range of colors that people with full color vision can see. The video’s creator argues—successfully, in my opinion—that these color corrective lenses are essentially a scam. They cannot restore full color vision, and only assist in distinguishing pairs of colors by entirely blocking the light of one of the colors. They may have some practical use, but only at significant cost; for example, green traffic lights may appear completely black, as occurred for one reporter who tried on the glasses and drove his car (!!!) with them on.

What really got my attention was the video creator showing some of the colorblind tests that he, a colorblind person, had tried and failed. These tests—known as Ishihara test plates—consist of circles filled with irregularly sized and placed dots I noticed that I too often could not read these test plates. Somewhat alarmed, I took several online colorblindness tests (including one furnished by a color corrective lens company). These tests generally indicated that I had mild deutan colorblindness. The cone receptors in my retinas that should be activated by green light are lacking in quantity or quality, and therefore my ability to distinguish red from green is significantly worse than someone with full color vision.

An example of an Ishihara plate, showing "74" in green dots surrounded by orange dots (from Wikipedia)

I was particularly shaken when I showed one of the Ishihara test plates to my friends & girlfriend. The plate above is the number 74 in green dots surrounded by orange dots; my friends & girlfriend told me they could easily perceive it as 74. This 74 was not (and still isn’t) clear at all to me! I can see that there’s a number, but I originally thought it might be a 21, and still can’t help but see it as a 21 sometimes.

I can't read this shirt. You can buy it from here, if you think it's funny

I became fascinated with Ishihara test plates and things like them (see above). I tried a few programs that I found to make my own Ishihara plates but none of them quite satisfied me in terms of power and customizability, so I decided to make my own using Rust and WebAssembly. This was my first time using WebAssembly, and I found the Conway’s Game of Life WASM tutorial very helpful, as well as Carl M. Kadie’s post Nine Rules for Running Rust in the Browser.

Creating the Algorithm

I decided to start with the simplest generation algorithm I could think of. First, we load the image as an array of pixels; we decide which pixels in the image should be “in” versus “out” depending on whether the pixel has a luma value greater or lesser than 0x7F (i.e. 127, or 50% of maximum illumination). Then we generate dots with random radii and coordinates within the image. If a dot doesn’t overlap with any already-added dot, we add the dot to the image. The dot’s color will depend on whether more than 50% of the pixels inside of it are marked as “on”; if so, it will be drawn with the “on” color, otherwise it will be drawn with the “off” color.

This algorithm worked surprisingly well, except that it slowed down very quickly as the number of dots grew; total checks for overlapping dots grew quadratically with the number of dots, i.e. $O(n^2)$. To cut down on required operations, I kept the added list of dots sorted by x-coordinate and used binary search to narrow down the list of dots to check for overlap to just those dots whose x-coordinate could possibly be in range of the new, candidate dot.

The text "HELLO WORLD", with (bottom) and without (top) a set maximum share of the dot's area that can cross the on/off boundary

I also noticed that the “in” and “out” dots sometimes crossed “in/out” boundary quite significantly, which made the outline of the text or number represented unclear. To make up for this, I added a user-set “tolerance” parameter, which defines the maximum share of a dot that can contain pixels of the “wrong” on/off value.

Playing with the Ishihara Generator

I still don’t have a good pipeline for generating the black-and-white text images to use as input for the Ishihara test generator; my quick-and-dirty solution is to make a Google Doc with very large font bold text and take a screenshot of that. I’ve noticed that if you set padding (the minimum space between dots) and tolerance very low, you can end up with images where you can see the outline of text even with identically-colored dots:

Even without any coloring, you can sometimes see the outline of text if the tolerance parameter is set very low. Here, the tolerance is set to 0% and padding is set to 0

Technically, you don’t have to input a black-and-white image of text. It’s also fun to play around with wacky colors on any image with significant numbers of high-luma and low-luma pixels. For example, check out this cool visual output from a picture of an astronaut in Earth orbit:

Trippy!

Here’s the link to my program that I used to make all these images. The only thing you should need to use it is a modern web browser. Thanks to the power of WebAssembly, computation happens on the client-side, so you don’t even need a persistent network connection.

Flipping Coins in 100,000 Universes Wouldn’t Be as Close as the Polls in Wisconsin

2024-11-05T00:00:00+00:00

I just read Nate Silver’s blog post, where he writes that pollsters are systematically altering their data to roughly match the average of existing polls. According to Silver, rather than releasing their findings as-is, they’re worried they’ll look uniquely wrong, and so they’re settling for blending in with the crowd. He infers this bias from the numbers that the pollsters themselves report; the margins in several swing states are too consistently close to be plausible, even if the election truly is a dead tie among decided voters.

Other than a passing mention of the binomial distribution, Silver doesn’t “show his work” with much detail. Since probability math can be really easy to get wrong (at least for me!), I thought I’d take a stab at trying the brute force option of simulating polls in a hypothetical dead tie, i.e. exactly 50% of decided voters plan to vote for each of the major candidates, Harris & Trump (I also happen to be a computer programmer by hobby and profession, so maybe I’m just a hammer looking for a nail).

This little project was made possible thanks to Nate Silver’s blog, Silver Bulletin, collecting and distributing poll results. Here are the data files containing the poll results that I used for Wisconsin, Pennsylvania, and New Hampshire.¹

Simulating Wisconsin polls²

If the exact same number of likely or registered voters (depending on which poll) plan to vote for Harris as Trump, we can easily simulate the act of surveying them by flipping a coin. Even more easily, we can run the random number generator on my computer and checking whether the output floating point number is greater than 0.5; if it is, that’s a Trump voter. Otherwise, that’s a Harris voter.

After we simulate our polls, lets extract our statistic of interest: the mean absolute margin. For example, if I have three polls with margins:³

\[\text{Trump} \space \text{+5%}\] \[\text{Harris} \space \text{+2%}\] \[\text{Harris} \space \text{+3%}\]

then their absolute margins are:

\[\text{0.05}\] \[\text{0.02}\] \[\text{0.03}\]

and the mean absolute margin for this universe of polls is:

\[\frac{0.05 + 0.02 + 0.03}{3} \approx 0.03333333333\]

Here’s what the actual polls⁴ in real world Wisconsin done by real pollsters look like:⁵

An average poll of Wisconsin has one candidate beating the other (some of them Trump beating Harris, some of them vice versa) by about 2% (or ~~0.203~~ 0.0203, as shown in the graph). While the trendline is not terribly strong, we do find that the absolute margin of a poll goes down as sample size goes up, as we’d expect if the race were truly tied.

And here’s a simulation of what those polls could look like in an alternate universe where pollsters perfectly randomly sample the same number of people in a perfectly matched race between Donald Trump and Kamala Harris:

The output of this simulation certainly differs from our observed results–our mean absolute margin is a full percentage point higher than our observed one. That doesn’t prove anything on its own, though; maybe this simulation of the Wisconsin polls just happened to result in a high mean absolute margin by chance.

Simulating a multiverse of polls

What happens if we run that simulation many, many times, keeping track of the resulting mean absolute margin for each simulation? Let’s look at the histogram we get when we do that:

Whoa! Our observed mean absolute margin of polls (the dashed red line to the left) is way lower than any of the MAMs in the multiverse where Harris and Trump are neck-and-neck. In fact, the lowest MAM out of 100,000 simulated universes is 0.02092 or 2.092%, still 0.06 percentage points higher than our observed MAM. Does this mean something is wrong? Well, I can’t think of any way these polls could get consistently closer margins than our simulations while still remaining scientifically valid. It’s hard to get a low variance estimate of a mean without increasing your sample size; that’s why the sample size $n$ is so important in scientific papers.

Recall also that we generously assumed the candidates had exactly even shares of decided voters. The more imbalanced the share of voters between the candidates, the higher we would expect the mean absolute margin to be. If you’re not convinced, look at this graph of simulations with varied shares for the candidates:

So, assuming the presidential race in Wisconsin isn’t exactly tied, the poll margins would look even more suspiciously close to zero than they already do!

With all that in mind, it seems hard to deny that there could be some systemic bias distorting these polls away from being true random samples of their populations–possibly herding driven by an aversion to publishing too strong of a poll for one or more of the candidates.

Some other states’ poll margins

Nate Silver noted that he observed herding in Pennsylvania as well, and our simulations reveal as much as well:

(the unnatural bias towards a tie looks even worse for Pennsylvania than it did for Wisconsin). However, the polls in New Hampshire apparently don’t suffer from herding:

You can see that not only is the mean absolute margin for New Hampshire not well below (to the left) the simulation’s distribution, it is actually far above (to the right of) it. This makes sense; New Hampshire appears to be nowhere near tied, with nearly all polls giving a strong Harris lead. Note that our simulations actually wouldn’t be able to detect herding if its not occurring around a near-tie polling average; therefore all we can say is that Nate Silver could be right to acquit New Hampshire pollsters of the herding accusation.

How is this happening?

To be clear, no individual poll–even one with a very close margin–is by itself indicative of foul play by the pollster who created it. Rather, the aggregation of poll results for each of multiple swing states indicate systemic bias. I know almost nothing about political polls, but I recently read a great book about systemic problems in modern science called Science Fictions and it seems like there’s a lot of ways to manipulate your data–even without intending to or realizing that you are doing it.

It’s totally plausible to me that pollsters are just focusing a lot more scrutiny on any result that shows a strong swing toward one candidate or another; maybe they’re more likely to throw out outliers or keep collecting more data if they start to see “too” wide of a margin. These hypotheses may sound very foolish to people more familiar with how polls are typically conducted; I’ll stop speculating before I make too much of a fool of myself, but suffice it to say there are a lot of ways for data to get distorted in any field of science and I would expect no less of political polling.

Why does this Matter?

After election results (either from exit polls or counting the votes) are announced, it may turn out that one or more of the swing states goes to either Trump or Harris by a very wide margin. Some people might look back at these polls and conclude that they constitute evidence of interference, cheating, voter suppression, fraud, or the like. After all, the pollsters nearly all agreed that these swing states were right on the margin. However, the high level of agreement between pollsters is not evidence that we know what the results will be, but rather that we can’t trust these polls, and therefore should be very uncertain about the outcome of this race.

Footnotes:

I had to delete a row representing a YouGov poll from each of the Wisconsin and Pennsylvania data files. For some reason, these polls had their sample sizes listed as 0, which is both logically impossible and impossible to simulate. I don’t believe they could have made a significant difference; each one being only one of 134 (Pennsylvania) or 100 (Wisconsin) polls, these YouGov polls could have at most impacted the observed or simulated mean absolute margin by a 100th of their corresponding values. ↩
I used a Jupyter notebook to simulate these polls, which can be found here ↩
In order to simplify the problem, I transformed each poll into a strictly binary poll consisting of only those respondents who responded that they intended to vote for Trump or Harris. This introduces some numerical error, since we have to infer the number of strict Trump-&-Harris-only respondents by dividing by the sum of the percentages for each candidate. Out of generosity to the quality of the polls, we consistently round up the inferred sample size to the nearest whole number. ↩
This whole post rests on the assumption that the polls on Silver Bulletin represent well the full distribution of seemingly “good” polls. Since Silver is complaining about and drawing attention to herding among pollsters, I have assumed that he himself is not consciously or unconsciously selecting specifically for closer polls in swing states. But technically, he or his blog staff could be responsible for 100% of the apparent herding if they are doing this! ↩
This figure in a previous version of this blog post had the observed mean absolute margin at completely the wrong value (just in the chart, not in the text of the blog post). A helpful Redditor pointed this out to me and I corrected this around 2024-11-05T21:59 UTC. ~~I show the full, transparent edit history of my entire website on this GitHub repo.~~ (EDIT 2024-11-16: I no longer do this; I’m currently figuring out a good way to show edit history without exposing my work-in-progress posts) ↩

How to build & push a Docker image directly to Minikube

2024-08-17T00:00:00+00:00

The other day, I was attempting to develop a Knative service and try it out on my local development set-up, which was a Minikube cluster. I assumed (incorrectly) that I could build a Docker image on the host machine and it would be automatically available to Minikube. However, this is not true, because Minikube has its own Docker daemon, inside of its own virtual machine (which, if your set-up is like mine, is itself running in a container on top of the host’s Docker daemon). While there is an easy and simple method that allows building a Docker image and pushing directly to your Minikube cluster’s Docker daemon, I don’t believe it is well-documented anywhere on the public web, so I thought I would write my own walkthrough.

The following walkthrough assumes that you have a running Minikube cluster and have installed kubectl.

Un-installing Snap Docker

First, if you are on Ubuntu, you need to make sure that you are not running the Snap version of Docker; the Docker client on your host machine will need to authenticate to the Docker daemon on the Minikube host using a cert file that is inaccessible to the Snap version of Docker, due to that Snap’s containment policy. So make sure you have installed Docker Desktop from the downloadable .deb file, or add Docker’s package repository and install Docker CE using Apt.

Connecting to the Minikube’s Docker daemon

In order to connect to the Docker daemon inside the Minikube VM, we will need to change the values of certain environment variables. Luckily, Minikube makes it easy for us to get these values with the following command:

minikube docker-env

(you will need to instead run minikube -p docker-env if you want to connect to a Minikube profile other than the currently activated one)

this should return an output similar to the following:

export DOCKER_TLS_VERIFY="1"
export DOCKER_HOST="tcp://192.168.58.2:2376"
export DOCKER_CERT_PATH="/home/your-username/.minikube/certs"
export MINIKUBE_ACTIVE_DOCKERD="profile-name"

# To point your shell to minikube's docker-daemon, run:
# eval $(minikube -p profile-name docker-env)

Export these values to your current terminal’s environment by running the command described in the last line of the output, i.e.:

eval $(minikube -p profile-name docker-env)

(note: profile-name will likely be a different value when run on your machine, you should copy & run the output of your minikube docker-env command, not the one on this webpage)

You can verify that your Docker client has successfully connected to the Minikube VM Docker daemon by running:

$ docker images
REPOSITORY                                         TAG                                        IMAGE ID       CREATED         SIZE
registry.k8s.io/kube-apiserver                     v1.30.0                                    c42f13656d0b   4 months ago    117MB
registry.k8s.io/kube-controller-manager            v1.30.0                                    c7aad43836fa   4 months ago    111MB
registry.k8s.io/kube-scheduler                     v1.30.0                                    259c8277fcbb   4 months ago    62MB
registry.k8s.io/kube-proxy                         v1.30.0                                    a0bf559e280c   4 months ago    84.7MB
...

your output should similarly contain several images from the Kubernetes official registry.

NOTE: the above will have to be re-run every time you open a new terminal, open a new SSH session, restart the computer, etc.

Running the Image on Minikube

To test that we can actually build an image to the Minikube VM’s Docker daemon, let’s start by making a directory named test-docker, then make a Dockerfile in it with the following contents:

FROM python
CMD python -c "print('Hello, world. This is Python, inside a Docker container, possibly on a Kubernetes cluster')"

In a terminal that has connected to the Minikube VM’s Docker daemon (by following the instructions above), cd into the parent directory of test-docker, then run:

docker build --tag my-python test-docker/

Now check that the image is available by running:

$ docker images
REPOSITORY                                         TAG                                        IMAGE ID       CREATED         SIZE
my-python                                          latest                                     17f99b663100   10 days ago     1.02GB
registry.k8s.io/kube-apiserver                     v1.30.0                                    c42f13656d0b   4 months ago    117MB
registry.k8s.io/kube-scheduler                     v1.30.0                                    259c8277fcbb   4 months ago    62MB
registry.k8s.io/kube-controller-manager            v1.30.0                                    c7aad43836fa   4 months ago    111MB
...

Now run a pod using this image with the following command:

$ kubectl run --image my-python --image-pull-policy Never my-python-pod
pod/my-python-pod created

(the --image-pull-policy Never is necessary because Kubernetes looks for images in a default registry, without even considering images in its own Docker daemon)

Check that the pod has run:

$ kubectl get pods
NAME                                             READY   STATUS                   RESTARTS         AGE
...
my-python-pod                                    0/1     Completed                2 (13s ago)      15s

(the pod’s STATUS may eventually change to CrashLoopBackOff; I think this is because Kubernetes does not expect pods to execute one command and then terminate)

You can see that the pod has completed the command described in the Dockerfile’s CMD directive by running:

$ kubectl logs my-python-pod
Hello, world. This is Python, inside a Docker container, possibly on a Kubernetes cluster

And there you have it! A Docker image built and run on your local Minikube cluster.

Scikit-Learn’s F-1 calculator is broken

2023-12-17T00:00:00+00:00

TL;DR: if you are using scikit-learn 1.3.X and use f1_score() or classification_report() with the argument zero_division=1.0 or zero_division=np.nan¹, then there’s a chance that the output of that function is wrong (possibly by any amount up to 100%, depending on the number of classes in your dataset). E.g. for zero_division=1.0:

>>> sklearn.__version__
'1.3.0'
>>> sklearn.metrics.f1_score(y_true=list(range(104)), y_pred=list(range(100)) + [101, 102, 103, 104], average='macro', zero_division=1.0)
0.9809523809523809  # incorrect

compare to (the exact same expression in an earlier version of Scikit-Learn):

>>> sklearn.__version__
'1.2.2'
>>> sklearn.metrics.f1_score(y_true=list(range(104)), y_pred=list(range(100)) + [101, 102, 103, 104], average='macro', zero_division=1.0)
0.9523809523809523  # correct

Similar cases for zero_division=np.nan (which was introduced in 1.3.0, so I can’t directly compare to the output in 1.2.2):

>>> sklearn.metrics.f1_score([0, 1], [1, 0], average='macro', zero_division=np.nan)
nan  # should be 0.0
>>> sklearn.metrics.f1_score([0, 1, 2], [1, 0, 2], average='macro', zero_division=np.nan)
1.0  # should be ~0.67

Both myself and the Scikit-Learn maintainers consider the behavior in 1.3.X to be incorrect. While a pull request to fix this behavior was just merged, the fix has not yet shipped on any released version of Scikit-Learn. Therefore, the easiest solution to this specific problem is to revert to Scikit-Learn 1.2.2, or use zero_division=0.0 if possible, while being careful to understand how this parameter change will affect precision, recall, & F-1 (see below for an explainer on the purpose and function of the zero_division parameter).

(EDIT 2024-01-24: Scikit-Learn 1.4.0 has been released as of a week ago and contains a fix for this bug. Go and update now!)

The problem is that F-1 for an individual class is getting calculated as 1.0 or np.nan when precision & recall are both 0.0 (which is not the desired behavior for the zero_division parameter).

How did this happen?

Let’s take a look at some formulae for classification metrics:

\[\textrm{precision} = \frac{\textrm{true positive}}{\textrm{true positive} + \textrm{false positive}}\] \[\textrm{recall} = \frac{\textrm{true positive}}{\textrm{true positive} + \textrm{false negative}}\] \[\textrm{F}_1 = \frac{2 \cdot \textrm{precision} \cdot \textrm{recall}}{\textrm{precision} + \textrm{recall}}\]

There are three different places here where a division by zero can occur:

in precision, if true positive + false positive = 0 (the classifier made no positive predictions for the class)
in recall, if true positive + false negative = 0 (there are no truly positive examples of the class in the dataset)
in F-1, if precision = recall = 0 (the classifier has made a nonzero number of exclusively incorrect predictions)

Two of these are interesting cases where reasonable people could disagree on what the correct behavior should be:

When the classifier has made zero positive predictions for the class, should that count as a precision of 1.0? If “perfect precision” is interpreted as “no false positives”, then this is totally reasonable behavior.
When the gold dataset has zero true positive examples of the class, should that count as a recall of 1.0? This is a much more unusual scenario than the “zero positive predictions” example–a good evaluation dataset should almost never be entirely missing a class. However, this can realistically occur when evaluating on subsets of a large multiclass dataset. Again, if the definition of “perfect recall” is taken as “no false negatives”, then assigning a recall of 1.0 in this case is totally reasonable behavior.

For F-1, however, the “division by zero” case is not interesting or controversial in any way. If a classifier has achieved a recall of 0.0 (all negative predictions are false) and a precision of 0.0 (all positive predictions are false), I don’t think any reasonable person would disagree what the F-1 score should be: 0.0. Indeed, this is exactly how Scikit-Learn calculated F-1 right up to (and including) version 1.2.2, regardless of the value of the zero_division parameter.

However, in Scikit-Learn 1.3.0, the zero_division parameter was turned into a kind of monkey’s paw that defines the behavior of any division-by-zero that happens to occur during the calculation of an F-1 score, leading to the bizarre scenario where a 100% wrong classifier can get an F-1 score of 100%:²

>>> sklearn.__version__
'1.3.0'
>>> print(sklearn.metrics.classification_report(y_true=[0, 1, 2, 3, 4], y_pred=[1, 2, 3, 4, 0], zero_division=1.0))
              precision    recall  f1-score   support

           0       0.00      0.00      1.00       1.0
           1       0.00      0.00      1.00       1.0
           2       0.00      0.00      1.00       1.0
           3       0.00      0.00      1.00       1.0
           4       0.00      0.00      1.00       1.0

    accuracy                           1.00       5.0
   macro avg       0.00      0.00      1.00       5.0
weighted avg       0.00      0.00      1.00       5.0

Why? Because precision and recall are both 0, which means the denominator of the F-1 formula is 0, and zero_division=1.0 now (as of Scikit-Learn 1.3.0) applies to the F-1 calculation itself, so that means F-1 is calculated (incorrectly) as 1.0!

Why does this matter?

I don’t know if there are rigorous statistics on this, but I’d wager that macro average F-1 is the most commonly used metric for multiclass classification by a wide margin. Scikit-Learn’s f1_score() function is in turn very likely the most commonly used implementation of F-1. Try asking Google or ChatGPT how to calculate F-1; the first results will very likely tell you to use this exact function in Scikit-Learn.

The kinds of tasks F-1 could be used for range from low-risk, like sentiment analysis on customer reviews, to some conceivably really safety-critical things. Imagine a researcher at an autonomous car company thinks their computer vision system is performing really well at recognizing all categories of objects & entities on the road. But actually, their classifier is completely missing every single example of a few classes!

Ideally, any machine learning practitioner probably should notice this bug well before a classifier is put into production or reporting results in a submitted journal paper. On the other hand, you really would not expect the definition of F-1 to change from one version of Scikit-Learn to the next! While just about any programmer should be able to implement an F-1 calculator in very little time, most of us prefer to just import Scikit-Learn’s specifically to avoid gotcha edge cases like this one.

What should I do now?

If your project:

is using, has used, or may have used any Scikit-Learn version starting with 1.3.0 (released 2023-06-30)
contains any call to classification_report(), f1_score(), or fbeta_score(), and
that call contains the parameter zero_division=1.0 or zero_division=np.nan

it may have been affected by this bug. To determine if any particular F-1 score calculation was impacted by this bug, first change that F-1 score calculation to a classification_report() if possible. If any class in that classification report contains a precision of 0.0, a recall of 0.0, and an f1-score of 1.0 or nan, then the F-1 score for this classifier has been calculated incorrectly.

Any call using zero_division=1.0 can be fixed by reverting to Scikit-Learn version 1.2.2. Unfortunately, the parameter zero_division=np.nan did not exist in Scikit-Learn 1.2.2, and I don’t believe there is any easy way to replicate it.

(EDIT 2024-01-24: Scikit-Learn 1.4.0 has been released, and you should update to it ASAP!)

Footnotes:

In this post, np.nan refers to numpy.nan ↩

A completely wrong classifier can also get an F-1 score of 0.0 in Scikit-Learn 1.3.X, for example:

>>> print(sklearn.metrics.classification_report(y_true=[0, 0, 0], y_pred=[1, 1, 1], zero_division=1.0))
              precision    recall  f1-score   support

           0       1.00      0.00      0.00       3.0
           1       0.00      1.00      0.00       0.0

    accuracy                           1.00       3.0
   macro avg       0.50      0.50      0.00       3.0
weighted avg       1.00      0.00      0.00       3.0

(correctly) receives an F-1 of 0.0, because in each class, either precision or recall (but never both) is zero, which means that the denominator of the F-1 score for each class is nonzero. ↩

Connor Boyle

Rep. Paul Gosar’s Claims about OPT Contain Major Errors

Many OPT Workers do in Fact Pay FICA

Nonresident Alien Tax Status is not Always Advantageous

OPT Workers are Probably Paid Much More than the Average New College Graduate

Gosar et al.’s Assumptions about Employment are Likely Wrong

Conclusion

Addendum: No Minor Blunder

I Found Out I’m Colorblind, So I Made a Program to Generate Images That I Can’t Read

Creating the Algorithm

Playing with the Ishihara Generator

Flipping Coins in 100,000 Universes Wouldn’t Be as Close as the Polls in Wisconsin

Simulating Wisconsin polls2

Simulating a multiverse of polls

Some other states’ poll margins

How is this happening?

Why does this Matter?

How to build & push a Docker image directly to Minikube

Un-installing Snap Docker

Connecting to the Minikube’s Docker daemon

Running the Image on Minikube

Scikit-Learn’s F-1 calculator is broken

How did this happen?

Why does this matter?

What should I do now?

Simulating Wisconsin polls²