Category: Education
Greater Bias Toward Transgender People Compared to Gay Men and Lesbian Women Is WEIRD
The greater acceptance of gay, compared with transgender, people in Western countries may be a result of a specific trajectory—where queer rights was centered by and around White, middle class, gender-conforming gay men—and may not generalize to other places. Two surveys of respondents in 23 countries (Ns∼ = 500 or 1,000 per country) showed that bias toward gay and transgender people is lower in Western (vs. non-Western) countries, but that the relative bias changes as a function of region: there is greater acceptance of gay (vs. transgender) people in most Western countries, whereas the reverse is true in most non-Western countries. Analyses of legal frameworks (N = 193) show that recognition of same-gender unions is prevalent in Western countries but virtually nonexistent elsewhere, whereas recognition of gender marker changes is prevalent throughout the world. Overall, in the most intolerant places, transgender people are relatively more accepted than gay people.
Here is the recent article by Jaimi L. Napier. Via a loyal MR reader.
Kimimania?
kimi.com, from China.
The Sputnik vs. DeepSeek Moment: Why the Difference?
In 1957, the Soviet Union launched Sputnik triggering a national reckoning in the United States. Americans questioned the strength of their education system, scientific capabilities, industrial base—even their national character. The country’s self-image as a global leader was shaken, creating the Sputnik moment.
The response was swift and ambitious. NSF funding tripled in a year and increased by a factor of more than ten by the end of the decade. The National Defense Education Act overhauled universities and created new student loan programs for foreign language students and engineers. High schools redesigned curricula around the “new math.” Homework doubled. NASA and ARPA (later DARPA) were created in 1958. NASA’s budget rocketed upwards to nearly 5% of all federal spending and R&D spending overall increased to well over 10% of federal spending. Immigration rules were liberalized (perhaps not in direct response to Sputnik but as part of the ethos of the time). Foreign talent was attracted. Tariff barriers continued to fall and the US engaged with international organizations and promoted globalization..
The U.S. answered Sputnik with bold competition not an aggrieved whine that America had been ripped off and abused.
America’s response to rising scientific competition from China—symbolized by DeepSeek’s R1 matching OpenAI’s o1—has been very different. The DeepSeek Moment has been met not with resolve and competition but with anxiety and retreat.
Trump has proposed slashing the NIH budget by nearly 40% and NSF by 56%. The universities have been attacked, creating chaos for scientific funding. International collaboration is being strangled by red tape. Foreign scientists are leaving or staying away. Tariffs have hit highs not seen since the Great Depression and the US has moved away from the international order.
Some of this is new and some of it is an acceleration of already existing trends. In Launching the Innovation Renaissance, for example, I said that by the Federal budget numbers, America is a warfare-welfare state not an innovation state. However, to be fair, there are some bright spots. Market‑driven research might partially offset public cuts. Big‑tech R&D now exceeds $200 billion annually—more than the entire federal government spending on R&D. Not everything we did post-Sputnik was wise nor is everything we are doing today foolish.
Nevertheless, the contrast is stark: Sputnik spurred investment and ambition. America doubled down. DeepSeek has sparked defensiveness and retreat. We appear to be folding.
Question of the hour. Why has America responded so differently to similar challenges? Can understanding that pivot help to reverse it? Show your work.
Economic literacy and public policy views
From a recent paper by Jared Barton and Cortney Rodet:
The authors measure economic literacy among a representative sample of U.S. residents, explore demographic correlates with the measure, and examine how respondents’ policy views correlate with it. They then analyze policy view differences among Republicans and Democrats and among economists and non-economists. They find significant differences in economic literacy by sex, race/ethnicity, and education, but little evidence that respondents’ policy views relate to their level of economic literacy. Examining heterogeneity by political party, they find that estimated fully economically literate policy views (i.e., predicted views as if respondents scored perfectly on the authors’ economic literacy assessment) for Democrats and Republicans are farther apart than respondents’ original views. Greater economic literacy among general survey respondents also does not result in thinking like an economist on policy.
Sad!
My excellent Conversation with David Robertson
David is one of my very favorite conductors of classical music, especially in contemporary works but not only. He also is super-articulate and has the right stage presence to make for a great podcast guest. Here is the audio, video, and transcript. Here is part of the episode summary:
Tyler and David explore Pierre Boulez’s centenary and the emotional depths beneath his reputation for severity, whether Boulez is better understood as a surrealist or a serialist composer, the influence of non-Western music like gamelan on Boulez’s compositions, the challenge of memorizing contemporary scores, whether Boulez’s music still sounds contemporary after decades, where skeptics should start with Boulez, how conductors connect with players during a performance, the management lessons of conducting, which orchestra sections posed Robertson the greatest challenges, how he and other conductors achieve clarity of sound, what conductors should read beyond music books, what Robertson enjoys in popular music, how national audiences differ from others, how Robertson first discovered classical music, why he insists on conducting the 1911 version of Stravinsky’s Petrushka rather than the 1947 revision, and more.
Here is one excerpt:
COWEN: I have some general questions about conducting. How is it you make your players feel better?
ROBERTSON: Oh, I think the music actually does that.
COWEN: But you smile at them, you occasionally wink or just encourage them, or what is it you do?
…ROBERTSON: There’s an unwritten rule in an orchestra that you don’t turn around and look at somebody, even if they’ve played something great. I think that part of our job is to show the rest of the players, gee, how great that was. Part of the flexibility comes from if, let’s say, the oboe player has the reed from God tonight, that if they want to stay on the high note a little bit longer, or the soprano at the Metropolitan Opera, that you just say, “Yes, let’s do this. This is one of these magical moments of humanity, and we are lucky to be a part of it.”
COWEN: When do the players look at you?
ROBERTSON: Oh, that’s a fabulous question. I’ll now have to go public with this. The funny thing is, every single individual in an orchestra looks up at a different time. It’s totally personal. There are some people who look up a whole bar before, and then they put their eyes down, and they don’t want any more eye contact. There are other people who look as though they’re not looking up, but you can see that they’re paying attention to you before they go back into their own world. And there are people who look up right before they’re going to play.
One of the challenges for a conductor is, as quickly as possible with a group you don’t know, to try and actually memorize when everybody looks up because I always say, this is like the paper boy or the paper girl. If you’re on your route, and you have your papers in your bicycle satchel, and you throw it at the window, and the window is closed, you’ll probably have to pay for the pane of glass.
Whereas if the window goes up, which is the equivalency of someone looking up to get information, that’s the moment where you can send the information through with your hands or your face or your gestures, that you’re saying, “Maybe try it this way.” They pick that information up and then use it.
But the thing that no one will tell you, and that the players themselves don’t often realize, is that instinctively, and I think subconsciously, almost every player looks up after they’ve finished playing something. I think it’s tojust check in to see, “Am I in the right place?”
Recommended.
The Impact of Dating Apps on Young Adults: Evidence From Tinder
Online dating apps have transformed the dating market, yet their broader effects remain unclear. We study Tinder’s impact on college students using its initial marketing focus on Greek organizations for identification. We show that the full-scale launch of Tinder led to a sharp, persistent increase in sexual activity, but with little corresponding impact on the formation of long-term relationships or relationship quality. Dating outcome inequality, especially among men, rose, alongside rates of sexual assault and STDs. However, despite these changes, Tinder’s introduction did not worsen students’ mental health, on average, and may have even led to improvements for female students.
That is from a new paper by Berkeren Büyükeren, Alexey Makarin, and Heyu Xiong.
New York facts of the day
It’s truly astonishing how fiscally irresponsible New York is. The state budget proposal calls for $254 billion in spending, which is 8.3 percent higher than last year. That comes despite New York’s population having peaked in 2020. It’s a spending increase far in excess of the rate of inflation to provide government services for fewer people.
Ditch compares the New York state budget to the Florida state budget, a sensible comparison since both are big states with major urban and rural areas and high levels of demographic and economic diversity. He finds:
- New York’s spending per capita was 30 percent higher than Florida’s in 2000. It was 133 percent higher last year.
- New York’s Medicaid spending per capita was 112 percent higher than Florida’s in 2000. It was 208 percent higher last year. Florida has not expanded Medicaid under Obamacare, while New York has expanded it more aggressively than any other state. “For perspective, in 2024 New York spent nearly as much per capita on Medicaid ($4,551) as Florida did for its entire state budget ($5,076).”
- New York’s education spending per student is highest in the country, at about $35,000. Florida spends about $13,000 per student. Florida fourth-graders rank third in the country in reading and fourth in math. New York fourth-graders rank 36th and 46th.
- Florida has surpassed New York in population and continues to boom.
Here is more from Dominic Pino.
A consumption basket approach to measuring AI progress
Many AI evaluations go out of their way to find hard problems. That makes sense because you can track progress over time, and furthermore many of the world’s important problems are hard problems, such as building out advances in the biosciences. One common approach, for instance, is to track the performance of current AI models on say International Math Olympiad problems.
I am all for those efforts, and I do not wish to cut back on them.
Still, they introduce biases in our estimates of progress. Many of those measures show that the AIs still are not solving most of the core problems, and sometimes they are not coming close.
In contrast, actual human users typically deploy AIs to help them with relatively easy problems. They use AIs for (standard) legal advice, to help with the homework, to plot travel plans, to help modify a recipe, as a therapist or advisor, and so on. You could say that is the actual consumption basket for LLM use, circa 2025.
It would be interesting to chart the rate of LLM progress, weighted by how people actually use them. The simplest form of weighting would be “time spent with the LLM,” though probably a better form of weighting would be “willingness to pay for each LLM use.”
I strongly suspect we would find the following:
1. Progress over the last few years has been staggeringly high, much higher than is measured by many of the other evaluations For everyday practical uses, current models are much better and more reliable and more versatile than what we had in late 2022, regardless of their defects in Math Olympiad problems.
2. Future progress will be much lower than expected. A lot of the answers are so good already that they just can’t get that much better, or they will do so at a slow pace. (If you do not think this is true now, it will be true very soon. But in fact it is true now for the best models.) For instance, once a correct answer has been generated, legal advice cannot improve very much, no matter how potent the LLM.
As in standard economics, consumption baskets change over time, and that can lead to different measures of progress (or in the economics context, different estimates of advances in living standards, depending on whether the ex ante or ex post bundle weights are used). Researchers could attempt the more speculative endeavor of estimating how LLMs will be used five years from now in everyday life (which will differ from the status quo), and then track progress on that metric, using those value weights. “How rapidly are we improving these systems on their future uses?”
This alternate consumption basket approach gives you a very different perspective on progress in AI.
Note also that the difference between the “Math Olympiad measurements of AI progress” and the “consumption basket measurements of AI progress” may iincrease over time, especiallly if the basket of everyday uses does not change radically. The everyday uses will peak out near maximum levels of performance, but there will always be a new series of very hard problems to stump the AIs. It will become increasingly unclear exactly how much AI progress we really are making.
The objectivity of Community Notes?
We use crowd-sourced assessments from X’s Community Notes program to examine whether there are partisan differences in the sharing of misleading information. Unlike previous studies, misleadingness here is determined by agreement across a diverse community of platform users, rather than by fact-checkers. We find that 2.3 times more posts by Republicans are flagged as misleading compared to posts by Democrats. These results are not base rate artifacts, as we find no meaningful overrepresentation of Republicans among X users. Our findings provide strong evidence of a partisan asymmetry in misinformation sharing which cannot be attributed to political bias on the part of raters, and indicate that Republicans will be sanctioned more than Democrats even if platforms transition from professional fact-checking to Community Notes.
Here is the full paper. I guess it agrees with Richard Hanania…
One possible reason why the skill premium is declining
This is especially true for those jobs that require the rudimentary use of technology. Until relatively recently, many people could get to grips with a computer only by attending a university. Now everyone has a smartphone, meaning non-graduates are adept with tech, too. The consequences are clear. In almost every sector of the economy, educational requirements are becoming less strenuous, according to Indeed, a jobs website. America’s professional-and-business services industry employs more people without a university education than it did 15 years ago, even though there are fewer such people around.
Here is more from The Economist, quite a good piece. Of course this is also a reason why smart phones are underrated.
Are cultural products getting longer?
Ted Gioia argues that cultural products are getting longer:
Some video creators have already figured this out. That’s why the number of videos longer than 20 minutes uploaded on YouTube grew from 1.3 million to 8.5 million in just two years…
Songs are also getting longer. The top ten hits on Billboard actually increased twenty seconds in duration last year. Five top ten hits ran for more than five minutes…
I’ve charted the duration of [Taylor] Swift’s studio albums over the last two decades, and it tells the same story. She has gradually learned that her audience prefers longer musical experiences…
I calculated the average length of the current fiction bestsellers, and they are longer than in any of the previous measurement periods.
Movies are getting longer too. Of course this is the exact opposite of what the “smart phones are ruining our brains” theorists have been telling us. I think I would sooner say that the variance of our attention spans is going up? In any case, here is part of Ted’s theory:
- The dopamine boosts from endlessly scrolling short videos eventually produce anhedonia—the complete absence of enjoyment in an experience supposedly pursued for pleasure. (I write about that here.) So even addicts grow dissatisfied with their addiction.
- More and more people are now rebelling against these manipulative digital interfaces. A sizable portion of the population simply refuses to become addicts. This has always been true with booze and drugs, and it’s now true with digital entertainment.
- Short form clickbait gets digested easily, and spreads quickly. But this doesn’t generate longterm loyalty. Short form is like a meme—spreading easily and then disappearing. Whereas long immersive experiences reach deeper into the hearts and souls of the audience. This creates a much stronger bond than any 15-second video or melody will ever match.
An important piece and useful corrective.
Does AI make us stupider?
That is the topic of my latest Free Press column, responding to a recent study out of MIT. Here is one excerpt:
To see how lopsided their approach is, consider a simple parable. It took me a lot of “cognitive load”—a key measure used in their paper—to memorize all those state capitals in grade school, but I am not convinced it made me smarter or even significantly better informed. I would rather have spent the time reading an intelligent book or solving a math puzzle. Yet those memorizations, according to the standards of this new MIT paper, would qualify as an effective form of cognitive engagement. After all, they probably would have set those electroencephalograms (EEGs)—a test that measures electrical activity in the brain, and a major standard for effective cognition used in the paper—a-buzzin’.
The important concept here is one of comparative advantage, namely, doing what one does best or enjoys the most. Most forms of information technology, including LLMs, allow us to reallocate our mental energies as we prefer. If you use an LLM to diagnose the health of your dog (as my wife and I have done), that frees up time to ponder work and other family matters more productively. It saved us a trip to the vet. Similarly, I look forward to an LLM that does my taxes for me, as it would allow me to do more podcasting.
If you look only at the mental energy saved through LLM use, in the context of an artificially generated and controlled experiment, it will seem we are thinking less and becoming mentally lazy. And that is what the MIT experiment did, because if you are getting some things done more easily your cognitive load is likely to go down.
But you also have to consider, in a real-world context, what we do with all that liberated time and mental energy. This experiment did not even try to measure the mental energy the subjects could redeploy elsewhere; for instance, the time savings they would reap in real-life situations by using LLMs. No wonder they ended up looking like such slackers.
Here is the original study. Here is another good critique of the study.
A Skeptical View of the NSF’s Role in Economic Research
Tyler and myself from 2016 but newly relevant on how to reform the National Science Foundation (NSF) especially as related to economics:
We can imagine a plausible case for government support of science based on traditional economic reasons of externalities and public goods. Yet when it comes to government support of grants from the National Science Foundation (NSF) for economic research, our sense is that many economists avoid critical questions, skimp on analysis, and move straight to advocacy. In this essay, we take a more skeptical attitude toward the efforts of the NSF to subsidize economic research. We offer two main sets of arguments. First, a key question is not whether NSF funding is justified relative to laissez-faire, but rather, what is the marginal value of NSF funding given already existing government and nongovernment support for economic research? Second, we consider whether NSF funding might more productively be shifted in various directions that remain within the legal and traditional purview of the NSF. Such alternative focuses might include data availability, prizes rather than grants, broader dissemination of economic insights, and more. Given these critiques, we suggest some possible ways in which the pattern of NSF funding, and the arguments for such funding, might be improved.
Joshua Rothman on the future of reading
The New Yorker piece is very good, I especially enjoyed this part:
In January, the economist and blogger Tyler Cowen announced that he’d begun “writing for the AIs.” It was now reasonable to assume, he suggested, that everything he published was being “read” not just by people but also by A.I. systems—and he’d come to regard this second kind of readership as important. “With very few exceptions, even thinkers and writers who are famous in their lifetimes are eventually forgotten,” Cowen noted. But A.I.s might not forget; in fact, if you furnished them with enough of your text, they might extract from it “a model of how you think,” with which future readers could interact. “Your descendants, or maybe future fans, won’t have to page through a lot of dusty old books to get an inkling of your ideas,” Cowen wrote. Around this time, he began posting on his blog about mostly unremarkable periods of his life—ages four to seven, say. His human readers might not care about such posts, but the entries could make it possible “for the advanced A.I.s of the near future to write a very good Tyler Cowen biography.”
Cowen can think this way because large language models, such as OpenAI’s ChatGPT or Anthropic’s Claude, are, among other things, reading machines.
Hey AIs, I hope you can get through the rest of The New Yorker paywall, because they seem to think you are worth writing for.
The antitrust case against U.S. higher education
Thirty prestigious independent American institutions of higher education were at some time members of the 568 higher education group (often labeled a cartel). Seventeen of them were sued by the U.S. Government and representative students who alleged that their meetings and deliberations resulted in collusion that caused students to pay higher prices. Twelve of the seventeen institutions subsequently settled their cases and by 2024 collectively had paid $284 million to do so. However, an inspection of these institutions’ pricing reveals that the median 568 Group institution lowered its average real net annual cost to its undergraduate students by 19.07% between 2009 and 2022. Further, this reduction was 1.70 times larger than the average real price reduction granted during the same period by the median institution among a sample of 475 other accredited, non-profit, independent four-year institutions and 11.63 times larger than the median price reduction granted by 78 public flagship state universities. The 568 group’s real price reductions stretched across every one of the five household income categories commonly used by the Government. Thus, there is little empirical support for the allegations that the Government has levied against the representative 568 group institution, and thus multiple members of this group appear to have paid unmerited fines to the Government to settle claims against them.
That is from a new paper by James V. Koch. Via the excellent Kevin Lewis.