Last month, I wrote a blog post warning about how, if you follow popular trends in NLP, you can easily accidentally make a classifier that is pretty racist. To demonstrate this, I included the very simple code, as a "cautionary tutorial". The post got a fair amount of reaction. Much of it positive and taking it seriously, so thanks for that. But eventually I heard from some detractors. Of course there were the fully expected "I'm not racist but what if racism is correct" retorts that I knew I'd have to face. But there were also people who couldn't believe that anyone does NLP this way. They said I was talking about a non-problem that doesn't show up in serious machine learning, or projecting my own bad NLP ideas, or something. Well. Here's Perspective API, made by an offshoot of Google. They believe they are going to use it to fight "toxicity" online. And by "toxicity" they mean "saying anything with negative sentiment". And by "negative sentiment" they mean "whatever word2vec thinks is bad". It works exactly like the hypothetical system that I cautioned against. On this blog, we've just looked at what word2vec (or GloVe) thinks is bad. It includes black people, Mexicans, Islam, and given names that don't usually belong to white Americans. You can actually type my examples into Perspective API and it will actually respond that the ones that are less white-sounding are more "likely to be perceived as toxic".