Quantcast
Channel: Planet Python
Viewing all articles
Browse latest Browse all 22462

A. Jesse Jiryu Davis: 72% Of The People I Follow On Twitter Are Men

$
0
0

Description: Black and white photo. A boy stands behind a very large abacus that fills the image. He looks up at the ball he is moving on one of the abacus's wires, above his eye-level. Behind him are two schoolchildren and a chalkboard with indistinct writing and diagrams.

At least, that's my estimate. Twitter does not ask users their gender, so I have written a program that guesses based on their names. Among those who follow me, the ratio is even worse: 83% are men.

The way to fix the first ratio is not mysterious: I should notice and seek more women experts tweeting about my interests, and follow them.

The second ratio, on the other hand, I can merely influence, but I intend to improve it as well. My network on Twitter should represent of the software industry's diverse future, not its unfair present.


How Did I Measure It?

I set out to estimate the gender ratio of the people I follow—my "friends" in Twitter's jargon—and found it surprisingly hard. Twitter analytics readily shows me the converse, an estimate of my followers' gender ratio:

Description: Chart titled,

But Twitter doesn't tell me the ratio of my friends. That which is measured improves, so I searched for a service that would measure this number for me, and found FollowerWonk.

FollowerWonk guesses my friends are 71% men. Is this a good guess? For the sake of validation, I compare FollowerWonk's estimate of my followers to Twitter's estimate:

Twitter analytics
 
menwomenFollowers83%17%FollowerWonk menwomenFollowers81%19%Friends I follow72%28%

My followers show up 81% male here, close to the Twitter analytics number. So far so good. If FollowerWonk and Twitter agree on the gender ratio of my followers, that suggests FollowerWonk's estimate of the people I follow (which Twitter doesn't analyze) is reasonably good. With it, I can make a habit of measuring my ratio, and improve it.

At $30 a month, however, checking my ratio with FollowerWonk is a pricey habit. I don't need all its features anyhow. Can I solve only the gender-ratio problem economically?

Since FollowerWonk's numbers seem reasonable, I tried to reproduce them. Using Python and some nice Philadelphians' Twitter API wrapper, I began downloading the profiles of all my friends and followers. I immediately found that Twitter's rate limits are miserly, so I randomly sampled only a subset of users instead.

The best gender-correlated information in a Twitter profile is the "name" field: for example, @gvanrossum's name field is "Guido van Rossum", and the first name "Guido" suggests that @gvanrossum is male. I decided to use first names to estimate my ratio.

I wrote a script that passes parts of each name to the SexMachine library to guess gender. SexMachine has predictable downfalls, like mistaking "Brooklyn Zen Center" for a woman named "Brooklyn", but its estimates are as good as FollowerWonk's and Twitter's:

 menwomenundetermined
Friends I follow16866173
 72%28% 
Followers459108433
 81%19% 

(Based on all 407 friends and a sample of 1000 followers.)

Know Your Number

I want you to check your ratios, too, so I've deployed "My Ratio" to PythonAnywhere's handy service for $10 a month:

The application may rate-limit you or otherwise fail, so use it gently. The code is on GitHub. It includes a command-line tool, as well.

Who is represented in your network on Twitter? Are you speaking and listening to the same unfairly distributed group who have been talking about software for the last few decades, or does your network look like the software industry of the future? Let's know our numbers and improve them.


Image: Cyclopedia of Photography 1975.


Viewing all articles
Browse latest Browse all 22462

Trending Articles



<script src="https://jsc.adskeeper.com/r/s/rssing.com.1596347.js" async> </script>