In analysing usage of social media sites such as Twitter one of the categories often used is male/female. On this scale there are some sites with a preponderance of males (Slashdot, Google+ and Reddit), others where they are roughly equal (Facebook and Twitter) and maybe some where females are in the majority (possibly MySpace and Bebo).
This raises the question, how do you count the numbers of males and females on a social media site? Take Twitter as an example. There is nothing about gender on a user's profile and so the analyst can only deduce the gender from the name or information in the profile. It's often said (e.g. http://www.sysomos.com/insidetwitter/mostactiveusers/#males-vs-female) that a user's gender can be found by looking up the first name in lists and databases. I decided to try this approach by getting the genders of the users I'm currently following (my followees) on Twitter. Hardly a representative or large sample but a simple starting point.
The results changed my thinking:
Genders of my followees on Twitter |
In my case there were considerably more more males than females, but the surprise was the number of users who were neither. In some cases it wasn't possible to identify gender even with the help of those lists of names. Of course some names such as Lesley are inherently ambiguous. Others are not on the lists, being unusual or nicknames. I could identify gender for some users from their profile photos but have not included these in the male/female numbers as I was trying to replicate what an automated system based upon text analysis might do.
No comments:
Post a Comment