Computer Algorithm Can Beat Drunk Tweeter

Drunk tweets, long considered an unfortunate, yet ubiquitous, a by product of the social media age, have finally been put to good use.

With the help of a machine-learning algorithm, researchers from the University of Rochester cross-referenced tweets mentioning alcohol consumption with geo-tagging information to broadly analyze human drinking behavior. They were able to estimate where and when people imbibed, and, to a limited extent, how they behaved under the influence. The experiment is more than a social critique — the algorithm helps researchers spot drinking patterns that could inform public health decisions and could be applied to a range of other human behaviors.

#Drunk Tweeting

To begin with, the researchers sorted through a selection of tweets from both New York City and rural New York with the help of Amazon’s Mechanical Turk. Users identified tweets related to drinking and picked out keywords, such as “drunk,” “vodka” and “get wasted,” to train an algorithm.

They put each relevant tweet through a series of increasingly stringent questions to home in on tweets that not only referenced the author drinking but indicated that they were doing so while sending the tweet. That way, they could determine whether a person was actually tweeting and drinking, or just sending tweets about drinking. Once they had built up a dependable database of keywords, they were able to fine-tune their algorithm so it could recognize words and locations that likely proved people were drinking.

To get tweeters’ locations, they used only tweets that had been geo-tagged with Twitter’s “check-in” feature. They then approximated users’ home locations by checking where they were when they sent tweets in the evenings, in addition to tweets containing words like “home” or “bed.” This lets them know whether users’ preferred to drink at home or out at bars or restaurants.

Combining these two datasets gave the researchers a broad idea of how many people in a given area or at a given time were drinking. Not surprisingly, they found a correlation between the number of bars and how much people drank — more bars meant more drunk people. New York City saw a stronger correlation between the two, proving that people in the big city really do like to drink more. Rather paradoxically, their data also showed that city dweller were more likely to tweet about drinking at home as well.

Their work builds on previous studies that attempted to tie people’s tweets to specific activities and locations. By using the check-in feature, they say that their system is much more accurate than others, and can reliably place people within a block of their actual location. They published their work on the pre-print server arXiv.


0 komentar:

Post a Comment