From where do tweets originate? - A GIS approach for user location inference

Qunying Huang, Guofeng Cao, Caixia Wang

Research output: Chapter in Book/Report/Conference proceedingConference contribution

19 Scopus citations

Abstract

A number of natural language processing and text-mining algorithms have been developed to extract the geospatial cues (e.g., place names) to infer locations of content creators from publicly available information, such as text content, online social profiles, and the behaviors or interactions of users from social networks. These studies, however, can only successfully infer user locations at city levels with relatively decent accuracy, while much higher resolution is required for meaningful spatiotemporal analysis in geospatial fields. Additionally, geographical cues exploited by current text-based approaches are hidden in the unreliable, unstructured, informal, ungrammatical, and multilingual data, and therefore are hard to extract and make meaningful correctly. Instead of using such hidden geographic cues, this paper develops a GIS approach that can infer the true origin of tweets down to the zip code level by using and mining spatial (geo-tags) and temporal (timestamps when a message was posted) information recorded on user digital footprints. Further, individual major daily activity zones and mobility can be successfully inferred and predicted. By integrating GIS data and spatiotemporal clustering methods, this proposed approach can infer individual daily physical activity zones with spatial resolution as high as 20 m by 20 m or even higher depending on the number of digit footprints collected for social media users. The research results with detailed spatial resolution are necessary and useful for various applications such as human mobility pattern analysis, business site selection, disease control, or transportation systems improvement.

Original languageEnglish
Title of host publicationProceedings of the 7th ACM SIGSPATIAL International Workshop on Location-Based Social Networks, LBSN 2014 - Held in conjunction with the 22nd ACM SIGSPATIAL GIS 2014
EditorsAlexei Pozdnoukhov
PublisherAssociation for Computing Machinery, Inc
Pages1-8
Number of pages8
ISBN (Electronic)9781450331401
DOIs
StatePublished - Nov 4 2014
Event7th ACM SIGSPATIAL International Workshop on Location-Based Social Networks, LBSN 2014 - Dallas, United States
Duration: Nov 4 2014 → …

Publication series

NameProceedings of the 7th ACM SIGSPATIAL International Workshop on Location-Based Social Networks, LBSN 2014 - Held in conjunction with the 22nd ACM SIGSPATIAL GIS 2014

Conference

Conference7th ACM SIGSPATIAL International Workshop on Location-Based Social Networks, LBSN 2014
CountryUnited States
CityDallas
Period11/4/14 → …

Keywords

  • Big data
  • Geography
  • Human mobility
  • Spatial clustering
  • Spatiotemporal clustering

Fingerprint Dive into the research topics of 'From where do tweets originate? - A GIS approach for user location inference'. Together they form a unique fingerprint.

  • Cite this

    Huang, Q., Cao, G., & Wang, C. (2014). From where do tweets originate? - A GIS approach for user location inference. In A. Pozdnoukhov (Ed.), Proceedings of the 7th ACM SIGSPATIAL International Workshop on Location-Based Social Networks, LBSN 2014 - Held in conjunction with the 22nd ACM SIGSPATIAL GIS 2014 (pp. 1-8). (Proceedings of the 7th ACM SIGSPATIAL International Workshop on Location-Based Social Networks, LBSN 2014 - Held in conjunction with the 22nd ACM SIGSPATIAL GIS 2014). Association for Computing Machinery, Inc. https://doi.org/10.1145/2755492.2755494