I am interested in collecting and analyzing two types of data: 1) social media data from Facebook, Twitter, Weibo etc. (so-called big data, digital trace data, or heterogeneous data); and 2) digital platform and infrastructure data including specific platforms (e.g., the technical architectures, digital interfaces, protocols) and national infrastructures (e.g., credit scoring systems).
Social Media Datasets
Liang, F. (In-progress). The effect of labeling propaganda on Twitter [Dataset: 384,000 cases and 28 features]. Ann Arbor, MI. Project on Chinese Media on Twitter
Liang, F. (2020). The discussion of Chinese politics on Twitter, 2017-2020. [Dataset: 2,307,313 cases and 18 features]. Ann Arbor, MI. Project on Chinese Astroturf on Twitter.
Liang, F & Campbell, S. (2020). The discussion of 5G technology on Weibo and Twitter. [Dataset: 11,313 cases and 11 features from Weibo, 87,586 cases and 17 features from Twitter]. Ann Arbor, MI. Project on Imagining 5G.
Liang, F. (2017). News coverage produced by China’s official media on Facebook, 2009-2017. [Dataset: 266,772 cases and 51 features]. Ann Arbor, MI. Project on Authoritarian Media Bias on Facebook.
Digital Platform and Infrastructure Datasets:
Liang, F. (In-progress). The global expansion of China’s AI surveillance firms. [Dataset: 87 cases and 13 features]. Ann Arbor, MI. Project on The Globalization of China’s AI surveillance and facial recognition products.
Liang, F. (In-progress). The scoring and ranking systems behind personal credit platforms. [Dataset: 59 cases and 21 features]. Ann Arbor, MI. Project on Automating Citizen Classification.
Hussain, M. M., Das, V., Liang, F., Kostyuk, N., Chen, W. (2017). The development of global big data surveillance systems. [Dataset: 175 cases and 22 features]. Ann Arbor, MI. Big Data Innovation and Governance.