Svoboda | Graniru | BBC Russia | Golosameriki | Facebook
skip to main content
10.1145/3366423.3380022acmconferencesArticle/Chapter ViewAbstractPublication PageswebconfConference Proceedingsconference-collections
research-article

Predicting Drug Demand with Wikipedia Views: Evidence from Darknet Markets.

Published: 20 April 2020 Publication History
  • Get Citation Alerts
  • Abstract

    Rapid changes in illicit drug demand, such as the Fentanyl epidemic, are a major public health issue. Policymakers currently rely on annual surveys to monitor public consumption, which are arguably too infrequent to detect rapid shifts in drug use. We present a novel method to predict drug use based on high-frequency sales data from darknet markets. We show that models based on historic trades alone cannot accurately predict drug demand. However, augmenting these models with data on Wikipedia page views for each drug greatly improves predictive accuracy, particularly for less popular drugs, suggesting such models may be particularly useful for detecting newly emerging substances. These results hold out-of-sample at high time frequency, across a range of drugs and countries. Therefore Wikipedia data may enable us to build a high-frequency measure of drug demand, which could help policymakers respond more quickly to future drug crises.

    References

    [1]
    Duilio Balsamo, Paolo Bajardi, and Andre Panisson. 2019. Firsthand Opiates Abuse on Social Media: Monitoring Geospatial Patterns of Interest Through a Digital Cohort. In The World Wide Web Conference(WWW ’19). ACM, New York, NY, USA, 2572–2579. https://doi.org/10.1145/3308558.3313634
    [2]
    Monica J. Barratt and Judith Aldridge. 2016. Everything you always wanted to know about drug cryptomarkets* (*but were afraid to ask). International Journal of Drug Policy 35 (Sept. 2016), 1–6. https://doi.org/10.1016/j.drugpo.2016.07.005
    [3]
    Johan Bollen, Huina Mao, and Xiao-Jun Zeng. 2011. Twitter mood predicts the stock market. Journal of Computational Science 2, 1 (March 2011), 1–8. https://doi.org/10.1016/j.jocs.2010.12.007
    [4]
    Yan Carriere-Swallow and Felipe Labbe. 2013. Nowcasting with Google Trends in an Emerging Market: Nowcasting with Google Trends in an Emerging Market. Journal of Forecasting 32, 4 (July 2013), 289–298. https://doi.org/10.1002/for.1252
    [5]
    Yin-Wong Cheung and Kon S. Lai. 1995. Lag Order and Critical Values of the Augmented Dickey Fuller Test. Journal of Business & Economic Statistics 13, 3 (July 1995), 277–280. https://doi.org/10.1080/07350015.1995.10524601
    [6]
    Zhi Da, Joseph Engelberg, and Pengjie Gao. 2011. In Search of Attention. The Journal of Finance 66, 5 (Oct. 2011), 1461–1499. https://doi.org/10.1111/j.1540-6261.2011.01679.x
    [7]
    Jakob Demant, Rasmus Munksgaard, David Decary-Hetu, and Judith Aldridge. 2018. Going Local on a Global Platform: A Critical Analysis of the Transformative Potential of Cryptomarkets for Organized Illicit Drug Crime. International Criminal Justice Review 28, 3 (Sept. 2018), 255–274. https://doi.org/10.1177/1057567718769719
    [8]
    Martin Dittus, Joss Wright, and Mark Graham. 2018. Platform Criminalism: The ’Last-Mile’ Geography of the Darknet Market Supply Chain. Proceedings of the 2018 World Wide Web Conference on World Wide Web - WWW ’18 (2018), 277–286. https://doi.org/10.1145/3178876.3186094 arXiv: 1712.10068.
    [9]
    Abeer ElBahrawy, Laura Alessandretti, and Andrea Baronchelli. 2019. Wikipedia and Digital Currencies: Interplay Between Collective Attention and Market Performance. arXiv:1902.04517 [physics, q-fin] (Feb. 2019). http://arxiv.org/abs/1902.04517 arXiv: 1902.04517.
    [10]
    EMCDDA. 2019. European Drug Report 2019: Trends and Developments. Technical Report. European Monitoring Centre for Drugs and Drug Addiction. 94 pages. http://www.emcdda.europa.eu/publications/edr/trends-developments/2019_en
    [11]
    Nicholas Generous, Geoffrey Fairchild, Alina Deshpande, Sara Y. Del Valle, and Reid Priedhorsky. 2014. Global Disease Monitoring and Forecasting with Wikipedia. PLoS Computational Biology 10, 11 (Nov. 2014), e1003892. https://doi.org/10.1371/journal.pcbi.1003892
    [12]
    Domenico Giannone, Lucrezia Reichlin, and David Small. 2008. Nowcasting: The real-time informational content of macroeconomic data. Journal of Monetary Economics 55, 4 (May 2008), 665–676. https://doi.org/10.1016/j.jmoneco.2008.05.010
    [13]
    Jeremy Ginsberg, Matthew H. Mohebbi, Rajan S. Patel, Lynnette Brammer, Mark S. Smolinski, and Larry Brilliant. 2009. Detecting influenza epidemics using search engine query data. Nature 457, 7232 (Feb. 2009), 1012–1014. https://doi.org/10.1038/nature07634
    [14]
    Mark Graham, Stefano De Sabbata, and Matthew A. Zook. 2015. Towards a study of information geographies: (im)mutable augmentations and a mapping of the geographies of information.Geo: Geography and Environment 2, 1 (June 2015), 88–105. https://doi.org/10.1002/geo2.8
    [15]
    Scott Higham, Sari Horwitz, and Katie Zezima. 2019. Obama officials failed to focus as fentanyl burned its way across America - Washington Post. https://www.washingtonpost.com/graphics/2019/national/fentanyl-epidemic-obama-administration/
    [16]
    Ladislav Kristoufek. 2013. BitCoin meets Google Trends and Wikipedia: Quantifying the relationship between phenomena of the Internet era. Scientific Reports 3, 1 (Dec. 2013). https://doi.org/10.1038/srep03415
    [17]
    Kristy Kruithof, Judith Aldridge, David Decary Hetu, Megan Sim, Elma Dujso, and Stijn Hoorens. 2016. Internet-facilitated drugs trade: An analysis of the size, scope and the role of the Netherlands. Product Page. RAND Corporation. https://www.rand.org/pubs/research_reports/RR1607.html
    [18]
    Allen Yilun Lin, Justin Cranshaw, and Scott Counts. 2019. Forecasting U.S. Domestic Migration Using Internet Search Queries. In The World Wide Web Conference on - WWW ’19. ACM Press, San Francisco, CA, USA, 1061–1072. https://doi.org/10.1145/3308558.3313667
    [19]
    Maimuna S. Majumder, Mauricio Santillana, Sumiko R. Mekaru, Denise P. McGinnis, Kamran Khan, and John S. Brownstein. 2016. Utilizing Nontraditional Data Sources for Near Real-Time Estimation of Transmission Dynamics During the 2015-2016 Colombian Zika Virus Disease Outbreak. JMIR public health and surveillance 2, 30 (2016). https://doi.org/10.2196/publichealth.5814
    [20]
    Connor McMahon, Isaac L. Johnson, and Brent J. Hecht. 2017. The Substantial Interdependence of Wikipedia and Google: A Case Study on the Relationship Between Peer Production Communities and Information Technologies. In ICWSM, Vol. 11. AAAI Publications.
    [21]
    Marton Mestyan, Taha Yasseri, and Janos Kertesz. 2013. Early Prediction of Movie Box Office Success Based on Wikipedia Activity Big Data. PLoS ONE 8, 8 (Aug. 2013), e71226. https://doi.org/10.1371/journal.pone.0071226
    [22]
    Helen Susannah Moat, Chester Curme, Adam Avakian, Dror Y. Kenett, H. Eugene Stanley, and Tobias Preis. 2013. Quantifying Wikipedia Usage Patterns Before Stock Market Moves. Scientific Reports 3, 1 (Dec. 2013). https://doi.org/10.1038/srep01801
    [23]
    David Molnar, Serge Egelman, and Nicolas Christin. 2010. This is our data on drugs: lessons computer security can learn from the drug war. In Proceedings of the 2010 workshop on New security paradigms - NSPW ’10. ACM Press, Concord, Massachusetts, USA, 143. https://doi.org/10.1145/1900546.1900566
    [24]
    Donald R. Olson, Kevin J. Konty, Marc Paladini, Cecile Viboud, and Lone Simonsen. 2013. Reassessing Google Flu Trends Data for Detection of Seasonal and Pandemic Influenza: A Comparative Epidemiological Study at Three Geographic Scales. PLoS Computational Biology 9, 10 (Oct. 2013), e1003256. https://doi.org/10.1371/journal.pcbi.1003256
    [25]
    National Institute on Drug Abuse. 2019. Overdose Death Rates. https://www.drugabuse.gov/related-topics/trends-statistics/overdose-death-rates
    [26]
    United Nations Office on Drugs and Crime. 2019. World Drug Report. Technical Report. United Nations. https://wdr.unodc.org/wdr2019/
    [27]
    Robert Todd Perdue, James Hawdon, and Kelly M. Thames. 2018. Can Big Data Predict the Rise of Novel Drug Abuse?Journal of Drug Issues 48, 4 (Oct. 2018), 508–518. https://doi.org/10.1177/0022042618772294
    [28]
    T. Preis and H. S. Moat. 2014. Adaptive nowcasting of influenza outbreaks using Google searches. Royal Society Open Science 1, 2 (Oct. 2014). https://doi.org/10.1098/rsos.140095
    [29]
    Silas W. Smith and Fiona M. Garlich. 2013. Chapter 3 - Availability and Supply of Novel Psychoactive Substances. In Novel Psychoactive Substances, Paul I. Dargan and David M. Wood (Eds.). Academic Press, Boston, 55–77. https://doi.org/10.1016/B978-0-12-415816-0.00003-1
    [30]
    Kyle Soska and Nicolas Christin. 2015. Measuring the Longitudinal Evolution of the Online Anonymous Marketplace Ecosystem. In SEC’15 Proceedings of the 24th USENIX Conference on Security Symposium(SEC’15), Vol. 15. 33–48. https://doi.org/978-1-931971-232
    [31]
    Ning Su, Jiyin He, Yiqun Liu, Min Zhang, and Shaoping Ma. 2018. User Intent, Behaviour, and Perceived Satisfaction in Product Search. In Proceedings of the Eleventh ACM International Conference on Web Search and Data Mining, WSDM 2018, Marina Del Rey, CA, USA, February 5-9, 2018. 547–555. https://doi.org/10.1145/3159652.3159714
    [32]
    Derek K Tracy, David M Wood, and David Baumeister. 2017. Novel psychoactive substances: identifying and managing acute and chronic harmful use. British Medical Journal356 (Jan. 2017), i6814. https://doi.org/10.1136/bmj.i6814
    [33]
    David Tsurel, Dan Pelleg, Ido Guy, and Dafna Shahaf. 2017. Fun Facts: Automatic Trivia Fact Extraction from Wikipedia. In Proceedings of the Tenth ACM International Conference on Web Search and Data Mining(WSDM ’17). ACM, New York, NY, USA, 345–354. https://doi.org/10.1145/3018661.3018709 event-place: Cambridge, United Kingdom.
    [34]
    United Nations Office on Drugs and Crime. 2017. Global Synthetic Drugs Assessment. Technical Report. United Nations. https://www.unodc.org/documents/scientific/Global_Synthetic_Drugs_Assessment_2017.pdf
    [35]
    Rachel S. Wightman, Jeanmarie Perrone, and Lewis S. Nelson. 2017. Comparative Analysis of Opioid Queries on Erowid.org: An Opportunity to Advance Harm Reduction. Substance Use & Misuse 52, 10 (Aug. 2017), 1315–1319. https://doi.org/10.1080/10826084.2016.1276600
    [36]
    Wikipedia. 2019. Wikipedia API: Main Page. https://www.mediawiki.org/wiki/API:Main_page
    [37]
    Mitsuo Yoshida, Yuki Arase, Takaaki Tsunoda, and Mikio Yamamoto. 2015. Wikipedia Page Views Reflect Web Search Trends. In Proceedings of the ACM Web Science Conference - WebSci ’15. ACM Press, Oxford, United Kingdom, 1–2. https://doi.org/10.1145/2786451.2786495
    [38]
    Jinhui Zhao, Tim Stockwell, and Scott Macdonald. 2009. Non-response bias in alcohol and drug population surveys: Non-response bias in surveys. Drug and Alcohol Review 28, 6 (May 2009), 648–657. https://doi.org/10.1111/j.1465-3362.2009.00077

    Cited By

    View all
    • (2023)Machine Learning Approaches for Region-level Prescription Demand Forecasting2023 IEEE Smart World Congress (SWC)10.1109/SWC57546.2023.10449058(1-6)Online publication date: 28-Aug-2023
    • (2022)Minimum Prediction Error at an Early Stage in Darknet AnalysisDark Web Pattern Recognition and Crime Analysis Using Machine Intelligence10.4018/978-1-6684-3942-5.ch002(18-30)Online publication date: 13-May-2022
    • (2022)Un modelo para predecir la demanda en farmaciasRedmarka. Revista de Marketing Aplicado10.17979/redma.2022.26.1.900726:1(1-14)Online publication date: 30-Jun-2022
    • Show More Cited By

    Recommendations

    Comments

    Information & Contributors

    Information

    Published In

    WWW '20: Proceedings of The Web Conference 2020
    April 2020
    3143 pages
    ISBN:9781450370233
    DOI:10.1145/3366423
    Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

    Sponsors

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    Published: 20 April 2020

    Permissions

    Request permissions for this article.

    Check for updates

    Author Tags

    1. deep web
    2. nowcasting
    3. policy support
    4. web search
    5. web traffic

    Qualifiers

    • Research-article
    • Research
    • Refereed limited

    Conference

    WWW '20
    Sponsor:
    WWW '20: The Web Conference 2020
    April 20 - 24, 2020
    Taipei, Taiwan

    Acceptance Rates

    Overall Acceptance Rate 1,899 of 8,196 submissions, 23%

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)33
    • Downloads (Last 6 weeks)2

    Other Metrics

    Citations

    Cited By

    View all
    • (2023)Machine Learning Approaches for Region-level Prescription Demand Forecasting2023 IEEE Smart World Congress (SWC)10.1109/SWC57546.2023.10449058(1-6)Online publication date: 28-Aug-2023
    • (2022)Minimum Prediction Error at an Early Stage in Darknet AnalysisDark Web Pattern Recognition and Crime Analysis Using Machine Intelligence10.4018/978-1-6684-3942-5.ch002(18-30)Online publication date: 13-May-2022
    • (2022)Un modelo para predecir la demanda en farmaciasRedmarka. Revista de Marketing Aplicado10.17979/redma.2022.26.1.900726:1(1-14)Online publication date: 30-Jun-2022
    • (2022)Macroscopic properties of buyer–seller networks in online marketplacesPNAS Nexus10.1093/pnasnexus/pgac2011:4Online publication date: 6-Oct-2022
    • (2022)Upside Down: Exploring the Ecosystem of Dark Web Data MarketsICT Systems Security and Privacy Protection10.1007/978-3-031-06975-8_28(489-506)Online publication date: 3-Jun-2022
    • (2021)Dark Web Marketplaces and COVID-19: before the vaccineEPJ Data Science10.1140/epjds/s13688-021-00259-w10:1Online publication date: 21-Jan-2021
    • (2021)Hidden Buyer Identification in Darknet Markets via Dirichlet Hawkes Process2021 IEEE International Conference on Big Data (Big Data)10.1109/BigData52589.2021.9671406(581-589)Online publication date: 15-Dec-2021
    • (undefined)The COVID-19 Online Shadow EconomySSRN Electronic Journal10.2139/ssrn.3703865

    View Options

    Get Access

    Login options

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    HTML Format

    View this article in HTML Format.

    HTML Format

    Media

    Figures

    Other

    Tables

    Share

    Share

    Share this Publication link

    Share on social media