Recent Posts

Twitter Bot Checker App Powered By Botometer

A Streamlit app for checking whether your Twitter followers are bots

Securing and Monitoring ShinyProxy Deployment of R Shiny Apps

This post provides a guide to secure ShinyProxy with Nginx, Certbot and AWS Cognito, and monitor usage statistics with InfluxDB, Telegraf and Grafana.

Global COVID-19 Deaths Tracker

Updated daily

The Effectiveness of Reducing Population Movement in Managing Coronavirus Outbreak

This post illustrates simulations of coronavirus outbreak in central Tokyo area based on SIR model and origin-destination flow data.

新冠肺炎深圳市数据分析 —— 2月16日 Data Analysis of COVID-19 (Coronavirus) Cases in Shenzhen - 16 Feb

文章包括病例分析,各城区数据汇总和趋势分析,以及已知病例活动地点的地图整理。最后更新于2月16日。Updated analysis on diagnosed cases, number of cases in each district and COVID-19 map. Last updated on 16 Feb.


Data Scientist

Working as a data scientist in a management consulting firm, Yihui’s job involves helping businesses and policymakers to understand and manage consumer decision-making, solving business problems, generating data-driven insights, and developing data products.

Yihui is also an AI / deep learning enthusiast and enjoy taking part in data science competitions and open source projects. If you find his posts interesting, please connect with him on or and he is happy to chat more!


  • Statistical Modelling
  • NLP and Deep Learning
  • Data Visualisation
  • Behavioural Science


  • MSc in Machine Learning

    Birkbeck, University of London

  • MSc in Quantitative Social Science

    University of Oxford

  • BA Linguistics and Sociology

    University of Manchester