Recent Posts

Securing and monitoring ShinyProxy deployment of R Shiny apps

This post provides a guide to secure ShinyProxy with Nginx, Certbot and AWS Cognito, and monitor usage statistics with InfluxDB, Telegraf and Grafana.

Simulating Coronavirus Outbreak in Cities with Origin-Destination Matrix and SEIR Model

This is a step-by-step guide on simulating and visualising the spread of coronavirus in the Greater Tokyo Area based on Origin-Destination Matrix and SEIR Model with R.

Global COVID-19 Deaths Tracker

Updated daily

The Effectiveness of Reducing Population Movement in Managing Coronavirus Outbreak

This post illustrates simulations of coronavirus outbreak in central Tokyo area based on SIR model and origin-destination flow data.

新冠肺炎深圳市数据分析 —— 2月16日 Data Analysis of COVID-19 (Coronavirus) Cases in Shenzhen - 16 Feb

文章包括病例分析,各城区数据汇总和趋势分析,以及已知病例活动地点的地图整理。最后更新于2月16日。Updated analysis on diagnosed cases, number of cases in each district and COVID-19 map. Last updated on 16 Feb.

Yihui Fan

Data Scientist

About me

Working as a data scientist in a management consulting firm, my job involves helping businesses and policymakers to understand and manage consumer decision-making, solving business problems, generating data-driven insights, and developing data products.

I am also an AI / deep learning enthusiast and enjoy taking part in data science competitions and open source projects. If you find my posts interesting, please connect with me on or and I am happy to chat more!

Interests

  • Statistical Modelling
  • Natural Language Processing and Deep Learning
  • Data Visualisation
  • Behavioural Science

Education

  • MSc in Machine Learning, 2018

    Birkbeck, University of London

  • MSc in Quantitative Social Science, 2014

    University of Oxford

  • BA Linguistics and Sociology, 2013

    University of Manchester