Skip to main content Skip to navigation

Yiming Ma

Hi , my name is Yiming Ma.

I currently work at King's College London (School of Biomedical Engineering & Imaging Sciences) as a postdoctoral researcher. I recently completed my PhD in the MathSys CDT at the University of Warwick and was part of the Signal and Information Processing (SIP) Lab. My PhD research focuses on machine learning and computer vision, especially crowd counting / density estimation and multimodal representation learning.

⚠️ Notice: This page is archived and no longer actively maintained. For the latest information, please visit yiming-m.github.ioLink opens in a new window.


Preprints

Publications

  • 2022 – ICIP: FusionCount: Efficient Crowd Counting via Multiscale Feature FusionLink opens in a new window
    • Motivation: Encoder-decoder counters underuse low-level features and add heavy multiscale modules.
    • Method: Contrast-aware group-wise fusion of encoder features plus a dual-branch channel-reduction decoder (1×1 + dilated conv).
    • Results: ShanghaiTech-B MAE 6.9 / RMSE 11.8 with ~815GFLOPs, surpassing or matching VGG-based peers (CSRNet, CAN, BL, DM-Count) at lower compute.

Experience

Research Associate, King's College London; London, UK — 2025–Now
  • Multimodal patient fingerprinting: Built a Multi-Modal Fingerprint (MMF) by integrating imaging, demographic, clinicopathological variables, and radiology reports to support patient-level risk stratification and personalised surveillance planning.
  • Longitudinal AI-driven clinical decision support: Developed and benchmarked deep learning models for AS enrolment/risk profiling, automated prostate/lesion assessment on bp-MRI, and longitudinal progression modelling; prioritised robustness to scanner/site shift and incomplete follow-up data.
  • Clinical translation & reporting: Prototyped a web-based standardised report aligned with PRECISE-style longitudinal assessment to streamline clinical review and improve consistency of follow-up decisions.
Research Assistant, University of Warwick; Coventry, UK — 2022–2023
  • Data curation: Refined and extended DAD annotations, adding 9 non-driving-related activities; prepared data for robust benchmarking.
  • Multiview multimodal fusion: Designed a multi-view multimodal driver monitoring system based on masked multi-head self attention; improved AUC-ROC from 88% to 97% on DAD and increased robustness to view/modality collapse.
Teaching Assistant, University of Warwick; Coventry, UK — 2023
  • Lab sessions: Assisted delivery of an undergraduate Python & Introductory ML module; led labs and tutorials guiding students to implement regression, classification, and neural networks in Python.
  • Tutoring: Provided one-to-one and small-group academic support, clarifying core programming/ML concepts and troubleshooting code and experiment design.

Education Background

2021~2025 (Doctor of Philosophy): University of WarwickLink opens in a new window, Coventry, UK 🇬🇧.

2020~2021 (Master of Science): University of WarwickLink opens in a new window, Coventry, UK 🇬🇧.

2016~2020 (Bachelor of Science): Southern University of Science and TechnologyLink opens in a new window, Shenzhen, China 🇨🇳.

This is a photo of Yiming Ma


📰 Recent News

2026-01-05: Joined King's College London and started to work with Dr Michela AntonelliLink opens in a new window as a Research Associate.

2025-07-31: Implemented a HuggingFace Space for ZIPLink opens in a new window.

2025-07-31: Released the code of ZIPLink opens in a new window on GitHub.

2025-07-31: Released a new paper ZIP: Scalable Crowd Counting via Zero-Inflated Poisson ModelingLink opens in a new window on arXiv.

2025-07-04: Attended IEEE ICME 2025Link opens in a new window @ Nantes, France.

2025-03-20: CLIP-EBC and Interact with me got accepted by IEEE ICME 2025Link opens in a new window.

2025-02-03: Released the code of Interact with meLink opens in a new window on GitHub.

2024-12-21: Released a new paper Interact with me: Joint Egocentric Forecasting of Intent to Interact, Attitude and Social ActionsLink opens in a new window (co-author) on arXiv.

2024-07-17: Released the code of CLIP-EBCLink opens in a new window on GitHub.

2024-03-14: Released a new paper CLIP-EBC: CLIP Can Count Accurately through Enhanced Blockwise ClassificationLink opens in a new window on arXiv.

2023-06-18: Attended CVPR 2023Link opens in a new window online.

2023-04-13: Released the code and dataset of MHSALink opens in a new window.

2023-04-13: Uploaded the paper of MHSALink opens in a new window on arXiv.

2023-03-21: The paper Robust Multiview Multimodal Driver Monitoring System Using Masked Multi-Head Self-Attention (MHSA) got accepted by MULA WorkshopLink opens in a new window at CVPR 2023.

2022-10-19: Attended IEEE ICIP 2022Link opens in a new window online.

2022-10-17: Released a new paper Real-Time Driver Monitoring Systems through Modality and View AnalysisLink opens in a new window on arXiv.

2022-06-20: FusionCount got accepted by ICIP 2022.

2022-04-05: Released the code of FusionCountLink opens in a new window on GitHub.

2022-02-27: Released a new paper FusionCount: Efficient Crowd Counting via Multiscale Feature FusionLink opens in a new window on arXiv.

2021-10-04: Started my PhD journey at Mathsys CDT of University of Warwick.

⚙️ Services

Reviewer for TNNLS, TMM, SPL, ECCV, ACM MM, CVPRW, ICME, WACV, BMVC, ICIP.


🧰 Skills

Deep Learning Concepts: Attention Mechanism, ViT, Prompt Tuning, CLIP, Contrastive Learning, Multimodal Alignment, Multimodal Fusion.

PyTorch: TIMM, OpenCLIP, Transformers, TensorBoard, Optuna.

Python: NumPy, SciPy, Scikit-learn, Pandas, OpenCV, Matplotlib / Seaborn.

Maths & Stats: Probability Theory, Statistical Inference, Optimization, Stochastic Processes, Time Series Modeling, Survival Analysis, Computational Statistics, Real / Complex / Functional / Fourier Analysis, Measure Theory.

Development & Tools: SSH, Git, Linux, LaTeX, Markdown, MS Word.

Languages: Mandarin Chinese (native), English (IELTS: 8.0/9.0).

Let us know you agree to cookies