Skills

Data Modalities

  • Electronic health records and clinical data
    • Clinical trials (BAI)
    • UK Biobank (BAI)
    • Retrospective studies (Owkin - MOSAIC)
  • Knowledge graphs and text (BAI)
  • Bulk RNAseq
    • Raw processing and QC (Owkin - MOSAIC)
      • STAR/ STAR fusion
      • Kallisto
      • Arriba
      • FastQC/ MultiQC
    • Downstream applications (PhD)
      • DGE analysis
      • GSEA/ ORA
      • Deconvolution
      • Motif enrichment
      • Dimensionality reduction
      • Feature selection
      • Classification and regression tasks
      • Unsupervised modelling
      • Co-expressing network modelling
  • Whole exome sequencing
    • Processing data QC and integrated QC pipeline building (Owkin - MOSAIC)
    • Downstream applications
      • SNV/Indel variant calls (PhD - cell line KRAS mutants)
      • Tumor purity
      • ML applications (PhD, BAI, Owkin)
  • Single cell RNA sequencing
    • QC pipeline (Owkin - MOSAIC)
    • Cell annotation
    • Doublet detection
    • Data integration
    • Cell communication analysis
    • Genetics integrated analysis
  • Spatial transcriptomics
    • QC pipeline (Owkin - MOSAIC)
    • Patient-matched scRNAseq data integration
    • Target ID
    • Biomarker identification
    • Network analysis
    • Feature extraction and selection
    • Spatial statistics
    • Pathology analysis integration
  • Mass spec proteomics
    • Downstream analysis (BAI, Owkin)
      • Coexpression network
      • mRNA to protein correlation
  • Multipliex IF
    • Computational pathology assessment using HALO (PhD)
  • Histology
    • H&E/ IHC QC (Owkin)

Technical

  • Python (advanced) - sklearn, pytorch, tensorflow, scipy, numpy, pandas, RDkit
  • R (advanced) - tidyverse
  • bash
  • git
  • Docker
  • Sagemaker
  • Redshift
  • S3
  • PostgresSQL
  • Cypher
  • Databricks
  • Neo4J
  • Kubeflow
  • Shiny

Leadership

  • Leading the construction of end-to-end ML solutions for drug discovery.
  • Tech lead on 2 internal target discovery programmes at Benevolent AI, and 1 external programme with AstraZeneca.
  • Line managed 2 interns
    • one project on multi’omics integration with Benevolent AI.
    • one project on machine learning for drug positioning at Owkin.
  • Managed 12 successful A level student projects across 4 years with Manchester Access Programme.

Teamwork

  • Track record of connecting teams internally and fostering external relationships in academia and industry.
  • I’m passionate about team science - the most enjoyable projects are those when you get to work in a close knit team with complementary skillsets where you have a clear mission.

Therapy Areas

Oncology

This is my main area of expertise including my most recent work at Owkin where I have been learning how to develop advanced computational methods for target ID that retain explainability. I’m fascinated by the intersection of cancer evolution, inflammation and immunity. Indeed, for my PhD I studied the interplay of pro- and anti-tumourigenic inflammatory phenotypes in cancer, focussing on translating the findings from our basic immunology lab into patient multi-modal ’omics data. We were particularly interested in biomarker development at the time, and indeed I’ve continued learning about patient stratification approaches from within industry.

Muscular disorders

A long time ago during my degree I worked with yeast, creating point mutations in genes involved in movement and monitoring changes in yeast phenotypes. Fast forward maybe 8 years and I had the chance to work on a therapeutic program for a muscle-wasting disorder. Given that it was a disease of old age, it was fascinating to consider how co-morbidities might influence the disease course. This was my first exposure to the UK Biobank as a tool for drug discovery. I learnt about how to design GWAS and PheWAS experiments. Ever since, I’ve kept an eye on the field and on publications arising from UKBB.