PGP public key
Fingerprint B829 F851 00B7 9CB7 6897 DCA7 A881 5DA8 5150 5ABF
Topconfects: Bioconductor R package. Top differential expression by confident effect size. A more intuitive way to use "TREAT".
Weitrix: Bioconductor R package. Tools for working with a matrix of data measured with varying precision or with missing values.
Varistran: R package. Variance stabilizing transformation for RNA-Seq data. This is a useful pre-processing step prior to visualization, and also makes clustering and statistical analysis more straightforward. Includes a shiny app for assessing RNA-seq data quality with heatmap and biplot.
Tail-Tools: Analysis pipeline for PAT-Seq data.
Nesoni: Tools for processing bacterial high-throughput sequencing data. (No longer maintained.)
Plasmodium Autocount and Semiautocount: Cell counting on blood smears. (Historical interest only, use a convolutional neural network.)
2017-06-14 slides Introduction to logistic regression and decision trees, and a description of my Melbourne Datathon 2017 Kaggle entry
Publications on Google Scholar.
Programming and tidy data analysis in R (2019) (based on an ealier workshop R more (2016)). Workshop to go from beginner to intermediate R usage, which I have developed and presented with other members of the Monash Bioinformatics Platform and the Monash Data Fluency initiative.
Linear models in R (2018) Workshop material introducing linear models in R. Linear models are a very broadly useful statistical tool, and also necessary background knowledge for using Bioconductor packages such as limma.