I am interested in understanding and improving the performance of complex architectures. My current research focuses on making complex architectures easier to program while maintaining high performance. I have worked on code generation targetting various parallel architectures and have created methodologies and built tools to better understand the behavior and performance of GPUs. I am currently working on generating code for graph applications on a manycore architecture that utilizes high bandwidth memory.
Taming the Zoo: The Unified GraphIt Compiler Framework for Novel Architectures.
In Intl. Symposium on Computer Architecture (ISCA) (2021).
paper (pdf), bibtex
Parallelizing Instance-Based Data Classifiers.
In Intl. Florida Artificial Intelligence Research Society Conference (FLAIRS) (2016).