About

I am a Ph.D. student in Computer Science at Stanford University working at the intersection of biomedicine and AI. I am very fortunate to have been working with Jure Leskovec, Emma Lundberg, Anshul Kundaje and Aaron Newman. My current research focuses on 1) tissue cellular composition and 2) subcellular protein localization, where I develop machine learning models for multi-omics data, with the goal of improving our understanding of the mechanism of diseases, like cancer, and enabling personalized treatment. Previously I have also worked on graph representation learning, natural language processing and regulatory genomics.

Before Stanford, I was an AI Resident at Google Research working with Michael Collins on pretraining multimodal vision+language models. I was also a research intern at Google Research in summer 2019 working with Dan Roth and Ian Tenney on probing pretrained language models.

I received my B.S. in computer science at UIUC in 2019. I did research with Jiawei Han on data mining and heterogenous graph representation learning.

xikunz2@cs.[university].edu Google Scholar Profile Resume GitHub LinkedIn Twitter

Selected Publications

How to Build the Virtual Cell with Artificial Intelligence: Priorities and Opportunities
Charlotte Bunne*, Yusuf Roohani*, Yanay Rosen*, Ankit Gupta, Xikun Zhang, Marcel Roed, Theo Alexandrov, Mohammed AlQuraishi, Patricia Brennan, Daniel B. Burkhardt, Andrea Califano, Jonah Cool, Abby F. Dernburg, Kirsty Ewing, Emily B. Fox, Matthias Haury, Amy E. Herr, Eric Horvitz, Patrick D. Hsu, Viren Jain, Gregory R. Johnson, Thomas Kalil, David R. Kelley, Shana O. Kelley, Anna Kreshuk, Tim Mitchison, Stephani Otte, Jay Shendure, Nicholas J. Sofroniew, Fabian Theis, Christina V. Theodoris, Srigokul Upadhyayula, Marc Valer, Bo Wang, Eric Xing, Serena Yeung-Levy, Marinka Zitnik, Theofanis Karaletsos, Aviv Regev, Emma Lundberg, Jure Leskovec, Stephen R. Quake (*equal contribution)
To appear in Cell
PDF Abstract Bibtex

GreaseLM: Graph REASoning Enhanced Language Models for Question Answering
Xikun Zhang, Antoine Bosselut, Michihiro Yasunaga, Hongyu Ren, Percy Liang, Christopher D Manning, Jure Leskovec
ICLR 2022 (Spotlight, Top 5%)
PDF Abstract Bibtex Code Video

On the opportunities and risks of foundation models
Rishi Bommasani, …, Xikun Zhang, …, Percy Liang (116 authors)
arXiv 2021
PDF Abstract Bibtex Website

Do Language Embeddings Capture Scales?
Xikun Zhang*, Deepak Ramachandran*, Ian Tenney, Yanai Elazar, Dan Roth (*equal contribution)
Findings of EMNLP 2020
EMNLP BlackboxNLP workshop 2020

PDF Abstract Bibtex Code Blog

Predicting Embedding Trajectories for Temporal Interaction Networks
Srijan Kumar, Xikun Zhang, Jure Leskovec
KDD 2019 (Oral, Top 9%)
PDF Abstract Bibtex Code Website Video

Other Publications

Deep Bidirectional Language-Knowledge Graph Pretraining
Michihiro Yasunaga, Antoine Bosselut, Hongyu Ren, Xikun Zhang, Christopher D Manning, Percy Liang*, Jure Leskovec* (*equal contribution)
NeurIPS 2022
AAAI DLG workshop 2023 (Best Paper Award)

PDF Abstract Bibtex Code

CLEVRER-Humans: Describing Physical and Causal Events the Human Way
Jiayuan Mao*, Xuelin Yang*, Xikun Zhang, Noah Goodman, Jiajun Wu (*equal contribution)
NeurIPS Datasets and Benchmarks Track 2022
PDF Abstract Bibtex Code Website

Similarity Modeling on Heterogeneous Networks via Automatic Path Discovery
Carl Yang, Mengxiong Liu, Frank He, Xikun Zhang, Jian Peng, Jiawei Han
ECML-PKDD 2018
PDF Abstract Bibtex Code

TruePIE: Discovering Reliable Patterns in Pattern-Based Information Extraction
Qi Li*, Meng Jiang*, Xikun Zhang, Meng Qu, Timothy Hanratty, Jing Gao, Jiawei Han (*equal contribution)
KDD 2018 (Long Presentation, Top 11%)
PDF Abstract Bibtex Code Video


Academic Services


Awards