Wanrong Zhu

Wanrong Zhu

CS Ph.D. Candidate

University of California, Santa Barbara


Hi, I am a final-year Ph.D. candidate in the Natural Language Processing Group at UCSB, advised by William Wang. I am honored and humbled to be named a 2023 Rising Stars in Machine Learning by University of Maryland. Before joining UCSB, I received my B.S. degree in Computer Science from Peking University.


  wanrongzhu [at] cs.ucsb.edu
   Henley Hall, UCSB


Education

  • University of California, Santa Barbara

    Ph.D. in Computer Science

    Sep. 2019 - Present

  • Peking University

    B.S. in Computer Science

    Sep. 2015 - July 2019

Interests

  • Vision-and-Language
  • Multimodal Study
  • Text Generation

Experience

Publications & Preprints

GPT-4V in Wonderland: Large Multimodal Models for Zero-Shot Smartphone GUI Navigation

Preprint (arXiv 2311.07562)
GPT-4V in Wonderland: Large Multimodal Models for Zero-Shot Smartphone GUI Navigation

LayoutGPT: Compositional Visual Planning and Generation with Large Language Models

The Thirty-seventh Conference on Neural Information Processing Systems (NeurIPS 2023)
LayoutGPT: Compositional Visual Planning and Generation with Large Language Models

Multimodal Procedural Planning via Dual Text-Image Prompting

Preprint (arXiv 2305.01795)
Multimodal Procedural Planning via Dual Text-Image Prompting

Multimodal C4: An Open, Billion-scale Corpus of Images Interleaved With Text

The Thirty-seventh Conference on Neural Information Processing Systems Datasets and Benchmarks Track (NeurIPS D&B 2023)
Multimodal C4: An Open, Billion-scale Corpus of Images Interleaved With Text

OpenFlamingo: An Open-Source Framework for Training Vision-Language Models with In-Context Learning

Preprint (arXiv 2308.01390)
OpenFlamingo: An Open-Source Framework for Training Vision-Language Models with In-Context Learning

CLIP also Understands Text: Prompting CLIP for Phrase Understanding

Preprint (arXiv 2210.05836)
CLIP also Understands Text: Prompting CLIP for Phrase Understanding

Texar: A Modularized, Versatile, and Extensible Toolkit for Text Generation

The 57th Annual Meeting of the Association for Computational Linguistics:System Demonstrations (ACL 2019 System Demonstration)
Texar: A Modularized, Versatile, and Extensible Toolkit for Text Generation

Text Infilling

Preprint (arXiv 1901.00158)
Text Infilling