Harman Singh

I am a PhD student at UC Berkeley. I am working on Reasoning, Agents and Diffusion Language Models.

Previously, I did research at Google DeepMind, working on Gemini with Dr. Partha Talukdar in the Languages team. I focused on Alignment, Reasoning, and Multimodal Modeling. I also contributed to improving Multilinguality. My work has been used in Gemini 3.0, Gemma 3, Gemini 2.5 Pro and 2.0 models.

Before that, I was an AI Resident at FAIR, Meta where I worked on reasoning abilities of Vision Language Models. I completed my undergrad from Indian Institute of Technology, Delhi, advised by Prof. Parag Singla. During this time, I was a research intern at InkLab, USC, advised by Prof. Xiang Ren and a research intern at IBM Research AI.

Email / CV / Google Scholar / Semantic Scholar / Twitter / Github

	Gemini 2.5 Pro and Gemini 2.0 Gemini Team (Including Harman Singh) Contributed to Multimodality and Multilinguality Gemini 2.5 Pro Technical Report \| Google Blog (Gemini 2.5 Pro) \| Google Blog (Gemini 2.0)
	Gemma 3 Gemma Team (Including Harman Singh) Contributed to Multilinguality Gemma 3 Technical Report \| Google Blog
	Robust Reward Modeling via Causal Rubrics Pragya Srivastava, Harman Singh, Rahul Madhavan, Gandharv Patil, Sravanti Addepalli, Arun Suggala, Rengarajan Aravamudhan, Soumya Sharma, Anirban Laha, Aravindan Raghuveer, Karthikeyan Shanmugam, Doina Precup Under Review, 2025* DataWorld Workshop and Models of Human Feedback for AI Alignment (MoFA) Workshop at ICML 2025 Developing robust reward models having reduced reliance on spurious attributes and higher sensitivity to causal attributes (rubrics).
	IndicGenBench: A Multilingual Benchmark to Evaluate Generation Capabilities of LLMs on Indic Languages Harman Singh, Nitish Gupta, Shikhar Bharadwaj, Dinesh Tewari, Partha Talukdar ACL 2024 (long paper, main conference) Dataset Links: Github \| HuggingFace IndicGenBench is a multilingual, multi-way parallel benchmark for measuring language generation capabilities across diverse user-facing tasks in 29 Indic languages spanning 13 writing scripts and 4 language families.
	Coarse-to-Fine Contrastive Learning in Image-Text-Graph Space for Improved Vision-Language Compositionality Harman Singh, Pengchuan Zhang, Qifan Wang, Mengjiao Wang, Wenhan Xiong, Jingfei Du, Yu Chen EMNLP 2023 (long paper, main conference) Oral acceptance to CLVL Workshop at ICCV 2023 Oral acceptance to SpLU-RoboNLP Workshop at EMNLP 2023 Improving compositional reasoning capabilities of SOTA Vision-Language Models through a new Coarse-to-Fine contrastive learning technique as well as effective hard negative mining.
	Cross-Lingual Multi-Hop Knowledge Editing Aditi Khandelwal, Harman Singh, Hengrui Gu, Tianlong Chen, Kaixiong Zhou [Equal Contribution] EMNLP 2024 (long paper, findings)* Benchmarking and improving retrieval augmented knoweledge editing of LLMs, in a cross-lingual and multi-hop setting.
	Image Manipulation via Multi-Hop Instructions -- A New Dataset and Weakly-Supervised Neuro-Symbolic Approach Harman Singh, Poorva Garg, Mohit Gupta, Kevin Shah, Ashish Goswami, Satyam Modi, Arnab Kumar Mondal, Dinesh Khandelwal, Parag Singla, Dinesh Garg EMNLP 2023 (long paper, main conference) New datasets and a modular method for weakly-supervised instruction guided image manipulations.
	FaiRR: Faithful and Robust Deductive Reasoning over Natural Language Soumya Sanyal, Harman Singh, Xiang Ren ACL 2022 (long paper, main conference) Proposed a modular method (FaiRR) for logical reasoning over natural language rule bases. Our methods ensure model faithfulness by assured causal relation from the proof step to the inference reasoning. FaiRR is more interpretable, efficient as compared to baselines, and generalizes better to OOD logical reasoning tasks.
	STAB: Speech Tokenizer Assessment Benchmark Shikhar Vashisht, Harman Singh, Shikhar Bharadwaj, Sriram Ganapathy, Chulayuth Asawaroengchai, Kartik Audhkhasi, Andrew Rosenberg, Ankur Bapna, Bhuvana Ramabhadran [Equal Contribution] Under Review A systematic evaluation framework designed to assess speech tokenizers comprehensively and shed light on their inherent characteristics, cheaply, without having to train large speech foundation models.

Past Work (Bioinformatics Research)
	Unlocking capacities of genomics for the COVID-19 response and future pandemics Sergey Knyazev, Karishma Chhugan, Harman Singh, Varuni Sarwal, Ram Ayyala, ..., Serghei Mangul Nature Methods*
	A Novel Network Representation of SARS-CoV-2 Sequencing Datas Sergey Knyazev, Daniel Novikov, Mark Grinshpon, Harman Singh, ..., Serghei Mangul International Symposium on Bioinformatics Research and Applications 2021

Reviewer for ICLR 2023, NeurIPS 2023, EMNLP 2023, MLRC 2022 (Outstanding Reviewer Award), COLM 2024, ACL Rolling Review (ARR) 2024 (Feb, April, June)

Teaching Assitant for Machine Learning (Dr. Sumeet Agarwal and Dr. Jayadeva, Fall 2021)

Teaching Assitant for Intro. to EE (Dr. Anuj Dhawan, Fall 2021)

Demo Leader, NeurIPS Education Outreach Program (for 240+ high school students), NeurIPS 2022

Website template by Jon Barron

Past Work (Bioinformatics Research)