News and Travel

  • Sept 2024: My paper, From Local Concepts to Universals has been accepted to EMNLP'24. Excited to present and travel to Miami in November 2024.
  • Sept 2024: Shifted to Montreal, starting my PhD at McGill University & MILA with Prof. Siva Reddy.
  • Aug 2024: Completed my Masters Thesis Presentation at UBC.
  • April 2024: Travelling to Barbados for the Bellairs Invitational Workshop on Contemporary, Foreseeable and Catastrophic Risks of Large Language Models.
  • Feb 2024: I was LIVE news on Global News! Thanks Vered for the opportunity.
  • Dec 2023: Virtually presented my paper GD-COMET at EMNLP'23 held in Singapore.
  • July 2023: Attended DLRL Summer School in Montreal.
  • July 2023: Travelling to Toronto for ACL'24.
  • Sept 2022: Shifted to Vancouver and starting my Masters (Thesis) in Computer Science at UBC Vancouver. I will be working with Prof. Vered Shwartz.

Selected Publications

Complete List: Google Scholar

Accepted to EMNLP'24
Despite recent advancements in vision-language models, their performance remains suboptimal on images from non-western cultures due to underrepresentation in training datasets. Various benchmarks have been proposed to test models' cultural inclusivity, but they have limited coverage of cultures and do not adequately assess cultural diversity across universal as well as culture-specific local concepts. To address these limitations, we introduce the GlobalRG benchmark ...

In Submission
Frontier large language models (LLMs) are developed by researchers and practitioners with skewed cultural backgrounds and on datasets with skewed sources. However, LLMs' (lack of) multicultural knowledge cannot be effectively assessed with current methods for developing benchmarks. Existing multicultural evaluations primarily rely on expensive and restricted human annotations or potentially outdated internet resources. Thus, they struggle to capture the intricacy, dynamics, and diversity of cultural norms ...

Accepted to EMNLP'23
With the increasing integration of AI into everyday life, it's becoming crucial to design AI systems that serve users from diverse backgrounds by making them culturally aware. In this paper, we present GD-COMET, a geo-diverse version of the COMET commonsense inference model. GD-COMET goes beyond Western commonsense knowledge and is capable of generating inferences pertaining to a broad range of cultures. We demonstrate the effectiveness of GD-COMET ...

Blog

Natural Language Processing (NLP) applications are now ubiquitous and used by millions of individuals worldwide on a daily basis. Nevertheless, these applications can be overwhelmingly brittle and biased. For example, it has been seen that the accuracy of syntactic parsing models drops by at least 20 percent on African-American vernacular English when compared to textbook-like English (how it is commonly spoken by the more privileged class of Americans). Further, sentiment analyzers fail on language originating from different time periods, question-answering systems fail on British English, conversational assistants struggle to interact with millions of elderly people with speech disabilities, and hate speech detection systems are biased and more likely to classify language from specific demographics incorrectly as offensive. In short, NLP models and applications work well only for a minority of the population, effectively excluding a significant majority that uses such applications exactly as often. It is shocking to see that roughly 6500 languages are spoken in the world today, however, the advancement in NLP in academia and industry focuses on a minuscule subset...

Contact