<?xml version="1.0" encoding="utf-8" standalone="yes"?><rss version="2.0" xmlns:atom="http://www.w3.org/2005/Atom" xmlns:content="http://purl.org/rss/1.0/modules/content/"><channel><title>Neeladri Bhuiya</title><link>https://zawedcvg.github.io/</link><description>Recent content on Neeladri Bhuiya</description><generator>Hugo -- 0.152.0</generator><language>en</language><lastBuildDate>Fri, 06 Feb 2026 00:00:00 +0000</lastBuildDate><atom:link href="https://zawedcvg.github.io/index.xml" rel="self" type="application/rss+xml"/><item><title>Prompt2Box</title><link>https://zawedcvg.github.io/papers/paper3/</link><pubDate>Fri, 06 Feb 2026 00:00:00 +0000</pubDate><guid>https://zawedcvg.github.io/papers/paper3/</guid><description>This paper uses prompts to model prompts.</description></item><item><title>PLAGUE: Plug-And-Play framework for life-long adaptive generation of multi-turn exploits</title><link>https://zawedcvg.github.io/papers/paper2/</link><pubDate>Sun, 28 Sep 2025 00:00:00 +0000</pubDate><guid>https://zawedcvg.github.io/papers/paper2/</guid><description>This paper studies the pulmonary efficiency of sausage dogs. Published in the ICLR 2026.</description></item><item><title>Seemingly Plausible Distractors</title><link>https://zawedcvg.github.io/papers/paper1/</link><pubDate>Tue, 12 Nov 2024 00:00:00 +0000</pubDate><guid>https://zawedcvg.github.io/papers/paper1/</guid><description>This paper shows that LLMs struggle with challenging multi-hop reasoning, but in more subtle ways than fine-tuned models</description></item><item><title>Experience</title><link>https://zawedcvg.github.io/experience/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://zawedcvg.github.io/experience/</guid><description>&lt;h2 id="industry"&gt;Industry&lt;/h2&gt;
&lt;h3 id="ai-safety-intern--a10-networks"&gt;AI Safety Intern · A10 Networks&lt;/h3&gt;
&lt;h5 id="june-2025--august-2025"&gt;June 2025 – August 2025&lt;/h5&gt;
&lt;ul&gt;
&lt;li&gt;Reduced peak memory usage of the existing GCG attack by 50% (2× memory efficiency), enabling attacks with larger number of prompts.&lt;/li&gt;
&lt;li&gt;Designed and implemented PLAGUE, a novel agentic multi-turn jailbreak attack framework that improved the attack success rate (ASR) by up to 40% points compared to the prior SOTA in models like Claude Opus 4.1.&lt;/li&gt;
&lt;li&gt;Designed an orchestrator to coordinate multiple agents and integrated a RAG-based module for lifelong retrieval of strategies and in-context learning for PLAGUE.&lt;/li&gt;
&lt;li&gt;Published paper: &lt;em&gt;PLAGUE: Plug-and-play Framework for Lifelong Adaptive Generation of Multi-turn Exploits&lt;/em&gt; | First Author | &lt;a href="https://iclr.cc/virtual/2026" target="_blank"&gt;ICLR 2026&lt;/a&gt;&lt;/li&gt;
&lt;/ul&gt;
&lt;hr&gt;
&lt;h2 id="research"&gt;Research&lt;/h2&gt;
&lt;h3 id="extern-graduate-researcher--ibm"&gt;Extern Graduate Researcher · IBM&lt;/h3&gt;
&lt;h5 id="january-2025--april-2025"&gt;January 2025 – April 2025&lt;/h5&gt;
&lt;ul&gt;
&lt;li&gt;Led a team of 4 to conduct research on creating a correctness classifier using LLM&amp;rsquo;s uncertainty estimation for code generation under the supervision of Dr. Andrew McCallum and Dr. Veronika Thost.&lt;/li&gt;
&lt;li&gt;Engineered an optimized TokenSAR implementation with Tree-sitter, improving runtime efficiency and boosting correctness prediction accuracy by 15% over uncertainty-based baselines.&lt;/li&gt;
&lt;/ul&gt;
&lt;h3 id="graduate-researcher--ciir-and-iesl"&gt;Graduate Researcher · CIIR and IESL&lt;/h3&gt;
&lt;h5 id="december-2024--present"&gt;December 2024 – Present&lt;/h5&gt;
&lt;ul&gt;
&lt;li&gt;Conducting research under Dr. Andrew McCallum (IESL) and Dr. Hamed Zamani (CIIR) on leveraging box embeddings for evaluating LLM performance.&lt;/li&gt;
&lt;li&gt;Proposed modeling LLM prompts as box embeddings — axis-aligned hyperrectangles that jointly encode semantic relevance and specificity, going beyond the topical similarity captured by standard vector embeddings.&lt;/li&gt;
&lt;li&gt;Developed BoxSNE, an nD-to-2D visualization technique for box embeddings, improving interpretability. Demonstrated strong correlation between box volume and instruction complexity.&lt;/li&gt;
&lt;li&gt;Results show improved retrieval performance and interesting scaling behaviors. See &lt;a href="https://arxiv.org/pdf/2603.21438" target="_blank"&gt;Prompt2Box&lt;/a&gt;.&lt;/li&gt;
&lt;/ul&gt;
&lt;h3 id="researcher--national-university-of-singapore"&gt;Researcher · National University of Singapore&lt;/h3&gt;
&lt;h5 id="august-2023--may-2024"&gt;August 2023 – May 2024&lt;/h5&gt;
&lt;ul&gt;
&lt;li&gt;Conducted a final year research project under the supervision of Dr. Stefan Winkler and Dr. Viktor Schlegel, leading to a deeper understanding of LLM&amp;rsquo;s performance under strong adversarial attacks.&lt;/li&gt;
&lt;li&gt;Investigated multi-hop reasoning failures in LLMs by designing a dependency-parsing-based adversarial attack to expose systematic shortcutting behavior during inference.&lt;/li&gt;
&lt;li&gt;Developed a novel adversarial attack which affects state-of-the-art LLM&amp;rsquo;s performance up to 45%.&lt;/li&gt;
&lt;li&gt;Published paper: &lt;em&gt;Seemingly Plausible Distractors in Multi-Hop Reasoning&lt;/em&gt; | First Author | &lt;a href="https://aclanthology.org/2024.emnlp-main.147/" target="_blank"&gt;EMNLP 2024&lt;/a&gt;&lt;/li&gt;
&lt;/ul&gt;
&lt;hr&gt;
&lt;h2 id="education"&gt;Education&lt;/h2&gt;
&lt;h3 id="ms-computer-science--umass-amherst"&gt;M.S. Computer Science · UMass Amherst&lt;/h3&gt;
&lt;h5 id="2024--2026--gpa-4040"&gt;2024 – 2026 · GPA: 4.0/4.0&lt;/h5&gt;
&lt;h3 id="bs-computer-science--national-university-of-singapore"&gt;B.S. Computer Science · National University of Singapore&lt;/h3&gt;
&lt;h5 id="2020--2024"&gt;2020 – 2024&lt;/h5&gt;</description></item></channel></rss>