AI For Security Research Areas


LLM Jailbreak Attacks & Defenses
Developing systematic frameworks for evaluating and hardening LLMs against jailbreak attacks. Covers both attack generation (multi-agent RL-based obfuscation) and defense strategies, with emphasis on black-box and white-box threat models. Current work includes cipher-based attacks, multi-agent coordination patterns, and time-persistent jailbreak methodologies.


Vulnerability Detection & Analysis


Offensive Security Benchmarks


VLM Hallucination Mitigation


Efficient AI for Code Generation


AI-driven vulnerability detection and analysis for C/C++ code and binary programs. Combines retrieval-augmented generation (RAG) with multi-agent LLM pipelines for memory corruption analysis. Current work focuses on scalable detection across large codebases and fine-tuning specialized models for vulnerability reporting.
Building scalable, open-source benchmark datasets for evaluating LLM capabilities in offensive security tasks. Covers Capture The Flag (CTF) challenges, penetration testing scenarios, and vulnerability exploitation. Focused on creating reproducible evaluation standards for the security community.
Paper 1: NYU CTF Bench: A Scalable Open-Source Benchmark Dataset for Evaluating LLMs in Offensive Security
Paper 2: An Empirical Evaluation of LLMs for Solving Offensive Security Challenges
Grounding vision-language models (VLMs) with object detection to mitigate counting hallucinations. Develops modular architectures that combine traditional computer vision detectors with VLMs for reliable visual reasoning. Aims to make VLMs trustworthy for security-critical visual analysis tasks.
Cost-efficient and reliable code generation using cascaded multi-model frameworks with self-testing. Reduces inference costs while maintaining code quality through intelligent model routing and verification. Applicable to both general code completion and security-sensitive code generation tasks.
Team Awards


Best Paper Nomination in the 18th International Conference on Control, Automation, Robotics and Vision (ICARCV) 2024.
© 2026 eBRAIN
