Skip to content
View aayambansal's full-sized avatar
🏠
In SF
🏠
In SF

Highlights

  • Pro

Block or report aayambansal

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
aayambansal/README.md

Hello.

Profile views


About me.

  • Currently building Synthetic Sciences
  • Reach me at aayambansal@gmail.com
  • Published at ICML, ICLR, NeurIPS, CVPR, AAAI, ... Worked at Kelis Lab, MIT CSAIL; Carnegie Mellon; NUS & Oxford.
  • Z Fellows, Exited aisock (patented in US, Sin, Ind) & spent the money on a $8k spec'd out macbook & chipotle over 2 years. (also funded my break even poker games)

🌐 Connect with me.

Twitter    LinkedIn    Instagram

Pinned Loading

  1. ConsistencyBench ConsistencyBench Public

    We benchmark 18 frontier LLMs on cross-query logical consistency, reveal universal 36-57pp gaps between individual accuracy and set-level consistency, and propose a training-free method (CGD) that …

    Python

  2. VIAR-EECV26 VIAR-EECV26 Public

    We discover and characterize the visual neglect zone, a systematic pattern in vision-language models (VLMs) where middle transformer layers allocate disproportionately low attention to visual token…

    Python

  3. OpenDiscoveryTrace OpenDiscoveryTrace Public

    Process traces for evaluating AI scientist workflows | ICML 2026 AI4Science Dataset Competition | 432 trajectories from GPT-5.4, Claude Opus 4.6, Gemini 3.1 Pro

    TeX

  4. CER-Bench CER-Bench Public

    CER-Bench is a condition-sensitive retrieval benchmark for biomedical literature comprising 304+ tasks across eight families, designed to evaluate retrieval capabilities that working scientists act…

    Python

  5. KINS KINS Public

    Knowledge is Not Saying: Some work on Abstention

    Python