Research

We aim to develop an agentic AI system that automates key aspects of mathematical reasoning, including theorem decomposition, lemma identification, novel proof strategy generation, seamless translation between natural language and formal proof systems, and automatic conjecture discovery.

Related Projects

Sponsors

Related Papers

Ctrl-R HAGeo MathVista MathVerse ScienceQA OpenVLThinker

We develop multimodal systems that learn new concepts through language, visual grounding, interaction, and feedback, with emphasis on recognizing unfamiliar objects, understanding richer descriptions, robust video-language understanding, and more reliable human-machine collaboration.

Related Projects

NOVA: A Neuro-Symbolic Vision-Language Framework for Multimodal Human-Machine Interactions
MIRACLE: Multimodal InteRActive Conceptual Learning

Sponsors

Related Papers

VisualBERT GLIP DesCo OpenVLThinker VideoCon JourneyBench VideoPhy VideoPhy-2 PARTONOMY MotionEdit MuirBench MRAG-Bench

We integrate neural prediction with probabilistic inference and symbolic constraints so models can produce structured outputs that are accurate and verifiable. We apply these approaches to problems ranging from improving LLM safety to enhancing model reasoning capabilities.

Related Projects

PYLON: An Integrated Semantic Framework for Probabilistic Neuro-Symbolic Learning and Reasoning
CRII: Learning Structured Prediction Model with Auxiliary Supervision

Sponsors

Related Papers

Semantic Probabilistic Layers Semantic Strengthening PYLON Controllable Generation Control Large Language Models via Divide and Conquer QUDSELECT Tree-of-Traversals

We build detection, evaluation, guardrail, and mitigation methods for unsafe behavior, over-refusal, sleeper agents, jailbreaks, and customized multimodal safety policies.

Related Projects

SLES: Verifying and Enforcing Safety Constraints in AI-based Sequential Generation
Customized robust and controllable text processing
Safety reasoning and red-teaming for LLMs and multimodal systems

Sponsors

Related Papers

SafeWorld Customized Guardrails LLM Unlearning X-Teaming Open-Domain Safety Policy Construction Over-refusal Steering BLUR LUME Selective Unlearning

We develop open-source language agents, tool-use workflows, long-term memory benchmarks, and data-analysis agents for complex mulitmodal interactive tasks.

Related Projects

Enhancing the Reasoning Capabilities of Multimodal Large Language Models
Learning to Reason Better Than Your Teacher for Adaptive Multimodal Agents

Sponsors

Related Papers

Agent Lumos Chameleon Re-ReST DACO LongMemEval AutoSUIT Bench Dynosaur Magnet METAL QLASS ContextNav Embodied Web Agents

We develop trustworthy AI solutions for healthcare applications, from matching patients in clinical trails, to clinical report analysis, radiology summarization, and patient-centered medical decision support.

Related Projects

Medical Vision-Language Foundation Models for Clinical Report Analysis
Co-designing ethical multimodal AI systems for mapping T1D progression

Sponsors

Related Papers

BrainGPT AI-Assisted Radiology Summarization BERTHop Clinical Temporal Relation Extraction PolicyQA Clinical Report Summarization

We develop NLP approaches to detect early signs of emerging infectious diseases, predict their spread, and detect and monitor risk factors through multilingual social media posts.

Related Projects

PIPP Phase 1: An end-to-end pandemic early warning system by harnessing open-source intelligence
Online news trend-watching via linguistic analysis

Sponsors

Related Papers

MetaKP KP Evaluation TextEE SPEED PIPP Event Detection SPEED++ GENEVA KPEval PolicyQA

We study how bias appears in representations, generation, recommendations, and social text, and design human-centered interventions for more equitable AI systems.

Related Projects

AI-DCL: Governing Bias in AI System with Humans in the Decision Loop
Discerning Group Biases in Online Communities via Linguistic Analysis
Sloan Research Fellowship on fairness, robustness, and inclusion

Sponsors

Related Papers

Debiasing Word Embeddings Men Also Like Shopping Persona Trigger Bias FairCritic BOLD ENTIGEN Nice Try, Kiddo Persona Bias

This project studies commonsense knowledge from video, images, text, and knowledge bases, with benchmarks and models for multimodal social and scientific reasoning.

Related Projects

Discovering Common Sense from Video, Images, Text and Knowledge Bases

Sponsors

Related Papers

ScienceQA V-ALPHASOCIAL AVIS Geo-Diverse Visual Commonsense V-ALPHASOCIAL AVIS ScienceQA

Research

AI for Formal Mathematics

Related Projects

Sponsors

Related Papers

Multimodal Interactive Learning

Related Projects

Sponsors

Related Papers

Neuro-Symbolic Reasoning

Related Projects

Sponsors

Related Papers

LLM Safety and Guardrails

Related Projects

Sponsors

Related Papers

Mulitmodal AI Agents, Tool Use, and Long-Term Memory

Related Projects

Sponsors

Related Papers

Trustwrothy Medical AI Agent

Related Projects

Sponsors

Related Papers

Information Extraction for Pandemic Prevention

Related Projects

Sponsors

Related Papers

Governing Bias and Human-Centered AI

Related Projects

Sponsors

Related Papers

Machine Common Sense

Related Projects

Sponsors

Related Papers