Picture for Zixian Ma

Zixian Ma

m&m's: A Benchmark to Evaluate Tool-Use for multi-step multi-modal Tasks

Add code
Mar 21, 2024
Figure 1 for m&m's: A Benchmark to Evaluate Tool-Use for multi-step multi-modal Tasks
Figure 2 for m&m's: A Benchmark to Evaluate Tool-Use for multi-step multi-modal Tasks
Figure 3 for m&m's: A Benchmark to Evaluate Tool-Use for multi-step multi-modal Tasks
Figure 4 for m&m's: A Benchmark to Evaluate Tool-Use for multi-step multi-modal Tasks
Viaarxiv icon

SugarCrepe: Fixing Hackable Benchmarks for Vision-Language Compositionality

Add code
Jun 26, 2023
Figure 1 for SugarCrepe: Fixing Hackable Benchmarks for Vision-Language Compositionality
Figure 2 for SugarCrepe: Fixing Hackable Benchmarks for Vision-Language Compositionality
Figure 3 for SugarCrepe: Fixing Hackable Benchmarks for Vision-Language Compositionality
Figure 4 for SugarCrepe: Fixing Hackable Benchmarks for Vision-Language Compositionality
Viaarxiv icon

Model Sketching: Centering Concepts in Early-Stage Machine Learning Model Design

Add code
Mar 06, 2023
Figure 1 for Model Sketching: Centering Concepts in Early-Stage Machine Learning Model Design
Figure 2 for Model Sketching: Centering Concepts in Early-Stage Machine Learning Model Design
Figure 3 for Model Sketching: Centering Concepts in Early-Stage Machine Learning Model Design
Figure 4 for Model Sketching: Centering Concepts in Early-Stage Machine Learning Model Design
Viaarxiv icon

CREPE: Can Vision-Language Foundation Models Reason Compositionally?

Add code
Dec 13, 2022
Figure 1 for CREPE: Can Vision-Language Foundation Models Reason Compositionally?
Figure 2 for CREPE: Can Vision-Language Foundation Models Reason Compositionally?
Figure 3 for CREPE: Can Vision-Language Foundation Models Reason Compositionally?
Figure 4 for CREPE: Can Vision-Language Foundation Models Reason Compositionally?
Viaarxiv icon

ELIGN: Expectation Alignment as a Multi-Agent Intrinsic Reward

Add code
Oct 09, 2022
Figure 1 for ELIGN: Expectation Alignment as a Multi-Agent Intrinsic Reward
Figure 2 for ELIGN: Expectation Alignment as a Multi-Agent Intrinsic Reward
Figure 3 for ELIGN: Expectation Alignment as a Multi-Agent Intrinsic Reward
Figure 4 for ELIGN: Expectation Alignment as a Multi-Agent Intrinsic Reward
Viaarxiv icon

MobilePhys: Personalized Mobile Camera-Based Contactless Physiological Sensing

Add code
Jan 11, 2022
Figure 1 for MobilePhys: Personalized Mobile Camera-Based Contactless Physiological Sensing
Figure 2 for MobilePhys: Personalized Mobile Camera-Based Contactless Physiological Sensing
Figure 3 for MobilePhys: Personalized Mobile Camera-Based Contactless Physiological Sensing
Figure 4 for MobilePhys: Personalized Mobile Camera-Based Contactless Physiological Sensing
Viaarxiv icon