Picture for Xuri Ge

Xuri Ge

Detail-Enhanced Intra- and Inter-modal Interaction for Audio-Visual Emotion Recognition

Add code
May 26, 2024
Viaarxiv icon

3SHNet: Boosting Image-Sentence Retrieval via Visual Semantic-Spatial Self-Highlighting

Add code
Apr 26, 2024
Viaarxiv icon

IISAN: Efficiently Adapting Multimodal Representation for Sequential Recommendation with Decoupled PEFT

Add code
Apr 11, 2024
Viaarxiv icon

Text2Pic Swift: Enhancing Long-Text to Image Retrieval for Large-Scale Libraries

Add code
Feb 28, 2024
Viaarxiv icon

The Relationship Between Speech Features Changes When You Get Depressed: Feature Correlations for Improving Speed and Performance of Depression Detection

Add code
Jul 07, 2023
Figure 1 for The Relationship Between Speech Features Changes When You Get Depressed: Feature Correlations for Improving Speed and Performance of Depression Detection
Figure 2 for The Relationship Between Speech Features Changes When You Get Depressed: Feature Correlations for Improving Speed and Performance of Depression Detection
Figure 3 for The Relationship Between Speech Features Changes When You Get Depressed: Feature Correlations for Improving Speed and Performance of Depression Detection
Figure 4 for The Relationship Between Speech Features Changes When You Get Depressed: Feature Correlations for Improving Speed and Performance of Depression Detection
Viaarxiv icon

Cross-modal Semantic Enhanced Interaction for Image-Sentence Retrieval

Add code
Oct 17, 2022
Figure 1 for Cross-modal Semantic Enhanced Interaction for Image-Sentence Retrieval
Figure 2 for Cross-modal Semantic Enhanced Interaction for Image-Sentence Retrieval
Figure 3 for Cross-modal Semantic Enhanced Interaction for Image-Sentence Retrieval
Figure 4 for Cross-modal Semantic Enhanced Interaction for Image-Sentence Retrieval
Viaarxiv icon

MGRR-Net: Multi-level Graph Relational Reasoning Network for Facial Action Units Detection

Add code
Apr 08, 2022
Figure 1 for MGRR-Net: Multi-level Graph Relational Reasoning Network for Facial Action Units Detection
Figure 2 for MGRR-Net: Multi-level Graph Relational Reasoning Network for Facial Action Units Detection
Figure 3 for MGRR-Net: Multi-level Graph Relational Reasoning Network for Facial Action Units Detection
Figure 4 for MGRR-Net: Multi-level Graph Relational Reasoning Network for Facial Action Units Detection
Viaarxiv icon

Automatic Facial Paralysis Estimation with Facial Action Units

Add code
Mar 30, 2022
Figure 1 for Automatic Facial Paralysis Estimation with Facial Action Units
Figure 2 for Automatic Facial Paralysis Estimation with Facial Action Units
Figure 3 for Automatic Facial Paralysis Estimation with Facial Action Units
Figure 4 for Automatic Facial Paralysis Estimation with Facial Action Units
Viaarxiv icon

Factored Attention and Embedding for Unstructured-view Topic-related Ultrasound Report Generation

Add code
Mar 12, 2022
Figure 1 for Factored Attention and Embedding for Unstructured-view Topic-related Ultrasound Report Generation
Figure 2 for Factored Attention and Embedding for Unstructured-view Topic-related Ultrasound Report Generation
Figure 3 for Factored Attention and Embedding for Unstructured-view Topic-related Ultrasound Report Generation
Viaarxiv icon

Differentiated Relevances Embedding for Group-based Referring Expression Comprehension

Add code
Mar 12, 2022
Figure 1 for Differentiated Relevances Embedding for Group-based Referring Expression Comprehension
Figure 2 for Differentiated Relevances Embedding for Group-based Referring Expression Comprehension
Figure 3 for Differentiated Relevances Embedding for Group-based Referring Expression Comprehension
Figure 4 for Differentiated Relevances Embedding for Group-based Referring Expression Comprehension
Viaarxiv icon