Picture for Bo Ren

Bo Ren

On decoder-only architecture for speech-to-text and large language model integration

Add code
Jul 14, 2023
Figure 1 for On decoder-only architecture for speech-to-text and large language model integration
Figure 2 for On decoder-only architecture for speech-to-text and large language model integration
Figure 3 for On decoder-only architecture for speech-to-text and large language model integration
Viaarxiv icon

Visual Information Extraction in the Wild: Practical Dataset and End-to-end Solution

Add code
May 12, 2023
Figure 1 for Visual Information Extraction in the Wild: Practical Dataset and End-to-end Solution
Figure 2 for Visual Information Extraction in the Wild: Practical Dataset and End-to-end Solution
Figure 3 for Visual Information Extraction in the Wild: Practical Dataset and End-to-end Solution
Figure 4 for Visual Information Extraction in the Wild: Practical Dataset and End-to-end Solution
Viaarxiv icon

Multi-Space Neural Radiance Fields

Add code
May 07, 2023
Figure 1 for Multi-Space Neural Radiance Fields
Figure 2 for Multi-Space Neural Radiance Fields
Figure 3 for Multi-Space Neural Radiance Fields
Figure 4 for Multi-Space Neural Radiance Fields
Viaarxiv icon

Looking Through the Glass: Neural Surface Reconstruction Against High Specular Reflections

Add code
Apr 18, 2023
Figure 1 for Looking Through the Glass: Neural Surface Reconstruction Against High Specular Reflections
Figure 2 for Looking Through the Glass: Neural Surface Reconstruction Against High Specular Reflections
Figure 3 for Looking Through the Glass: Neural Surface Reconstruction Against High Specular Reflections
Figure 4 for Looking Through the Glass: Neural Surface Reconstruction Against High Specular Reflections
Viaarxiv icon

Collaborative Noisy Label Cleaner: Learning Scene-aware Trailers for Multi-modal Highlight Detection in Movies

Add code
Mar 26, 2023
Figure 1 for Collaborative Noisy Label Cleaner: Learning Scene-aware Trailers for Multi-modal Highlight Detection in Movies
Figure 2 for Collaborative Noisy Label Cleaner: Learning Scene-aware Trailers for Multi-modal Highlight Detection in Movies
Figure 3 for Collaborative Noisy Label Cleaner: Learning Scene-aware Trailers for Multi-modal Highlight Detection in Movies
Figure 4 for Collaborative Noisy Label Cleaner: Learning Scene-aware Trailers for Multi-modal Highlight Detection in Movies
Viaarxiv icon

Turning a CLIP Model into a Scene Text Detector

Add code
Mar 01, 2023
Figure 1 for Turning a CLIP Model into a Scene Text Detector
Figure 2 for Turning a CLIP Model into a Scene Text Detector
Figure 3 for Turning a CLIP Model into a Scene Text Detector
Figure 4 for Turning a CLIP Model into a Scene Text Detector
Viaarxiv icon

SLAN: Self-Locator Aided Network for Cross-Modal Understanding

Add code
Dec 08, 2022
Figure 1 for SLAN: Self-Locator Aided Network for Cross-Modal Understanding
Figure 2 for SLAN: Self-Locator Aided Network for Cross-Modal Understanding
Figure 3 for SLAN: Self-Locator Aided Network for Cross-Modal Understanding
Figure 4 for SLAN: Self-Locator Aided Network for Cross-Modal Understanding
Viaarxiv icon

FoPro: Few-Shot Guided Robust Webly-Supervised Prototypical Learning

Add code
Dec 01, 2022
Figure 1 for FoPro: Few-Shot Guided Robust Webly-Supervised Prototypical Learning
Figure 2 for FoPro: Few-Shot Guided Robust Webly-Supervised Prototypical Learning
Figure 3 for FoPro: Few-Shot Guided Robust Webly-Supervised Prototypical Learning
Figure 4 for FoPro: Few-Shot Guided Robust Webly-Supervised Prototypical Learning
Viaarxiv icon

Grafting Pre-trained Models for Multimodal Headline Generation

Add code
Nov 14, 2022
Figure 1 for Grafting Pre-trained Models for Multimodal Headline Generation
Figure 2 for Grafting Pre-trained Models for Multimodal Headline Generation
Figure 3 for Grafting Pre-trained Models for Multimodal Headline Generation
Figure 4 for Grafting Pre-trained Models for Multimodal Headline Generation
Viaarxiv icon

Leveraging Key Information Modeling to Improve Less-Data Constrained News Headline Generation via Duality Fine-Tuning

Add code
Oct 10, 2022
Figure 1 for Leveraging Key Information Modeling to Improve Less-Data Constrained News Headline Generation via Duality Fine-Tuning
Figure 2 for Leveraging Key Information Modeling to Improve Less-Data Constrained News Headline Generation via Duality Fine-Tuning
Figure 3 for Leveraging Key Information Modeling to Improve Less-Data Constrained News Headline Generation via Duality Fine-Tuning
Figure 4 for Leveraging Key Information Modeling to Improve Less-Data Constrained News Headline Generation via Duality Fine-Tuning
Viaarxiv icon