Picture for Weichong Yin

Weichong Yin

ERNIE-UniX2: A Unified Cross-lingual Cross-modal Framework for Understanding and Generation

Add code
Nov 09, 2022
Figure 1 for ERNIE-UniX2: A Unified Cross-lingual Cross-modal Framework for Understanding and Generation
Figure 2 for ERNIE-UniX2: A Unified Cross-lingual Cross-modal Framework for Understanding and Generation
Figure 3 for ERNIE-UniX2: A Unified Cross-lingual Cross-modal Framework for Understanding and Generation
Figure 4 for ERNIE-UniX2: A Unified Cross-lingual Cross-modal Framework for Understanding and Generation
Viaarxiv icon

ERNIE-ViLG 2.0: Improving Text-to-Image Diffusion Model with Knowledge-Enhanced Mixture-of-Denoising-Experts

Add code
Oct 27, 2022
Figure 1 for ERNIE-ViLG 2.0: Improving Text-to-Image Diffusion Model with Knowledge-Enhanced Mixture-of-Denoising-Experts
Figure 2 for ERNIE-ViLG 2.0: Improving Text-to-Image Diffusion Model with Knowledge-Enhanced Mixture-of-Denoising-Experts
Figure 3 for ERNIE-ViLG 2.0: Improving Text-to-Image Diffusion Model with Knowledge-Enhanced Mixture-of-Denoising-Experts
Figure 4 for ERNIE-ViLG 2.0: Improving Text-to-Image Diffusion Model with Knowledge-Enhanced Mixture-of-Denoising-Experts
Viaarxiv icon

ERNIE-Layout: Layout Knowledge Enhanced Pre-training for Visually-rich Document Understanding

Add code
Oct 14, 2022
Figure 1 for ERNIE-Layout: Layout Knowledge Enhanced Pre-training for Visually-rich Document Understanding
Figure 2 for ERNIE-Layout: Layout Knowledge Enhanced Pre-training for Visually-rich Document Understanding
Figure 3 for ERNIE-Layout: Layout Knowledge Enhanced Pre-training for Visually-rich Document Understanding
Figure 4 for ERNIE-Layout: Layout Knowledge Enhanced Pre-training for Visually-rich Document Understanding
Viaarxiv icon

ERNIE-ViL 2.0: Multi-view Contrastive Learning for Image-Text Pre-training

Add code
Sep 30, 2022
Figure 1 for ERNIE-ViL 2.0: Multi-view Contrastive Learning for Image-Text Pre-training
Figure 2 for ERNIE-ViL 2.0: Multi-view Contrastive Learning for Image-Text Pre-training
Figure 3 for ERNIE-ViL 2.0: Multi-view Contrastive Learning for Image-Text Pre-training
Figure 4 for ERNIE-ViL 2.0: Multi-view Contrastive Learning for Image-Text Pre-training
Viaarxiv icon

ERNIE-mmLayout: Multi-grained MultiModal Transformer for Document Understanding

Add code
Sep 18, 2022
Figure 1 for ERNIE-mmLayout: Multi-grained MultiModal Transformer for Document Understanding
Figure 2 for ERNIE-mmLayout: Multi-grained MultiModal Transformer for Document Understanding
Figure 3 for ERNIE-mmLayout: Multi-grained MultiModal Transformer for Document Understanding
Figure 4 for ERNIE-mmLayout: Multi-grained MultiModal Transformer for Document Understanding
Viaarxiv icon

ERNIE-ViLG: Unified Generative Pre-training for Bidirectional Vision-Language Generation

Add code
Dec 31, 2021
Figure 1 for ERNIE-ViLG: Unified Generative Pre-training for Bidirectional Vision-Language Generation
Figure 2 for ERNIE-ViLG: Unified Generative Pre-training for Bidirectional Vision-Language Generation
Figure 3 for ERNIE-ViLG: Unified Generative Pre-training for Bidirectional Vision-Language Generation
Figure 4 for ERNIE-ViLG: Unified Generative Pre-training for Bidirectional Vision-Language Generation
Viaarxiv icon

ERNIE-ViL: Knowledge Enhanced Vision-Language Representations Through Scene Graph

Add code
Jun 30, 2020
Figure 1 for ERNIE-ViL: Knowledge Enhanced Vision-Language Representations Through Scene Graph
Figure 2 for ERNIE-ViL: Knowledge Enhanced Vision-Language Representations Through Scene Graph
Figure 3 for ERNIE-ViL: Knowledge Enhanced Vision-Language Representations Through Scene Graph
Figure 4 for ERNIE-ViL: Knowledge Enhanced Vision-Language Representations Through Scene Graph
Viaarxiv icon