MetaSeg: MetaFormer-Based Global Contexts-Aware Network for Efficient Semantic Segmentation

Hierarchical Text Spotter for Joint Text Spotting and Layout Analysis

Query-Guided Attention in Vision Transformers for Localizing Objects Using a Single Sketch

#JasonDeruloTV // Street Art #GotPermissionToPost From @greg_goya #SlowLow

Kursa yazilmaq isteyen bəylər və xanımlar əlaqə nömrəsi ilə əlaqə saxlamanız xais olunur

Man vs Wind Turbine 🤯

Unsupervised Graphic Layout Grouping With Transformers

ComputerVisionFoundation Videos

zhlédnutí 24

Přidat do
- Můj playlist
- Přehrát později
Sdílet

Sdílet

Vložit

Velikost videa:

Zobrazit ovladače přehrávání

Automatické přehrávání

Přehrát

čas přidán 28. 01. 2024
Authors: Jialiang Zhu; Danqing Huang; Chunyu Wang; Mingxi Cheng; Ji Li; Han Hu; Xin Geng; Baining Guo
Description: Graphic design conveys messages through the combination of text, images and other visual elements. Unstructured designs such as overloaded social media graphics may fail to communicate their intended messages effectively. To address this issue, layout grouping offers a solution by organizing design elements into perceptual groups. While most methods rely on heuristic Gestalt principles, they often lack the context modeling ability needed to handle complex layouts. In this work, we reformulate the layout grouping task as a set prediction problem. It uses Transformers to learn a set of group tokens at various hierarchies, enabling it to reason the membership of the elements more effectively. The self-attention mechanism in Transformers boosts its context modeling ability, which enables it to handle complex layouts more accurately. To reduce annotation costs, we also propose an unsupervised learning strategy that pre-trains on noisy pseudo-labels induced by a novel heuristic algorithm. This approach then bootstraps to self-refine the noisy labels, further improving the accuracy of our model. Our extensive experiments demonstrate the effectiveness of our method, which outperforms existing state-of-the-art approaches in terms of accuracy and efficiency.
Věda a technologie

Komentáře •

Další v pořadí

Automatické přehrávání

MetaSeg: MetaFormer-Based Global Contexts-Aware Network for Efficient Semantic Segmentation

MetaSeg: MetaFormer-Based Global Contexts-Aware Network for Efficient Semantic Segmentation

Hierarchical Text Spotter for Joint Text Spotting and Layout Analysis

Hierarchical Text Spotter for Joint Text Spotting and Layout Analysis

Query-Guided Attention in Vision Transformers for Localizing Objects Using a Single Sketch

Query-Guided Attention in Vision Transformers for Localizing Objects Using a Single Sketch

#JasonDeruloTV // Street Art #GotPermissionToPost From @greg_goya #SlowLow

#JasonDeruloTV // Street Art #GotPermissionToPost From @greg_goya #SlowLow

Kursa yazilmaq isteyen bəylər və xanımlar əlaqə nömrəsi ilə əlaqə saxlamanız xais olunur

Kursa yazilmaq isteyen bəylər və xanımlar əlaqə nömrəsi ilə əlaqə saxlamanız xais olunur

Man vs Wind Turbine 🤯

Man vs Wind Turbine 🤯

Useful Woodworking tips and skills. How to securely tie a wooden beams with wire #shorts #tips

Useful Woodworking tips and skills. How to securely tie a wooden beams with wire #shorts #tips

Generative AI in a Nutshell - how to survive and thrive in the age of AI

Generative AI in a Nutshell - how to survive and thrive in the age of AI

Miika Aittala: Elucidating the Design Space of Diffusion-Based Generative Models

Miika Aittala: Elucidating the Design Space of Diffusion-Based Generative Models

GraphRAG: LLM-Derived Knowledge Graphs for RAG

GraphRAG: LLM-Derived Knowledge Graphs for RAG

The Sad Reality of Microsoft Edge

The Sad Reality of Microsoft Edge

Gradient-Guided Knowledge Distillation for Object Detectors

Gradient-Guided Knowledge Distillation for Object Detectors

Probabilistic Temporal Subspace Clustering | Spotlight 3-1A

Probabilistic Temporal Subspace Clustering | Spotlight 3-1A

Attention is all you need (Transformer) - Model explanation (including math), Inference and Training

Attention is all you need (Transformer) - Model explanation (including math), Inference and Training

High-Fidelity Pseudo-Labels for Boosting Weakly-Supervised Segmentation

High-Fidelity Pseudo-Labels for Boosting Weakly-Supervised Segmentation

But what is a GPT? Visual intro to transformers | Chapter 5, Deep Learning

But what is a GPT? Visual intro to transformers | Chapter 5, Deep Learning

Logitech, wake up.

Logitech, wake up.

[1595] Sentry Safe’s Lock Design Malpractice

[1595] Sentry Safe’s Lock Design Malpractice

Memory subscription

Memory subscription

AI = Apple Inteligence. To nejlepší z WWDC!

AI = Apple Inteligence. To nejlepší z WWDC!

iPhone 12 socket cleaning #fixit

iPhone 12 socket cleaning #fixit

WWDC 2024 Recap: Is Apple Intelligence Legit?

WWDC 2024 Recap: Is Apple Intelligence Legit?

Apple's Best WWDC Ever

Apple's Best WWDC Ever

Jak získat nejnovější Iphone 😅 #vtipy

Jak získat nejnovější Iphone 😅 #vtipy