Cryptocurrency, Bitcoin, and Behind Dark Web Technology
0
LayoutDiffusion: Controllable Diffusion Model
0

LayoutDiffusion: Controllable Diffusion Model for Layout-to-image Generation Recently, diffusion models have achieved great success in image synthesis. ...

0
Accelerating Vision-Language Pretraining
0

Accelerating Vision-Language Pretraining with Free Language Modeling The state of the arts in vision-language pretraining (VLP) achieves exemplary performance ...

0
OSRT: Omnidirectional Image Super-Resolution
0

OSRT: Omnidirectional Image Super-Resolution with Distortion-aware Transformer Omnidirectional images (ODIs) have obtained lots of research interest for ...

0
Spherical Geometry=Aware Transformer for PAnoramic Semantic Segmentation
0

SGAT4PASS:Spherical Geometry=Aware Transformer for PAnoramic Semantic Segmentation As an important and challenging problem in computer vision, PAnoramic ...

0
Task-Aware Dual-Representation Network
0

Task-Aware Dual-Representation Network for Few-Shot Action Recognition Few-shot action recognition has attracted increasing attention in recent years, but it ...

0
Delete the Artifacts of GAN-based Real-World Super-Resolution Models
0

DeSRA: Detect and Delete the Artifacts of GAN-based Real-World Super-Resolution Models Image super-resolution (SR) with generative adversarial networks (GAN) ...

0
Pi-Tuning: Transferring Multimodal Foundation Models
0

Pi-Tuning: Transferring Multimodal Foundation Models with Optimal Multi-task Interpolation Foundation models have achieved great advances in multi-task ...

0
NeRF-Texture: Texture Synthesis With Neural Radiance Fields
0

NeRF-Texture: Texture Synthesis With Neural Radiance Fields Texture synthesis is a fundamental problem in computer graphics that would benefit various ...

0
Binary Embedding-based Retrieval at Tencent
0

Binary Embedding-based Retrieval at Tencent Large-scale embedding-based retrieval (EBR) is the cornerstone of search-related industrial applications. Given a ...

0
Prosody Modeling with 3D Visual Information
0

Prosody Modeling with 3D Visual Information for Expressive Video Dubbing The automatic video dubbing task is proposed to meet personal and industrial demands ...

0
Unleashing Vanilla Vision Transformer with Masked Image Modeling
0

Unleashing Vanilla Vision Transformer with Masked Image Modeling for Object Detection We present an approach to efficiently and effectively adapt a masked ...

0
Tune-A-Video
0

Tune-A-Video: One-Shot Tuning of Image Diffusion Models for Text-to-Video Generation To replicate the success of text-to-image (T2I) generation, recent works ...

0
Exploring Model Transferability through the Lens of Potential Energy
0

Transfer learning has emerged to be crucial in various computer vision tasks benefiting from the vast availability of pre-trained deep learning models. ...

0
Order-Prompted Tag Sequence Generation for Video Tagging
0

Video Tagging intends to infer multiple tags spanning relevant content for a given video. Typically, video tags are freely defined and uploaded by a variety of ...

0
MasaCtrl: Tuning-free Mutual Self-Attention Control
0

MasaCtrl: Tuning-free Mutual Self-Attention Control for Consistent Image Synthesis and Editing Despite the success in large-scale text-to-image generation and ...

0
Speech2Lip: High-fidelity Speech to Lip Generation
0

Speech2Lip: High-fidelity Speech to Lip Generation by Learning from a Short Video Synthesizing realistic videos according to a given speech is still an open ...

0
OmniZoomer: Learning to Move and Zoom
0

OmniZoomer: Learning to Move and Zoom in on Sphere at High-Resolution Omnidirectional images (ODIs) have become increasingly popular, as their large ...

0
HOSNeRF: Dynamic Human-Object-Scene
0

HOSNeRF: Dynamic Human-Object-Scene Neural Radiance Fields from a Single Video We introduce HOSNeRF, a novel 360° free-viewpoint rendering method that ...

0
VMesh: Hybrid Volume-Mesh Representation
0

VMesh: Hybrid Volume-Mesh Representation for Efficient View Synthesis With the emergence of neural radiance fields (NeRFs), view synthesis quality has reached ...

0
CL-NeRF: Continual Learning of Neural Radiance
0

CL-NeRF: Continual Learning of Neural Radiance Fields for Evolving Scene Representation Existing methods for adapting Neural Radiance Fields (NeRFs) to scene ...

RxHarun
Logo