MikuProj
MikuProj is a WebPlayer who works with Hatsune Miku’s songs. All of the songs have their respective credits in the layout, and in licenses.txt. MikuProj uses...
Browse medical articles by letter, category, and search. Built for large health libraries.
MikuProj is a WebPlayer who works with Hatsune Miku’s songs. All of the songs have their respective credits in the layout, and in licenses.txt. MikuProj uses...
MILES: Visual BERT Pre-training with Injected Language Semantics for Video-text Retrieval Dominant pre-training work for video-text retrieval mainly adopt the “dual-encoder” architectures to enable efficient retrieval,...
Mitigating Artifacts in Real-World Video Super-Resolution Models with More Cheap Hidden States and Selective Cross Attention The recurrent structure is a prevalent framework for the task...
Mix-of-Show: Decentralized Low-Rank Adaptation for Multi-Concept Customization of Diffusion Models Public large-scale text-to-image diffusion models, such as Stable Diffusion, have gained significant attention from the community....
MM-RealSR: Metric Learning based Interactive Modulation for Real-World Super-Resolution Interactive image restoration aims to restore images by adjusting several controlling coefficients, which determine the restoration strength....
In the world of online anonymity and privacy, proxies play a crucial role in ensuring secure and private browsing experiences. However, buying proxies with traditional payment...
Multi-Modal Fusion for Video Tag Inference via Translation-based Knowledge Embedding Tag inference is an important task in the business of video platforms with wide applications such...
Many people think that taxation is something that came with banks and the Industrial Revolution. The truth is that it existed long before in ancient civilizations,...
NeRF-Texture: Texture Synthesis With Neural Radiance Fields Texture synthesis is a fundamental problem in computer graphics that would benefit various applications. Existing methods are effective in...
Any-to-any singing voice conversion is confronted with a significant challenge of “timbre leakage” issue caused by inadequate disentanglement between the content and the speaker timbre. To...
Object-aware Video-language Pre-training for Retrieval Recently, by introducing large-scale dataset and strong transformer network, video-language pre-training has shown great success especially for retrieval. Yet, existing video-language...
OmniZoomer: Learning to Move and Zoom in on Sphere at High-Resolution Omnidirectional images (ODIs) have become increasingly popular, as their large field-of-view (FoV) can offer the...