Jukebox is a brand new component introduced to Project Lyricova as a full-fledged music management system – again – focused on music of the ...
Lyricova as a lyrics blog has been redesigned to have a more modern look, encompassed by modern web features like dynamic Open Graph cover, advanced typography ...
The GNU Project has two principal licenses to use for libraries. One is the GNU Lesser GPL; the other is the ordinary GNU GPL. The choice of license makes a ...
Real Time Singing Synthesizer project made from sinsy-NG. The idea was to generate vocal audio samples on real time easily for live coding performances. ...
This script relies on the sinsy.jp website from the Nagoya Institute of Technology which implements a HMM-based Singing Voice Synthesis System. You can find a ...
VocaDB provides a public REST API for accessing artist, album and song information, and more. Endpoint for the most recent version of the API ...
OpenUtau is a free, open-source editor made for the UTAU community. It is strongly recommended that you read these Github wiki pages before using the ...
Feature Augmented Memory with Global Attention Network for VideoQA Recently, Recurrent Neural Network (RNN) based methods and Self-Attention (SA) based ...
Fast Video Object Segmentation using Global Context Module We developed a real-time, high-quality semi-supervised video object segmentation algorithm. Its ...
Dual Semantic Fusion Network for Video Object Detection Video object detection is a tough task due to the deteriorated quality of video sequences captured ...
Detecting Interactions from Neural Networks via Topological Analysis Detecting statistical interactions between input features is a crucial and challenging ...
Distilling Audio-Visual Knowledge by Compositional Contrastive Learning Having access to multi-modal cues (eg vision and audio) empowers some cognitive tasks ...
Semantic-Guided Relation Propagation Network for Few-shot Action Recognition Few-shot action recognition has drawn growing attention as it can recognize novel ...
Multi-Modal Fusion for Video Tag Inference via Translation-based Knowledge Embedding Tag inference is an important task in the business of video platforms ...
Cross-modal Consensus Network for Weakly Supervised Temporal Action Localization Weakly supervised temporal action localization (WS-TAL) is a challenging task ...
Towards Vivid and Diverse Image Colorization with Generative Color Prior Colorization has attracted increasing interest in recent years. Classic ...
Instances as Queries Recently, query based object detection frameworks achieve comparable performance with previous state-ofthe-art object detectors. However, ...
Crossover Learning for Fast Online Video Instance Segmentation Modeling temporal visual context across frames is critical for video instance segmentation ...
Open-book Video Captioning with Retrieve-Copy-Generate Network In this paper, we convert traditional video captioning task into a new paradigm, ie, Open-book ...
GFP-GAN: Towards Real-World Blind Face Restoration with Generative Facial Prior Blind face restoration usually relies on facial priors, such as facial ...
- « Previous Page
- 1
- …
- 11
- 12
- 13
- 14
- 15
- …
- 18
- Next Page »