1

Learning to Floorplan like Human Experts via Reinforcement Learning

Ege-unet: an efficient group enhanced unet for skin lesion segmentation

LiveChat: A Large-Scale Personalized Dialogue Dataset Automatically Constructed from Live Streaming

Open-domain dialogue systems have made promising progress in recent years. While the state-of-the-art dialogue agents are built upon large-scale text-based social media data and large pre-trained models, there is no guarantee these agents could also …

DynaSlim: Dynamic Slimming for Vision Transformers

AV-TAD: Audio-Visual Temporal Action Detection with Transformer

CC-PoseNet: Towards human pose estimation in crowded classrooms

MALUNet: A Multi-Attention and Light-weight UNet for Skin Lesion Segmentation

Recently, some pioneering works have preferred applying more complex modules to improve segmentation performances. However, it is not friendly for actual clinical environments due to limited computing resources. To address this challenge, we propose …

Spatial Attention Guided Local Facial Attribute Editing

Facial attribute manipulation has attracted great attention from the public due to its wide range of applications. Aiming to smoothly manipulate the attributes of real facial images, it is critical to search a proper latent code aligns with the …

GOS: A Large-Scale Annotated Outdoor Scene Synthetic Dataset

Scene editing has attracted increasing research interests owing to its valuable applications in the field of photography, entertainment. With style-based GAN being proposed, images can be reasonably edited on specific semantic by manipulating in …

SynPose: A Large-scale and Densely Annotated Synthetic Dataset for Human Pose Estimation in Classroom

Deep learning-based methods for human pose estimation require large volumes of training data to achieve superior performance. However, data acquisition in classroom environments raises privacy concerns, which will undoubtedly hinder the development …