4129 |
Appearance-Preserving 3D Convolution for Video-based Person Re-identification
|
Xinqian Gu, Hong Chang, Bingpeng Ma, Hongkai Zhang, Xilin Chen |
4069 |
BorderDet: Border Feature for Dense Object Detection
|
Han Qiu, Yuchen Ma, Zeming Li, Songtao Liu, Jian Sun |
4032 |
Conditional Convolutions for Instance Segmentation
|
Zhi Tian, Chunhua Shen, Hao Chen |
4094 |
Content-Aware Unsupervised Deep Homography Estimation
|
Jirong Zhang, Chuan Wang, Shuaicheng Liu, Lanpeng Jia, Nianjin Ye, Jue Wang, Ji Zhou, Jian Sun |
4004 |
DeepFit: 3D Surface Fitting via Neural Network Weighted Least Squares
|
Yizhak Ben-Shabat, Stephen Gould |
4145 |
DeepHandMesh: A Weakly-supervised Deep Encoder-Decoder Framework for High-fidelity Hand Mesh Modeling
|
Gyeongsik Moon, Takaaki Shiratori, Kyoung Mu Lee |
3239 |
Fashionpedia: Ontology, Segmentation, and an Attribute Localization Dataset
|
Menglin Jia, Mengyun Shi, Mikhail Sirotenko, Yin Cui, Claire Cardie, Bharath Hariharan, Hartwig Adam, Serge Belongie |
4206 |
ForkGAN: Seeing into the Rainy Night
|
Ziqiang Zheng, Yang Wu, Xinran Han, Jianbo Shi |
4092 |
Gradient Centralization: A New Optimization Technique for Deep Neural Networks
|
Hongwei Yong, Jianqiang Huang, Xiansheng Hua, Lei Zhang |
4197 |
Hybrid Models for Open Set Recognition
|
Hongjie Zhang, Ang Li, Jie Guo, Yanwen Guo |
4068 |
Learn to Recover Visible Color for Video Surveillance in a Day
|
Guangming Wu, Yinqiang Zheng, Zhiling Guo, Zekun Cai, Xiaodan Shi, Xin Ding, Yifei Huang, Yimin Guo, Ryosuke Shibasaki |
4204 |
Learning to Localize Actions from Moments
|
Fuchen Long, Ting Yao, Zhaofan Qiu, Xinmei Tian, Jiebo Luo, Tao Mei |
4127 |
Motion Capture from Internet Videos
|
Junting Dong, Qing Shuai, Yuanqing Zhang, Xian Liu, Xiaowei Zhou, Hujun Bao |
3274 |
Multitask Learning Strengthens Adversarial Robustness
|
Chengzhi Mao, Amogh Gupta, Vikram Nitin, Baishakhi Ray, Shuran Song, Junfeng Yang, Carl Vondrick |
4111 |
Prototype Rectification for Few-Shot Learning
|
Jinlu Liu, Liang Song, Yongqiang Qin |
4029 |
Segment as Points for Efficient Online Multi-Object Tracking and Segmentation
|
Zhenbo Xu, Wei Zhang, Xiao Tan, Wei Yang, Huan Huang, Shilei Wen, Errui Ding, Liusheng Huang |
3287 |
V2VNet: Vehicle-to-Vehicle Communication for Joint Perception and Prediction
|
Tsun-Hsuan Wang, Sivabalan Manivasagam, Ming Liang, Bin Yang, Wenyuan Zeng, Raquel Urtasun |
3227 |
A Unified Framework of Surrogate Loss by Refactoring and Interpolation
|
Lanlan Liu, Mingzhe Wang, Jia Deng |
3296 |
SoundSpaces: Audio-Visual Navigation in 3D Environments
|
Changan Chen, Unnat Jain, Carl Schissler, Sebastia Vicenc Amengual Gari, Ziad Al-Halah, Vamsi Krishna Ithapu, Philip Robinson, and Kristen Grauman |
3266 |
BLSM: A Bone-Level Skinned Model of the Human Mesh
|
Haoyang Wang, Riza Alp G"uler, Iasonas Kokkinos, George Papandreou, Stefanos Zafeiriou |
4161 |
Bounding-box Channels for Visual Relationship Detection
|
Sho Inayoshi, Keita Otani, Antonio Tejero-de-Pablos, Tatsuya Harada |
4086 |
CPGAN: Content-Parsing Generative Adversarial Networks for Text-to-Image Synthesis
|
Jiadong Liang, Wenjie Pei, Feng Lu |
4154 |
Character Grounding and Re-Identification in Story of Videos and Text Descriptions
|
Youngjae Yu, Jongseok Kim, Heeseung Yun, Jiwan Chung, Gunhee Kim |
4043 |
Collaborative Learning of Gesture Recognition and 3D Hand Pose Estimation with Multi-Order Feature Analysis
|
Siyuan Yang, Jun Liu, Shijian Lu, Meng Hwa Er, Alex C. Kot |
4140 |
Collaborative Video Object Segmentation by Foreground-Background Integration
|
Zongxin Yang, Yunchao Wei, Yi Yang |
3271 |
Contact and Human Dynamics from Monocular Video
|
Davis Rempe, Leonidas J. Guibas, Aaron Hertzmann, Bryan Russell, Ruben Villegas, Jimei Yang |
4024 |
DSA: More Efficient Budgeted Pruning via Differentiable Sparsity Allocation
|
Xuefei Ning, Tianchen Zhao, Wenshuo Li, Peng Lei, Yu Wang, Huazhong Yang |
4105 |
Dense Hybrid Recurrent Multi-view Stereo Net with Dynamic Consistency Checking
|
Jianfeng Yan, Zizhuang Wei, Hongwei Yi, Mingyu Ding, Runze Zhang, Yisong Chen, Guoping Wang, Yu-Wing Tai |
4173 |
Filter Style Transfer between Photos
|
Jonghwa Yim, Jisung Yoo, Won-joon Do, Beomsu Kim, Jihwan Choe |
3254 |
Generative Sparse Detection Networks for 3D Single-shot Object Detection
|
JunYoung Gwak, Christopher Choy, Silvio Savarese |
4171 |
Guided Deep Decoder: Unsupervised Image Pair Fusion
|
Tatsumi Uezato, Danfeng Hong, Naoto Yokoya, Wei He |
4046 |
Human Interaction Learning on 3D Skeleton Point Clouds for Video Violence Recognition
|
Yukun Su, Guosheng Lin, Jinhui Zhu, Qingyao Wu |
4165 |
Invertible Neural BRDF for Object Inverse Rendering
|
Zhe Chen, Shohei Nobuhara, Ko Nishino |
4010 |
Learning Delicate Local Representations for Multi-Person Pose Estimation
|
Yuanhao Cai, Zhicheng Wang, Zhengxiong Luo, Binyi Yin, Angang Du, Haoqian Wang, Xiangyu Zhang, Xinyu Zhou, Erjin Zhou, Jian Sun |
4123 |
Learning From Multiple Experts: Self-paced Knowledge Distillation for Long-tailed Classification
|
Liuyu Xiang, Guiguang Ding, Jungong Han |
4013 |
Learning Open Set Network with Discriminative Reciprocal Points
|
Guangyao Chen, Limeng Qiao, Yemin Shi, Peixi Peng, Jia Li, Tiejun Huang, Shiliang Pu, Yonghong Tian |
3262 |
Learning to Factorize and Relight a City
|
Andrew Liu, Shiry Ginosar, Tinghui Zhou, Alexei A. Efros, Noah Snavely |
3234 |
Multi-person 3D Pose Estimation in Crowded Scenes Based on Multi-View Geometry
|
He Chen, Pengfei Guo, Pengfei Li, Gim Hee Lee, Gregory Chirikjian |
4078 |
Negative Margin Matters: Understanding Margin in Few-shot Classification
|
Bin Liu, Yue Cao, Yutong Lin, Qi Li, Zheng Zhang, Mingsheng Long, Han Hu |
3281 |
Occupancy Anticipation for Efficient Exploration and Navigation
|
Santhosh K. Ramakrishnan, Ziad Al-Halah, Kristen Grauman |
4209 |
PROFIT: A Novel Training Method for sub-4-bit MobileNet Models
|
Eunhyeok Park, Sungjoo Yoo |
4079 |
Particularity beyond Commonality: Unpaired Identity Transfer with Multiple References
|
Ruizheng Wu, Xin Tao, Yingcong Chen, Xiaoyong Shen, Jiaya Jia |
4192 |
Photon-Efficient 3D Imaging with A Non-Local Neural Network
|
Jiayong Peng, Zhiwei Xiong, Xin Huang, Zheng-Ping Li, Dong Liu, Feihu Xu |
4167 |
Practical Deep Raw Image Denoising on Mobile Devices
|
Yuzhi Wang, Haibin Huang, Qin Xu, Jiaming Liu, Yiqun Liu, Jue Wang |
3273 |
Reconstructing NBA Players
|
Luyang Zhu, Konstantinos Rematas, Brian Curless, Steven M. Seitz, Ira Kemelmacher-Shlizerman |
4054 |
RobustFusion: Human Volumetric Capture with Data-driven Visual Cues using a RGBD Camera
|
Zhuo Su, Lan Xu, Zerong Zheng, Tao Yu, Yebin Liu, Lu Fang |
3256 |
Rotationally-Temporally Consistent Novel View Synthesis of Human Performance Video
|
Youngjoong Kwon, Stefano Petrangeli, Dahun Kim, Haoliang Wang, Eunbyung Park, Viswanathan Swaminathan, Henry Fuchs |
4077 |
SF-Net: Single-Frame Supervision for Temporal Action Localization
|
Fan Ma, Linchao Zhu, Yi Yang, Shengxin Zha, Gourab Kundu, Matt Feiszli, Zheng Shou |
4210 |
SODA: Story Oriented Dense Video Captioning Evaluation Framework
|
Soichiro Fujita, Tsutomu Hirao, Hidetaka Kamigaito, Manabu Okumura, Masaaki Nagata |
4109 |
Short-Term and Long-Term Context Aggregation Network for Video Inpainting
|
Ang Li, Shanshan Zhao, Xingjun Ma, Mingming Gong, Jianzhong Qi, Rui Zhang, Dacheng Tao, Ramamohanarao Kotagiri |
4074 |
Side-Aware Boundary Localization for More Precise Object Detection
|
Jiaqi Wang, Wenwei Zhang, Yuhang Cao, Kai Chen, Jiangmiao Pang, Tao Gong, Jianping Shi, Chen Change Loy, Dahua Lin |
4182 |
Efficient Spatio-Temporal Recurrent Neural Network for Video Deblurring
|
Zhihang Zhong, Ye Gao, Yinqiang Zheng, Bo Zheng |
3316 |
The Hessian Penalty: A Weak Prior for Unsupervised Disentanglement
|
William Peebles, John Peebles, Jun-Yan Zhu, Alexei Efros, Antonio Torralba |
4033 |
Towards Part-aware Monocular 3D Human Pose Estimation: An Architecture Search Approach
|
Zerui Chen, Yan Huang, Hongyuan Yu, Bin Xue, Ke Han, Yiru Guo, Liang Wang |
3260 |
Tracking Objects as Points
|
Xingyi Zhou, Vladlen Koltun, Philipp Kr"ahenb"uhl |
4198 |
3D-CVF: Generating Joint Camera and LiDAR Features Using Cross-View Spatial Feature Fusion for 3D Object Detection
|
Jin Hyeok Yoo, Yecheol Kim, Jisong Kim, Jun Won Choi |
3305 |
A Broader Study of Cross-Domain Few-Shot Learning
|
Yunhui Guo, Noel C. Codella, Leonid Karlinsky, James V. Codella, John R. Smith, Kate Saenko, Tajana Rosing, Rogerio Feris |
4019 |
Sequential Convolution and Runge-Kutta Residual Architecture for Image Compressed Sensing\thanks{This research is supported by National Natural Science Foundation of China grant No.61772232.
|
Runkai Zheng, Yinqi Zhang, Daolang Huang, Qingliang Chen |
4193 |
AE-OT-GAN: Training GANs from data specific latent distribution
|
Dongsheng An, Yang Guo, Min Zhang, Xin Qi, Na Lei, Xianfang Gu |
3282 |
APRICOT: A Dataset of Physical Adversarial Attacks on Object Detection
|
A. Braunegg, Amartya Chakraborty, Michael Krumdick, Nicole Lape, mboxSara Leary, Keith Manville, Elizabeth Merkhofer, Laura Strickhart, Matthew Walmer |
4117 |
Accurate RGB-D Salient Object Detection via Collaborative Learning
|
Wei Ji, Jingjing Li, Miao Zhang, Yongri Piao, Huchuan Lu |
4130 |
Accurate Polarimetric BRDF for Real Polarization Scene Rendering
|
Yuhi Kondo, Taishi Ono, Legong Sun, Yasutaka Hirasawa, Jun Murayama |
4132 |
Acquiring Dynamic Light Fields through Coded Aperture Camera
|
Kohei Sakai, Keita Takahashi, Toshiaki Fujii, Hajime Nagahara |
3279 |
Active Crowd Counting with Limited Supervision
|
Zhen Zhao, Miaojing Shi, Xiaoxiao Zhao, Li Li |
4156 |
Active Visual Information Gathering for Vision-Language Navigation
|
small Hanqing Wang, Letter Wenguan Wang, Tianmin Shu, Wei Liang, Jianbing Shen |
4042 |
Adapting Object Detectors with Conditional Domain Normalization
|
Peng Su, Kun Wang, Xingyu Zeng, Shixiang Tang, Dapeng Chen, Di Qiu, Xiaogang Wang |
3245 |
Adversarial Continual Learning
|
Sayna Ebrahimi, Franziska Meier, Roberto Calandra, Trevor Darrell, Marcus Rohrbach |
4075 |
Adversarial Ranking Attack and Defense
|
Mo Zhou, Zhenxing Niu, Le Wang, Qilin Zhang, Gang Hua |
4001 |
Adversarial Self-Supervised Learning for Semi-Supervised 3D Action Recognition
|
Chenyang Si, Xuecheng Nie, Wei Wang, Liang Wang, Tieniu Tan, Jiashi Feng |
4050 |
An Analysis of Sketched IRLS for Accelerated Sparse Residual Regression
|
Daichi Iwata, Michael Waechter, Wen-Yan Lin, Yasuyuki Matsushita |
4176 |
An End-to-End OCR Text Re-organization Sequence Learning for Rich-text Detail Image Comprehension
|
Liangcheng Li, Feiyu Gao, Jiajun Bu, Yongpan Wang, Zhi Yu, Qi Zheng |
3291 |
Anatomy-Aware Siamese Network: Exploiting Semantic Asymmetry for Accurate Pelvic Fracture Detection in X-ray Images
|
Haomin Chen, Yirui Wang, Kang Zheng, Weijian Li, Chi-Tung Chang, Adam P. Harrison, Jing Xiao, Gregory D. Hager, Le Lu, Chien-Hung Liao, Shun Miao |
3258 |
Associative3D: Volumetric Reconstruction from Sparse Views
|
Shengyi Qian, Linyi Jin, David F. Fouhey |
4076 |
Asynchronous Interaction Aggregation for Action Detection
|
Jiajun Tang, Jin Xia, Xinzhi Mu, Bo Pang, Cewu Lu |
4202 |
Attentive Prototype Few-shot Learning with Capsule Network-based Embedding
|
Fangyu Wu, Jeremy S.Smith, Wenjin Lu, Chaoyi Pang, Bailing Zhang |
4072 |
Attract, Perturb, and Explore: Learning a Feature Alignment Network for Semi-supervised Domain Adaptation
|
Taekyung Kim, Changick Kim |
4061 |
AutoTrajectory: Label-free Trajectory Extraction and Prediction from Videos using Dynamic Points
|
Yuexin Ma, Xinge Zhu, Xinjing Cheng, Ruigang Yang, Jiming Liu, Dinesh Manocha |
3297 |
BIRNAT: Bidirectional Recurrent Neural Networks with Adversarial Training for Video Snapshot Compressive Imaging
|
Ziheng Cheng, Ruiying Lu, Zhengjue Wang, Hao Zhang, Bo Chen, Ziyi Meng, Xin Yuan |
3241 |
BSL-1K: Scaling up co-articulated sign language recognition using mouthing cues
|
Samuel Albanie, G"ul Varol, Liliane Momeni, Triantafyllos Afouras, Joon Son Chung, Neil Fox, Andrew Zisserman |
4012 |
Beyond 3DMM Space: Towards Fine-grained 3D Face Reconstruction
|
Xiangyu Zhu, Fan Yang, Di Huang, Chang Yu, Hao Wang, Jianzhu Guo, Zhen Lei, Stan Z. Li |
4021 |
Blind Face Restoration via Deep Multi-scale Component Dictionaries
|
Xiaoming Li, Chaofeng Chen, Shangchen Zhou, Xianhui Lin, Wangmeng Zuo, Lei Zhang |
4081 |
Boosting Decision-based Black-box Adversarial Attacks with Random Sign Flip
|
Weilun Chen, Zhaoxiang Zhang, Xiaolin Hu, Baoyuan Wu |
4014 |
Bottom-Up Temporal Action Localization with Mutual Regularization
|
Peisen Zhao, Lingxi Xie, Chen Ju, Ya Zhang, Yanfeng Wang, Qi Tian |
4174 |
Boundary-Aware Cascade Networks for Temporal Action Segmentation
|
Zhenzhi Wang, Ziteng Gao, Limin Wang, Zhifeng Li, Gangshan Wu |
4073 |
Boundary-preserving Mask R-CNN
|
Tianheng Cheng, Xinggang Wang, Lichao Huang, Wenyu Liu |
3269 |
Meta-Sim2: Unsupervised Learning of Scene Structure for Synthetic Data Generation
|
Jeevan Devaranjan, Amlan Kar, Sanja Fidler |
4023 |
BroadFace: Looking at Tens of Thousands of People at Once for Face Recognition
|
Yonghyun Kim, Wonpyo Park, Jongju Shin |
4213 |
ByeGlassesGAN: Identity Preserving Eyeglasses Removal for Face Images
|
Yu-Hui Lee, Shang-Hong Lai |
4100 |
CFAD: Coarse-to-Fine Action Detector for Spatiotemporal Action Localization
|
Yuxi Li, Weiyao Lin, John See, Ning Xu Shugong Xu, Ke Yan, Cong Yang |
3267 |
Captioning Images Taken by People Who Are Blind
|
Danna Gurari, Yinan Zhao, Meng Zhang, Nilavra Bhattacharya |
4048 |
Cascade Graph Neural Networks for RGB-D Salient Object Detection
|
Ao Luo, Xin Li, Fan Yang, Zhicheng Jiao, Hong Cheng, Siwei Lyu |
4011 |
Cheaper Pre-training Lunch: An Efficient Paradigm for Object Detection
|
Dongzhan Zhou, Xinchi Zhou, Hongwen Zhang, Shuai Yi, Wanli Ouyang |
4083 |
Clustering Driven Deep Autoencoder for Video Anomaly Detection
|
Yunpeng Chang, Zhigang Tu, Wei Xie, Junsong Yuan |
4118 |
Collaborative Training between Region Proposal Localization and Classification for Domain Adaptive Object Detection
|
Ganlong Zhao, Guanbin Li, Ruijia Xu, Liang Lin |
4003 |
Colorization of Depth Map via Disentanglement
|
Chung-Sheng Lai, Zunzhi You, Ching-Chun Huang, Yi-Hsuan Tsai, Wei-Chen Chiu |
4008 |
Component Divide-and-Conquer for Real-World Image Super-Resolution
|
Pengxu Wei, Ziwei Xie, Hannan Lu, Zongyuan Zhan, Qixiang Ye, Wangmeng Zuo, Liang Lin |
3255 |
Comprehensive Image Captioning via Scene Graph Decomposition
|
Yiwu Zhong, Liwei Wang, Jianshu Chen, Dong Yu, Yin Li |
3268 |
Conditional Entropy Coding for Efficient Video Compression
|
Jerry Liu, Shenlong Wang, Wei-Chiu Ma, Meet Shah, Rui Hu, Pranaab Dhawan, Raquel Urtasun |
4022 |
Conditional Image Repainting via Semantic Bridge and Piecewise Value Function
|
Shuchen Weng, Wenbo Li, Dawei Li, Hongxia Jin, Boxin Shi |
4062 |
Conditional Sequential Modulation for Efficient Global Image Retouching
|
Jingwen He, Yihao Liu, Yu Qiao, Chao Dong |
4162 |
Consensus-Aware Visual-Semantic Embedding for Image-Text Matching
|
Haoran Wang, Ying Zhang, Zhong Ji, Yanwei Pang, Lin Ma |
4002 |
Controllable Image Synthesis via SegVAE
|
Yen-Chi Cheng, Hsin-Ying Lee, Min Sun, Ming-Hsuan Yang |
3265 |
Count- and Similarity-aware R-CNN for Pedestrian Detection
|
Jin Xie, Hisham Cholakkal, Rao Muhammad Anwer, Fahad Shahbaz Khan, Yanwei Pang, Ling Shao, Mubarak Shah |
4168 |
Cross-Identity Motion Transfer for Arbitrary Objects through Pose-Attentive Video Reassembling
|
Subin Jeon, Seonghyeon Nam, Seoung Wug Oh, Seon Joo Kim |
4164 |
Cross-Task Transfer for Geotagged Audiovisual Aerial Scene Recognition
|
Di Hu, Xuhong Li, Lichao Mou, Pu Jin, Dong Chen, Liping Jing, Xiaoxiang Zhu, Dejing Dou |
3229 |
Curriculum DeepSDF
|
Yueqi Duan, Haidong Zhu, He Wang, Li Yi Ram Nevatia, Leonidas J. Guibas |
4091 |
CurveLane-NAS: Unifying Lane-Sensitive Architecture Search and Adaptive Point Blending
|
Hang Xu, Shaoju Wang, Xinyue Cai, Wei Zhang, Xiaodan Liang, Zhenguo Li |
4203 |
DA4AD: End-to-End Deep Attention-based Visual Localization for Autonomous Driving
|
Yao Zhou, Guowei Wan, Shenhua Hou, Li Yu, Gang Wang, Xiaofei Rui, Shiyu Song |
3249 |
DRG: Dual Relation Graph for Human-Object Interaction Detection
|
Chen Gao, Jiarui Xu, Yuliang Zou, Jia-Bin Huang |
4147 |
DVI: Depth Guided Video Inpainting for Autonomous Driving
|
Miao Liao, Feixiang Lu, Dingfu Zhou, Sibo Zhang, Wei Li, Ruigang Yang |
4016 |
Deep Credible Metric Learning for Unsupervised Domain Adaptation Person Re-identification
|
Guangyi Chen, Yuhao Lu, Jiwen Lu, Jie Zhou |
4020 |
Deep Hough Transform for Semantic Line Detection
|
Qi Han, Kai Zhao, Jun Xu, Ming-Ming Cheng |
4126 |
Deep Learning-based Pupil Center Detection for Fast and Accurate Eye Tracking System
|
Kang Il Lee, Jung Ho Jeon, Byung Cheol Song |
3253 |
Deep Multi Depth Panoramas for View Synthesis
|
Kai-En Lin, Zexiang Xu, Ben Mildenhall, Pratul P. Srinivasan, Yannick Hold-Geoffroy, Stephen DiVerdi, Qi Sun, Kalyan Sunkavalli, Ravi Ramamoorthi |
3261 |
Deep Plastic Surgery: Robust and Controllable Image Editing with Human-Drawn Sketches
|
Shuai Yang, Zhangyang Wang, Jiaying Liu, Zongming Guo |
4028 |
Deep Positional and Relational Feature Learning for Rotation-Invariant Point Cloud Analysis
|
Ruixuan Yu, Xin Wei, Federico Tombari, Jian Sun |
4099 |
Deep Reinforced Attention Learning for Quality-Aware Visual Recognition
|
Duo Li, Qifeng Chen |
4037 |
Deep Space-Time Video Upsampling Networks
|
Jaeyeon Kang, Younghyun Jo, Seoung Wug Oh, Peter Vajda, Seon Joo Kim |
4009 |
Deep near-light photometric stereo for spatially varying reflectances
|
Hiroaki Santo, Michael Waechter, Yasuyuki Matsushita |
4065 |
Defocus Blur Detection via Depth Distillation
|
Xiaodong Cun, Chi-Man Pun |
4148 |
Dense RepPoints: Representing Visual Objects with Dense Point Sets
|
Ze Yang, Yinghao Xu, Han Xue, Zheng Zhang Raquel Urtasun, Liwei Wang, Stephen Lin, Han Hu |
4181 |
Detail Preserved Point Cloud Completion via Separated Feature Aggregation
|
Wenxiao Zhang, Qingan Yan, Chunxia Xiao |
4114 |
Differentiable Feature Aggregation Search for Knowledge Distillation
|
Yushuo Guan, Pengyu Zhao, Bingxuan Wang, Yuanxing Zhang, Cong Yao, Kaigui Bian, Jian Tang |
3311 |
Differentiable Joint Pruning and Quantization for Hardware Efficiency
|
Ying Wang, Yadong Lu, Tijmen Blankevoort |
4025 |
Discriminability Distillation in Group Representation Learning
|
Manyuan Zhang, Guanglu Song, Hang Zhou, Yu Liu |
4212 |
Distance-Normalized Unified Representation for Monocular 3D Object Detection
|
Xuepeng Shi, Zhixiang Chen, Tae-Kyun Kim |
4015 |
Domain-Specific Mappings for Generative Adversarial Style Transfer
|
Hsin-Yu Chang, Zhixiang Wang, Yung-Yu Chuang |
4172 |
Dual Adversarial Network for Deep Active Learning
|
Shuo Wang, Yuexiang Li, Kai Ma, Ruhui Ma, Haibing Guan, Yefeng Zheng |
4143 |
Dual Refinement Underwater Object Detection Network
|
Baojie Fan, Wei Chen, Yang Cong, Jiandong Tian |
4107 |
Dynamic Dual-Attentive Aggregation Learning for Visible-Infrared Person Re-Identification
|
Mang Ye, Jianbing Shen, David J. Crandall, Ling Shao, Jiebo Luo |
3284 |
EGDCL: An Adaptive Curriculum Learning Framework for Unbiased Glaucoma Diagnosis
|
Rongchang Zhao, Xuanlin Chen, Zailiang Chen, Shuo Li |
4096 |
Early Exit Or Not: Resource-Efficient Blind Quality Enhancement for Compressed Images
|
Qunliang Xing, Mai Xu, Tianyi Li, Zhenyu Guan |
4047 |
Edge-aware Graph Representation Learning and Reasoning for Face Parsing
|
Gusi Te, Yinglu Liu, Wei Hu, Hailin Shi, Tao Mei |
3294 |
Efficient Scale-Permuted Backbone with Learned Resource Distribution
|
Xianzhi Du, Tsung-Yi Lin, Pengchong Jin, Yin Cui Mingxing Tan, Quoc Le, Xiaodan Song |
4031 |
Efficient Semantic Video Segmentation with Per-frame Inference
|
Yifan Liu, Chunhua Shen, Changqian Yu, Jingdong Wang |
3289 |
End-to-End Low Cost Compressive Spectral Imaging with Spatial-Spectral Self-Attention
|
Ziyi Meng, Jiawei Ma, Xin Yuan |
4055 |
Event Enhanced High-Quality Image Recovery
|
Bishan Wang, Jingwei He, Lei Yu, Gui-Song Xia, Wen Yang |
3244 |
Explainable Face Recognition
|
Jonathan R. Williford, Brandon B. May, Jeffrey Byrne |
3303 |
Extending and Analyzing Self-Supervised Learning Across Domains
|
Bram Wallace, Bharath Hariharan |
4214 |
Extract and Merge: Superpixel Segmentation with Regional Attributes
|
Jianqiao An, Yucheng Shi, Yahong Han, Meijun Sun, Qi Tian |
4134 |
Representation Sharing for Fast Object Detector Search and Beyond
|
Yujie Zhong, Zelu Deng, Sheng Guo, Matthew R. Scott, Weilin Huang |
4186 |
A universal framework for training low-bit DNNs via Feature Transfer
|
Kunyuan Du, Ya Zhang, Haibing Guan, Qi Tian, Shenggan Cheng, James Lin |
3250 |
Fashion Captioning: Towards Generating Accurate Descriptions with Semantic Rewards
|
Xuewen Yang, Heming Zhang, Di Jin, Yingru Liu, Chi-Hao Wu, Jianchao Tan, Dongliang Xie, Jue Wang, Xin Wang |
4026 |
Federated Visual Classification with Real-World Data Distribution
|
Tzu-Ming Harry Hsu, Hang Qi, Matthew Brown |
4185 |
Few-Shot Single-View 3-D Object Reconstruction with Compositional Priors
|
Mateusz Michalkiewicz, Sarah Parisot, Stavros Tsogkas, Mahsa Baktashmotlagh, Anders Eriksson, Eugene Belilovsky |
4137 |
Few-shot Compositional Font Generation with Dual Memory
|
Junbum Cha, Sanghyuk Chun, Gayoung Lee, Bado Lee, Seonghyeon Kim, Hwalsuk Lee |
4071 |
Finding It at Another Side: A Viewpoint-Adapted Matching Encoder for Change Captioning
|
Xiangxi Shi, Xu Yang, Jiuxiang Gu, Shafiq Joty, Jianfei Cai |
4041 |
Funnel Activation for Visual Recognition
|
Ningning Ma, Xiangyu Zhang, Jian Sun |
3300 |
GATCluster: Self-Supervised Gaussian-Attention Network for Image Clustering
|
Chuang Niu, Jun Zhang, Ge Wang, Jimin Liang |
4103 |
GINet: Graph Interaction Network for Scene Parsing
|
Tianyi Wu, Yu Lu, Yu Zhu, Chuang Zhang, MingWu, Zhanyu Ma, Guodong Guo |
3246 |
G-LBM:Generative Low-dimensional Background Model Estimation from Video Sequences
|
Behnaz Rezaei, Amirreza Farnoosh, Sarah Ostadabbas |
4088 |
GSNet: Joint Vehicle Pose and Shape Reconstruction with Geometrical and Scene-aware Supervision
|
Lei Ke, Shichao Li, Yanan Sun, Yu-Wing Tai, Chi-Keung Tang |
4052 |
Generating Handwriting via Decoupled Style Descriptors
|
Atsunobu Kotani, Stefanie Tellex, James Tompkin |
4006 |
Global Distance-distributions Separation for Unsupervised Person Re-identification
|
Xin Jin, Cuiling Lan, Wenjun Zeng, Zhibo Chen |
4179 |
Global and Local Enhancement Networks for Paired and Unpaired Image Enhancement
|
Han-Ul Kim, Young Jun Koh, Chang-Su Kim |
4188 |
Globally-Optimal Event Camera Motion Estimation
|
Xin Peng, Yifu Wang, Ling Gao, Laurent Kneip |
4101 |
Guessing State Tracking for Visual Dialogue
|
Wei Pang, Xiaojie Wang |
4059 |
Guided Collaborative Training for Pixel-wise Semi-Supervised Learning
|
Zhanghan Ke, Di Qiu, Kaican Li, Qiong Yan, Rynson W.H. Lau |
4207 |
Guided Saliency Feature Learning for Person Re-identification in Crowded Scenes
|
Lingxiao He, Wu Liu |
3247 |
H3DNet: 3D Object Detection Using Hybrid Geometric Primitives
|
Zaiwei Zhang, Bo Sun, Haitao Yang, Qixing Huang |
4146 |
Hierarchical Visual-Textual Graph for Temporal Activity Localization via Language
|
Shaoxiang Chen, Yu-Gang Jiang |
4125 |
High-Resolution Image Inpainting with Iterative Confidence Feedback and Guided Upsampling
|
Yu Zeng, Zhe Lin, Jimei Yang, Jianming Zhang, Eli Shechtman, Huchuan Lu |
4000 |
Highly Efficient Salient Object Detection with 100K Parameters
|
Shang-Hua Gao, Yong-Qiang Tan, Ming-Ming Cheng, Chengze Lu, Yunpeng Chen, Shuicheng Yan |
4005 |
How Can I See My Future? FvTraj: Using First-person View for Pedestrian Trajectory Prediction
|
Huikun Bi, Ruisi Zhang, Tianlu Mao, Zhigang Deng, Zhaoqi Wang |
3263 |
How does Lipschitz Regularization Influence GAN Training?
|
Yipeng Qin, Niloy Mitra, Peter Wonka |
4196 |
Reducing the Sim-to-Real Gap for Event Cameras
|
Timo Stoffregen, Cedric Scheerlinck, Davide Scaramuzza, Tom Drummond, Nick Barnes, Lindsay Kleeman, Robert Mahony |
4149 |
Identity-Aware Multi-Sentence Video Description
|
Jae Sung Park, Trevor Darrell, Anna Rohrbach |
3313 |
Imaging Behind Occluders Using Two-Bounce Light
|
Connor Henley, Tomohiro Maeda, Tristan Swedish, Ramesh Raskar |
3230 |
Improved Adversarial Training via Learned Optimizer
|
Yuanhao Xiong, Cho-Jui Hsieh |
4124 |
Improving Multispectral Pedestrian Detection by Addressing Modality Imbalance Problems
|
Kailai Zhou, Linsen Chen, Xun Cao |
4177 |
Improving Query Efficiency of Black-box Adversarial Attack
|
Yang Bai, Yuyuan Zeng, Yong Jiang, Yisen Wang, Shu-Tao Xia, Weiwei Guo |
4158 |
Improving the Transferability of Adversarial Examples with Resized-Diverse-Inputs, Diversity-Ensemble and Region Fitting
|
Junhua Zou, Zhisong Pan, Junyang Qiu, Xin Liu, Ting Rui, Wei Li |
3314 |
Info3D: Representation Learning on 3D Objects using Mutual Information Maximization and Contrastive Learning
|
Aditya Sanghi |
3237 |
InfoFocus: 3D Object Detection for Autonomous Driving with Dynamic Information Modeling
|
Jun Wang, Shiyi Lan, Mingfei Gao, Larry S. Davis |
4110 |
Interactive Video Object Segmentation Using Global and Local Transfer Modules
|
Yuk Heo, Yeong Jun Koh, Chang-Su Kim |
4201 |
Interpretable Foreground Object Search As Knowledge Distillation
|
Boren Li, Po-Yu Zhuang, Jian Gu, Mingyang Li, Ping Tan |
3264 |
Invertible Zero-Shot Recognition Flows
|
Yuming Shen, Jie Qin, Lei Huang, Li Liu, Fan Zhu, Ling Shao |
3278 |
Iterative Feature Transformation for Fast and Versatile Universal Style Transfer
|
Tai-Yin Chiu, Danna Gurari |
3252 |
JSSR: A Joint Synthesis, Segmentation, and Registration System for 3D Multi-Modal Image Alignment of Large-scale Pathological CT Scans
|
Fengze Liu, Jinzheng Cai, Yuankai Huo, Chi-Tung Cheng, Ashwin Raju, Dakai Jin, Jing Xiao, Alan Yuille, Le Lu, ChienHung Liao, Adam P. Harrison |
4119 |
Knowledge-Based Video Question Answering with Unsupervised Scene Descriptions
|
Noa Garcia, Yuta Nakashima |
3270 |
A Generic Visualization Approach for Convolutional Neural Networks
|
Ahmed Taha, Xitong Yang, Abhinav Shrivastava, Larry Davis |
4155 |
Privacy Preserving Visual SLAM
|
Mikiya Shibuya, Shinya Sumikura, Ken Sakurada |
4053 |
LEED: Label-Free Expression Editing via Disentanglement
|
Rongliang Wu, Shijian Lu |
4120 |
LIRA: Lifelong Image Restoration from Unknown Blended Distortions
|
Jianzhao Liu, Jianxin Lin, Xin Li, Wei Zhou, Sen Liu, Zhibo Chen |
4035 |
LST-Net: Learning a Convolutional Neural Network with a Learnable Sparse Transform
|
Lida Li, Kun Wang, Shuai Li, Xiangchu Feng, Lei Zhang |
3310 |
Label-similarity Curriculum Learning
|
"Ur"un Dogan, Aniket Anand Deshmukh, Marcin Bronislaw Machura, Christian~Igel |
4224 |
Learning 3D Part Assembly from a Single Image
|
Yichen Li, Kaichun Mo, Lin Shao, Minhyuk Sung, Leonidas Guibas |
4152 |
Learning Connectivity of Neural Networks from a Topological Perspective
|
Kun Yuan, Quanquan Li, Jing Shao, Junjie Yan |
4195 |
Learning Discriminative Feature with CRF for Unsupervised Video Object Segmentation
|
Mingmin Zhen, Shiwei Li, Lei Zhou, Jiaxiang Shang, Haoan Feng, Tian Fang, Long Quan |
4157 |
Learning Memory Augmented Cascading Network for Compressed Sensing of Images
|
Jiwei Chen, Yubao Sun, Qingshan Liu, Rui Huang |
4112 |
Learning Noise-Aware Encoder-Decoder from Noisy Labels by Alternating Back-Propagation for Saliency Detection
|
Jing Zhang, Jianwen Xie, Nick Barnes |
4153 |
Ocean: Object-aware Anchor-free Tracking
|
Zhipeng Zhang hrefhttps://houwenpeng.com/index.htmltextcolorblackHouwen Peng Jianlong Fu Bing Li, Weiming Hu |
3280 |
Learning Propagation Rules for Attribution Map Generation
|
Yiding Yang, Jiayan Qiu, Mingli Song, Dacheng Tao, Xinchao Wang |
4056 |
Learning Semantic Neural Tree for Human Parsing
|
Ruyi Ji, Dawei Du, Libo Zhang, Longyin Wen, Yanjun Wu, Chen Zhao, Feiyue Huang, Siwei Lyu |
4093 |
Learning Where to Focus for Efficient Video Object Detection
|
Zhengkai Jiang, Yu Liu, Ceyuan Yang, Jihao Liu, Peng Gao, Qian Zhang, Shiming Xiang, Chunhong Pan |
3243 |
Learning to Count in the Crowd from Limited Labeled Data
|
Vishwanath A. Sindagi, Rajeev Yasarla, Deepak Sam Babu, R. Venkatesh Babu, Vishal M. Patel |
4066 |
Learning with Noisy Class Labels for Instance Segmentation
|
Longrong Yang, Fanman Meng, Hongliang Li, Qingbo Wu, Qishang Cheng |
4169 |
Learning with Privileged Information for Efficient Image Super-Resolution
|
Wonkyung Lee, Junghyup Lee, Dohyung Kim, Bumsub Ham |
4064 |
Length-Controllable Image Captioning
|
Chaorui Deng, Ning Ding, Mingkui Tan, Qi Wu |
3293 |
LevelSet R-CNN: A Deep Variational Method for Instance Segmentation
|
Namdar Homayounfar quad Yuwen Xiong quad Justin Liang quad Wei-Chiu Ma quad Raquel Urtasun smalltextttnamdar,yuwen,justin.liang,weichiu,urtasun@uber.com |
4082 |
Matching Guided Distillation
|
Kaiyu Yue, Jiangfan Deng, Feng Zhou |
4136 |
Meta-Learning with Network Pruning
|
Hongduan Tian, Bo Liu, Xiao-Tong Yuan, Qingshan Liu |
4151 |
Mining Inter-Video Proposal Relations for Video Object Detection
|
Mingfei Han, Yali Wang, Xiaojun Chang, Yu Qiao |
4034 |
Modeling 3D Shapes by Reinforcement Learning
|
Cheng Lin, Tingxiang Fan, Wenping Wang, Matthias Niess ner |
3317 |
Modeling the Space of Point Landmark Constrained Diffeomorphisms
|
Chengfeng Wen, Yang Guo, Xianfeng Gu |
4018 |
Monocular 3D Object Detection via Feature Domain Adaptation
|
Lele Chen, Guofeng Cui, Celong Liu, Zhong Li, Ziyi Kou, Yi Xu, Chenliang Xu |
4142 |
Motion-Excited Sampler: Video Adversarial Attack with Sparked Prior
|
Hu Zhang, Linchao Zhu, Yi Zhu, Yi Yang |
4097 |
MotionSqueeze: Neural Motion Feature Learning for Video Understanding
|
Heeseung Kwon, Manjin Kim, Suha Kwak, Minsu Cho |
4030 |
MuCAN: Multi-Correspondence Aggregation Network for Video Super-Resolution
|
Wenbo Li, Xin Tao, Taian Guo, Lu Qi, Jiangbo Lu, Jiaya Jia |
4116 |
Multi-Loss Rebalancing Algorithm for Monocular Depth Estimation
|
Jae-Han Lee, Chang-Su Kim |
4098 |
Multi-Scale Positive Sample Refinement for Few-Shot Object Detection
|
Jiaxi Wu, Songtao Liu, Di Huang, Yunhong Wang |
4166 |
NeuRoRA: Neural Robust Rotation Averaging
|
Pulak Purkait, Tat-Jun Chin, Ian Reid |
4049 |
Nighttime Defogging Using High-Low Frequency Decomposition and Grayscale-Color Networks
|
Wending Yan, Robby T. Tan, Dengxin Dai |
3308 |
NoiseRank: Unsupervised Label Noise Reduction with Dependence Models
|
Karishma Sharma, Pinar Donmez, Enming Luo, Yan Liu, I. Zeki Yalniz |
4200 |
Novel View Synthesis on Unpaired Data by Conditional Deformable Variational Auto-Encoder
|
Mingyu Yin, Li Sun, Qingli Li |
4184 |
OID: Outlier Identifying and Discarding in Blind Image Deblurring
|
Liang Chen, Faming Fang, Jiawei Zhang, Jun Liu, Guixu Zhang |
3283 |
Object as Hotspots: An Anchor-Free 3D Object Detection Approach via Firing of Hotspots
|
Qi Chen, Lin Sun, Zhixin Wang, Kui Jia, Alan Yuille |
4144 |
Occlusion-Aware Siamese Network for Human Pose Estimation
|
Lu Zhou, Yingying Chen, Yunze Gao, Jinqiao Wang, Hanqing Lu |
3312 |
On Transferability of Histological Tissue Labels in Computational Pathology
|
Mahdi S. Hosseini, Lyndon Chan, Weimin Huang, Yichen Wang, Danial Hasan, Corwyn Rowsell, Savvas Damaskinos, Konstantinos N. Plataniotis |
3276 |
Online Ensemble Model Compression using Knowledge Distillation
|
Devesh Walawalkar, Zhiqiang Shen, Marios Savvides |
4038 |
Open-Edit: Open-Domain Image Manipulation with Open-Vocabulary Instructions
|
Xihui Liu, Zhe Lin, Jianming Zhang, Handong Zhao, Quan Tran, Xiaogang Wang, Hongsheng Li |
4115 |
Open-set Adversarial Defense
|
Rui Shao, Pramuditha Perera, Pong C. Yuen, Vishal M. Patel |
3301 |
PL$_1$P - Point-line Minimal Problems under Partial Visibility in Three Views
|
Timothy Duff, Kathlén Kohn, Anton Leykin, Tomas Pajdla |
4138 |
PUGeo-Net: A Geometry-centric Network for 3D Point Cloud Upsampling
|
Yue Qian, Junhui Hou, Sam Kwong, Ying He |
3298 |
Pairwise Similarity Knowledge Transfer for Weakly Supervised Object Localization
|
Amir Rahimi, Amirreza Shaban, Thalaiyasingam Ajanthan, Richard Hartley, Byron Boots |
4215 |
Physics-based Feature Dehazing Networks
|
Jiangxin Dong, Jinshan Pan |
4150 |
Piggyback GAN: Efficient Lifelong Learning for Image Conditioned Generation
|
Mengyao Zhai, Lei Chen, Jiawei He, Megha Nawhal, Frederick Tung, Greg Mori |
3272 |
Pix2Surf: Learning Parametric 3D Surface Models of Objects from Images
|
Jiahui Lei, Srinath Sridhar, Paul Guerrero, Minhyuk Sung, Niloy Mitra, Leonidas J.~Guibas |
4122 |
Polynomial Regression Network for Variable-Number Lane Detection
|
Bingke Wang, Zilei Wang, Yixin Zhang |
4139 |
Polysemy Deciphering Network for Human-Object Interaction Detection
|
Xubin Zhong, Changxing Ding, Xian Qu, Dacheng Tao |
3290 |
Practical Detection of Trojan Neural Networks: Data-Limited and Data-Free Cases
|
Ren Wang, Gaoyuan Zhang, Sijia Liu, Pin-Yu Chen, Jinjun Xiong, Meng Wang |
4189 |
Prediction and Recovery for Adaptive Low-Resolution Person Re-Identification
|
Ke Han, Yan Huang, Zerui Chen, Liang Wang, Tieniu Tan |
4040 |
Procedure Planning in Instructional Videos
|
Chien-Yi Chang, De-An Huang, Danfei Xu, Ehsan Adeli, Li Fei-Fei, Juan Carlos Niebles |
4211 |
Procrustean Regression Networks: Learning 3D Structure of Non-Rigid Objects from 2D Annotations
|
Sungheon Park, Minsik Lee, Nojun Kwak |
4159 |
Progressive Refinement Network for Occluded Pedestrian Detection
|
Xiaolin Song hfill Kaili Zhao hfill Wen-Sheng Chu hfill Honggang Zhang hfill Jun Guo hfill |
4135 |
Propagating Over Phrase Relations for One-Stage Visual Grounding
|
Sibei Yang, Guanbin Li, Yizhou Yu |
4190 |
RBF-Softmax: Learning Deep Representative Prototypes with Radial Basis Function Softmax
|
Xiao Zhang, Rui Zhao, Yu Qiao, Hongsheng Li |
3257 |
ReDro: Efficiently Learning Large-sized SPD Visual Representation
|
Saimunur Rahman, Lei Wang, Changming Sun, Luping Zhou |
4027 |
Reflection Backdoor: A Natural Backdoor Attack on Deep Neural Networks
|
Yunfei Liu, Xingjun Ma, James Bailey, Feng Lu |
3231 |
Regression of Instance Boundary by Aggregated CNN and GCN
|
Yanda Meng, Wei Meng, Dongxu Gao, Yitian Zhao, Xiaoyun Yang, Xiaowei Huang, Yalin Zheng |
4089 |
Resolution Switchable Networks for Runtime Efficient Image Recognition
|
Yikai Wang, Fuchun Sun, Duo Li, Anbang Yao |
4113 |
Rethinking Image Deraining via Rain Streaks and Vapors
|
Yinglong Wang, Yibing Song, Chao Ma, Bing Zeng |
4057 |
Rethinking Pseudo-LiDAR Representation
|
Xinzhu Ma, Shinan Liu, Zhiyi Xia, Hongwen Zhang, Xingyu Zeng, Wanli Ouyang |
3232 |
RetrieveGAN: Image Synthesis via Differentiable Patch Retrieval
|
Hung-Yu Tseng, Hsin-Ying Lee, Lu Jiang, Ming-Hsuan Yang, Weilong Yang |
4180 |
Context-Aware RCNN: A Baseline for Action Detection in Videos
|
Jianchao Wu, Zhanghui Kuang, Limin Wang, Wayne Zhang, Gangshan Wu |
3226 |
Image Stitching and Rectification for Hand-Held Cameras
|
Bingbing Zhuang, Quoc-Huy Tran |
3315 |
S3Net: Semantic-Aware Self-supervised Depth Estimation with Monocular Videos and Synthetic Data
|
Bin Cheng, Inderjot Singh Saggu, Raunak Shah, Gaurav Bansal, Dinesh Bharadia |
4160 |
SACA Net: Cybersickness Assessment of Individual Viewers for VR Content via Graph-based Symptom Relation Embedding
|
Sangmin Lee, Jung Uk Kim, Hak Gu Kim, Seongyeop Kim, Yong Man Ro |
4090 |
SMAP: Single-Shot Multi-Person Absolute 3D Pose Estimation
|
Jianan Zhen, Qi Fang, Jiaming Sun, Wentao Liu, Wei Jiang, Hujun Bao, Xiaowei Zhou |
3307 |
SMART: Simultaneous Multi-Agent Recurrent Trajectory Prediction
|
Sriram N N, Buyu Liu, Francesco Pittaluga, Manmohan Chandraker |
4217 |
SNE-RoadSeg: Incorporating Surface Normal Information into Semantic Segmentation for Accurate Freespace Detection
|
Rui Fan, Hengli Wang, Peide Cai, Ming Liu |
4121 |
SOLO: Segmenting Objects by Locations
|
Xinlong Wang, Tao Kong, Chunhua Shen, Yuning Jiang, Lei Li |
4178 |
SPARK: Spatial-aware Online Incremental Attack Against Visual Tracking
|
Qing Guo, Xiaofei Xie, Felix Juefei-Xu, Lei Ma, Zhongguo Li, Wanli Xue, Wei Feng, Yang Liu |
4183 |
SSN: Shape Signature Networks for Multi-class Object Detection from Point Clouds
|
Xinge Zhu~~~~~ Yuexin Ma~~~~~ Tai Wang~~~~~ Yan Xu Jianping Shi~~~~~ Dahua Lin |
4036 |
Scene Text Image Super-resolution in the wild
|
Wenjia Wang, Enze Xie, Xuebo Liu, Wenhai Wang, Ding Liang, Chunhua Shen, Xiang Bai |
4063 |
Segmenting Transparent Objects in the Wild
|
Enze Xie, Wenjia Wang, Wenhai Wang, Mingyu Ding, Chunhua Shen, Ping Luo |
3240 |
Selecting Relevant Features from a Multi-domain Representation for Few-shot Classification
|
Nikita Dvornik, Cordelia Schmid, Julien Mairal |
4216 |
Self-Paced Deep Regression Forests with Consideration on Underrepresented Examples
|
Lili Pan, Shijie Ai, Yazhou Ren, Zenglin Xu |
3288 |
Self-Prediction for Joint Instance and Semantic Segmentation of Point Clouds
|
Jinxian Liu, Minghui Yu, Bingbing Niprotectfootnotemark[4], Ye Chen |
4067 |
Self-supervised Motion Representation via Scattering Local Motion Cues
|
Yuan Tian, Zhaohui Che, Wenbo Bao, Guangtao Zhai, Zhiyong Gao |
4141 |
Semantic Line Detection Using Mirror Attention and Comparative Ranking and Matching
|
Dongkwon Jin, Jun-Tae Lee, Chang-Su Kim |
3306 |
SemifreddoNets: Partially Frozen Neural Networks for~Efficient~Computer~Vision~Systems
|
Leo F Isikdogan, Bhavin V Nayak, Chyuan-Tyng Wu, Joao Peralta Moreira, Sushma Rao, Gilad Michael |
3259 |
Semi-Supervised Crowd Counting via Self-Training on Surrogate Tasks
|
Yan Liu, Lingqiao Liu, Peng Wang, Pingping Zhang, Yinjie Lei |
4045 |
Semi-supervised Learning with a Teacher-student Network for Generalized Attribute Prediction
|
Minchul Shin |
4170 |
SeqXY2SeqZ: Structure Learning for 3D Shapes by Sequentially Predicting 1D Occupancy Segments From 2D Coordinates
|
Zhizhong Han, Guanhui Qiao, Yu-Shen Liu, Matthias Zwicker |
3248 |
Shape Adaptor: A Learnable Resizing Module
|
Shikun Liu, Zhe Lin, Yilin Wang, Jianming Zhang, Federico Perazzi, Edward Johns |
3277 |
Single-Shot Neural Relighting and SVBRDF Estimation
|
Shen Sang, Manmohan Chandraker |
3286 |
Sparse-to-Dense Depth Completion Revisited: Sampling Strategy and Graph Construction
|
Xin Xiong, Haipeng Xiong, Ke Xian, Chen Zhao, Zhiguo Cao, Xin Li |
4163 |
Spatial Hierarchy Aware Residual Pyramid Network for Time-of-Flight Depth Denoising
|
Guanting Dong, Yueyi Zhang, Zhiwei Xiong |
3236 |
Spatially Aware Multimodal Transformers for TextVQA
|
Yash Kant, Dhruv Batra, Peter Anderson, Alexander Schwing, Devi Parikh, Jiasen Lu, Harsh Agrawal |
4106 |
Spatiotemporal Attacks for Embodied Agents
|
Aishan Liu, Tairan Huang, Xianglong Liu, Yitao Xu, Yuqing Ma, Xinyun Chen, Stephen J. Maybank, Dacheng Tao |
4133 |
Spherical Feature Transform for Deep Metric Learning
|
Yuke Zhu, Yan Bai, Yichen Wei |
4084 |
Stochastic Bundle Adjustment for Efficient and Scalable 3D Reconstruction
|
Lei Zhou, Zixin Luo, Mingmin Zhen, Tianwei Shen, Shiwei Li, Zhuofei Huang, Tian Fang, Long Quan |
4194 |
Structure-Aware Generation Network for Recipe Generation from Images
|
Hao Wang, Guosheng Lin, Steven C. H. Hoi, Chunyan Miao |
4102 |
Suppressing Mislabeled Data via Grouping and Self-Attention
|
Xiaojiang Peng, Kai Wang, Zhaoyang Zeng, Qing Li, Jianfei Yang, Yu Qiao |
4085 |
TANet: Towards Fully Automatic Tooth Arrangement
|
Guodong Wei, Zhiming Cui, Yumeng Liu, Nenglun Chen, Runnan Chen, Guiqing Li, Wenping Wang |
4199 |
TP-LSD: Tri-Points Based Line Segment Detector
|
Siyu Huang, Fangbo Qin, Pengfei Xiong, Ning Ding, Yijia He, Xiao Liu |
3304 |
Teaching Cameras to Feel: Estimating Tactile Physical Properties of Surfaces From Images
|
Matthew Purri, Kristin Dana |
4104 |
Tensor Low-Rank Reconstruction for Semantic Segmentation
|
Wanli Chen, Xinge Zhu, Ruoqi Sun, Junjun He, Ruiyu Li, Xiaoyong Shen, Bei Yu |
3302 |
Testing the Safety of Self-driving Vehicles by Simulating Perception and Prediction
|
Kelvin Wong, Qiang Zhang, Ming Liang, Bin Yang, Renjie Liao, Abbas Sadat, Raquel Urtasun |
3238 |
TexMesh: Reconstructing Detailed Human Texture and Geometry from RGB-D Video
|
Tiancheng Zhi, Christoph Lassner, Tony Tung, Carsten Stoll, Srinivasa G. Narasimhan, Minh Vo |
4070 |
Latent Topic-aware Multi-Label Classification
|
Jianghong Ma, Yang Liu |
4131 |
Topology-Preserving Class-Incremental Learning
|
Xiaoyu Tao, Xinyuan Chang, Xiaopeng Hong, Xing Wei, Yihong Gong |
4128 |
Toward Faster and Simpler Matrix Normalization via Rank-1 Update
|
Tan Yu, Yunfeng Cai, Ping Li |
4175 |
Towards Content-Independent Multi-Reference Super-Resolution: Adaptive Pattern Matching and Feature Aggregation
|
Xu Yan, Weibing Zhao, Kun Yuan, Ruimao Zhang, Zhen Li, Shuguang Cui |
4039 |
Towards Real-Time Multi-Object Tracking
|
Zhongdao Wang, Liang Zheng, Yixuan Liu, Yali Li, Shengjin Wang |
3228 |
Towards Unique and Informative Captioning of Images
|
Zeyu Wang, Berthy Feng, Karthik Narasimhan, Olga Russakovsky |
3275 |
Transformation Consistency Regularization -- A Semi-Supervised Paradigm for Image-to-Image Translation
|
Aamir Mustafa, Rafal K. Mantiuk |
4080 |
URVOS: Unified Referring Video Object Segmentation Network with a Large-Scale Benchmark
|
Seonguk Seo, Joon-Young Lee, Bohyung Han |
4087 |
UnionDet: Union-Level Detector Towards Real-Time Human-Object Interaction Detection
|
Bumsoo Kim, Taeho Choi, Jaewoo Kang, Hyunwoo J. Kim |
4044 |
Self-supervised Bayesian Deep Learning for Image Recovery with Applications to Compressive Sensing
|
Tongyao Pang, Yuhui Quan, Hui Ji |
4007 |
Unsupervised Domain Attention Adaptation Network for Caricature Attribute Recognition
|
Wen Ji, Kelei He, Jing Huo, Zheng Gu, Yang Gao |
4187 |
VCNet: A Robust Approach to Blind Image Inpainting
|
Yi Wang, Ying-Cong Chen, Xin Tao, Jiaya Jia |
3285 |
VQA-LOL: Visual Question Answering under the Lens of Logic
|
Tejas Gokhale, Pratyay Banerjee, Chitta Baral, Yezhou Yang |
4208 |
Variational Connectionist Temporal Classification
|
Linlin Chao, Jingdong Chen, Wei Chu |
3292 |
Variational Diffusion Autoencoders with Random Walk Sampling
|
Henry Li, Ofir Lindenbaum, Xiuyuan Cheng, Alexander Cloninger |
3235 |
Vectorizing World Buildings: Planar Graph Reconstruction by Primitive Detection and Relationship Inference
|
Nelson Nauata, Yasutaka Furukawa |
4051 |
Video Super-Resolution with Recurrent Structure-Detail Network
|
Takashi Isobe, Xu Jia, Shuhang Gu, Songjiang Li, Shengjin Wang, Qi Tian |
3299 |
Virtual Multi-view Fusion for 3D Semantic Segmentation
|
Abhijit Kundu, Xiaoqi Yin, Alireza Fathi, David Ross, Brian Brewington, Thomas Funkhouser, Caroline Pantofaru |
4205 |
Visual-Relation Conscious Image Generation from Structured-Text
|
Duc Minh Vo, Akihiro Sugimoto |
3251 |
Wavelet-Based Dual-Branch Network for Image Demoir\'{eing
|
Lin Liu, Jianzhuang Liu, Shanxin Yuan, Gregory Slabaugh, Alevs Leonardis, Wengang Zhou, Qi Tian |
4060 |
Weakly Supervised 3D Object Detection from Lidar Point Cloud
|
Qinghao Meng, Wenguan Wang, Tianfei Zhou, Jianbing Shen, Luc Van Gool, Dengxin Dai |
4017 |
Webly Supervised Image Classification with Self-Contained Confidence
|
Jingkang Yang, Litong Feng, Weirong Chen, Xiaopeng Yan, Huabin Zheng, Ping Luo, Wayne Zhang |
4191 |
Weight Decay Scheduling and Knowledge Distillation for Active Learning
|
Juseung Yun, Byungjoo Kim, Junmo Kim |
3242 |
Who Left the Dogs Out? 3D Animal Reconstruction with Expectation Maximization in the Loop
|
Benjamin Biggs, Oliver Boyne, James Charles, Andrew Fitzgibbon, Roberto Cipolla |
3233 |
World-Consistent Video-to-Video Synthesis
|
Arun Mallya, Ting-Chun Wang, Karan Sapra, Ming-Yu Liu |
4095 |
Yet Another Intermediate-Level Attack
|
Qizhang Li, Yiwen Guo, Hao Chen |
4108 |
Zero-Shot Image Super-Resolution with Depth Guided Internal Degradation Learning
|
Xi Cheng, Zhenyong Fu, Jian Yang |