4341 |
A Competence-aware Curriculum for Visual Concepts Learning via Question Answering
|
Qing Li, Siyuan Huang, Yining Hong, Song-Chun Zhu |
4320 |
Aligning and Projecting Images to Class-conditional Generative Networks
|
Minyoung Huh, Richard Zhang, Jun-Yan Zhu, Sylvain Paris, Aaron Hertzmann |
4248 |
Crowdsampling the Plenoptic Function
|
Zhengqi Li, Wenqi Xian, Abe Davis, Noah Snavely |
4251 |
DeepSFM: Structure From Motion Via Deep Bundle Adjustment
|
Xingkui Wei, Yinda Zhang, Zhuwen Li, Yanwei Fu, Xiangyang Xue |
4232 |
Describing Textures using Natural Language
|
Chenyun Wu, Mikayla Timm, Subhransu Maji |
4316 |
The Phong Surface: Efficient 3D Model Fitting using Lifted Optimization
|
Jingjing Shen, Thomas J. Cashman, Qi Ye, Tim Hutton, Toby Sharp, Federica Bogo, Andrew Fitzgibbon, Jamie Shotton |
4333 |
In-Home Daily-Life Captioning Using Radio Signals
|
Lijie Fan, Tianhong Li, Yuan Yuan, Dina Katabi |
4376 |
It is not the Journey but the Destination: Endpoint Conditioned Trajectory Prediction
|
Karttikeya Mangalam, Harshayu Girase, Shreyas Agarwal, Kuan-Hui Lee, Ehsan Adeli, Jitendra Malik, Adrien Gaidon |
4353 |
Learning Lane Graph Representations for Motion Forecasting
|
Ming Liang, Bin Yang, Rui Hu, Yun Chen, Renjie Liao, Song Feng, Raquel Urtasun |
4318 |
Learning Stereo from Single Images
|
Jamie Watson, Oisin Mac Aodha, Daniyar Turmukhambetov, Gabriel J. Brostow, Michael Firman |
4275 |
Long-term Human Motion Prediction with Scene Context
|
Zhe Cao, Hang Gao, Karttikeya Mangalam, Qi-Zhi Cai, Minh Vo, Jitendra Malik |
4278 |
NeRF: Representing Scenes as Neural Radiance Fields for View Synthesis
|
Ben Mildenhall, Pratul P. Srinivasan, Matthew Tancik, Jonathan T. Barron, Ravi Ramamoorthi, Ren Ng |
4328 |
Post-Training Piecewise Linear Quantization for Deep Neural Networks
|
Jun Fang, Ali Shafiee, Hamzah Abdel-Aziz, David Thorsley, Georgios Georgiadis, Joseph H. Hassoun |
4348 |
RAFT: Recurrent All-Pairs Field Transforms for Optical Flow
|
Zachary Teed, Jia Deng |
4267 |
Rewriting a Deep Generative Model
|
David Bau, Steven Liu, Tongzhou Wang, Jun-Yan Zhu, Antonio Torralba |
4340 |
Self-Challenging Improves Cross-Domain Generalization
|
Zeyi Huang, Haohan Wang, Eric P. Xing, Dong Huang |
4357 |
Synthesis and Completion of Facades from Satellite Imagery
|
Xiaowei Zhang, Christopher May, Daniel Aliaga |
4242 |
Synthesize then Compare: Detecting Failures and Anomalies for Semantic Segmentation
|
Yingda Xia, Yi Zhang, Fengze Liu, Wei Shen, Alan L. Yuille |
4371 |
TextCaps: a Dataset for Image Captioning with Reading Comprehension
|
Oleksii Sidorov, Ronghang Hu, Marcus Rohrbach, Amanpreet Singh |
4417 |
TopoGAN: A Topology-Aware Generative Adversarial Network
|
Fan Wang, Huidong Liu, Dimitris Samaras, Chao Chen |
4349 |
Towards Streaming Perception
|
Mengtian Li, Yu-Xiong Wang, Deva Ramanan |
4324 |
Visual Memorability for Robotic Interestingness via Unsupervised Online Learning
|
Chen Wang, Wenshan Wang, Yuheng Qiu, Yafei Hu, Sebastian Scherer |
4354 |
A Cordial Sync: Going Beyond Marginal Policies for Multi-Agent Embodied Tasks
|
Unnat Jain, Luca Weihs, Eric Kolve, Ali Farhadi, Svetlana Lazebnik, Aniruddha Kembhavi, Alexander Schwing |
4261 |
REVISE: A Tool for Measuring and Mitigating Bias in Visual Datasets
|
Angelina Wang, Arvind Narayanan, Olga Russakovsky |
4379 |
Active Perception using Light Curtains for Autonomous Driving
|
Siddharth Ancha, Yaadhav Raaj, Peiyun Hu, Srinivasa G. Narasimhan, David Held |
4281 |
Adaptive Computationally Efficient Network for Monocular 3D Hand Pose Estimation
|
Zhipeng Fan, Jun Liu, Yao Wang |
4439 |
Behind the Scene: Revealing the Secrets of Pre-trained Vision-and-Language Models
|
Jize Cao, Zhe Gan, Yu Cheng, Licheng Yu, Yen-Chun Chen, Jingjing Liu |
4268 |
Contrastive Learning for Weakly Supervised Phrase Grounding
|
Tanmay Gupta, Arash Vahdat, Gal Chechik, Xiaodong Yang, Jan Kautz, Derek Hoiem |
4343 |
Deep Feedback Inverse Problem Solver
|
Wei-Chiu Ma, Shenlong Wang, Jiayuan Gu, Sivabalan Manivasagam, Antonio Torralba, Raquel Urtasun |
4230 |
Deep Reflectance Volumes: Relightable Reconstructions from Multi-View Photometric Images
|
Sai Bi, Zexiang Xu, Kalyan Sunkavalli, Milovs} Havs}an, Yannick Hold-Geoffroy, David Kriegman, Ravi Ramamoorthi |
4378 |
DeepGMR: Learning Latent Gaussian Mixture Models for Registration
|
Wentao Yuan, Benjamin Eckart, Kihwan Kim, Varun Jampani, Dieter Fox, Jan Kautz |
4405 |
Directional Temporal Modeling for Action Recognition
|
Xinyu Li, Bing Shuai, Joseph Tighe |
4336 |
Few-Shot Scene-Adaptive Anomaly Detection
|
Yiwei Lu, Frank Yu, Mahesh Kumar Krishna Reddy, Yang Wang |
4398 |
GeLaTO: Generative Latent Textured Objects
|
Ricardo Martin-Brualla, Rohit Pandey, Sofien Bouaziz, Matthew Brown, Dan B Goldman |
4344 |
Hallucinating Visual Instances in Total Absentia
|
Jiayan Qiu, Yiding Yang, Xinchao Wang, Dacheng Tao |
4404 |
Improving Vision-and-Language Navigation with Image-Text Pairs from the Web
|
Arjun Majumdar, Ayush Shrivastava, Stefan Lee, Peter Anderson, Devi Parikh, Dhruv Batra |
4425 |
Jointly learning visual motion and confidence from local patches in event cameras
|
Daniel R. Kepple, Daewon Lee, Colin Prepsius, Volkan Isler, Il Memming Park, Daniel D. Lee |
4419 |
Learning Multi-layer Latent Variable Model via Variational Optimization of Short Run MCMC for Approximate Inference
|
Erik Nijkamp, Bo Pang, Tian Han, Linqi Zhou, Song-Chun Zhu, Ying Nian Wu |
4283 |
Learning to Scale Multilingual Representations for Vision-Language Tasks
|
Andrea Burns, Donghyun Kim, Derry Wijaya, Kate Saenko, Bryan A. Plummer |
4231 |
Memory-augmented Dense Predictive Coding for Video Representation Learning
|
Tengda Han, Weidi Xie, Andrew Zisserman |
4427 |
A unifying mutual information view of metric learning: cross-entropy vs. pairwise losses
|
Malik Boudiaf, J'er^ome Rony, Imtiaz Masud Ziko, Eric Granger, Marco Pedersoli, Pablo Piantanida, Ismail Ben Ayed |
4237 |
Neural Design Network: Graphic Layout Generation with Constraints
|
Hsin-Ying Lee, Lu Jiang, Irfan Essa, Phuong B Le, Haifeng Gong, Ming-Hsuan Yang, Weilong Yang |
4253 |
PointContrast: Unsupervised Pre-training for 3D Point Cloud Understanding
|
Saining Xie, Jiatao Gu, Demi Guo, Charles R. Qi, Leonidas Guibas, Or Litany |
4329 |
PointPWC-Net: Cost Volume on Point Clouds for (Self-)Supervised Scene Flow Estimation
|
Wenxuan Wu, Zhi Yuan Wang, Zhuwen Li, Wei Liu, Li Fuxin |
4367 |
Predicting Visual Overlap of Images Through Interpretable Non-Metric Box Embeddings
|
Anita Rau, Guillermo Garcia-Hernando, Danail Stoyanov, Gabriel J. Brostow, Daniyar Turmukhambetov |
4311 |
Region Graph Embedding Network for Zero-Shot Learning
|
Guo-Sen Xie, Li Liu, Fan Zhu, Fang Zhao, Zheng Zhang, Yazhou Yao, Jie Qin, Ling Shao |
4409 |
Shonan Rotation Averaging: Global Optimality by Surfing $SO(p)^n$
|
Frank Dellaert, David M. Rosen, Jing Wu, Robert Mahony, Luca Carlone |
4259 |
Side-Tuning: A Baseline for Network Adaptation via Additive Side Networks
|
Jeffrey O. Zhang, Alexander Sax, Amir Zamir, Leonidas Guibas, Jitendra Malik |
4288 |
Surface Normal Estimation of Tilted Images via Spatial Rectifier
|
Tien Do, Khiem Vuong, Stergios I. Roumeliotis, Hyun Soo Park |
4393 |
Joint Semantic Instance Segmentation on Graphs with the Semantic Mutex Watershed
|
Steffen Wolf, Yuyan Li, Constantin Pape, Alberto Bailoni, Anna Kreshuk, Fred A. Hamprecht |
4309 |
Transporting Labels via Hierarchical Optimal Transport for Semi-Supervised Learning
|
Fariborz Taherkhani, Ali Dabouei, Sobhan Soleymani, Jeremy Dawson, Nasser M. Nasrabadi |
4345 |
Weakly-supervised 3D Shape Completion in the Wild
|
Jiayuan Gu, Wei-Chiu Ma, Sivabalan Manivasagam, Wenyuan Zeng, Zihao Wang, Yuwen Xiong, Hao Su, Raquel Urtasun |
4424 |
Visual Relation Grounding in Videos
|
Junbin Xiao, Xindi Shang, Xun Yang, Sheng Tang, Tat-Seng Chua |
4312 |
3D Fluid Flow Reconstruction Using Compact Light Field PIV
|
Zhong Li, Yu Ji, Jingyi Yu, Jinwei Ye |
4250 |
3D Human Shape and Pose from a Single Low-Resolution Image with Self-Supervised Learning
|
Xiangyu Xu, Hao Chen, Francesc Moreno-Noguer, L'aszl'o A. Jeni, Fernando De la Torre |
4236 |
3PointTM: Faster Measurement of High-Dimensional Transmission Matrices
|
Yujun Chen, Manoj Kumar Sharma, Ashutosh Sabharwal, Ashok Veeraraghavan, Aswin C. Sankaranarayanan |
4258 |
A Comprehensive Study of Weight Sharing in Graph Networks for 3D Human Pose Estimation
|
Kenkun Liu, Rongqi Ding, Zhiming Zou, Le Wang, Wei Tang |
4327 |
A Large-scale Annotated Mechanical Components Benchmark for Classification and Retrieval Tasks with Deep Neural Networks
|
Sangpil Kim, Hyung-gun Chi, Xiao Hu, Qixing Huang, Karthik Ramani |
4413 |
A Recurrent Transformer Network for Novel View Action Synthesis
|
Kara Marie Schatz, Erik Quintanilla, Shruti Vyas, Yogesh S Rawat |
4412 |
A Simple and Effective Framework for Pairwise Deep Metric Learning
|
Qi Qi, Yan Yan, Zixuan Wu, Xiaoyu Wang, Tianbao Yang |
4247 |
AUTO3D: Novel view synthesis through unsupervisely learned variational viewpoint and global 3D representation
|
Xiaofeng Liu, Tong Che, Yiqun Lu, Chao Yang, Site Li, Jane You |
4375 |
Accelerating Deep Learning with Millions of Classes
|
Zhuoning Yuan, Zhishuai Guo, Xiaotian Yu, Xiaoyu Wang, Tianbao Yang |
4299 |
Action Localization through Continual Predictive Learning
|
Sathyanarayanan Aakur, Sudeep Sarkar |
4298 |
Adversarial Background-Aware Loss for Weakly-supervised Temporal Activity Localization
|
Kyle Min, Jason J. Corso |
4436 |
Adversarial Data Augmentation via Deformation Statistics
|
Sahin Olut, Zhengyang Shen, Zhenlin Xu, Samuel Gerber, Marc Niethammer |
4408 |
An Efficient Training Framework for Reversible Neural Architectures
|
Zixuan Jiang, Keren Zhu, Mingjie Liu, Jiaqi Gu, David Z. Pan |
4291 |
An Image Enhancing Pattern-based Sparsity for Real-time Inference on Mobile Devices
|
Xiaolong Ma, Wei Niu, Tianyun Zhang, Sijia Liu, Sheng Lin, Hongjia Li, Wujie Wen, Xiang Chen, Jian Tang, Kaisheng Ma, Bin Ren, Yanzhi Wang |
4350 |
AssembleNet++: Assembling Modality Representations via Attention Connections
|
Michael S. Ryoo, AJ Piergiovanni, Juhana Kangaspunta, Anelia Angelova |
4227 |
Atlas: End-to-End 3D Scene Reconstruction from Posed Images
|
Zak Murez, Tarrence van As, James Bartolozzi, Ayan Sinha, Vijay Badrinarayanan, Andrew Rabinovich |
4420 |
Attention-Based Query Expansion Learning
|
Albert Gordo, Filip Radenovic, Tamara Berg |
4317 |
Attentive Normalization
|
Xilai Li, Wei Sun, Tianfu Wu |
4363 |
AutoSimulate: (Quickly) Learning Synthetic Data Generation
|
Harkirat Singh Behl, Atilim Güneş Baydin, Ran Gal, Philip H.S. Torr, Vibhav Vineet |
4368 |
BATS: Binary ArchitecTure Search
|
Adrian Bulat, Brais Martinez, Georgios Tzimiropoulos |
4279 |
BBS-Net: RGB-D Salient Object Detection with a Bifurcated Backbone Strategy Network
|
Deng-Ping Fan, Yingjie Zhai, Ali Borji, Jufeng Yang, Ling Shao |
4418 |
Beyond the Nav-Graph: Vision-and-Language Navigation in Continuous Environments
|
Jacob Krantz, Erik Wijmans, Arjun Majumdar, Dhruv Batra, Stefan Lee |
4395 |
Boosting Weakly Supervised Object Detection with Progressive Knowledge Transfer
|
Yuanyi Zhong, Jianfeng Wang, Jian Peng, Lei Zhang |
4410 |
Box2Seg: Attention Weighted Loss and Discriminative Feature Learning for Weakly Supervised Segmentation
|
Viveka Kulharia, Siddhartha Chandra, Amit Agrawal, Philip Torr, Ambrish Tyagi |
4396 |
B\'ezierSketch: A generative model for scalable vector sketches
|
Ayan Das, Yongxin Yang, Timothy Hospedales, Tao Xiang, Yi-Zhe Song |
4406 |
Channel selection using Gumbel Softmax
|
Charles Herrmann, Richard Strong Bowen, Ramin Zabih |
4369 |
Connecting the Dots: Detecting Adversarial Perturbations Using Context Inconsistency
|
Shasha Li, Shitong Zhu, Sudipta Paul, Amit Roy-Chowdhury, Chengyu Song, Srikanth Krishnamurthy, Ananthram Swami, Kevin S Chan |
4277 |
Contrastive Multiview Coding
|
Yonglong Tian, Dilip Krishnan, Phillip Isola |
4262 |
Coupling Explicit and Implicit Surface Representations for Generative 3D Modeling
|
Omid Poursaeed, Matthew Fisher, Noam Aigerman, Vladimir G. Kim |
4249 |
Structured Landmark Detection via Topology-Adapting Deep Graph Learning
|
Weijian Li, Yuhang Lu, Kang Zheng, Haofu Liao, Chihung Lin, Jiebo Luo, Chi-Tung Cheng, Jing Xiao, Le Lu, Chang-Fu Kuo, Shun Miao |
4416 |
DA-NAS: Data Adapted Pruning for Efficient Neural Architecture Search
|
Xiyang Dai, Dongdong Chen, Mengchen Liu, Yinpeng Chen, Lu Yuan |
4352 |
DDGCN: A Dynamic Directed Graph Convolutional Network for Action Recognition
|
Matthew Korban, Xin Li |
4356 |
DSDNet: Deep Structured self-Driving Network
|
Wenyuan Zeng, Shenlong Wang, Renjie Liao, Yun Chen, Bin Yang, Raquel Urtasun |
4386 |
Deep FusionNet for Point Cloud Semantic Segmentation
|
Feihu Zhang quadquad Jin Fangquadquad Benjamin Wah quadquad Philip Torr |
4435 |
Deep Local Shapes: Learning Local SDF Priors for Detailed 3D Reconstruction
|
Rohan Chabra, Jan E. Lenssen, Eddy Ilg, Tanner Schmidt, Julian Straub, Steven Lovegrove, Richard Newcombe |
4385 |
Deep Shape from Polarization
|
Yunhao Ba, Alex Gilbert, Franklin Wang, Jinfa Yang, Rui Chen, newline Yiqin Wang, Lei Yan, Boxin Shi, Achuta Kadambi |
4366 |
Describing Unseen Videos via Multi-Modal Cooperative Dialog Agents
|
Ye Zhu, Yu Wu, Yi Yang, Yan Yan |
4301 |
3D Human Shape Reconstruction from a Polarization Image
|
Shihao Zou, Xinxin Zuo, Yiming Qian, Sen Wang, Chi Xu, Minglun Gong, Li Cheng |
4355 |
Disambiguating Monocular Depth Estimation with a Single Transient
|
Mark Nishimura, David B. Lindell, Christopher Metzler, Gordon Wetzstein |
4397 |
Domain Adaptation Through Task Distillation
|
Brady Zhou, Nimit Kalra, Philipp Kr"ahenb"uhl |
4254 |
Domain Adaptive Semantic Segmentation Using Weak Labels
|
Sujoy Paul, Yi-Hsuan Tsai, Samuel Schulter, Amit K. Roy-Chowdhury, Manmohan Chandraker |
4433 |
Dual Mixup Regularized Learning for Adversarial Domain Adaptation
|
Yuan Wu, Diana Inkpen, Ahmed El-Roby |
4347 |
Dynamic ReLU
|
Yinpeng Chen, Xiyang Dai, Mengchen Liu, Dongdong Chen, Lu Yuan, Zicheng Liu |
4346 |
Efficient Residue Number System Based Winograd Convolution
|
Zhi-Gang Liu, Matthew Mattina |
4392 |
Embedding Propagation: Smoother Manifold for Few-Shot Classification
|
Pau Rodr'iguez, Issam Laradji, Alexandre Drouin, Alexandre Lacoste |
4383 |
Environment-agnostic Multitask Learning for Natural Language Grounded Navigation
|
Xin Eric Wang, Vihan Jain, Eugene Ie, William Yang Wang, Zornitsa Kozareva, Sujith Ravi[2] |
4302 |
Example-Guided Image Synthesis using Masked Spatial-Channel Attention and Self-Supervision
|
Haitian Zheng, Haofu Liao, Lele Chen, Wei Xiong, Tianlang Chen, Jiebo Luo |
4339 |
RadarNet: Exploiting Radar for Robust Perception of Dynamic Objects
|
Bin Yang, Runsheng Guo, Ming Liang, Sergio Casas, Raquel Urtasun |
4407 |
Exploiting Temporal Coherence for Self-Supervised One-shot Video Re-identification
|
Dripta S. Raychaudhuri, Amit K. Roy-Chowdhury |
4271 |
DataMix: Efficient Privacy-Preserving Edge-Cloud Inference
|
Zhijian Liu, Zhanghao Wu, Chuang Gan, Ligeng Zhu, Song Han |
4234 |
Faster Person Re-Identification
|
Guan'an Wang, Shaogang Gong, Jian Cheng, Zengguang Hou |
4319 |
Finding Non-Uniform Quantization Schemes using Multi-Task Gaussian Processes
|
Marcelo Gennari do Nascimento, Theo W. Costain, Victor Adrian Prisacariu |
4276 |
Foley Music: Learning to Generate Music from Videos
|
Chuang Gan, Deng Huang, Peihao Chen, Joshua B. Tenenbaum, Antonio Torralba |
4372 |
From Image to Stability: Learning Dynamics from Human Pose
|
Jesse Scott, Bharadwaj Ravichandran, Christopher Funk, Robert T. Collins, Yanxi Liu |
4266 |
From Shadow Segmentation to Shadow Removal
|
Hieu Le, Dimitris Samaras |
4432 |
Fully Embedding Fast Convolutional Networks on Pixel Processor Arrays
|
Laurie Bose, Piotr Dudek, Jianing Chen, Stephen J. Carey, Walterio W. Mayol-Cuevas |
4331 |
GAN-based Garment Generation Using Sewing Pattern Images
|
Yu Shen, Junbang Liang, Ming C. Lin |
4289 |
GSIR: Generalizable 3D Shape Interpretation and Reconstruction
|
Jianren Wang, Zhaoyuan Fang |
4360 |
Gen-LaneNet: A Generalized and Scalable Approach for 3D Lane Detection
|
Yuliang Guo, Guang Chen, Peitao Zhao, Weide Zhang, Jinghao Miao, Jingao Wang, Tae Eun Choe |
4280 |
Generating Videos of Zero-Shot Compositions of Actions and Objects
|
Megha Nawhal, Mengyao Zhai, Andreas Lehrmann, Leonid Sigal, Greg Mori |
4300 |
Generative View-Correlation Adaptation for Semi-Supervised Multi-View Learning
|
Yunyu Liu, Lichen Wang, Yue Bai, Can Qin, Zhengming Ding, Yun Fu |
4387 |
Hand-Transformer: Non-Autoregressive Structured Modeling for 3D Hand Pose Estimation
|
Lin Huang, Jianchao Tan, Ji Liu, Junsong Yuan |
4323 |
Hierarchical Kinematic Human Mesh Recovery
|
Georgios Georgakis, Ren Li, Srikrishna Karanam, Terrence Chen, Jana Kovseck'a, Ziyan Wu |
4264 |
Hierarchical Style-based Networks for Motion Synthesis
|
Jingwei Xu, Huazhe Xu, Bingbing Ni, Xiaokang Yang, Xiaolong Wang, Trevor Darrell |
4241 |
Image Classification in the Dark using Quanta Image Sensors
|
Abhiram Gnanasambandam, Stanley H. Chan |
4373 |
Implicit Latent Variable Model for Scene-Consistent Motion Forecasting
|
Sergio Casas, Cole Gulino, Simon Suo, Katie Luo, Renjie Liao, Raquel Urtasun |
4434 |
Improving Object Detection with Selective Self-Supervised Self-Training
|
Yandong Li, Di Huang, Danfeng Qin, Liqiang Wang, Boqing Gong |
4381 |
Improving Face Recognition by Clustering Unlabeled Faces in the Wild
|
Aruni RoyChowdhury, Xiang Yu, Kihyuk Sohn, Erik Learned-Miller, Manmohan Chandraker |
4322 |
Interactive Annotation of 3D Object Geometry using 2D Scribbles
|
Tianchang Shen, Jun Gao, Amlan Kar, Sanja Fidler |
4382 |
Iterative Distance-Aware Similarity Matrix Convolution with Mutual-Supervised Point Elimination for Efficient Point Cloud Registration
|
Jiahao Li, Changhao Zhang, Ziyao Xu, Hangning Zhou, Chi Zhang |
4337 |
JNR: Joint-based Neural Rig Representation for Compact 3D Face Modeling
|
Noranart Vesdapunt, Mitch Rundle, HsiangTao Wu, Baoyuan Wang |
4238 |
Joint Bilateral Learning for Real-time Universal Photorealistic Style Transfer
|
Xide Xia, Meng Zhang, Tianfan Xue, Zheng Sun, Hui Fang, Brian Kulis, Jiawen Chen |
4429 |
Jointly De-biasing Face Recognition and Demographic Attribute Estimation
|
Sixue Gong, Xiaoming Liu, Anil K. Jain |
4390 |
Journey Towards Tiny Perceptual Super-Resolution
|
Royson Lee, Lukasz Dudziak, Mohamed Abdelfattah, Stylianos I. Venieris, Hyeji Kim, Hongkai Wen, Nicholas D. Lane |
4321 |
Key Frame Proposal Network for Efficient Pose Estimation in Videos
|
Yuexi Zhang, Yin Wang, Octavia Camps, Mario Sznaier |
4365 |
Kinematic 3D Object Detection in Monocular Video
|
Garrick Brazil, Gerard Pons-Moll, Xiaoming Liu, Bernt Schiele |
4400 |
LEMMA: A Multi-view Dataset for \underline{L\underline{Earning \underline{Multi-agent \underline{Multi-task \underline{Activities
|
Baoxiong Jia, Yixin Chen, Siyuan Huang, Yixin Zhu, Song-Chun Zhu |
4415 |
Label-Driven Reconstruction for Domain Adaptation in Semantic Segmentation
|
Jinyu Yang, Weizhi An, Sheng Wang, Xinliang Zhu, Chaochao Yan, Junzhou Huang |
4334 |
Large-scale Pretraining for Visual Dialog: A Simple State-of-the-Art Baseline
|
Vishvak Murahari, Dhruv Batra, Devi Parikh, Abhishek Das |
4438 |
Laying the Foundations of Deep Long-Term Crowd Flow Prediction
|
Samuel S. Sohn, Honglu Zhou, Seonghyeon Moon, Sejong Yoon, Vladimir Pavlovic, Mubbasir Kapadia |
4403 |
Learn distributed GAN with Temporary Discriminators
|
Hui Qu, Yikai Zhang, Qi Chang, Zhennan Yan, Chao Chen, Dimitris Metaxas |
4252 |
Learnable Cost Volume Using the Cayley Representation
|
Taihong Xiao, Jinwei Yuan, Deqing Sun, Qifei Wang Xin-Yu Zhang, Kehan Xu, Ming-Hsuan Yang |
4224 |
Learning 3D Part Assembly from a Single Image
|
Yichen Li, Kaichun Mo, Lin Shao, Minhyuk Sung, Leonidas Guibas |
4308 |
Learning Attentive and Hierarchical Representations for 3D Shape Recognition
|
Jiaxin Chen, Jie Qin, Yuming Shen, Li Liu, Fan Zhu, Ling Shao |
4389 |
Learning Enriched Features for Real Image Restoration and Enhancement
|
Syed Waqas Zamir, Aditya Arora, Salman Khan, Munawar Hayat, Fahad Shahbaz Khan, Ming-Hsuan Yang, Ling Shao |
4226 |
Learning Pairwise Inter-Plane Relations for Piecewise Planar Reconstruction
|
Yiming Qian, Yasutaka Furukawa |
4306 |
Learning Monocular Visual Odometry via Self-Supervised Long-Term Modeling
|
Yuliang Zou, Pan Ji, Quoc-Huy Tran, Jia-Bin Huang, Manmohan Chandraker |
4290 |
Learning Object Placement by Inpainting for Compositional Data Augmentation
|
Lingzhi Zhang, Tarmily Wen, Jie Min, Jiancong Wang, David Han, Jianbo Shi |
4374 |
Learning Visual Commonsense for Robust Scene Graph Generation
|
Alireza Zareian, Zhecan Wang, Haoxuan You, Shih-Fu Chang |
4244 |
Learning to Combine: Knowledge Aggregation for Multi-Source Domain Adaptation
|
Hang Wang, Minghao Xu, Bingbing Ni, Wenjun Zhang |
4335 |
Learning to Generate Grounded Visual Captions without Localization Supervision
|
Chih-Yao~Ma, Yannis~Kalantidis, Ghassan~AlRegib, Peter~Vajda, Marcus~Rohrbach, Zsolt~Kira |
4315 |
Learning to Generate Novel Domains for Domain Generalization
|
Kaiyang Zhou, Yongxin Yang, Timothy Hospedales, Tao Xiang |
4431 |
Learning to Learn Words from Visual Scenes
|
D'idac Sur'is, Dave Epstein, Heng Ji, Shih-Fu Chang, Carl Vondrick |
4364 |
Least squares surface reconstruction on arbitrary domains
|
Dizhong Zhu, William A. P. Smith |
4297 |
Lift, Splat, Shoot: Encoding Images from Arbitrary Camera Rigs by Implicitly Unprojecting to 3D
|
Jonah Philion, Sanja Fidler |
4421 |
Explaining Image Classifiers using Statistical Fault Localization
|
Youcheng~Sun, Hana~Chockler, Xiaowei~Huang, Daniel~Kroening |
4285 |
Low Light Video Enhancement using Synthetic Data Produced with an Intermediate Domain Mapping
|
Danai Triantafyllidou, Sean Moran, Steven McDonagh, Sarah Parisot, Gregory Slabaugh |
4233 |
Meshing Point Clouds with Predicted Intrinsic-Extrinsic Ratio Guidance
|
Minghua Liu, Xiaoshuai Zhang, Hao Su |
4305 |
MetaDistiller: Network Self-Boosting via Meta-Learned Top-Down Distillation
|
Benlin Liu, Yongming Rao, Jiwen Lu, Jie Zhou, Cho-Jui Hsieh |
4394 |
Mining self-similarity: Label super-resolution with epitomic representations
|
Nikolay Malkin, Anthony Ortiz, Nebojsa Jojic |
4326 |
Modeling Artistic Workflows for Image Generation and Editing
|
Hung-Yu Tseng, Matthew Fisher, Jingwan Lu, Yijun Li, Vladimir Kim, Ming-Hsuan Yang |
2359 |
Monocular Real-Time Volumetric Performance Capture
|
Ruilong Li, Yuliang Xiu, Shunsuke Saito, Zeng Huang, Kyle Olsewski, Hao Li |
4228 |
Multiple Class Novelty Detection Under Data Distribution Shift
|
Poojan Oza, Hien V. Nguyen, Vishal M. Patel |
4414 |
Multi-view Action Recognition using Cross-view Video Prediction
|
Shruti Vyas, Yogesh S Rawat, Mubarak Shah |
4256 |
Naive-Student: Leveraging Semi-Supervised Learning in Video Sequences for Urban Scene Segmentation
|
Liang-Chieh Chen, Raphael Gontijo Lopes, Bowen Cheng, Maxwell D. Collins, Ekin D. Cubuk, Barret Zoph, Hartwig Adam, Jonathon Shlens |
4428 |
Representation Learning on Visual-Symbolic Graphs for Video Understanding
|
Effrosyni Mavroudi, Benjam'in B'ejar Haro, Ren'e Vidal |
4437 |
Neural Predictor for Neural Architecture Search
|
Wei Wen, Hanxiao Liu, Yiran Chen, Hai Li, Gabriel Bender, Pieter-Jan Kindermans |
4296 |
Object Detection with a Unified Label Space from Multiple Datasets
|
Xiangyun Zhao, Samuel Schulter, Gaurav Sharma, Yi-Hsuan Tsai, Manmohan Chandraker, Ying Wu |
4310 |
Omni-sourced Webly-supervised Learning for Video Recognition
|
Haodong Duan, Yue Zhao, Yuanjun Xiong, Wentao Liu, Dahua Lin |
4338 |
On Disentangling Spoof Trace for Generic Face Anti-Spoofing
|
Yaojie Liu, Joel Stehouwer, Xiaoming Liu |
4442 |
On Diverse Asynchronous Activity Anticipation
|
He Zhao, Richard P. Wildes |
4411 |
One-Pixel Signature: Characterizing CNN Models for Backdoor Detection
|
Shanjiaoyang Huang, Weiqi Peng, Zhiwei Jia, Zhuowen Tu |
4314 |
Online Meta-Learning for Multi-Source and Semi-Supervised Domain Adaptation
|
Da Li, Timothy Hospedales |
4423 |
Orderly Disorder in Point Cloud Domain
|
Morteza~Ghahremani, Bernard~Tiddeman, Yonghuai~Liu, and Ardhendu~Behera |
4446 |
Object-Semantics Aligned Pre-training for Vision-Language Tasks
|
Xiujun Li, Xi Yin, Chunyuan Li, Pengchuan Zhang, Xiaowei Hu, Lei Zhang, Lijuan Wang, Houdong Hu, Li Dong, Furu Wei, Yejin Choi, Jianfeng Gao |
4225 |
PT2PC: Learning to Generate 3D Point Cloud Shapes from Part Tree Conditions
|
Kaichun Mo, He Wang, Xinchen Yan, Leonidas Guibas |
4399 |
PatchAttack: A Black-box Texture-based Attack with Reinforcement Learning
|
Chenglin Yang, Adam Kortylewski, Cihang Xie, Yinzhi Cao, Alan Yuille |
4260 |
People as Scene Probes
|
Yifan Wang, Brian L. Curless, Steven M. Seitz |
4370 |
Perceive, Predict, and Plan: Safe Motion Planning Through Interpretable Semantic Representations
|
Abbas Sadat, Sergio Casas, Mengye Ren, Xinyu Wu, Pranaab Dhawan, Raquel Urtasun |
4361 |
Pillar-based Object Detection for Autonomous Driving
|
Yue Wang, Alireza Fathi, Abhijit Kundu, David A. Ross, Caroline Pantofaru, Tom Funkhouser, Justin Solomon |
4377 |
PointTriNet: Learned Triangulation of 3D Point Sets
|
Nicholas Sharp, Maks Ovsjanikov |
4313 |
Polarized Optical-Flow Gyroscope
|
Masada Tzabari, Yoav Y. Schechner |
4402 |
Practical Poisoning Attacks on Neural Networks
|
Junfeng Guo, Cong Liu |
4273 |
Progressive Transformers for End-to-End Sign Language Production
|
Ben Saunders, Necati Cihan Camgoz, Richard Bowden |
4401 |
Proposal-based Video Completion
|
Yuan-Ting Hu, Heng Wang, Nicolas Ballas, Kristen Grauman, Alexander G. Schwing |
4384 |
ProxyNCA++: Revisiting and Revitalizing Proxy Neighborhood Component Analysis
|
Eu Wern Teh, Terrance DeVries, Graham W. Taylor |
4235 |
Quantization Guided JPEG Artifact Correction
|
Max Ehrlich, Ser-Nam Lim, Larry Davis, Abhinav Shrivastava |
4240 |
REMIND Your Neural Network to Prevent Catastrophic Forgetting
|
Tyler L. Hayes, Kushal Kafle, Robik Shrestha, Manoj Acharya, Christopher Kanan |
4294 |
ReActNet: Towards Precise Binary Neural Network with Generalized Activation Functions
|
Zechun Liu, Zhiqiang Shen, Marios Savvides, Kwang-Ting Cheng |
4443 |
Representative-Discriminative Learning for Open-set Land Cover Classification of Satellite Imagery
|
Razieh Kaviani Baghbaderani, Ying Qu, Hairong Qi, Craig Stutts |
4257 |
RhyRNN: Rhythmic RNN for Recognizing Events in Long and Complex Videos
|
Tianshu Yu, Yikang Li, Baoxin Li |
4292 |
SipMask: Spatial Information Preservation for Fast Image and Video Instance Segmentation
|
Jiale Cao, Rao Muhammad Anwer, Hisham Cholakkal, Fahad Shahbaz Khan, Yanwei Pang, Ling Shao |
4388 |
SOLAR: Second-Order Loss and Attention for Image Retrieval
|
Tony Ng, Vassileios Balntas, Yurun Tian, Krystian Mikolajczyk |
4358 |
SPAN: Spatial Pyramid Attention Network for Image Manipulation Localization
|
Xuefeng Hu, Zhihan Zhang, Zhenye Jiang, Syomantak Chaudhuri, Zhenheng Yang, Ram Nevatia |
4265 |
SPOT: Selective Point Cloud Voting for Better Proposal in Point Cloud Object Detection
|
Hongyuan Du, Linjun Li, Bo Liu, Nuno Vasconcelos |
4380 |
Sat2Graph: Road Graph Extraction through Graph-Tensor Encoding
|
Songtao He, Favyen Bastani, Satvat Jagwani, Mohammad Alizadeh, Hari Balakrishnan, Sanjay Chawla, Mohamed M. Elshrif, Samuel Madden, Mohammad Amin Sadeghi |
4269 |
Single View Metrology in the Wild
|
Rui Zhu, Xingyi Yang, Yannick Hold-Geoffroy, Federico Perazzi, Jonathan Eisenmann, Kalyan Sunkavalli, Manmohan Chandraker |
4287 |
ScribbleBox: Interactive Annotation Framework for Video Object Segmentation
|
Bowen Chen, Huan Ling, Xiaohui Zeng, Jun Gao, Ziyue Xu, Sanja Fidler |
4245 |
Search What You Want: Barrier Panelty NAS for Mixed Precision Quantization
|
Haibao Yu, Qi Han, Jianbo Li, Jianping Shi, Guangliang Cheng, Bin Fan |
4426 |
Searching Efficient 3D Architectures with Sparse Point-Voxel Convolution
|
Haotian Tang, Zhijian Liu, Shengyu Zhao, Yujun Lin, Ji Lin, Hanrui Wang, Song Han |
4441 |
Self-supervision with Superpixels: Training Few-shot Medical Image Segmentation without Annotation
|
Cheng Ouyang, Carlo Biffi, Chen Chen, Turkay Kart, Huaqi Qiu, Daniel Rueckert |
4362 |
Self-supervised Outdoor Scene Relighting
|
Ye Yu, Abhimitra Meka, Mohamed Elgharib, Hans-Peter Seidel, Christian Theobalt, William A. P. Smith |
4304 |
Self-supervised Single-view 3D Reconstruction via Semantic Consistency
|
Xueting Li, Sifei Liu, Kihwan Kim, Shalini De Mello, Varun Jampani, Ming-Hsuan Yang, Jan Kautz |
4330 |
Self-Supervised Learning of Audio-Visual Objects from Video
|
Triantafyllos Afouras, Andrew Owens, Joon Son Chung, Andrew Zisserman |
4282 |
Semantic View Synthesis
|
Hsin-Ping Huang, Hung-Yu Tseng, Hsin-Ying Lee, Jia-Bin Huang |
4293 |
SemanticAdv: Generating Adversarial Examples \\via Attribute-conditioned Image Editing
|
Haonan Qiu, Chaowei Xiao, Lei Yang, Xinchen Yan, Honglak Lee, Bo Li |
4307 |
Shape and Viewpoint without Keypoints
|
Shubham Goel, Angjoo Kanazawa, Jitendra Malik |
4284 |
Shuffle and Attend: Video Domain Adaptation
|
Jinwoo Choi, Gaurav Sharma, Samuel Schulter, Jia-Bin Huang |
4286 |
SimAug: Learning Robust Representations from Simulation for Trajectory Prediction
|
Junwei Liang, Lu Jiang, Alexander Hauptmann |
4447 |
Solving Phase Retrieval with a Learned Reference
|
Rakib Hyder, Zikui Cai, M. Salman Asif |
4239 |
AttentionNAS: Spatiotemporal Attention Cell Search for Video Classification
|
Xiaofang Wang, Xuehan Xiong, Maxim Neumann, AJ Piergiovanni, Michael S. Ryoo, Anelia Angelova, Kris M. Kitani, Wei Hua |
4430 |
Spike-FlowNet: Event-based Optical Flow Estimation with Energy-Efficient Hybrid Neural Networks
|
Chankyu Lee, Adarsh Kumar Kosta, Alex Zihao Zhu, Kenneth Chaney, Kostas Daniilidis, Kaushik Roy |
4444 |
Structure-Aware Human-Action Generation
|
Ping Yu, Yang Zhao, Chunyuan Li, Junsong Yuan, Changyou Chen |
4332 |
Style Transfer for Co-Speech Gesture Animation: A Multi-Speaker Conditional-Mixture Approach
|
Chaitanya Ahuja, Dong Won Lee, Yukiko I. Nakano, Louis-Philippe Morency |
4274 |
Sub-center ArcFace: Boosting Face Recognition by Large-scale Noisy Web Faces
|
Jiankang Deng, Jia Guo, Tongliang Liu, Mingming Gong, Stefanos Zafeiriou |
1189 |
Synthesizing Coupled 3D Face Modalities by Trunk-Branch Generative Adversarial Networks
|
Baris Gecer, Alexandros Lattas, Stylianos Ploumpis, Jiankang Deng, Athanasios Papaioannou, Stylianos Moschoglou, Stefanos Zafeiriou |
4359 |
TVR: A Large-Scale Dataset for Video-Subtitle Moment Retrieval
|
Jie Lei, Licheng Yu, Tamara L. Berg, Mohit Bansal |
4246 |
Talking-head Generation with Rhythmic Head Motion
|
Lele Chen, Guofeng Cui, Celong Liu, Zhong Li, Ziyi Kou, Yi Xu, Chenliang Xu |
4342 |
Towards causal benchmarking of bias in face analysis algorithms
|
Guha Balakrishnan, Yuanjun Xiong, Wei Xia, Pietro Perona |
4445 |
Towards Efficient Coarse-to-Fine Networks for Action and Gesture Recognition
|
Niamul Quader, Juwei Lu, Peng Dai, Wei Li |
4325 |
Two Stream Active Query Suggestion for Active Learning in Connectomics
|
Zudi Lin, Donglai Wei, Won-Dong Jang, Siyan Zhou, Xupeng Chen, Xueying Wang, Richard Schalek, Daniel Berger, Brian Matejek, Lee Kamentsky, Adi Peleg, Daniel Haehn, Thouis Jones, Toufiq Parag, Jeff Lichtman, Hanspeter Pfister |
4351 |
Unifying Deep Local and Global Features for Image Search
|
Bingyi Cao, Andr'e Araujo, Jack Sim |
4263 |
Unsupervised Deep Metric Learning with Transformed Attention Consistency and Contrastive Clustering Loss
|
Yang Li, Shichao Kan, Zhihai He |
4422 |
Unsupervised Monocular Depth Estimation for Night-time Images using Adversarial Domain Feature Adaptation
|
Madhu Vankadari, Sourav Garg, Anima Majumder, Swagat Kumar, Ardhendu Behera |
4303 |
Unsupervised Video Object Segmentation with Joint Hotspot Tracking
|
Lu Zhang, Jianming Zhang, Zhe Lin, Radom'ir Mvech, Huchuan Lu, You He |
4295 |
Video Object Detection via Object-level Temporal Aggregation
|
Chun-Han Yao, Chen Fang, Xiaohui Shen, Yangyue Wan, Ming-Hsuan Yang |
4255 |
Spatial Image Representation Learning through Echolocation
|
Ruohan Gao, Changan Chen, Ziad Al-Halah, Carl Schissler, Kristen Grauman |
4440 |
Weakly-Supervised Action Localization with Expectation-Maximization Multi-Instance Learning
|
Zhekun Luo, Devin Guillory, Baifeng Shi, Wei Ke, Fang Wan, Trevor Darrell, Huijuan Xu |
4391 |
What makes fake images detectable? Understanding properties that generalize
|
Lucy Chai, David Bau, Ser-Nam Lim, Phillip Isola |
4229 |
When Does Self-supervision Improve Few-shot Learning?
|
Jong-Chyi Su, Subhransu Maji, Bharath Hariharan |
4272 |
Why do These Match? Explaining the Behavior of Image Similarity Models
|
Bryan A. Plummer, Mariya I. Vasileva, Vitali Petsiuk, Kate Saenko, David Forsyth |
4243 |
n-Reference Transfer Learning for Saliency Prediction
|
Yan Luo, Yongkang Wong, Mohan S. Kankanhalli, Qi Zhao |