default search action
International Journal of Computer Vision, Volume 132
Volume 132, Number 1, January 2024
- Gen Luo, Yiyi Zhou, Xiaoshuai Sun, Yongjian Wu, Yue Gao, Rongrong Ji:
Towards Language-Guided Visual Recognition via Dynamic Convolutions. 1-19 - Xin Luo, Wei Chen, Zhengfa Liang, Longqi Yang, Siwei Wang, Chen Li:
Crots: Cross-Domain Teacher-Student Learning for Source-Free Domain Adaptive Semantic Segmentation. 20-39 - Pia Bideau, Erik G. Learned-Miller, Cordelia Schmid, Karteek Alahari:
The Right Spin: Learning Object Motion from Rotation-Compensated Flow Fields. 40-55 - Junda Cheng, Gangwei Xu, Peng Guo, Xin Yang:
Coatrsnet: Fully Exploiting Convolution and Attention for Stereo Matching by Region Separation. 56-73 - Hyeongmin Lee, Taeoh Kim, Hanbin Son, Sangwook Baek, Minsu Cheon, Sangyoun Lee:
A Nonlinear, Regularized, and Data-independent Modulation for Continuously Interactive Image Processing Network. 74-94 - Gang Fu, Qing Zhang, Lei Zhu, Qifeng Lin, Yihao Wang, Siyuan Fan, Chunxia Xiao:
Towards High-Resolution Specular Highlight Detection. 95-117 - Ruize Han, Wei Feng, Feifan Wang, Zekun Qian, Haomin Yan, Song Wang:
Benchmarking the Complementary-View Multi-human Association and Tracking. 118-136 - Shijie Wang, Zhihui Wang, Haojie Li, Jianlong Chang, Wanli Ouyang, Qi Tian:
Accurate Fine-Grained Object Recognition with Structure-Driven Relation Graph Networks. 137-160 - Lingkun Luo, Shiqiang Hu, Liming Chen:
Discriminative Noise Robust Sparse Orthogonal Label Regression-Based Domain Adaptation. 161-184 - Jingjing Jiang, Ziyi Liu, Nanning Zheng:
Correlation Information Bottleneck: Towards Adapting Pretrained Multimodal Models for Robust Visual Question Answering. 185-207 - Xiaokang Chen, Mingyu Ding, Xiaodi Wang, Ying Xin, Shentong Mo, Yunhao Wang, Shumin Han, Ping Luo, Gang Zeng, Jingdong Wang:
Context Autoencoder for Self-supervised Representation Learning. 208-223 - Yidong Wang, Zhuohao Yu, Jindong Wang, Qiang Heng, Hao Chen, Wei Ye, Rui Xie, Xing Xie, Shikun Zhang:
Exploring Vision-Language Models for Imbalanced Learning. 224-237 - Haocong Rao, Cyril Leung, Chunyan Miao:
Hierarchical Skeleton Meta-Prototype Contrastive Learning with Hard Skeleton Mining for Unsupervised Person Re-identification. 238-260 - Chunbo Lang, Gong Cheng, Binfei Tu, Junwei Han:
Few-Shot Segmentation via Divide-and-Conquer Proxies. 261-283 - Libo Zhang, Lutao Jiang, Ruyi Ji, Heng Fan:
Correction: PIDray: A Large-Scale X-ray Benchmark for Real-World Prohibited Item Detection. 284 - Wenfeng Song, Xinyu Zhang, Yuting Guo, Shuai Li, Aimin Hao, Hong Qin:
Correction: Automatic Generation of 3D Scene Animation Based on Dynamic Knowledge Graphs and Contextual Encoding. 285
Volume 132, Number 2, February 2024
- Samu Koskinen, Erman Acar, Joni-Kristian Kämäräinen:
Single Pixel Spectral Color Constancy. 287-299 - Tianlun Zheng, Zhineng Chen, Shancheng Fang, Hongtao Xie, Yu-Gang Jiang:
CDistNet: Perceiving Multi-domain Character Distance for Robust Text Recognition. 300-318 - Zhong Zhuang, Taihui Li, Hengkang Wang, Ju Sun:
Blind Image Deblurring with Unknown Kernel Size and Substantial Noise. 319-348 - Da Chen, Jean-Marie Mirebeau, Huazhong Shu, Laurent D. Cohen:
A Region-Based Randers Geodesic Approach for Image Segmentation. 349-391 - Wenhao Wu, Zhun Sun, Yuxin Song, Jingdong Wang, Wanli Ouyang:
Transferring Vision-Language Models for Visual Recognition: A Classifier Perspective. 392-409 - Chaoyu Zhao, Jianjun Qian, Shumin Zhu, Jin Xie, Jian Yang:
Learning Robust Facial Representation From the View of Diversity and Closeness. 410-427 - Liang Chen, Jiawei Zhang, Zhenhua Li, Yunxuan Wei, Faming Fang, Jimmy S. J. Ren, Jinshan Pan:
Deep Richardson-Lucy Deconvolution for Low-Light Image Deblurring. 428-445 - Yumeng Li, Dan Zhang, Margret Keuper, Anna Khoreva:
Intra- & Extra-Source Exemplar-Based Style Synthesis for Improved Domain Generalization. 446-465 - Xiangtai Li, Jiangning Zhang, Yibo Yang, Guangliang Cheng, Kuiyuan Yang, Yunhai Tong, Dacheng Tao:
Sfnet: Faster and Accurate Semantic Segmentation Via Semantic Flow. 466-489 - Yaxing Wang, Abel Gonzalez-Garcia, Chenshen Wu, Luis Herranz, Fahad Shahbaz Khan, Shangling Jui, Jian Yang, Joost van de Weijer:
MineGAN++: Mining Generative Models for Efficient Knowledge Transfer to Limited Data Domains. 490-514 - Lujia Jin, Qing Guo, Shi Zhao, Lei Zhu, Qian Chen, Qiushi Ren, Yanye Lu:
One-Pot Multi-frame Denoising. 515-536 - Subhabrata Choudhury, Iro Laina, Christian Rupprecht, Andrea Vedaldi:
The Curious Layperson: Fine-Grained Image Recognition Without Expert Labels. 537-554 - Skylar Sutherland, Bernhard Egger, Joshua B. Tenenbaum:
Building 3D Generative Models from Minimal Data. 555-580 - Peng Gao, Shijie Geng, Renrui Zhang, Teli Ma, Rongyao Fang, Yongfeng Zhang, Hongsheng Li, Yu Qiao:
CLIP-Adapter: Better Vision-Language Models with Feature Adapters. 581-595 - Yifei Ming, Yixuan Li:
How Does Fine-Tuning Impact Out-of-Distribution Detection for Vision-Language Models? 596-609
Volume 132, Number 3, March 2024
- Yushi Lan, Chen Change Loy, Bo Dai:
Correspondence Distillation from NeRF-Based GAN. 611-631 - Editor's Note: Special Issue on Physics-Based Vision Meets Deep Learning. 632
- Jun Tu, Gangshan Wu, Limin Wang:
Dual Graph Networks for Pose Estimation in Crowded Scenes. 633-653 - Song Tang, An Chang, Fabian Zhang, Xiatian Zhu, Mao Ye, Changshui Zhang:
Source-Free Domain Adaptation via Target Prediction Distribution Searching. 654-672 - 周和宇, 刘安安, 张晨宇, 朱平, 张千义, Mohan S. Kankanhalli:
用于少数镜头 3D 模型分类的多模态元转移融合网络. 673-688 - Soumya Suvra Ghosal,Yixuan Li:
视觉转换器对伪相关性是否具有鲁棒性? 689-709 元 - Yanan Sun、Chi-Keung Tang、Yu-Wing Tai:
语义图像遮罩:一般和特定语义。 710-730 - Henry Hengyuan Zhao, Pichao Wang, Yuyang Zhao, Hao Luo, Fan Wang, Mike Zheng Shou:
SCT:通过显著通道进行参数高效微调的简单基线。 731-749 - Wei Zhai, Pingyu Wu, Kai Zhu, Yang Cao, Feng Wu, Zheng-Jun Zha:
弱监督对象定位和语义分割的背景激活抑制。 750-775 - Gani Rahmon、Kannappan Palaniappan、Imad Eddine Toubal、Filiz Bunyak、Raghuveer Rao、Guna Seetharaman:
DeepFTSG:多流非对称 USE-Net 网格编码器,具有用于视频运动分割的共享解码器功能融合架构。 邮编 776-804 - Yunfei Guo, Wei Feng, Fei Yin, Cheng-Lin Liu:
SignParser:交通标志理解的端到端框架。 805-821 - Kaiyang 周, Yongxin Yang, Yu Qiao, Tao Xiang:
用于域泛化和适应的 MixStyle 神经网络. 822 页 836 - Yuyang Zhao、Zhun Zhong、Na Zhao、Nicu Sebe、Gim Hee Lee:
风格幻觉双重一致性学习:视觉领域泛化的统一框架。 837-853 号 - Bolin Lai、Miao Liu、Fiona Ryan、James M. Rehg:
在变形金刚的眼中:以自我为中心的凝视估计及其他的全球-局部相关性。 854-871 - Shiyu 胡, Xin Zhao, Kaiqi Huang:
SOTVerse:用户定义的单个目标跟踪任务空间。 872-930 元 - Alexander Lehner、Stefano Gasperini、Alvaro Marcos-Ramiro、Michael Schmidt、Nassir Navab、Benjamin Busam、Federico Tombari:
用于稳健域外预测的 3D 对抗性增强。 931-963 元 - Avishek Siris、Jianbo Jiao、Gary K. L. Tam、Xianghua Xie、Rynson W. H. Lau:
推断显着实例排名的注意力转移。 964-986 - 薛峰, 张一聪, 王天熙, 周宇, 明安龙:
通过单目相机在反射地面上发现室内障碍物. 987-1007 元
第 132 卷,第 4 期,2024 年 4 月
- 周开扬,刘紫薇,翟晓华,李春元,凯特·萨恩科:
客座社论:关于大型视觉模型的前景和危险的特刊。 邮编:1009-1011 - Mochu Xiang, Yuchao Dai, Feiyu Zhang, Jiawei Shi, Xinyu Tian, Zhensong Zhang:
迈向鲁棒单目深度估计的统一网络:网络架构、训练策略和数据集。 1012-1028 元 - Wu Wang, Liang-Jian 邓, 冉然, Gemine Vivone:
用于图像融合的具有细节保留条件可逆网络的通用范式. 1029 元至 1054 元 - Zhiwei Lin, Tingting Liang, Taihong Xiao, Yongtao Wang, Ming-Hsuan Yang:
FlowNAS:用于光流估计的神经架构搜索。 公元 1055 年至 1074 年 - Shengyu Hao, Peiyuan Liu Liu, 詹一兵, Kaixun Jin, Zuozhu Liu, Mingli Song, Jenq-Neng Hwang, Gaoang Wang:
DIVOTrack:一种用于 DIVerse Open 场景中跨视图多对象跟踪的新型数据集和基线方法。 公元 1075 年至 1090 年 - Shuang Liu、Masanori Suganuma、Takayuki Okatani:
用于具身视觉导航的对称感知神经架构。 1091 元至 1107 元 - Adrian Bulat,Georgios Tzimiropoulos:
语言感知软提示:V &L 模型的少数和零镜头适应的文本到文本优化。 1108-1125 元 - Bowen Zhang, Liyang Liu, Minh Hieu Phan, Zhi Tian, Chunhua Shen, Yifan Liu:
SegViT v2:使用普通视觉转换器探索高效和持续的语义分割。 1126 元 1147 元 - Pramod Rao, Mallikarjun B. R., Gereon Fox, Tim Weyrich, Bernd Bickel, Hanspeter Pfister, Wojciech Matusik, Fangneng Zhan, Ayush Tewari, ChristianTheobalt, Mohamed Elgharib:
体积可重射人脸的更深入分析。 1148 元至 1166 元 - Soohyun Kim、Jongbeom Baek、Jihye Park、Eunjae Ha、Homin Jung、Taeyoung Lee、Seungryong Kim:
InstaFormer++:使用 Transformer 进行多域实例感知图像到图像转换。 1167 年至 1186 年 - Libo Zhang, Xin Gu, Congcong Lio, Tiejian Luo, Heng Fan:
用于通用事件边界检测的本地压缩视频流学习。 1187 年至 1204 年 - Burak Tasdemir、Mustafa Goktan Gudukbay、Dogac Eldenk、Adil Meric、Aysegul Dundar:
学习无人监督部分的肖像画。 1205-1218 年 - Cong Yang, Bipin Indurkhya, John See, Bo Gao, Yan Ke, Zeyd Boukhers, Zhenyu Yang, Marcin Grzegorzek:
骨架真实提取:方法、注释工具和基准测试。 1219 元 1241 元 - Yaokun Li, Guang Tan, Chao Gou:
用于联合预测面部特征点、遮挡概率和头部姿势的级联迭代转换器。 1242 元至 1257 元 - 林峰、胡文泽、王耀伟、田永红、卢光明、陈方林、徐勇、王晓宇:
使用大视觉模型的通用目标检测。 1258 年至 1276 年 - 刘伟德、吴中华、赵阳、方玉明、福传生、程军、林国胜:
协调基础类和新类:广义少数镜头分割的类对比方法。 1277-1291 年 - 徐跃聪、曹浩志、尹建雄、陈正华、李小丽、李正国、徐倩文、杨剑飞:
深入识别黑暗环境中的行为:A综合基准研究。 1292 年至 1309 年 - Nishant Jain、Suryansh Kumar、Luc Van Gool:
从未摆姿势的图像中学习神经辐射场的稳健多尺度表示。 公元 1310 元至 1335 年 - Nan Yang, Xin Luan, Huidi Jia, Zhi Han, Xiaofeng Li, Yandong Tang:
CCR:具有连续性、一致性和可逆性的面部图像编辑。 公元 1336 年至 1349 年 - Daniel Wilson、Xiaohan Zhang、Waqas Sultani、Safwan Wshah:
图像和对象地理定位。 1350-1392 年 - 翁廷宇、肖军、潘浩、江海勇:
PartCom:用于 3D Open-Set 识别的部件合成学习。 1393 年至 1416 年 - 谢美雪、李爽、龚开雄、王玉林、黄高:
原型约束下通过面向目标的可转移语义增强跨域适应。 1417-1441 年
第 132 卷,第 5 期,2024 年 5 月
- Weitao Feng, Lei Bai, Yongqiang Yao, Fengwei Yu, Wanli Ouyang:
迈向与帧速率无关的多目标跟踪。 1443 年至 1462 年 - Chongwei Liu, Haojie Li, Zhi-Hui Wang:
FastTrack:一种具有并行卡尔曼滤波器的基于 GPU 的高效通用多目标跟踪方法。 1463 年至 1483 年 - 吴荣成, 王明哲, 李志东, 周建龙, 陈芳, 王轩, 孙长明:
基于自适应递归网络的高域适应性少镜头立体匹配. 1484 年至 1501 年 - Dong Zhang, Yi Lin, Jinhui Tang, Kwang-ting Cheng:
CAE-GReaT:用于密集图像预测的卷积辅助高效图推理转换器。 1502 年至 1520 年 - Wei-Hong Li, Xialei Liu, Hakan Bilen:
Universal Representations: A Unified Look at Multiple Task and Domain Learning. 1521-1545 - Peng Gao, Ziyi Lin, Renrui Zhang, Rongyao Fang, Hongyang Li, Hongsheng Li, Yu Qiao:
Mimic before Reconstruct: Enhancing Masked Autoencoders with Feature Mimicking. 1546-1556 - Yang Yang, Chaoyue Wang, Xiaojie Guo, Dacheng Tao:
Robust Unpaired Image Dehazing via Density and Depth Decomposition. 1557-1577 - Jie Ma, Jun Liu, Qi Chai, Pinghui Wang, Jing Tao:
Diagram Perception Networks for Textbook Question Answering via Joint Optimization. 1578-1591 - Yifan Zhang, Junhui Hou, Yixuan Yuan:
A Comprehensive Study of the Robustness for LiDAR-Based 3D Object Detectors Against Adversarial Attacks. 1592-1624 - Huafeng Li, Junyu Liu, Yafei Zhang, Yu Liu:
A Deep Learning Framework for Infrared and Visible Image Fusion Without Strict Registration. 1625-1644 - Shaochuan Zhao, Tianyang Xu, Xiaojun Wu, Josef Kittler:
A Spatio-Temporal Robust Tracker with Spatial-Channel Transformer and Jitter Suppression. 1645-1658 - Xin Zhao, Shiyu Hu, Yipei Wang, Jing Zhang, Yimin Hu, Rongshuai Liu, Haibin Ling, Yin Li, Renshu Li, Kun Liu, Jiadong Li:
BioDrone: A Bionic Drone-Based Single Object Tracking Benchmark for Robust Vision. 1659-1684 - Xiaofeng Mao, Yufeng Chen, Xiaojun Jia, Rong Zhang, Hui Xue, Zhao Li:
Context-Aware Robust Fine-Tuning. 1685-1700 - Marcella Cornia, Lorenzo Baraldi, Giuseppe Fiameni, Rita Cucchiara:
Generating More Pertinent Captions by Leveraging Semantics and Style on Multi-Source Datasets. 1701-1720 - Youfa Liu, Bo Du, Yongyong Chen, Lefei Zhang, Mingming Gong, Dacheng Tao:
Convex-Concave Tensor Robust Principal Component Analysis. 1721-1747 - Jinyuan Liu, Runjia Lin, Guanyao Wu, Risheng Liu, Zhongxuan Luo, Xin Fan:
CoCoNet: Coupled Contrastive Learning Network with Multi-level Feature Ensemble for Multi-modality Image Fusion. 1748-1775 - Editor's Note: Special Issue on BMVC 2021. 1776
- Kongming Liang, Zijin Yin, Min Min, Yan Liu, Zhanyu Ma, Jun Guo:
Learning Dynamic Prototypes for Visual Pattern Debiasing. 1777-1799 - Yifan Wang, Lin Zhang, Ran Song, Hongliang Li, Paul L. Rosin, Wei Zhang:
Exploiting Inter-Sample Affinity for Knowability-Aware Universal Domain Adaptation. 1800-1816 - Wenqi Ren, Senyou Deng, Kaihao Zhang, Fenglong Song, Xiaochun Cao, Ming-Hsuan Yang:
Fast Ultra High-Definition Video Deblurring via Multi-scale Separable Network. 1817-1834 - Azin Jahedi, Maximilian Luz, Marc Rivinius, Lukas Mehl, Andrés Bruhn:
MS-RAFT+: High Resolution Multi-Scale RAFT. 1835-1856 - Jiqing Zhang, Bo Dong, Yingkai Fu, Yuanchen Wang, Xiaopeng Wei, Baocai Yin, Xin Yang:
A Universal Event-Based Plug-In Module for Visual Object Tracking in Degraded Conditions. 1857-1879 - Shiyu Hu, Xin Zhao, Kaiqi Huang:
Correction: SOTVerse: A User-Defined Task Space of Single Object Tracking. 1880
Volume 132, Number 6, June 2024
- Aishan Liu, Shiyu Tang, Xinyun Chen, Lei Huang, Haotong Qin, Xianglong Liu, Dacheng Tao:
Towards Defending Multiple ℓ p-Norm Bounded Adversarial Perturbations via Gated Batch Normalization. 1881-1898 - Xiang Wang, Shiwei Zhang, Jun Cen, Changxin Gao, Yingya Zhang, Deli Zhao, Nong Sang:
CLIP-guided Prototype Modulating for Few-shot Action Recognition. 1899-1912 - Chang Liu, Gaurav Mittal, Nikolaos Karianakis, Victor Fragoso, Ye Yu, Yun Fu, Mei Chen:
HyperSTAR: Task-Aware Hyperparameter Recommendation for Training and Compression. 1913-1927 - Xingxing Wei, Jie Yu, Yao Huang:
Infrared Adversarial Patches with Learnable Shapes and Locations in the Physical World. 1928-1944 - Hongchen Luo, Wei Zhai, Jing Zhang, Yang Cao, Dacheng Tao:
Grounded Affordance from Exocentric View. 1945-1969 - Yuki Fujimura, Masaaki Iiyama, Takuya Funatomi, Yasuhiro Mukaigawa:
Deep Depth from Focal Stack with Defocus Model for Camera-Setting Invariance. 1970-1985 - Moira Shooter, Charles Malleson, Adrian Hilton:
SyDog-Video: A Synthetic Dog Video Dataset for Temporal Pose Estimation. 1986-2002 - Minglang Qiao, Yufan Liu, Mai Xu, Xin Deng, Bing Li, Weiming Hu, Ali Borji:
Joint Learning of Audio-Visual Saliency Prediction and Sound Source Localization on Multi-face Videos. 2003-2025 - Sachit Menon, Ishaan Preetam Chandratreya, Carl Vondrick:
Task Bias in Contrastive Vision-Language Models. 2026-2040 - Namhyuk Ahn, Jaejun Yoo, Kyung-Ah Sohn:
Data Augmentation for Low-Level Vision: CutBlur and Mixture-of-Augmentation. 2041-2059 - Yang Guo, Wei Gao, Ge Li:
Interpretable Task-inspired Adaptive Filter Pruning for Neural Networks Under Multiple Constraints. 2060-2076 - Yafei Yang, Bo Yang:
Benchmarking and Analysis of Unsupervised Object Segmentation from Real-World Single Images. 2077-2113 - Liang Zhao, Yao Teng, Limin Wang:
Logit Normalization for Long-Tail Object Detection. 2114-2134 - Agniva Sengupta, Adrien Bartoli:
ToTem NRSfM: Object-Wise Non-rigid Structure-from-Motion with a Topological Template. 2135-2176 - Patrick Ruhkamp, Daoyi Gao, Nassir Navab, Benjamin Busam:
S2P3: Self-Supervised Polarimetric Pose Prediction. 2177-2194 - Chafic Abou Akar, Rachelle Abdel Massih, Anthony Yaghi, Joe Khalil, Marc Kamradt, Abdallah Makhoul:
Generative Adversarial Network Applications in Industry 4.0: A Review. 2195-2254 - Xianqiang Lyu, Junhui Hou:
Probabilistic-Based Feature Embedding of 4-D Light Fields for Compressive Imaging and Denoising. 2255-2275 - Zhonghua Wu, Yicheng Wu, Guosheng Lin, Jianfei Cai:
Reliability-Adaptive Consistency Regularization for Weakly-Supervised Point Cloud Segmentation. 2276-2289 - Kohei Uehara, Tatsuya Harada:
Learning by Asking Questions for Knowledge-Based Novel Object Recognition. 2290-2309 - Bowen Zhao, Chen Chen, Qian-Wei Wang, Anfeng He, Shu-Tao Xia:
Delving into Identify-Emphasize Paradigm for Combating Unknown Bias. 2310-2330 - Haipeng Li, Kunming Luo, Bing Zeng, Shuaicheng Liu:
GyroFlow+: Gyroscope-Guided Unsupervised Deep Homography and Optical Flow Learning. 2331-2349
Volume 132, Number 7, July 2024
- Di Yang, Yaohui Wang, Antitza Dantcheva, Lorenzo Garattoni, Gianpiero Francesca, François Brémond:
View-Invariant Skeleton Action Representation Learning via Motion Retargeting. 2351-2366 - Weize Quan, Jiaxi Chen, Yanli Liu, Dong-Ming Yan, Peter Wonka:
Deep Learning-Based Image and Video Inpainting: A Survey. 2367-2400 - Ke Xian, Zhiguo Cao, Chunhua Shen, Guosheng Lin:
Towards Robust Monocular Depth Estimation: A New Baseline and Benchmark. 2401-2419 - Xingxing Xie, Gong Cheng, Jiabao Wang, Ke Li, Xiwen Yao, Junwei Han:
Oriented R-CNN and Beyond. 2420-2442 - Bo Ke, Ruizhi Qiao, Xing Sun:
Multi-dataset Detection with Transformers. 2443-2449 - Petra Bevandic, Marin Orsic, Josip Saric, Ivan Grubisic, Sinisa Segvic:
Weakly Supervised Training of Universal Visual Concepts for Multi-domain Semantic Segmentation. 2450-2472 - Zhongyun Hu, Jiahao Li, Xue Wang, Qing Wang:
Spatially-Varying Illumination-Aware Indoor Harmonization. 2473-2492 - Yanbiao Ma, Licheng Jiao, Fang Liu, Shuyuan Yang, Xu Liu, Puhua Chen:
Geometric Prior Guided Feature Representation Learning for Long-Tailed Classification. 2493-2510 - Mouxing Yang, Zhenyu Huang, Xi Peng:
Robust Object Re-identification with Coupled Noisy Labels. 2511-2529 - Shuzhe Wang, Zakaria Laskar, Iaroslav Melekhov, Xiaotian Li, Yi Zhao, Giorgos Tolias, Juho Kannala:
HSCNet++: Hierarchical Scene Coordinate Classification and Regression for Visual Localization with Transformer. 2530-2550 - Yinghao Huang, Omid Taheri, Michael J. Black, Dimitrios Tzionas:
InterCap: Joint Markerless 3D Tracking of Humans and Objects in Interaction from Multi-view RGB-D Images. 2551-2566 - Hanna Ragnarsdóttir, Ece Ozkan, Holger Michel, Kieran Chin-Cheong, Laura Manduchi, Sven Wellmann, Julia E. Vogt:
Deep Learning Based Prediction of Pulmonary Hypertension in Newborns Using Echocardiograms. 2567-2584 - Sha Zhang, Jiajun Deng, Lei Bai, Houqiang Li, Wanli Ouyang, Yanyong Zhang:
HVDistill: Transferring Knowledge from Images to Point Clouds via Unsupervised Hybrid-View Distillation. 2585-2599 - Zenglin Shi, Pascal Mettes, Cees G. M. Snoek:
Focus for Free in Density-Based Counting. 2600-2617 - Mirco Planamente, Chiara Plizzari, Simone Alberto Peirone, Barbara Caputo, Andrea Bottino:
Relative Norm Alignment for Tackling Domain Shift in Deep Multi-modal Classification. 2618-2638 - Numair Khan, Min H. Kim, James Tompkin:
Are Multi-view Edges Incomplete for Depth Estimation? 2639-2673 - Yan Xu, Chaoda Zheng, Ying Xue, Zhen Li, Shuguang Cui, Dengxin Dai:
Benchmarking the Robustness of LiDAR Semantic Segmentation Models. 2674-2697 - Tianyang Xu, Ze Kang, Xuefeng Zhu, Xiaojun Wu:
Learning Adaptive Spatio-Temporal Inference Transformer for Coarse-to-Fine Animal Visual Tracking: Algorithm and Benchmark. 2698-2712
Volume 132, Number 8, August 2024
- Haonan Qiu, Zhaoxi Chen, Yuming Jiang, Hang Zhou, Xiangyu Fan, Lei Yang, Wayne Wu, Ziwei Liu:
ReliTalk: Relightable Talking Portrait Generation from a Single Video. 2713-2728 - Alan Lukezic, Ziga Trojer, Jirí Matas, Matej Kristan:
A New Dataset and a Distractor-Aware Architecture for Transparent Object Tracking. 2729-2742 - Lan Yang, Kaiyue Pang, Honggang Zhang, Yi-Zhe Song:
Annotation-Free Human Sketch Quality Assessment. 2743-2764 - Jinpeng Wang, Ziyun Zeng, Bin Chen, Yuting Wang, Dongliang Liao, Gongfu Li, Yiru Wang, Shu-Tao Xia:
Hugs Bring Double Benefits: Unsupervised Cross-Modal Hashing with Multi-granularity Aligned Transformers. 2765-2797 - Yufan Liu, Jiajiong Cao, Bing Li, Weiming Hu, Jingting Ding, Liang Li, Stephen J. Maybank:
Cross-Architecture Knowledge Distillation. 2798-2824 - Gongjie Zhang, Zhipeng Luo, Jiaxing Huang, Shijian Lu, Eric P. Xing:
Semantic-Aligned Matching for Enhanced DETR Convergence and Multi-Scale Feature Fusion. 2825-2844 - Xuefeng Zhu, Tianyang Xu, Zongtao Liu, Zhangyong Tang, Xiaojun Wu, Josef Kittler:
UniMod1K: Towards a More Universal Large-Scale Dataset and Benchmark for Multi-modal Learning. 2845-2860 - Jianan Fan, Dongnan Liu, Hang Chang, Tom Weidong Cai:
Learning to Generalize over Subpartitions for Heterogeneity-Aware Domain Adaptive Nuclei Segmentation. 2861-2884 - Zhi-Song Liu, Robin Courant, Vicky Kalogeiton:
FunnyNet-W: Multimodal Learning of Funny Moments in Videos in the Wild. 2885-2906 - Shuyuan Lin, Feiran Huang, Taotao Lai, Jianhuang Lai, Hanzi Wang, Jian Weng:
Robust Heterogeneous Model Fitting for Multi-source Image Correspondences. 2907-2928 - Weixiang Hong, Wang Ren, Jiangwei Lao, Lele Xie, Liheng Zhong, Jian Wang, Jingdong Chen, Honghai Liu, Wei Chu:
Training Object Detectors from Scratch: An Empirical Study in the Era of Vision Transformer. 2929-2942 - Yuxuan Xue, Haolong Li, Stefan Leutenegger, Jörg Stückler:
Event-Based Non-rigid Reconstruction of Low-Rank Parametrized Deformations from Contours. 2943-2961 - Chenyi Jiang, Yuming Shen, Dubing Chen, Haofeng Zhang, Ling Shao, Philip H. S. Torr:
Estimation of Near-Instance-Level Attribute Bottleneck for Zero-Shot Learning. 2962-2988 - Julien Ducrocq, Guillaume Caron:
A Survey on Adaptive Cameras. 2989-3022 - Bo Wang, Yifan Zhang, Jian Li, Yang Yu, Zhenping Sun, Li Liu, Dewen Hu:
SplatFlow: Learning Multi-frame Optical Flow via Splatting. 3023-3045 - Quan Zhang, Jianhuang Lai, Zhan-Xiang Feng, Xiaohua Xie:
Uncertainty Modeling for Group Re-Identification. 3046-3066 - Xihang Hu, Fuming Sun, Jing Sun, Fasheng Wang, Haojie Li:
Cross-Modal Fusion and Progressive Decoding Network for RGB-D Salient Object Detection. 3067-3085 - Otto Brookes, Majid Mirmehdi, Colleen Stephens, Samuel Angedakin, Katherine Corogenes, Dervla Dowd, Paula Dieguez, Thurston C. Hicks, Sorrel Jones, Kevin Lee, Vera Leinert, Juan Lapuente, Maureen S. McCarthy, Amelia Meier, Mizuki Murai, Emmanuelle Normand, Virginie Vergnes, Erin G. Wessling, Roman M. Wittig, Kevin Langergraber, Nuria Maldonado, Xinyu Yang, Klaus Zuberbühler, Christophe Boesch, Mimi Arandjelovic, Hjalmar S. Kühl, Tilo Burghardt:
PanAf20K: A Large Video Dataset for Wild Ape Detection and Behaviour Recognition. 3086-3102 - George Martvel, Ilan Shimshoni, Anna Zamansky:
Automated Detection of Cat Facial Landmarks. 3103-3118 - Xi Zhao, Wei Feng, Zheng Zhang, Jingjing Lv, Xin Zhu, Zhangang Lin, Jinghe Hu, Jingping Shao:
CBNet: A Plug-and-Play Network for Segmentation-Based Scene Text Detection. 3119-3138 - Huan Yin, Xuecheng Xu, Sha Lu, Xieyuanli Chen, Rong Xiong, Shaojie Shen, Cyrill Stachniss, Yue Wang:
A Survey on Global LiDAR Localization: Challenges, Advances and Open Problems. 3139-3171 - Kecheng Chen, Elena Gal, Hong Yan, Haoliang Li:
Domain Generalization with Small Data. 3172-3190 - Haoang Chi, Wenjing Yang, Feng Liu, Long Lan, Tao Qin, Bo Han:
Does Confusion Really Hurt Novel Class Discovery? 3191-3207 - Zhen Yang, Jun Yue, Pedram Ghamisi, Shiliang Zhang, Jiayi Ma, Leyuan Fang:
Open Set Recognition in Real World. 3208-3231 - Xiaoqi Zhao, Shijie Chang, Youwei Pang, Jiaxing Yang, Lihe Zhang, Huchuan Lu:
Adaptive Multi-Source Predictor for Zero-Shot Video Object Segmentation. 3232-3250 - Guofeng Mei, Cristiano Saltori, Elisa Ricci, Nicu Sebe, Qiang Wu, Jian Zhang, Fabio Poiesi:
Unsupervised Point Cloud Representation Learning by Clustering and Neural Rendering. 3251-3269 - Lu Yang, Wenhe Jia, Shan Li, Qing Song:
Deep Learning Technique for Human Parsing: A Survey and Outlook. 3270-3301 - Timothy Duff, Kathlén Kohn, Anton Leykin, Tomás Pajdla:
PL1P: Point-Line Minimal Problems under Partial Visibility in Three Views. 3302-3323 - Stella Bounareli, Christos Tzelepis, Vasileios Argyriou, Ioannis Patras, Georgios Tzimiropoulos:
One-Shot Neural Face Reenactment via Finding Directions in GAN's Latent Space. 3324-3354 - Jiachen Lu, Junge Zhang, Xiatian Zhu, Jianfeng Feng, Tao Xiang, Li Zhang:
Softmax-Free Linear Transformers. 3355-3374
Volume 132, Number 9, September 2024
- Lin Zhu, Weihan Yin, Yiyao Yang, Fan Wu, Zhaoyu Zeng, Qinying Gu, Xinbing Wang, Chenghu Zhou, Nanyang Ye:
Vision-Language Alignment Learning Under Affinity and Divergence Principles for Few-Shot Out-of-Distribution Generalization. 3375-3407 - Yiming Qin, Nanxuan Zhao, Jiale Yang, Siyuan Pan, Bin Sheng, Rynson W. H. Lau:
UrbanEvolver: Function-Aware Urban Layout Regeneration. 3408-3427 - Peleg Harel, Ofir Itzhak Shahar, Ohad Ben-Shahar:
Pictorial and Apictorial Polygonal Jigsaw Puzzles from Arbitrary Number of Crossing Cuts. 3428-3462 - Han Liang, Wenqian Zhang, Wenxuan Li, Jingyi Yu, Lan Xu:
InterGen: Diffusion-Based Multi-human Motion Generation Under Complex Interactions. 3463-3483 - Pascal Mettes, Mina Ghadimi Atigh, Martin Keller-Ressel, Jeffrey Gu, Serena Yeung:
Hyperbolic Deep Learning in Computer Vision: A Survey. 3484-3508 - Jiangning Zhang, Xiangtai Li, Yabiao Wang, Chengjie Wang, Yibo Yang, Yong Liu, Dacheng Tao:
EATFormer: Improving Vision Transformer Inspired by Evolutionary Algorithm. 3509-3536 - Jianbin Zheng, Daqing Liu, Chaoyue Wang, Minghui Hu, Zuopeng Yang, Changxing Ding, Dacheng Tao:
MMoT: Mixture-of-Modality-Tokens Transformer for Composed Multimodal Conditional Image Synthesis. 3537-3565 - Qi Fan, Wei Zhuo, Chi-Keung Tang, Yu-Wing Tai:
FSODv2: A Deep Calibrated Few-Shot Object Detection Network. 3566-3585 - Yuhang Li, Shikuang Deng, Xin Dong, Shi Gu:
Error-Aware Conversion from ANN to SNN via Post-training Parameter Calibration. 3586-3609 - Han Xu, Hao Zhang, Xunpeng Yi, Jiayi Ma:
CRetinex: A Progressive Color-Shift Aware Retinex Model for Low-Light Image Enhancement. 3610-3632 - Haoru Tan, Chuang Wang, Sitong Wu, Xu-Yao Zhang, Fei Yin, Chenglin Liu:
Ensemble Quadratic Assignment Network for Graph Matching. 3633-3655 - Zhenyu Huang, Peng Hu, Guocheng Niu, Xinyan Xiao, Jiancheng Lv, Xi Peng:
Learning with Noisy Correspondence. 3656-3677 - Yihao Liu, Junyu Chen, Shuwen Wei, Aaron Carass, Jerry L. Prince:
On Finite Difference Jacobian Computation in Deformable Image Registration. 3678-3688 - Benteng Ma, Jing Zhang, Yong Xia, Dacheng Tao:
VNAS: Variational Neural Architecture Search. 3689-3713 - Guoxuan Xia, Christos-Savvas Bouganis:
Augmenting the Softmax with Additional Confidence Scores for Improved Selective Classification with Out-of-Distribution Data. 3714-3752 - Elisa Warner, Joonsang Lee, William Hsu, Tanveer F. Syeda-Mahmood, Charles E. Kahn Jr., Olivier Gevaert, Arvind Rao:
Multimodal Machine Learning in Image-Based and Clinical Biomedicine: Survey and Prospects. 3753-3769 - Valentin Gabeff, Marc Rußwurm, Devis Tuia, Alexander Mathis:
WildCLIP: Scene and Animal Attribute Retrieval from Camera Trap Data with Domain-Adapted Vision-Language Models. 3770-3786 - Yuzhen Liu, Qiulei Dong:
Descriptor Distillation: A Teacher-Student-Regularized Framework for Learning Local Descriptors. 3787-3805 - Muhammad Ferjad Naeem, Yongqin Xian, Luc Van Gool, Federico Tombari:
I2DFormer+: Learning Image to Document Summary Attention for Zero-Shot Image Classification. 3806-3822 - Lei Zhang, Xiaowei Fu, Fuxiang Huang, Yi Yang, Xinbo Gao:
An Open-World, Diverse, Cross-Spatial-Temporal Benchmark for Dynamic Wild Person Re-Identification. 3823-3846 - Yu Wang, Xinjie Yao, Pengfei Zhu, Weihao Li, Meng Cao, Qinghua Hu:
Integrated Heterogeneous Graph Attention Network for Incomplete Multi-modal Clustering. 3847-3866 - Xixi Wang, Xiao Wang, Bo Jiang, Jin Tang, Bin Luo:
MutualFormer: Multi-modal Representation Learning via Cross-Diffusion Attention. 3867-3888 - Md. Amirul Islam, Matthew Kowal, Sen Jia, Konstantinos G. Derpanis, Neil D. B. Bruce:
Position, Padding and Predictions: A Deeper Look at Position Information in CNNs. 3889-3910 - Dong Liang, Zhengyan Xu, Ling Li, Mingqiang Wei, Songcan Chen:
PIE: Physics-Inspired Low-Light Enhancement. 3911-3932 - Yuchen Hong, Yakun Chang, Jinxiu Liang, Lei Ma, Tiejun Huang, Boxin Shi:
Light Flickering Guided Reflection Removal. 3933-3953 - Xinyue Huo, Lingxi Xie, Hengtong Hu, Wengang Zhou, Houqiang Li, Qi Tian:
Domain-Agnostic Priors for Semantic Segmentation Under Unsupervised Domain Adaptation and Domain Generalization. 3954-3976 - Yifei Huang, Lijin Yang, Guo Chen, Hongjie Zhang, Feng Lu, Yoichi Sato:
Matching Compound Prototypes for Few-Shot Action Recognition. 3977-4002 - Ekaterina A. Nepovinnykh, Ilja Chelak, Tuomas Eerola, Veikka Immonen, Heikki Kälviäinen, Maksim Kholiavchenko, Charles V. Stewart:
Species-Agnostic Patterned Animal Re-identification by Aggregating Deep Local Features. 4003-4018 - Weijia Wu, Yuanqiang Cai, Chunhua Shen, Debing Zhang, Ying Fu, Hong Zhou, Ping Luo:
End-to-End Video Text Spotting with Transformer. 4019-4035 - Wei Yin, Yifan Liu, Chunhua Shen, Baichuan Sun, Anton van den Hengel:
Scaling Up Multi-domain Semantic Segmentation with Sentence Embeddings. 4036-4051 - Liang Chen, Yong Zhang, Yibing Song, Zhen Zhang, Lingqiao Liu:
A Causal Inspired Early-Branching Structure for Domain Generalization. 4052-4072 - Wenwei Song, Wenxiong Kang, Adams Wai-Kin Kong, Yufeng Zhang, Yitao Qiao:
L3AM: Linear Adaptive Additive Angular Margin Loss for Video-Based Hand Gesture Authentication. 4073-4090 - Lei Wang, Jun Liu, Liang Zheng, Tom Gedeon, Piotr Koniusz:
Meet JEANIE: A Similarity Measure for 3D Skeleton Sequences via Temporal-Viewpoint Alignment. 4091-4122 - Guang Yang, Angelica Aviles-Rivero, Yingying Fang, Zhenhua Feng, Gianluigi Ciocca, Yulia Hicks, Constantino Carlos Reyes-Aldasoro:
Guest Editorial: Special Issue on the British Machine Vision Conference 2022. 4123-4127 - Matteo Poggi, Federica Arrigoni, Andrea Fusiello, Stefano Mattoccia, Adrien Bartoli, Torsten Sattler, Tomás Pajdla:
Guest Editorial: Special Issue on Traditional Computer Vision in the Age of Deep Learning. 4128-4130
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.