Lin Song's Homepage

Last update at 02/01/2024.

Hello! My name is Lin Song (宋林). I am a Senior Researcher at Tencent AILab, Shenzhen, China. In July 2022, I received my PhD degree from College of Artificial Intelligence, Xi’an Jiaotong University, advised by Jian Sun and Hongbin Sun. My research interest mainly focus on Computer Vision, Machine Learning and Integrated Circuit.
Google Scholar/GitHub/Email


News

[02/2024] Three papers were accepted by CVPR2024
[02/2024] We proposed YOLO-World, a real-time model for detecting everything
[10/2023] Four papers were accepted by ICCV2023, NeurIPS2023 and ICLR2024
[05/2023] We introduce and release the code of GPT4Tools and BoxSnake.
[01/2023] One paper was accepted by ICLR 2023.
[04/2022] We won the 2rd Place on LVIS Challenges (Workshop at ICCV 2021).
[09/2021] One paper was accepted by NeurIPS 2021.
[06/2021] We won the 1st Place on Streaming Perception Challenges (Workshop on Autonomous Driving at CVPR 2021).

Selected Publications

Conference

* indicates equal contribution
^ indicates corresponding author
YOLO-World: Real-time Open-vocabulary Object Detection
Tianheng Cheng*, Lin Song*^, Yixiao Ge, Wenyu Liu, Xinggang Wang, Ying Shan
IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2024
[paper][code]
LoRA-Sparse: Low-Rank Approximation for Sparse Large Language Models
Lin Song*^, Yukang Chen*, Shuai Yang, Xiaohan Ding, Yixiao Ge, Ying-Cong Chen, Ying Shan
IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2024
Meta-Adapter: An Online Few-shot Learner for Vision-Language Model
Cheng Cheng*, Lin Song*, Ruoyi Xue, Hang Wang, Hongbin Sun, Yixiao Ge, Ying Shan
Conference on Neural Information Processing Systems (NeurIPS), 2023
[paper][code]
GPT4Tools: Teaching Large Language Model to Use Tools via Self-instruction
Rui Yang*, Lin Song*^, Yanwei Li, Sijie Zhao, Yixiao Ge, Xiu Li, Ying Shan
Conference on Neural Information Processing Systems (NeurIPS), 2023
[paper][code]
BoxSnake: Polygonal Instance Segmentation with Box Supervision
Rui Yang*, Lin Song*^, Yixiao Ge, Xiu Li^
International Conference on Computer Vision (ICCV), 2023
[paper][code]
DBQ-SSD: Dynamic Ball Query for Efficient 3D Object Detection
Jinrong Yang*, Lin Song*, Songtao Liu, Zeming Li, Xiaoping Li, Hongbin Sun, Jian Sun, Nanning Zheng
International Conference on Learning Representations (ICLR), 2023
[paper][code]
Dynamic Grained Encoder for Vision Transformer
Lin Song*, Songyang Zhang*, Songtao Liu, Zeming Li, Hongbin Sun, Jian Sun, Nanning Zheng
Conference on Neural Information Processing Systems (NeurIPS), 2021
[paper] [code]
End-to-End Object Detection with Fully Convolutional Network
Jianfeng Wang*, Lin Song*, Zeming Li, Hongbin Sun, Jian Sun, Nanning Zheng
IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2021
[paper] [code]
Fine-Grained Dynamic Head for Object Detection
Lin Song, Yanwei Li, Zhengkai Jiang, Zeming Li, Hongbin Sun, Jian Sun, Nanning Zheng
Conference on Neural Information Processing Systems (NeurIPS), 2020
[paper] [code] [slides]
Rethinking Learnable Tree Filter for Generic Feature Transform
Lin Song, Yanwei Li, Zhengkai Jiang, Zeming Li, Xiangyu Zhang, Hongbin Sun, Jian Sun, Nanning Zheng
Conference on Neural Information Processing Systems (NeurIPS), 2020
[paper] [code] [slides]
Learning Dynamic Routing for Semantic Segmentation
Yanwei Li, Lin Song, Yukang Chen, Zeming Li, Xiangyu Zhang, Xingang Wang, Jian Sun
IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2020 (Oral)
[paper] [code] [slides]
Learnable Tree Filter for Structure-preserving Feature Transform
Lin Song*, Yanwei Li*, Zeming Li, Gang Yu, Hongbin Sun, Jian Sun, Nanning Zheng
Conference on Neural Information Processing Systems (NeurIPS), 2019
[paper] [code]
TACNet: Transition-aware Context Network for Spatio-temporal Action Detection
Lin Song*, Shiwei Zhang*, Gang Yu, Hongbin Sun
IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2019
[paper]

Journal

GLNet: Global Local Network for Weakly Supervised Action Localization
Shiwei Zhang*, Lin Song*, Changxin Gao, Nong Sang
IEEE Transactions on Multimedia (TMM)
[paper]
NIPM-sWMF: Toward Efficient FPGA Design for High-Definition Large-Disparity Stereo Matching
Xuchong Zhang, Hongbin Sun, Shiqiang Chen, Lin Song, Nanning Zheng
IEEE Transactions on Circuits and Systems for Video Technology (TCSVT)
[paper]

Experience

Tencent AILab (Vision Computing Center)
Areas: Visual Preception
Research Scientist, 2022.8 - Present

Megvii (Face++)
Areas: Object Detection, Image Segmentation and Action Localization
Full Time Research Intern, 2019.5 - 2022.6
Research Intern, 2017.11 - 2019.5

School of Microelectronics, Xidian University
Areas: Integrated Circuit
Research Intern, 2017.07 - 2017.09


Activity

Reviewer: CVPR 2021-2023, ICCV 2021-2023, NeurIPS 2021-2022, ICLR 2021-2023, AAAI 2022-2023, TPAMI, IJCV, TMM, TCSVT
Seminar talk: "Rethinking Learnable Tree Filter for Generic Feature Transform", 2020 [slides]


Award

LVIS Challenges (Workshop at ICCV 2021), 2022, 2rd
Streaming Perception Challenges (Workshop on Autonomous Driving at CVPR 2021), 2021, 1st
ActivityNet-AVA, 2018, 1st
OpenImage Object Detection, 2018, GOLD (solo)
National Undergraduate Electronics Design Contest, 2015, 1st
National Undergraduate Intelligent Car Competition, 2014, 1st


    ​ ​ ​ ​