Flag Counter

Yiwei Ma   马祎炜

First-year Ph. D Student at Xiamen University


Email: yiweima@stu.xmu.edu.cn
             [Github]   [Google Scholar]

[Biography] [Latest News] [Publications] [Projects] [Major Awards] [Patent] [Professional Activities]

Biography   [back top]

I am currently a first-year Ph.D student in Department of Artificial Intelligence, School of Informatics, Xiamen University, advised by Prof. Rongrong Ji and Prof. Xiaoshuai Sun .

My recent research interests are in (2D/3D) vision-and-language learning and AIGC.

Latest News   [back top]

Publications   [back top]

Journal

Yiwei Ma, Jiayi Ji, Xiaoshuai Sun, Yiyi Zhou, Rongrong Ji
Towards Local Visual Modeling for Image Captioning
Pattern Recognition (PR), 2023
[PDF] [ArXiv] [Code]
Jiayi Ji, Yiwei Ma (co-frist author), Xiaoshuai Sun, Yiyi Zhou, Yongjian Wu, Rongrong Ji
Knowing What to Learn: A Metric-oriented Focal Mechanism for Image Captioning
IEEE Transactions on Image Processing (TIP), 2022
[PDF] [Code]
Yiwei Ma, Jiayi Ji, Xiaoshuai Sun, Yiyi Zhou, Yongjian Wu, Feiyue Huang, Rongrong Ji
Knowing what it is: Semantic-enhanced Dual Attention Transformer
IEEE Transactions on Multimedia (TMM), 2022
[PDF] [Code]

Conference

Sihan Liu, Yiwei Ma (co-frist author), Xiaoqing Zhang, Haowei Wang, Jiayi Ji, Xiaoshuai Sun, Rongrong Ji
Rotated Multi-Scale Interaction Network for Referring Remote Sensing Image Segmentation
Computer Vision and Pattern Recognition Conference (CVPR), 2024
[arXiv] [Code]
Changli Wu, Yiwei Ma (co-frist author), Qi Chen, Haowei Wang, Gen Luo, Jiayi Ji, Xiaoshuai Sun
3D-STMN: Dependency-Driven Superpoint-Text Matching Network for End-to-End 3D Referring Expression Segmentation
AAAI Conference on Artificial Intelligence (AAAI), 2024
[arXiv] [Code] [PDF comming]
Zhipeng Qian, Yiwei Ma (co-frist author), Jiayi Ji, Xiaoshuai Sun
X-RefSeg3D: Enhancing Referring 3D Instance Segmentation via Structured Cross-Modal Graph Neural Networks
AAAI Conference on Artificial Intelligence (AAAI), 2024
[Code] [PDF comming]
Tianyu Guo, Haowei Wang, Yiwei Ma, Jiayi Ji , Xiaoshuai Sun
Improving Panoptic Narrative Grounding by Harnessing Semantic Relationships and Visual Confirmation
AAAI Conference on Artificial Intelligence (AAAI), 2024
[Code] [PDF comming]
Yiwei Ma, Jiayi Ji, Xiaoshuai Sun, Guannan Jiang, Weilin Zhuang, Rongrong Ji
Beat: Bi-directional One-to-Many Embedding Alignment for Text-based Person Retrieval
ACM International Conference on Multimedia (ACM MM), 2023
[PDF] [Code] [Project Page]
Haowei Wang, Jiji Tang, Jiayi Ji, Xiaoshuai Sun, Rongsheng Zhang, Yiwei Ma, Minda Zhao, Lincheng Li, Zeng Zhao, Tangjie Lv, Rongrong Ji
Beyond First Impressions: Integrating Joint Multi-modal Cues for Comprehensive 3D Representation
ACM International Conference on Multimedia (ACM MM), 2023
[PDF] [arXiv] [Code]
Danni Yang, Jiayi Ji, Xiaoshuai Sun, Haowei Wang, Yinan Li, Yiwei Ma, Rongrong Ji
Semi-Supervised Panoptic Narrative Grounding
ACM International Conference on Multimedia (ACM MM), 2023
[PDF] [arXiv] [Code]
Yiwei Ma, Xiaoqing Zhang, Xiaoshuai Sun, Jiayi Ji, Haowei Wang, Guannan Jiang, Weilin Zhuang, Rongrong Ji
X-Mesh:Towards Fast and Accurate Text-driven 3D Stylization via Dynamic Textual Guidance
IEEE International Conference on Computer Vision (ICCV), 2023
[PDF] [arXiv] [Code] [Project Page]
Yiwei Ma, Guohai Xu, Xiaoshuai Sun, Ming Yan, Ji Zhang, Rongrong Ji
X-CLIP: End-to-End Multi-grained Contrastive Learning for Video-Text Retrieval
ACM International Conference on Multimedia (ACM MM), 2022
[PDF] [arXiv] [Code] [Project Page]

Preprint

Yiwei Ma, Yijun Fan, Jiayi Ji, Haowei Wang, Xiaoshuai Sun, Guannan Jiang, Annan Shu, Rongrong Ji
X-Dreamer: Creating High-quality 3D Content by Bridging the Domain Gap Between Text-to-2D and Text-to-3D Generation
arXiv preprint arXiv:2312.00085 , 2023
[arXiv] [Code] [Project Page]
Jiayi Ji, Haowei Wang, Changli Wu, Yiwei Ma, Xiaoshuai Sun, Rongrong Ji
JM3D & JM3D-LLM: Elevating 3D Representation with Joint Multi-modal Cues
arXiv preprint arXiv:2310.09503 , 2023
[arXiv] [Code]

Projects   [back top]


External-Attention-pytorch
Pytorch implementation of various Attention Mechanisms, MLP, Re-parameter, Convolution, which is helpful to further understand papers.
[github ] (10000+ stars)

Patent   [back top]

  • 基于动态文本引导的文本驱动3D风格化方法 - 公开号: CN116704090A - [详情]
  • 面向视频文本检索的端到端多粒度对比学习方法 - 公开号: CN115757713A - [详情]
  • 面向局部视觉建模的图像描述生成方法 - 公开号: CN115964530A - [详情]
  • 基于文本的人物检索的双向一对多嵌入对齐方法 - 公开号: CN116304145A - [详情]
  • 一种3D内容创建方法 - 公开号: CN117593469A - [详情]
  • 一种基于链式感知的指向性3D实例分割方法 - 公开号: CN117593527A - [详情]
  • 基于文本和视觉上下文关系时间融合的视频文本检索方法 - 公开号: CN117407561A - [详情]
  • Major Awards   [back top]

    Professional Activities   [back top]