Publications

2027

Zero-Shot Functional Affordances and Skill Recall for Robust Robot Manipulation.
Md Selim Sarowar, and Sungho Kim
IROS'27(To be submitted) Flagship in Robotics
arxiv | PDF

AGI: Agentic Intelligence in Latent World Model for Robot Manipulation
Md Selim Sarowar, Md Tanvir Islam, Sungho Kim, and Sangtae Anh
ICRA'27(To be submitted) Flagship in Robotics

arxiv | PDF

2026

GaussVLA: Geometry-Aware Spatial Reasoning for Vision-Language-Action Model
Md Selim Sarowar, Md Tanvir Islam, Sungho Kim and Sangtae Ahn
BMVC'26 Flagship in Vision
OpenReview | PDF

Hands or Not? Building a Robust Dataset for Dynamic Gesture Recognition
Ashikuzzaman, Md Selim Sarowar, Md Tanvir Islam, Sangtae Ahn, and Khan Muhammad
BMVC'26 Flagship in Vision
arxiv | PDF

GST-VLA: Structured Gaussian Spatial Tokens for 3D Depth-Aware Vision-Language-Action Models
Md Selim Sarowar, and Sungho Kim
ICMLw'26
Openreview | PDF

C3G-VM6D: Data-Efficient C3G Vision Model Aided 6D Pose Estimation based on RGB-D Data
Md Selim Sarowar, Manar Alnaasan and Sungho Kim
IEEE Access(SCIE-Q1, IF: 3.9)
IEEE Access | PDF

What Matters: Datasets or Robust Frameworks in Modern Robot Learning?
Md Selim Sarowar
Artificial Intelligence Review(Under Review)
Preprint | PDF

Fusion VLM-Gait: RGB-D Fusion in Vision-Language Models for Explainable Multi-Task Parkinsonian Gait Analysis
Manar Alnaasan, Md Selim Sarowar and Sungho Kim
Scientific Reports((SCIE-Q1, IF: 3.9))
arxiv | PDF

Vision-Language-Action and Vision Language Models for Robot Manipulation: A Comprehensive Review Towards Real-World Applications
Md Selim Sarowar and Sungho Kim
PeerJ Computer Science(SCIE-Q1, IF:3)
PeerJ Computer Science | PDF

2025

VFM-VLM: Vision Foundation Model and Vision Language Model based Visual Comparison for 3D Pose Estimation
Md Selim Sarowar and Sungho Kim
IEIE'26 Conference, South Korea
arxiv | PDF

VLM6D: VLM based 6Dof Pose Estimation based on RGB-D Images
Md Selim Sarowar and Sungho Kim
IEIE'25 Conference, South Korea
IEIE | PDF

Hand Gesture Recognition Systems: A Review of Methods, Datasets, and Emerging Trends
*Md Selim Sarowar, and Nur E Jannatul Farjana et. all
International Journal of Computer Applications
IJCA Journal | PDF

2022

Improvement of Denoising in Images Using Generic Image Denoising Network (GID Net)
Md Selim Sarowar, Kaustav Dutta, and *Rasmita Lenka
IEEE 2nd International Conference on Applied Electromagnetics, Signal Processing and Communication (AESPC), Nov. 2021
IEEE Xplore | PDF