I'm Kei Katsumata.-image

I'm Kei Katsumata.

I am a Master's student advised by Prof. Komei Sugiura at Keio University in Japan.

Click to expand

My research interests lie at the intersection of vision and language, computer graphics, and autonomous driving, with a particular focus on building intelligent systems that understand and act upon multimodal inputs.

I am especially interested in enabling agents that can interpret visual scenes and generate natural language instructions in real-world settings.

As a full-stack engineer, I have extensive experience in building scalable applications using TypeScript, Go, and modern cloud technologies. I combine my research expertise with practical software engineering skills to create impactful solutions.

about-me-image

About me

I am a passionate researcher and full-stack engineer at Keio University, focusing on multimodal AI systems and their real-world applications. My work combines theoretical insights from machine learning research with practical software engineering to create impactful technology solutions. I have experience in both academic research, publishing at top-tier conferences/journals like IEEE RAL, and industry work, building scalable applications using modern technologies.

  • Location:Tokyo, Japan
  • Age:22
  • Nationality:Japanese
  • Interests:Vison and Language, Autonomous Driving, Computer Vision
  • Study:Keio University
  • Employment:Keio University

Publications

Journals

Mobile Manipulation Instruction Generation From Multiple Images With Automatic Metric Enhancement

Kei Katsumata, Motonari Kambara, Daichi Yashima, Ryosuke Korekata, Komei Sugiura

IEEE Robotics and Automation Letters, pp. 3022-3029, DOI: 10.1109/LRA.2025.3539086, 2025

Impact Factor (2023): 4.6, h5-index: 117

International Conferences

Mobile Manipulation Instruction Generation From Multiple Images With Automatic Metric Enhancement

Kei Katsumata, Motonari Kambara, Daichi Yashima, Ryosuke Korekata, Komei Sugiura

IEEE RA-L presented at International Conference on Robotics and Automation (ICRA), 2026

Viena, Austria

Domestic Conferences

NaiLIA: 多層的な依頼文に基づくネイルデザインのマルチモーダル検索

雨宮佳音, 小松拓実, 八島大地, 是方諒介, 勝又圭, 杉浦孔明

第28回画像の認識・理解シンポジウム, 2025

国立京都国際会館, 2025年7月

物体操作指示文生成モデルに基づくモバイルマニピュレーションのためのデータセット拡張

勝又圭, 神原元就, 八島大地, 是方諒介, 杉浦孔明

第39回人工知能学会全国大会, 2025

大阪国際会議場, 2025年5月

NaiLIA: 緩和損失に基づくネイルデザインのマルチモーダル検索

雨宮佳音, 小松拓実, 八島大地, 是方諒介, 勝又圭, 杉浦孔明

第39回人工知能学会全国大会, 2025

大阪国際会議場, 2025年5月

自動評価尺度を用いた強化学習およびマルチモーダル基盤モデルに基づく物体操作指示文生成

勝又圭, 神原元就, 杉浦孔明

第42回日本ロボット学会学術講演会, 2024

大阪工業大学梅田キャンパス, 2024年9月6日

AIC Databricksプロジェクト

西川誠人, 勝又圭, 好田駿成, 石川繁樹, 小林真里

AIC カンファレンス 2023, 2023

慶應義塾大学日吉キャンパス, 2023年3月4日

Education

High School

Seikei High SchoolApril 2018 - March 2021

B.E. in Information and Computer Science

Keio UniversityApril 2021 - March 2025

M.S. in Information and Computer Science

Keio UniversityMarch 2025 - Present

Work

Keio University

Research AssistantJan 2025 - Present

CyberAgent, Inc.

Backend Engineer InternNov 2023 - Dec 2023

DMM.com LLC

Backend Engineer InternAug 2023 - Sep 2023

looking up Inc.

Full Stack Engineer InternJan 2023 - Feb 2024

Keio University AI and Advanced Programming Consortium

Education InternshipJun 2022 - Mar 2024

Standard Inc.

Project Lead (Hait Lab 6th Term)Jul 2021 - Dec 2021

Coconala Inc.

Data Science InternJun 2021 - Apr 2023

Skills

Programming Languages
Python
TypeScript
Go
C# (Unity)
Languages
Japanese (Native)
English (Fluent)
German (Basic)
Development
Frontend: TS (Next.JS)
Backend: TS (Nest.JS / Express), Go (gRPC)
Infrastructure: AWS / GCP
RDBMS: PostgreSQL / MySQL
Data & ML
PyTorch
BigQuery
Databricks

Get in touch.

I am always looking for new opportunities to collaborate and learn. Feel free to reach out to me via email or LinkedIn.

Location
Tokyo, Japan
Github
agate106k
Provided by ReactResume© Copyright 2025 Tim Baker