NII Kurita Lab

Coffee machine manipulation demo by a bimanual robot Aloha Stationary

双腕ロボットAloha Stationaryでのコーヒーマシン操作デモ

Kurita Lab at the National Institute of Informatics studies real-world applications of large language models and foundation models. The lab was established in April 2024 and hosted students from April 2025. Our goal is to connect symbolic information such as text with physical, non-symbolic information obtained from cameras and other sensors, and to apply advanced reasoning technologies represented by large language models to real-world and physical environments. To this end, we work on research projects in language models and vision-language models, embodied AI and robotic foundation models, and AI for Science.

NII栗田研は国立情報学研究所にて大規模言語モデルや基盤モデル技術の実世界応用に取り組む研究室です。2024年4月に発足し2025年4月より学生の受け入れを開始しました。テキストのようなシンボル情報とカメラ・センサなどから得られる物理情報・非シンボル情報を対応付けること、さらに、大規模言語モデルに代表される高度な推論技術を実世界・物理世界に応用することを目標に、言語モデル・視覚言語モデル、Embodied AI・ロボット基盤モデル、AI for Science などの研究プロジェクトを進めています。

News ニュース

Jun 17, 2026 2026年6月17日	2 papers are accepted to IROS2026! Daichi Yashima, Koki Seno, Shuhei Kurita, Yusuke Oda, Komei Sugiura, “HiFlow: Tokenization-Free Scale-Wise Autoregressive Policy Learning via Flow Matching”, IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS2026). [arXiv] Hisayuki Yokomizo, Taiki Miyanishi, Gang Yan, Shuhei Kurita, Nakamasa Inoue, Yusuke Iwasawa, “PhysQuantAgent: An Inference Pipeline of Mass Estimation for Vision-Language Models”, IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS2026). [arXiv]
May 01, 2026 2026年5月1日	Chi Zhang joined our lab as a Specially Appointed Researcher. Welcome!
Feb 21, 2026 2026年2月21日	Our paper is accepted to CVPR2026! Daichi Yashima, Shuhei Kurita, Yusuke Oda, Komei Sugiura, “ReMoRa: Multimodal Large Language Model based on Refined Motion Representation for Long-Video Understanding”, CVPR2026. arXiv
Feb 01, 2026 2026年2月1日	Our paper is accepted to ICRA2026! Tomoya Yoshida, Shuhei Kurita, Taichi Nishimura, Shinsuke Mori, “Developing Vision-Language-Action Model from Egocentric Videos”, ICRA2026. arXiv
Sep 27, 2025 2025年9月27日	Our paper is accepted to ACMMM2025! Mahiro Ukai, Shuhei Kurita, Nakamasa Inoue, “STATUS Bench: A Rigorous Benchmark for Evaluating Object State Understanding in Vision-Language Models”, ACMMM2025. paper

Latest posts 最新記事

Mar 29, 2026 2026年3月29日	2025 Annual Report 2025年度活動報告

Selected publications 主な論文

IROS

HiFlow: Tokenization-Free Scale-Wise Autoregressive Policy Learning via Flow Matching

In IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS2026), 2026

To appear

arXiv
CVPR

ReMoRa: Multimodal Large Language Model based on Refined Motion Representation for Long-Video Understanding

Daichi Yashima, Shuhei Kurita, Yusuke Oda, and 1 more author

In IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2026

To appear

arXiv
ICRA

Developing Vision-Language-Action Model from Egocentric Videos

Tomoya Yoshida, Shuhei Kurita, Taichi Nishimura, and 1 more author

In IEEE International Conference on Robotics and Automation (ICRA), 2026

To appear

arXiv
CVPR

Generating 6DoF Object Manipulation Trajectories from Action Description in Egocentric Vision

Tomoya Yoshida, Shuhei Kurita, Taichi Nishimura, and 1 more author

In IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2025

Awarded arXiv HTML

Selected as Highlight.
ICCV

RefEgo: Referring Expression Comprehension Dataset from First-Person Perception of Ego4D

Shuhei Kurita, Naoki Katsura, and Eri Onami

In IEEE/CVF International Conference on Computer Vision (ICCV), 2023

arXiv Code
CVPR

ScanQA: 3D Question Answering for Spatial Scene Understanding

Daichi Azuma^*, Taiki Miyanishi^*, Shuhei Kurita^*, and 1 more author

In IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2022

* Equally contributed.

arXiv HTML Code
ACL

Neural Joint Model for Transition-based Chinese Syntactic Analysis

Shuhei Kurita, Daisuke Kawahara, and Sadao Kurohashi

In Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics (ACL), 2017

Awarded HTML

Selected as Outstanding Paper! (2% of submissions)