Pos:  Direction >> Content

Multimodal Sensing


Pub Date:2025-01-21 16:33 Page Views:


Our research group focuses on multimodal perception, which involves the process of acquiring and processing information through various sensory modalities, including visual, auditory, tactile, linguistic, and sensor data. Based on machine learning, especially deep learning technology, and specific application needs, our research particularly emphasizes the applications of multimodal data in medicine and 3D modeling.In the field of smart healthcare, our group is dedicated to studying high-precision intelligent diagnostic methods based on multimodal medical data, focusing on three aspects: diagnosability, interpretability, and visualizability. In the realm of virtual reality, we research 3D scene reconstruction and 3D content generation, concentrating on signals from multiple sensors such as cameras and LiDAR. Key areas of focus include 3D semantic segmentation, voxel and point cloud completion, and 3D generation tasks.Additionally, in the field of smart engineering, our group utilizes artificial intelligence technologies to develop and optimize various early warning and recognition systems, aiming to enhance the safety and efficiency of engineering projects.