Heterogeneous and multimodal data are acquired from diverse sensors and stored in different formats such as text, au-dio, images, LiDAR, or biomedical signals, offering complementary perspectives that can significantly enrich analysis and model accuracy. Their integration enables a wide range of applications, from healthcare, where imaging, genomic, and clinical data contribute to diagnostics and personalised medicine, to traffic monitoring and simulation with LiDAR, cameras, radar, and GPS, as well as computer graphics scenarios involving 3D motion and scene understanding. How-ever, such data present several challenges, including interoperability across acquisition systems, missing or incomplete measurements, high computational demands for real-time processing, and the need for domain specialists to interpret the results. We address these issues by developing a unified and real-time framework for noise and artefact reduction, enhancement of low-resolution data, multimodal data fusion, and predictive analysis based on robust feature extraction. Our multimodal data processing pipeline deep learning and high-performance computing modules. In particular, we in-troduce a learning-based approach for tumour classification that integrates ultrasound segmentation and feature extrac-tion, techniques for action recognition in 3D depth-sensor videos using kinematic descriptors, and a novel method for graph recovery and prediction via graph convolutional networks. Our results are validated through quantitative metrics on international benchmarks, such as the TUS-REC dataset. As future work, we plan to validate the results with indus-trial and clinical experts to support the technological transfer of our framework in hospitals and clinical centres.
| Published in | Abstract Book of the CNR IMATI Workshop |
| Page(s) | 1-1 |
| Creative Commons |
This is an Open Access abstract, distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution and reproduction in any medium or format, provided the original work is properly cited. |
| Copyright |
Copyright © The Author(s), 2026. Published by Science Publishing Group |
Biomedical Data, Artificial Intelligence, Multimodal Data, Real-time Processing