Integrating Hierarchical Structures in Medical Data Classification: A Kernel-Based Method
DOI:
https://doi.org/10.29020/nybg.ejpam.v19i1.5825Keywords:
Kernel , Classification, Tree, Medical diagnosis, similarity, hierarchical kernelAbstract
Medical diagnosis systems frequently rely on structured information collected during physician–patient interviews. These data naturally follow a hierarchical organization, where general questions are followed by more specific sub-questions. Such a structure should be explicitly incorporated into similarity measures used in classification algorithms, as it reflects dependencies
between symptoms and contributes essential diagnostic information. In this work, we introduce a kernel that simultaneously accounts for (i) the hierarchical structure linking main questions to their subordinate items and (ii) interactions among sub-variables. The kernel is integrated into the pgpDA classification framework, allowing the method to embed prior
knowledge on how variables are organized and how symptoms interact. The proposed kernel is designed for binary data arranged in two-level tree structures and supports interaction modeling of any given order. Experiments conducted on simulated data and a real verbal autopsy dataset from Senegal demonstrate consistent improvements over classical kernels, and a deep-learning benchmark confirms that the structured kernel retains strong predictive power even in modern architectures. The methodology may be extended to mixed data types or adapted to graph-structured symptom networks.
Downloads
Published
Issue
Section
License
Copyright (c) 2026 Seydou Nourou Sylla

This work is licensed under a Creative Commons Attribution-NonCommercial 4.0 International License.
Upon acceptance of an article by the European Journal of Pure and Applied Mathematics, the author(s) retain the copyright to the article. However, by submitting your work, you agree that the article will be published under the Creative Commons Attribution-NonCommercial 4.0 International License (CC BY-NC 4.0). This license allows others to copy, distribute, and adapt your work, provided proper attribution is given to the original author(s) and source. However, the work cannot be used for commercial purposes.
By agreeing to this statement, you acknowledge that:
- You retain full copyright over your work.
- The European Journal of Pure and Applied Mathematics will publish your work under the Creative Commons Attribution-NonCommercial 4.0 International License (CC BY-NC 4.0).
- This license allows others to use and share your work for non-commercial purposes, provided they give appropriate credit to the original author(s) and source.