ViT-TB: Ensemble Learning Based ViT Model for Tuberculosis Recognition
Abstract
Dynamic modern healthcare systems rely heavily on the contributions of computer scientists. The diagnosis process is a team effort involving many people: patients, their families, healthcare providers, researchers, and policymakers. Computer technology plays a crucial role in supporting this effort by providing a number of essential services to all of these groups. In the early stages of many diseases, a diagnosis can be made automatically using a computer-aided system, with some degree of certainty. This paper presents a hybrid optimal deep learning-based model for tuberculosis disease recognition using MRI images. Several deep learning models are combined to extract the most relevant features from MRI images. In particular, we establish a combination between vision transformer (ViTs) and Efficient-Net models in order to maximize classification accuracy. We conducted experiments to investigate the accuracy of the proposed model using the Shenzhen and Montgomery data set, and found that it yielded substantially more accurate and better results than the state of-the-art works.