Minicursos do SBCAS 2026

Eduardo Luz; Rodrigo da Rosa Righi; Saul E. Delabrida Silva; Maria Tereza Fernandes Abrahão; Pablo Jorge Madril; Mateus de Lima Freitas; Marcos Silva de Mendonça; Letícia Martins Raposo; Vinicius Navega Stelet; Amanda Morais Almeida; Diêgo Farias de Freitas; Paulo Eduardo Ambrósio; Leonardo Augusto Ferreira; Marcelo A. P. Xavier; Andrea G. Campos; Walmir M. Caminhas; Frederico Gadelha Guimarães; James D. Sousa; Frank Cesár Lopes Véras; Karina dos Santos Machado; Adriano Velasque Werhli; Frederico Kremer; Rafael Junqueira Borges

doi:10.5753/sbc.19670.4

Authors

Eduardo Luz (ed)

UFOP

Rodrigo da Rosa Righi (ed)

UNISINOS

Saul E. Delabrida Silva (ed)

UFOP

DOI: https://doi.org/10.5753/sbc.19670.4

Keywords:

OMOP/OHDSI, Machine learning in healthcare, SHAP, Intelligent conversational agents, Cervical cancer, Federated learning, AI agents applied to healthcare, RAG, Drug discovery

Synopsis

The Short Courses of the 26th Brazilian Symposium on Computing Applied to Health (SBCAS 2026) brings together six chapters corresponding to the short courses selected for this edition of the event. The collection faithfully reflects the major themes at the forefront of contemporary technological innovation, with a strong emphasis on Artificial Intelligence and data interoperability. Readers will find in-depth discussions on evidence generation through the OMOP/OHDSI model, the development of conversational agents and intelligent agents for healthcare services, and the use of virtual reality in healthcare training. In addition, the volume addresses critical frontiers related to ethics and algorithmic efficiency, including the interpretation of machine learning models using SHAP, the application of Federated Learning in cancer screening, and the impact of Machine Learning on drug discovery. These topics not only define the current state of the art but also propose concrete solutions to real-world challenges faced by healthcare systems.

Chapters

1. OMOP/OHDSI Made Simple: From Theory to Evidence Generation in Healthcare

Maria Tereza Fernandes Abrahão, Pablo Jorge Madril, Mateus de Lima Freitas, Marcos Silva de Mendonça

DOI: https://doi.org/10.5753/sbc.19670.4.1

Chapter 1
2. Interpretation of Machine Learning Models Applied to Healthcare Using SHAP

Letícia Martins Raposo, Vinicius Navega Stelet

DOI: https://doi.org/10.5753/sbc.19670.4.2

Chapter 2
3. Building a Conversational Agent for Healthcare Applications: From Real-World Problems to Intelligent Solutions

Amanda Morais Almeida, Diêgo Farias de Freitas, Paulo Eduardo Ambrósio

DOI: https://doi.org/10.5753/sbc.19670.4.3

Chapter 3
4. Federated Learning in Practice: From Theory to Implementation in Cervical Cancer Screening

Leonardo Augusto Ferreira, Marcelo A. P. Xavier, Andrea G. Campos, Walmir M. Caminhas, Frederico Gadelha Guimarães

DOI: https://doi.org/10.5753/sbc.19670.4.4

Chapter 4
5. Artificial Intelligence Agents Applied to Healthcare: Fundamentals, Architectures, and Practical Implementation

James D. Sousa, Frank Cesár Lopes Véras

DOI: https://doi.org/10.5753/sbc.19670.4.5

Chapter 5
6. Machine Learning Applied to Drug Discovery

Karina dos Santos Machado, Adriano Velasque Werhli, Frederico Kremer, Rafael Junqueira Borges

DOI: https://doi.org/10.5753/sbc.19670.4.6

Chapter 6

Downloads

Download data is not yet available.

References

Abo El-Enen, M., Saad, S. and Nazmy, T. (2025) “A survey on retrieval-augmentation generation (RAG) models for healthcare applications”, Neural Computing and Applications, v. 37, p. 28191-28267.

Abrahão, M. T., Nobre, M. R., Madril, P. J., O estado da arte em pesquisa observacional de dados de saúde: A iniciativa OHDSI 2019. [link] [link]

Abramson, J., Adler, J., Dunger, J., Evans, R., Green, T., Pritzel, A., Ronneberger, O.,Willmore, L., Ballard, A. J., Bambrick, J., et al. (2024). Accurate structure prediction of biomolecular interactions with alphafold 3. Nature, 630:493–500.

Ackloo, S., Li, F., Szewczyk, M., et al. (2025). A target class ligandability evaluation of wd40 repeat-containing proteins. Journal of Medicinal Chemistry, 68(2).

Afolalu, O. O., Akpor, O. A. and Afolalu, S. A. (2025) “A systematic review of interventions for reducing and reporting adverse events in emergency departments: Multidisciplinary approaches and technological innovations”, Collegian, v. 32, p. 34-45. DOI: 10.1016/j.colegn.2024.12.001.

Ahmad, S., Xu, J., Feng, J., Hutchinson, A., Zeng, H., Ghiabi, P., Dong, A., Centrella, P., Clark, M., Guié, M.-A., et al. (2023). Discovery of a first-inclass small-molecule ligand for wdr91 using dna-encoded chemical library selection followed by machine learning. Journal of Medicinal Chemistry, 66(23):16051–16061.

Akheel, S. A. (2025) “Guardrails for Large Language Models: A Review of Techniques and Challenges”, Journal of Artificial Intelligence, Machine Learning and Data Science, v. 3, n. 1, p. 2504-2512. DOI: 10.51219/JAIMLD/syed-arhamakheel/536.

Alabi, R. O., Elmusrati, M., Leivo, I., Almangush, A., and Mäkitie, A. (2023). Machine learning explainability in nasopharyngeal cancer survival using lime and shap. Scientific Reports, 13.

Alammar, J. (2018) “The illustrated transformer”, The illustrated transformer–Jay Alammar–visualizing machine learning one concept at a time, v. 27, n. 1.

Aldhafeeri, F. (2025). Governing artificial intelligence in radiology: A systematic review of ethical, legal, and regulatory frameworks. Diagnostics, 15.

Allen, W. J., Balius, T. E., Mukherjee, S., Brozell, S. R., Moustakas, D. T., Lang, P. T., Case, D. A., Kuntz, I. D., and Rizzo, R. C. (2015). Dock 6: impact of new features and current docking performance. Journal of Computational Chemistry, 36(15):1132–1156.

ALOMARI, L. M. et al. Safety and accuracy of ai in triaging patients in the emergency department. International Journal of Emergency Medicine, Springer, v. 18, n. 1, p. 243, 2025.

Altschuh, D., Lesk, A. M., Bloomer, A. C., and Klug, A. (1987). Correlation of coordinated amino acid substitutions with function in viruses related to tobacco mosaic virus. Journal of Molecular Biology, 193(4):693–707.

Amann, J., Blasimme, A., Vayena, E., Frey, D., Madai, V. I., and Consortium, P. (2020). Explainability for artificial intelligence in healthcare: a multidisciplinary perspective. BMC medical informatics and decision making, 20(1):310.

Amaro, R. E., Baudry, J., Chodera, J., Demir, O., McCammon, J. A., Miao, Y., and Smith, J. C. (2018). Ensemble docking in drug discovery. Biophysical journal, 114(10):2271–2278.

ANTHROPIC. Model Context Protocol Introduction. 2024. [link]. Acesso em: 10 maio 2026.

Arrieta, A. B., Díaz-Rodríguez, N., Del Ser, J., Bennetot, A., Tabik, S., Barbado, A., García, S., Gil-López, S., Molina, D., Benjamins, R., et al. (2020). Explainable artificial intelligence (XAI): Concepts, taxonomies, opportunities and challenges toward responsible AI. Information Fusion, 58:82–115.

Arrua, O. E., Aderhold, A., Werhli, A. V., and Machado, K. D. S. (2024). Rfl-score: random forest with lasso scoring function for protein-ligand molecular docking. In 2024 IEEE Conference on Computational Intelligence in Bioinformatics and Computational Biology (CIBCB), pages 1–8. IEEE.

Associação Nacional de Hospitais Privados (Anahp). A Inteligência Artificial já está sendo usada na saúde: descubra como isso te beneficia. 2025. Publicado com base nos dados apresentados no Showcase de IA nos Hospitais Brasileiros. Disponível em: [link].

ATLAS Tutorial: Explore Concept Sets Video: [link]

Auzine, M. M., Khan, M. H.-M., Baichoo, S., Sahib, N. G., Bissoonauth-Daiboo, P., Gao, X., and Heetun, Z. (2024). Development of an ensemble cnn model with explainable ai for the classification of gastrointestinal cancer. PLOS ONE, 19.

Bachmann, S. (2025). Efficient xai: A low-cost data reduction approach to shap interpretability. J. Artif. Intell. Res., 83.

Bajwa, A., Nosheen, N., Talpur, K., and Akram, S. (2023). A prospective study on diabetic retinopathy detection based on modify convolutional neural network using fundus images at sindh institute of ophthalmology and visual sciences. Diagnostics, 13.

Ballester, P. J. and Mitchell, J. B. (2010). A machine learning approach to predicting protein–ligand binding affinity with applications to molecular docking. Bioinformatics, 26(9):1169–1175.

BASTIAN, H.; GLASZIOU, P.; CHALMERS, I. Seventy-five trials and eleven systematic reviews a day: how will we ever keep up? PLoS medicine, Public Library of Science San Francisco, USA, v. 7, n. 9, p. e1000326, 2010.

Belle, V. and Papantonis, I. (2020). Principles and practice of explainable machine learning. Frontiers in Big Data, 4.

Berman, H. M., Westbrook, J., Feng, Z., Gilliland, G., Bhat, T. N., Weissig, H., Shindyalov, I. N., and Bourne, P. E. (2000). The protein data bank. Nucleic acids research, 28(1):235–242.

BHAVNANI, S. P.; NARULA, J.; SENGUPTA, P. P. Mobile technology and the digitization of healthcare. European heart journal, v. 37, n. 18, p. 1428, 2016. Citado na página page.2424.

Bifarin, O. O. (2022). Interpretable machine learning with tree-based shapley additive explanations: Application to metabolomics datasets for binary classification. PLOS ONE, 18.

Binzagr, F. (2024). Explainable ai-driven model for gastrointestinal cancer classification. Frontiers in medicine, 11:1349373.

Bishop, C. M. and Bishop, H. (2024). Deep Learning: Foundations and Concepts. Springer, Cham.

Bjerrum, E. J. (2017). Smiles enumeration as data augmentation for neural network modeling of molecules. arXiv preprint arXiv:1703.07076.

Bodria, F., Giannotti, F., Guidotti, R., Naretto, F., Pedreschi, D., and Rinzivillo, S. (2021). Benchmarking and survey of explanation methods for black box models. Data Mining and Knowledge Discovery, 37:1719–1778.

Branden, C. I. and Tooze, J. (2012). Introduction to protein structure. Garland Science.

Bray, F., Laversanne, M., Sung, H., Ferlay, J., Siegel, R. L., Soerjomataram, I., and Jemal, A. (2024). Global cancer statistics 2022: Globocan estimates of incidence and mortality worldwide for 36 cancers in 185 countries. CA: A Cancer Journal for Clinicians, 74(3):229–263.

Breiman, L. (2001). Random forests. Machine Learning, 45(1):5–32.

Brenning, A. (2021). Interpreting machine-learning models in transformed feature space with an application to remote-sensing classification. Machine Learning, 112:3455–3471.

Brown, T. B. et al. (2020) “Language models are few-shot learners”, Advances in Neural Information Processing Systems (NeurIPS), v. 33, p. 1877–1901.

Burley, S. K., Bhikadiya, C., Bi, C., Bittrich, S., Chen, L., Crichlow, G. V., Christie, C. H., Dalenberg, K., Di Costanzo, L., Duarte, J. M., et al. (2021). Rcsb protein data bank: powerful new tools for exploring 3d structures of biological macromolecules for basic and applied research and education in fundamental biology, biomedicine, biotechnology, bioengineering and energy sciences. Nucleic acids research, 49(D1):D437–D451.

Burrell, J. (2016). How the machine ’thinks’: Understanding opacity in machine learning algorithms. Big Data and Society, 3.

CACHE Initiative (2024). Cache challenge #2 results. Accessed March 2026.

Carhart, R. E., Smith, D. H., and Venkataraghavan, R. (1985). Atom pairs as molecular features in structure-activity studies: definition and applications. Journal of Chemical Information and Computer Sciences, 25(2):64–73.

Cereto-Massagué, A., Ojeda, M. J., Valls, C., Mulero, M., Garcia-Vallvé, S., and Pujadas, G. (2015). Molecular fingerprint similarity search in virtual screening. Methods, 71:58–63.

Cha, Y., Shin, J., Go, B., Lee, D.-S., Kim, Y., Kim, T., and Park, Y.-S. (2021). An interpretable machine learning method for supporting ecosystem management: Application to species distribution models of freshwater macroinvertebrates. Journal of environmental management, 291:112719.

Chai Discovery Team (2024). Chai-1: Decoding the molecular interactions of life. bioRxiv. Preprint.

Chen, C., Isa, N. A. M., and Liu, X. (2024). A review of convolutional neural network based methods for medical image classification. Computers in biology and medicine, 185:109507.

Chen, I. Y., Pierson, E., Rose, S., Joshi, S., Ferryman, K., and Ghassemi, M. (2021). Ethical machine learning in healthcare. Annual review of biomedical data science, 4(1):123–144.

Chen, T. and Guestrin, C. (2016). Xgboost: A scalable tree boosting system. In KDD, pages 785–794.

Chithrananda, S., Grand, G., and Ramsundar, B. (2020). Chemberta: large-scale self-supervised pretraining for molecular property prediction. arXiv preprint arXiv:2010.09885.

COLLINS, F. S.; VARMUS, H. A new initiative on precision medicine. New England journal of medicine, Mass Medical Soc, v. 372, n. 9, p. 793–795, 2015. page.2323.

Columbia DBMI: OHDSI Sets Path for Enhancing Trust in Science, Engaging Global Community at 2025 Symposium. COLUMBIA UNIVERSITY, Department of Biomedical Informatics (DBMI). 2025. Disponível em: [link]

Concept Sets in Capr: [link]

Contreras, J., Winterfeld, A., Popp, J., and Bocklitz, T. (2024). Spectral zones-based shap/lime: Enhancing interpretability in spectral deep learning models through grouped feature analysis. Analytical Chemistry, 96:15588 – 15597.

Copeland, R. A., Pompliano, D. L., and Meek, T. D. (2006). Drug–target residence time and its implications for lead optimization. Nature Reviews Drug Discovery, 5(9):730–739.

Covert, I., Lundberg, S. M., and Lee, S.-I. (2020). Understanding global feature contributions with additive importance measures. Advances in neural information processing systems, 33:17212–17223.

Crampon, K., Giorkallos, A., Deldossi, N., Baud, S., and Steffenel, L. A. (2022). Machine-learning methods for ligand–protein molecular docking. Drug Discovery Today, 27(1):151–164.

Dalby, A., Nourse, J. G., Hounshell, W. D., Gushurst, A. K., Grier, D. L., Leland, B. A., and Laufer, J. (1992). Description of several chemical structure file formats used by computer programs developed at molecular design limited. Journal of chemical information and computer sciences, 32(3):244–255.

Daoud, H. G. and Bayoumi, M. (2019). Efficient epileptic seizure prediction based on deep learning. IEEE Transactions on Biomedical Circuits and Systems, 13:804–813.

Daylight Chemical Information Systems (2024). Daylight theory manual: Fingerprints. Accessed: 2026-05-11.

Dekkers OM, Egger M, Altman DG, Vandenbroucke JP. Distinguishing case series from cohort studies. Ann Intern Med. 2012 Jan 3;156(1 Pt 1):37-40. DOI: 10.7326/0003-4819-156-1-201201030-00006. PMID: 22213493.

Diniz, D. N., Rezende, M. T., Bianchi, A. G. C., Carneiro, C. M., Ushizima, D. M., Medeiros, F. N. S. d., and Souza, M. J. F. (2021). A hierarchical feature-based methodology to perform cervical cancer classification. Applied Sciences, 11(9):4091.

Distrito; Associação Brasileira de Startups de Saúde e HealthTechs (ABSS). HealthTech Recap 2024. [S.l.], 2025. Relatório lançado em fevereiro de 2025 com dados do exercício de 2024. Disponível em: [link].

Dong, Y. et al. (2024) “Building guardrails for large language models”, arXiv preprint arXiv:2402.01822.

Durant, J. L., Leland, B. A., Henry, D. R., and Nourse, J. G. (2002). Reoptimization of mdl keys for use in drug discovery. Journal of Chemical Information and Computer Sciences, 42(6):1273–1280.

Edwards, A. M. et al. (2025). Protein–ligand data at scale to support machine learning. Nature Reviews Chemistry, 9:634–645.

Elshawi, R., Al-Mallah, M., and Sakr, S. (2019). On the interpretability of machine learning-based model for predicting hypertension. BMC Medical Informatics and Decision Making, 19.

ESTEVA, A. et al. A guide to deep learning in healthcare. Nature medicine, Nature Publishing Group US New York, v. 25, n. 1, p. 24–29, 2019.

ESTEVA, A. et al. Dermatologist-level classification of skin cancer with deep neural networks. nature, Nature Publishing Group UK London, v. 542, n. 7639, p. 115–118, 2017.

Esteva, A., Kuprel, B., Novoa, R. A., Ko, J., Swetter, S. M., Blau, H. M., and Thrun, S. (2017). Dermatologist-level classification of skin cancer with deep neural networks. Nature, 542(7639):115–118.

FDA – regulatory information: Real-World Data: Assessing Electronic Health Records and Medical Claims Data to Support Regulatory Decision-Making for Drug and Biological Products Guidance for Industry Final version: [link] More info: [link]

FDA: Food and Drug Administration. Real-World Evidence. 2023–2025. [link]

Ferrari, A. M., Wei, B. Q., Costantino, L., and Shoichet, B. K. (2004). Soft docking and multiple receptor conformations in virtual screening. Journal of medicinal chemistry, 47(21):5076–5084.

Ferreira, L. B. et al. (2024) “Chatbots para Pré-triagem Odontológica: Validação Preditiva e Fluxo de Pacientes em Clínicas Universitárias”, Revista Brasileira de Informática em Saúde (RBIS), v. 21, n. 1.

Ferreira, L. G., dos Santos, R. N., Oliva, G., and Andricopulo, A. D. (2015). Molecular docking and structure-based drug design strategies. Molecules, 20(7):13384–13421.

Foley, E., Jacob, A. and Kapoor, R. (2024) “Generating Test Cases Through Large Language Models”, Major Qualifying Project Report, Worcester Polytechnic Institute, Worcester, MA.

Franzini, R. M., Neri, D., and Scheuermann, J. (2014). Dnaencoded chemical libraries: Advancing beyond conventional small-molecule libraries. Accounts of Chemical Research, 47(4):1247–1255.

Friedman, J. H. (2001). Greedy function approximation: a gradient boosting machine. Annals of statistics, pages 1189–1232.

Ganesan, A., Coote, M. L., and Barakat, K. (2017). Molecular dynamics-driven drug discovery: leaping forward with confidence. Drug discovery today, 22(2):249–269.

Gevaert, A. and Saeys, Y. (2022). Pdd-shap: Fast approximations for shapley values using functional decomposition.

Ghasemi, A., Hashtarkhani, S., Schwartz, D. L., and Shaban-Nejad, A. (2024). Explainable artificial intelligence in breast cancer detection and risk prediction: A systematic scoping review. Cancer Innovation, 3.

Ghosh, S. K. and Khandoker, A. (2024). Investigation on explainable machine learning models to predict chronic kidney diseases. Scientific Reports, 14.

Gilson, M. K., Liu, T., Baitaluk, M., Nicola, G., Hwang, L., and Chong, J. (2015). Bindingdb in 2015: A public database for medicinal chemistry, computational chemistry and systems pharmacology. Nucleic Acids Research, 44(D1):D1045–D1053.

Gironda-Martínez, A., Donckele, E. J., Samain, F., and Neri, D. (2021). Dna-encoded chemical libraries: A comprehensive review with successful stories and future challenges. ACS Pharmacology & Translational Science, 4(4):1265–1279.

Goldstein, B. A., Navar, A. M., Pencina, M. J., and Ioannidis, J. P. (2017). Opportunities and challenges in developing risk prediction models with electronic health records data: a systematic review. Journal of the American Medical Informatics Association, 24(1):198–208.

Gomes, K. A. S. (2023) “Gissa Chatbot: uma proposta de Agente Conversacional Inteligente RASA Open-Source para assistência no período gestacional”, Dissertação (Mestrado em Engenharia Elétrica e de Computação), Universidade Federal do Ceará, Sobral.

Gómez-Bombarelli, R., Wei, J. N., Duvenaud, D., Hernández-Lobato, J. M., Sánchez-Lengeling, B., Sheberla, D., Aguilera-Iparraguirre, J., Hirzel, T. D., Adams, R. P., and Aspuru-Guzik, A. (2018). Automatic chemical design using a data-driven continuous representation of molecules. ACS central science, 4(2):268–276.

GOODFELLOW, I. et al. Deep learning. [S.l.]: MIT press Cambridge, 2016. v. 1.

Goodfellow, I., Bengio, Y., and Courville, A. (2016). Deep Learning. MIT Press, Cambridge, MA.

Goodman, B. and Flaxman, S. (2017). European union regulations on algorithmic decision-making and a “right to explanation”. AI Magazine, 38(3):50–57.

Goodnow, R. A., Dumelin, C. E., and Keefe, A. D. (2016). Dnaencoded chemistry: enabling the deeper sampling of chemical space. Nature Reviews Drug Discovery, 16(2):131–147.

GRABER, M. L. The incidence of diagnostic error in medicine. BMJ quality & safety, BMJ Publishing Group Ltd, v. 22, n. Suppl 2, p. ii21–ii27, 2013.

Greenwell, B. M. (2017). pdp: An r package for constructing partial dependence plots. R J., 9:421.

Greenwell, B. M. and Boehmke, B. C. (2020). Variable importance plots - an introduction to the vip package. R J., 12:343.

GULSHAN, V. et al. Development and validation of a deep learning algorithm for detection of diabetic retinopathy in retinal fundus photographs. jama, American Medical Association, v. 316, n. 22, p. 2402–2410, 2016.

Guyon, I. and Elisseeff, A. (2003). An introduction to variable and feature selection. Journal of machine learning research, 3(Mar):1157–1182.

HABERLE, T. et al. The impact of nuance dax ambient listening ai documentation: a cohort study. Journal of the American Medical Informatics Association, Oxford University Press, v. 31, n. 4, p. 975–979, 2024.

Hakkoum, H., Idri, A., and Abnane, I. (2024). Global and local interpretability techniques of supervised machine learning black box models for numerical medical data. Eng. Appl. Artif. Intell., 131:107829.

Halperin, I., Ma, B., Wolfson, H., and Nussinov, R. (2002). Principles of docking: An overview of search algorithms and a guide to scoring functions. Proteins: Structure, Function, and Bioinformatics, 47(4):409–443.

HAN, J.; KAMBER, M.; PEI, J. Data Mining: Concepts and Techniques. 3. ed. Burlington, MA: Morgan Kaufmann, 2011. ISBN 9780123814791. Disponível em: [link].

Hannun, A. Y., Rajpurkar, P., Haghpanahi, M., Tison, G. H., Bourn, C., Turakhia, M. P., and Ng, A. Y. (2019). Cardiologist-level arrhythmia detection and classification in ambulatory electrocardiograms using a deep neural network. Nature Medicine, 25(1):65–69.

Harrigan, C. F., Morgenshtern, G., Goldenberg, A., and Chevalier, F. (2022). Considerations for visualizing uncertainty in clinical machine learning models.

Hassija, V., Chamola, V., Mahapatra, A., Singal, A., Goel, D., Huang, K., Scardapane, S., Spinelli, I., Mahmud, M., and Hussain, A. (2023). Interpreting blackbox models: A review on explainable artificial intelligence. Cognitive Computation, 16:45–74.

Hastie, T. (2009). The elements of statistical learning: data mining, inference, and prediction.

Herasymenko, O. et al. (2025). Cache challenge #2: Targeting sars-cov-2 nsp13. Journal of Chemical Information and Modeling.

Hernán, M. A., & Robins, J. M. Causal Inference: What If. Chapman & Hall/CRC, 2025. [link] [link]

Hettikankanamage, N. D., Shafiabady, N., Chatteur, F., Wu, R. M. X., Din, F. U., and Zhou, J. (2025). explainable artificial intelligence (xai): A systematic review for unveiling the black box models and their relevance to biomedical imaging and sensing. Sensors (Basel, Switzerland), 25.

Homola, J. (2008). Surface plasmon resonance sensors for detection of chemical and biological species. Chemical Reviews, 108(2):462–493.

Hooker, G., Mentch, L., and Zhou, S. (2019). Unrestricted permutation forces extrapolation: variable importance requires at least one more model, or there is no free variable importance. Statistics and Computing, 31.

Hooshyar, D. and Yang, Y. (2024). Problems with shap and lime in interpretable ai for education: A comparative study of post-hoc explanations and neural-symbolic rule extraction. IEEE Access, 12:137472–137490.

Hripcsak G, Duke JD, Shah NH, Reich CG, Huser V, Schuemie MJ, Suchard MA, Park RW, Wong IC, Rijnbeek PR, van der Lei J, Pratt N, Norén GN, Li YC, Stang PE, Madigan D, Ryan PB. Observational Health Data Sciences and Informatics (OHDSI): Opportunities for Observational Researchers. Stud Health Technol Inform. 2015;216:574-8. PMID: 26262116; PMCID: PMC4815923.

Hu, C., Gao, C., Li, T., Liu, C., and Peng, Z. (2024a). Explainable artificial intelligence model for mortality risk prediction in the intensive care unit: a derivation and validation study. Postgraduate medical journal, 100(1182):219–227.

Hu, C., Li, L., ping Huang, W., Wu, T., Xu, Q., Liu, J., and Hu, B. (2022). Interpretable machine learning for early prediction of prognosis in sepsis: A discovery and validation study. Infectious Diseases and Therapy, 11:1117 – 1132.

Hu, X., Zhu, M., Feng, Z., and Stanković, L. (2024b). Manifold-based shapley explanations for high dimensional correlated features. Neural networks : the official journal of the International Neural Network Society, 180:106634.

Huang, X. and ao Marques-Silva, J. (2024). On the failings of shapley values for explainability. Int. J. Approx. Reason., 171:109112.

Huser V, Kahn MG, Brown JS, Gouripeddi R. Methods for examining data quality in healthcare integrated data repositories. Pac Symp Biocomput. 2018;23:628-633. PMID: 29218922.

Hutchison, M. P. C. V. and Oliveira, N. A. (2025) “Integração da Inteligência Artificial na educação médica: desenvolvimento de um modelo baseado em GPT para o ensino de anamnese e documentação de prontuários médicos”, Revista Caderno Pedagógico, v. 22, n. 9, p. 1-15.

INCA (2022). Estimativa 2023: incidência de câncer no brasil. Technical report, Instituto Nacional de Câncer José Alencar Gomes da Silva, Rio de Janeiro.

Jeevan, H. R. (2023) “The Evolution of Natural Language Processing”, Medium, [link], October.

Jiang, H., Wang, J., Cong, W., Huang, Y., Ramezani, M., Sarma, A., Dokholyan, N. V., Mahdavi, M., and Kandemir, M. T. (2022). Predicting protein–ligand docking structure with graph neural network. Journal of chemical information and modeling, 62(12):2923–2932.

Jiménez, J., Škalič, M., Martínez-Rosell, G., and De Fabritiis, G. (2018). k_deep: Protein–ligand absolute binding affinity prediction via 3d-convolutional neural networks. Journal of Chemical Information and Modeling, 58(2):287–296.

Joachim, K., Sparks, O., Perrotta, A., Lin, A., Gettleman, B., Hamad, C., Jeong, S., Dingle, E., Stavrakis, A., and Christ, A. B. (2026). Evaluating the methodological suitability of partial dependence plots and shapley additive explanations for population-level interpretation of machine learning models in total joint arthroplasty. Arthroplasty, 8.

Jumper, J., Evans, R., Pritzel, A., Green, T., Figurnov, M., Ronneberger, O., Tunyasuvunakool, K., Bates, R., Žídek, A., Potapenko, A., et al. (2021). Highly accurate protein structure prediction with alphafold. Nature, 596:583–589.

Jurafsky, D. and Martin, J. H. (2026) “Speech and Language Processing”, 3rd ed. draft, Stanford University.

Kadukova, M., Machado, K. d. S., Chacón, P., and Grudinin, S. (2021). Korp-pl: a coarse-grained knowledge-based scoring function for protein–ligand interactions. Bioinformatics, 37(7):943–950.

Kahn MG, Callahan TJ, Barnard J, Bauck AE, Brown J, Davidson BN, Estiri H, Goerg C, Holve E, Johnson SG, Liaw ST, Hamilton-Lopez M, Meeker D, Ong TC, Ryan P, Shang N, Weiskopf NG, Weng C, Zozus MN, Schilling L. A Harmonized Data Quality Assessment Terminology and Framework for the Secondary Use of Electronic Health Record Data. EGEMS (Wash DC). 2016 Sep 11;4(1):1244. DOI: 10.13063/2327-9214.1244. PMID: 27713905; PMCID: PMC5051581.

Kamradt, G. (2024) “5 Levels of Text Splitting”, FullStackRetrieval Tutorials, [link], April.

Karunanayake, N. (2025) “Next-generation agentic AI for transforming healthcare”, Informatics and Health, v. 2, p. 73-83. DOI: 10.1016/j.infoh.2025.03.001.

Ke, G., Meng, Q., Finley, T., Wang, T., Chen, W., Ma, W., Ye, Q., and Liu, T.-Y. (2017). Lightgbm: A highly efficient gradient boosting decision tree. Advances in neural information processing systems, 30.

Kelodjou, G., Rozé, L., Masson, V., Galárraga, L., Gaudel, R., Tchuente, M., and Termier, A. (2024). Shaping up shap: enhancing stability through layer-wise neighbor selection. In Proceedings of the AAAI Conference on Artificial Intelligence, volume 38, pages 13094–13103.

Kernohan, K. D. and Boycott, K. M. (2024). The expanding diagnostic toolbox for rare genetic diseases. Nature Reviews Genetics, 25(6):401–415.

Khan, M. M., Shah, N., Shaikh, N., Thabet, A., Alrabayah, T., and belkhair, S. (2024). Towards secure and trusted ai in healthcare: A systematic review of emerging innovations and ethical challenges. International journal of medical informatics, 195:105780.

Kheder, W., Leblouba, M., Rego, R., and Hamdoon, Z. (2025). Multicentre validation and clinical interpretation of an explainable gradient-boosting model for dental-implant survival/failure prediction. Journal of dentistry, page 106166.

Kiseleva, A., Kotzinos, D., and Hert, P. (2022). Transparency of ai in healthcare as a multilayered system of accountabilities: Between legal requirements and technical limitations. Frontiers in Artificial Intelligence, 5.

Krishna, R., Wang, J., Ahern, W., Sturmfels, P., Venkatesh, P., Kalvet, I., Lee, G. R., Morey-Burrows, F. S., Anishchenko, I., Humphreys, I. R., et al. (2024). Generalized biomolecular modeling and design with rosettafold all-atom. Science, 384:eadl2528.

Kryshtafovych, A., Schwede, T., Topf, M., Fidelis, K., and Moult, J. (2021). Critical assessment of methods of protein structure prediction (casp)—round xiv. Proteins: Structure, Function, and Bioinformatics, 89(12):1607–1617.

KRZYSZCZYK, P. et al. The growing role of precision and personalized medicine for cancer treatment. Technology, World Scientific, v. 6, n. 03n04, p. 79–100, 2018.

Kuntz, I. D. (1992). Structure-based strategies for drug design. Science, 257:1078–1082.

Landrum, G. (2006). Rdkit: Open-source cheminformatics. 2006. Google Scholar.

Landrum, G. (2013). Rdkit: Open-source cheminformatics software. Accessed: 2026-05-11.

LangChain. (2026) “Retrieval”, LangChain Documentation, [link], April.

LeCun, Y. (1989). Generalization and network design strategies. Technical Report CRG-TR-89-4, University of Toronto.

LECUN, Y.; BENGIO, Y.; HINTON, G. Deep learning. Nature, Nature Publishing Group, v. 521, n. 7553, p. 436–444, 2015. Disponível em: [link].

Lengauer, T. and Rarey, M. (1996). Computational methods for biomolecular docking. Current Opinion in Structural Biology, 6(3):402–406.

Lewis, P. et al. (2020) “Retrieval-augmented generation for knowledge-intensive NLP tasks”, Advances in Neural Information Processing Systems (NeurIPS), v. 33, p. 9459–9474.

LEWIS, P. et al. Retrieval-augmented generation for knowledge-intensive nlp tasks. Advances in Neural Information Processing Systems, v. 33, p. 9459–9474, 2020. Disponível em: [link].

Li, F., Ackloo, S., Arrowsmith, C. H., Ban, F., Barden, C. J., Beck, H., Beránek, J., Berenger, F., Bolotokova, A., Bret, G., et al. (2024). Cache challenge# 1: targeting the wdr domain of lrrk2, a parkinson’s disease associated protein. Journal of Chemical Information and Modeling, 64(22):8521–8536.

Li, J. et al. (2025) “Agent Hospital: A Simulacrum of Hospital with Evolvable Medical Agents”, arXiv preprint arXiv:2405.02957v3.

Li, J., Fu, A., and Zhang, L. (2019). An overview of scoring functions used for protein–ligand interactions in molecular docking. Interdisciplinary Sciences: Computational Life Sciences, 11(2):320–328.

Liang, Y., Feng, S., Liu, Q., Kuang, H., Liu, J., Liao, L., Du, Y., and Wang, J. (2023). Exploring contextual relationships for cervical abnormal cell detection. IEEE Journal of Biomedical and Health Informatics, 27(8):4086–4097.

Liang, Y., Tang, Z., Yan, M., Chen, J., Liu, Q., and Xiang, Y. (2021). Comparison detector for cervical cell/clumps detection in the limited data scenario. Neurocomputing, 437:195–205.

Lima, T. A., Bezerra, I. C., Rocha, B. C. G., Viana, J. R., and Silva, C. R. (2016). Use of docking approaches to predict the affinity and orientation between molecules: a review. Biophysical Reviews, 8(2):157–165.

Lin, Q., Zhao, W., Zhang, H., Chen, W., Lian, S., Ruan, Q., Qu, Z., Lin, Y., Chai, D., and Lin, X. (2025). Predicting the risk of heart failure after acute myocardial infarction using an interpretable machine learning model. Frontiers in Cardiovascular Medicine, 12.

LINZER, M. et al. A cluster randomized trial of interventions to improve work conditions and clinician burnout in primary care: results from the healthy work place (hwp) study. Journal of general internal medicine, Springer, v. 30, n. 8, p. 1105–1111, 2015.

Lip, G. Y., Nieuwlaat, R., Pisters, R., Lane, D. A., and Crijns, H. J. (2010). Refining clinical risk stratification for predicting stroke and thromboembolism in atrial fibrillation using a novel risk factor-based approach: the euro heart survey on atrial fibrillation. Chest, 137(2):263–272.

LIPPMANN, R. P. An introduction to computing with neural nets. ACM SIGARCH Computer Architecture News, ACM New York, NY, USA, v. 16, n. 1, p. 7–25, 1988.

Lipton, Z. C. (2018). The mythos of model interpretability. Queue, 16(3):31–57.

Liu, M., Ning, Y., Yuan, H., Ong, M. E. H., and Liu, N. (2022). Balanced background and explanation data are needed in explaining deep learning models with shap: An empirical study on clinical decision making.

Liu, T., Lin, Y., Wen, X., Jorissen, R. N., and Gilson, M. K. (2007). Bindingdb: a web-accessible database of experimentally determined protein-ligand binding affinities. Nucleic Acids Research, 35(Database):D198–D201.

Liu, Z., Su, M., Han, L., Liu, J., Yang, Q., Li, Y., and Wang, R. (2017). Forging the basis for developing protein–ligand interaction scoring functions. Accounts of chemical research, 50(2):302–309.

Lundberg, S. and Lee, S.-I. (2017). A unified approach to interpreting model predictions.

Lundberg, S. M., Erion, G. G., and Lee, S.-I. (2019). Consistent individualized feature attribution for tree ensembles.

Lundberg, S. M., Erion, G., Chen, H., DeGrave, A., Prutkin, J. M., Nair, B., Katz, R., Himmelfarb, J., Bansal, N., and Lee, S.-I. (2020). From local explanations to global understanding with explainable AI for trees. Nature Machine Intelligence, 2(1):56–67.

Luo, H., Xiang, C., Zeng, L., Li, S., Mei, X., Xiong, L., Liu, Y., Wen, C., Cui, Y., Du, L., Zhou, Y., Wang, K., Li, L., Liu, Z., Wu, Q., Pu, J., and Yue, R. (2024). Shap based predictive modeling for 1 year all-cause readmission risk in elderly heart failure patients: feature selection and model interpretation. Scientific Reports, 14.

Lybrand, T. P. (1995). Ligand–protein docking. Current Opinion in Structural Biology, 5(2):224–228.

Machado, K. S., Winck, A. T., Ruiz, D. D., and de Souza, O. N. (2010). Mining flexible-receptor docking experiments to select promising protein receptor snapshots. BMC genomics, 11(5):1–13.

Madigan D, Ryan PB, Schuemie M. Does design matter? Systematic evaluation of the impact of analytical choices on effect estimates in observational studies. Ther Adv Drug Saf. 2013 Apr;4(2):53-62. DOI: 10.1177/2042098613477445. PMID: 25083251; PMCID: PMC4110833.

Mahmoudi, E., Kamdar, N., Kim, N., Gonzales, G., Singh, K., and Waljee, A. (2020). Use of electronic medical records in development and validation of risk prediction models of hospital readmission: systematic review. The BMJ, 369.

Markus, A. F., Kors, J. A., and Rijnbeek, P. R. (2021). The role of explainability in creating trustworthy artificial intelligence for health care: a comprehensive survey of the terminology, design choices, and evaluation strategies. Journal of biomedical informatics, 113:103655.

MARSHALL, I. J.; WALLACE, B. C. Toward systematic review automation: a practical guide to using machine learning tools in research synthesis. Systematic reviews, Springer, v. 8, n. 1, p. 163, 2019.

Masters, M. R., Mahmoud, A. H., Wei, Y., and Lill, M. A. (2023). Deep learning model for efficient protein–ligand docking with implicit side-chain flexibility. Journal of Chemical Information and Modeling, 63(6):1695–1707.

McCammon, J. A. and Harvey, S. C. (1987). Dynamics of Proteins and Nucleic Acids. Cambridge University Press.

McDonald CJ, Humphreys BL. The U.S. National Library of Medicine and standards for electronic health records: One thing led to another. Inf Serv Use. 2022 May 10;42(1):81-94. DOI: 10.3233/ISU-210142. PMID: 35600128; PMCID: PMC9108563.

McMahan, B., Moore, E., Ramage, D., Hampson, S., and Arcas, B. A. y. (2017). Communication-efficient learning of deep networks from decentralized data. In Proceedings of the 20th International Conference on Artificial Intelligence and Statistics (AISTATS), volume 54 of Proceedings of Machine Learning Research, pages 1273–1282. PMLR.

McNutt, A. T., Li, Y., Meli, R., Aggarwal, R., and Koes, D. R. (2025). Gnina 1.3: the next increment in molecular docking with deep learning. Journal of Cheminformatics, 17(1):28.

Meli, R., Morris, G. M., and Biggin, P. C. (2022). Scoring functions for protein-ligand binding affinity prediction using structure-based deep learning: a review. Frontiers in bioinformatics, 2:885983.

Meng, X. et al. (2011). Molecular docking: a powerful approach. Current Computer-Aided Drug Design, 7(2):146–157.

Mienye, I. D., Swart, T. G., Obaido, G., Jordan, M., and Ilono, P. (2025). Deep convolutional neural networks in medical image analysis: A review. Inf., 16:195.

MILLER, R. A.; JR, H. E. P.; MYERS, J. D. Internist-i, an experimental computer-based diagnostic consultant for general internal medicine. In: Computer-assisted medical decision making. [S.l.]: Springer, 1985. p. 139–158.

Miotto, R., Wang, F., Wang, S., Jiang, X., and Dudley, J. T. (2018). Deep learning for healthcare: review, opportunities and challenges. Briefings in Bioinformatics, 19(6):1236–1246.

Miquido. (2024) “What are guardrails in AI?”, [link], April.

MK, P.-O. The causal pathways linking health literacy to health outcomes. Am J Health Behav., v. 31, n. 1, p. S19–S26, 2007.

Molnar, C. (2020). Interpretable Machine Learning: A Guide for Making Black Box Models Explainable. 3 edition.

Momenzadeh, A., Shamsa, A., and Meyer, J. G. (2022). Bias or biology? importance of model interpretation in machine learning studies from electronic health records. JAMIA Open, 5.

Morris, G. M., Huey, R., Lindstrom,W., Sanner, M. F., Belew, R. K., Goodsell, D. S., and Olson, A. J. (2009). Autodock4 and autodocktools4: Automated docking with selective receptor flexibility. Journal of computational chemistry, 30(16):2785–2791.

Muchiri, R. N. and van Breemen, R. B. (2020). Affinity selection–mass spectrometry for the discovery of pharmacologically active compounds from combinatorial libraries and natural products. Journal of Mass Spectrometry, 56(5).

Müller, H. et al. (2022) “Explainability and causability for artificial intelligence-supported medical image analysis in the context of the European In Vitro Diagnostic Regulation”, New Biotechnology, v. 70, p. 67-72.

MÜLLER, J. P. Architectures and applications of intelligent agents: A survey. The Knowledge Engineering Review, Cambridge University Press, v. 13, n. 4, p. 353–380, 1999.

Murphy, K. P. (2012). Machine learning: a probabilistic perspective. MIT press.

Murray, C. J. L., Ikuta, K. S., Sharara, F., Swetschinski, L., Robles Aguilar, J., Gray, A., Han, C., Bisignano, C., Rao, P., et al. (2022). Global burden of bacterial antimicrobial resistance in 2019: a systematic analysis. The Lancet, 399(10325):629–655.

Nascimento, J. R. (2024) “Exploração de técnicas de engenharia de prompt para aprimorar os resultados do uso de LLM no TCMRio”, Trabalho de Conclusão de Curso (Especialização em TI), Instituto Metrópole Digital, UFRN, Natal.

Nilakantan, R., Bauman, N., Dixon, J. S., and Venkataraghavan, R. (1987). Topological torsion: a new molecular descriptor for sar applications. comparison with other descriptors. Journal of Chemical Information and Computer Sciences, 27(2):82–85.

Nohara, Y., Matsumoto, K., Soejima, H., and Nakashima, N. (2021). Explanation of machine learning models using shapley additive explanation and application for real data in hospital. Computer methods and programs in biomedicine, 214:106584.

Núcleo de Informação e Coordenação do Ponto BR (NIC.br). Pesquisa sobre o uso das tecnologias de informação e comunicação nos estabelecimentos de saúde brasileiros: TIC Saúde 2024. São Paulo, 2024. Entrevistas realizadas entre fevereiro e agosto de 2024 com 2.057 gestores e 2.021 profissionais de saúde em todo o território nacional. Disponível em: [link].

NVIDIA. (2024) “Programmable Guardrails: High-level flow”, NeMo Guardrails User Guide, v. 0.19.0, [link], April.

Obermeyer, Z., Powers, B., Vogeli, C., and Mullainathan, S. (2019). Dissecting racial bias in an algorithm used to manage the health of populations. Science, 366(6464):447–453.

OBERMEYER, Z.; EMANUEL, E. J. Predicting the future—big data, machine learning, and clinical medicine. The New England journal of medicine, v. 375, n. 13, p. 1216, 2016.

OHDSI (OBSERVATIONAL HEALTH DATA SCIENCES AND INFORMATICS). Who We Are. 2026. Disponível em: [link]

OHDSI 2023 Plenary: Improving the reliability and scale of case validation Anna Ostropolets, Martijn Schuemie, Patrick Ryan. Apresentação: [link] video: [link]

OHDSI Keeper. Repositório Github: [link] Documentação: [link] Manual: [link]

Olsen, L. H. B. and Jullum, M. (2025). Improving the weighting strategy in kernelshap. In World Conference on Explainable Artificial Intelligence, pages 194–218. Springer.

Ostropolets A, Hripcsak G, Husain SA, Richter LR, Spotnitz M, Elhussein A, Ryan PB. Scalable and interpretable alternative to chart review for phenotype evaluation using standardized structured data from electronic health records. J Am Med Inform Assoc. 2023 Dec 22;31(1):119-129. DOI: 10.1093/jamia/ocad202. PMID: 37847668; PMCID: PMC10746303.

Overhage JM, Ryan PB, Reich CG, Hartzema AG, Stang PE. Validation of a common data model for active safety surveillance research. J Am Med Inform Assoc. 2012 Jan-Feb;19(1):54-60. DOI: 10.1136/amiajnl-2011-000376. Epub 2011 Oct 28. PMID: 22037893; PMCID: PMC3240764.

Pagadala, N. S., Syed, K., and Tuszynski, J. (2017). Software for molecular docking: a review. Biophysical Reviews, 9(2):91–102.

Park, H. et al. (2025) “Scoping review of nurse triage in primary care”, BMC Nursing, v. 24, n. 1104. DOI: 10.1186/s12912-025-03740-3.

Passaro, S., Corso, G., Wohlwend, J., Reveiz, M., Thaler, S., Somnath, V. R., Getz, N., Portnoi, T., Roy, J., Stark, H., et al. (2025). Boltz-2: Towards accurate and efficient binding affinity prediction. bioRxiv. Preprint.

Patel, A. M., Baxter, W., and Porat, T. (2024). Toward guidelines for designing holistic integrated information visualizations for time-critical contexts: Systematic review. Journal of Medical Internet Research, 26.

Patharkar, A., Cai, F., Al-Hindawi, F., and Wu, T. (2024). Predictive modeling of biomedical temporal data in healthcare applications: review and future directions. Frontiers in Physiology, 15.

Petsko, G. A. and Ringe, D. (2004). Protein Structure and Function. Primers in Biology. New Science Press; distributed by Oxford University Press.

Pinzi, L. and Rastelli, G. (2019). Molecular docking: shifting paradigms in drug discovery. International Journal of Molecular Sciences, 20(18):4331.

Ponce-Bobadilla, A. V., Schmitt, V., Maier, C., Mensing, S., and Stodtmann, S. (2024). Practical guide to shap analysis: Explaining supervised machine learning model predictions in drug development. Clinical and Translational Science, 17.

Porta, M. (Ed.). A Dictionary of Epidemiology. Oxford University Press, 2014. ISBN: 9780197663639

Prendin, F., Pavan, J., Cappon, G., Favero, S. D., Sparacino, G., and Facchinetti, A. (2023). The importance of interpreting machine learning models for blood glucose prediction in diabetes: an analysis using shap. Scientific Reports, 13.

Prudent, R., Lemoine, H., Walsh, J., and Roche, D. (2023). Affinity selection mass spectrometry speeding drug discovery. Drug Discovery Today, 28(11):103760.

Radford, A. et al. (2018) “Improving language understanding by generative pre-training”, OpenAI, [link].

Rajkomar, A., Oren, E., Chen, K., Dai, A. M., Hajaj, N., Hardt, M., Liu, P. J., Liu, X., Marcus, J., Sun, M., et al. (2018). Scalable and accurate deep learning with electronic health records. NPJ Digital Medicine, 1(1):18.

Rasheed, K., Qayyum, A., Ghaly, M., Al-Fuqaha, A. I., Razi, A., and Qadir, J. (2021). Explainable, trustworthy, and ethical machine learning for healthcare: A survey. Computers in biology and medicine, 149:106043.

RAZAGHI, M. et al. Transforming clinical documentation with ambient artificial intelligence (ai) scribes: a narrative review of technology, impact, and implementation. Cardiovascular Diagnosis and Therapy, LWW, v. 16, n. 1, p. 11, 2026.

Rebedea, T. et al. (2023) “NeMo Guardrails: A toolkit for controllable and safe LLM applications with programmable rails”, arXiv preprint arXiv:2310.10501.

Reddy, S., Allan, S., Coghlan, S., and Cooper, P. (2020). A governance model for the application of ai in health care. Journal of the American Medical Informatics Association, 27(3):491–497.

REZENDE, S. O. Sistemas inteligentes: fundamentos e aplicações. [S.l.]: Editora Manole Ltda, 2003.

Ribeiro, M. T., Singh, S., and Guestrin, C. (2016). “Why should I trust you?”: Explaining the predictions of any classifier. In Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pages 1135–1144.

Rich, R. L. and Myszka, D. G. (2008). Survey of the year 2007 commercial optical biosensor literature. Journal of Molecular Recognition, 21(6):355–400.

Ricotta EE, Bustos Carrillo FA, Angelli-Nichols S, Barugahare J, Benton A, Carlson CJ, et al. Observational research in epidemic settings: a roadmap to reform. BMJ Global Health. 2025;10:e017981. DOI: 10.1136/bmjgh-2024-017981

Rogers, D. and Hahn, M. (2010). Extended-connectivity fingerprints. Journal of Chemical Information and Modeling, 50(5):742–754. PMID: 20426451.

Roshinta, T. A. and Gábor, S. (2024). A comparative study of lime and shap for enhancing trustworthiness and efficiency in explainable ai systems. 2024 IEEE International Conference on Computing (ICOCO), pages 134–139.

Rudin, C. (2019). Stop explaining black box machine learning models for high stakes decisions and use interpretable models instead. Nature machine intelligence, 1(5):206–215.

RUSSELL, S.; NORVIG, P.; INTELLIGENCE, A. A modern approach. Artificial Intelligence. Prentice-Hall, Egnlewood Cliffs, v. 25, n. 27, p. 79–80, 1995.

Salih, A. M. A., Raisi-Estabragh, Z., Galazzo, I., Radeva, P., Petersen, S. E., Lekadir, K., and Menegaz, G. (2023). A perspective on explainable artificial intelligence methods: Shap and lime. Advanced Intelligent Systems, 7.

Scaini, J. L. R., Camargo, A. D., Seus, V. R., von Groll, A., Werhli, A. V., da Silva, P. E. A., and dos Santos Machado, K. (2019). Molecular modelling and competitive inhibition of a mycobacterium tuberculosis multidrug-resistance efflux pump. Journal of Molecular Graphics and Modelling, 87:98–108.

Scantlebury, J. et al. (2020). Data set augmentation allows deep learning-based virtual screening to better generalize to unseen target classes. Journal of Chemical Information and Modeling, 60(8):3722–3730.

Scheffer, M. C. (2025). Demografia médica no brasil 2025. Acesso em: 12 jan. 2026.

Schuemie MJ, Hripcsak G, Ryan PB, Madigan D, Suchard MA. Empirical confidence interval calibration for population-level effect estimation studies in observational healthcare data. Proc Natl Acad Sci U S A. 2018 Mar 13;115(11):2571-2577. DOI: 10.1073/pnas.1708282114. PMID: 29531023; PMCID: PMC5856503.

Schuemie MJ, Hripcsak G, Ryan PB, Madigan D, Suchard MA. Robust empirical calibration of p-values using observational data. Stat Med. 2016 Sep 30;35(22):3883-8. DOI: 10.1002/sim.6977. PMID: 27592566; PMCID: PMC5108459.

Schuemie MJ, Ryan PB, DuMouchel W, Suchard MA, Madigan D. Interpreting observational studies: why empirical calibration is needed to correct p-values. Stat Med. 2014 Jan 30;33(2):209-18. DOI: 10.1002/sim.5925. Epub 2013 Jul 30. PMID: 23900808; PMCID: PMC4285234.

Schuemie MJ, Ryan PB, Hripcsak G, Madigan D, Suchard MA. Improving reproducibility by using high-throughput observational studies with empirical calibration. Philos Trans A Math Phys Eng Sci. 2018 Sep 13;376(2128):20170356. DOI: 10.1098/rsta.2017.0356. PMID: 30082302; PMCID: PMC6107542.

Schuemie, M.J., Ostropolets, A., Zhuk, A. et al. Standardized patient profile review using large language models for case adjudication in observational research. npj Digit. Med. 8, 18 (2025). DOI: 10.1038/s41746-025-01433-4 [link] Video: [link]

Schwaller, P., Laino, T., Gaudin, T., Bolgar, P., Hunter, C. A., Bekas, C., and Lee, A. A. (2019). Molecular transformer: a model for uncertaintycalibrated chemical reaction prediction. ACS central science, 5(9):1572–1583.

Selvaraju, R. R., Cogswell, M., Das, A., Vedantam, R., Parikh, D., and Batra, D. (2020). Grad-cam: visual explanations from deep networks via gradient-based localization. International journal of computer vision, 128(2):336–359.

Senior, A.W., Evans, R., Jumper, J., Kirkpatrick, J., Sifre, L., Green, T., Qin, C., Žídek, A., Nelson, A.W. R., Bridgland, A., et al. (2020). Improved protein structure prediction using potentials from deep learning. Nature, 577:706–710.

SHAN, R.; SARKAR, S.; MARTIN, S. S. Digital health technology and mobile devices for the management of diabetes mellitus: state of the art. Diabetologia, Springer, v. 62, n. 6, p. 877–887, 2019.

Shapley, L. S. (1953). A value for n-person games. Contributions to the Theory of Games, 2(28):307–317. Reprinted in: Kuhn, H.W. (ed.) Classics in Game Theory, Princeton University Press, 1997.

Shen, C., Ding, J., Wang, Z., Cao, D., Ding, X., and Hou, T. (2020). From machine learning to deep learning: Advances in scoring functions for protein–ligand docking. Wiley Interdisciplinary Reviews: Computational Molecular Science, 10(1):e1429.

SHORTLIFFE, E. Computer-based medical consultations: MYCIN. [S.l.]: Elsevier, 2012. v. 2.

Singh, Y., Hathaway, Q. A., Keishing, V., Salehi, S., Wei, Y., Horvat, N., Vera-Garcia, D. V., Choudhary, A., Kh, A. M., Quaia, E., and Andersen, J. (2025). Beyond post hoc explanations: A comprehensive framework for accountable ai in medical imaging through transparency, interpretability, and explainability. Bioengineering, 12.

SINGHAL, K. et al. Large language models encode clinical knowledge. Nature, Nature Publishing Group UK London, v. 620, n. 7972, p. 172–180, 2023.

SINSKY, C. et al. Allocation of physician time in ambulatory practice: a time and motion study in 4 specialties. Annals of internal medicine, American College of Physicians, v. 165, n. 11, p. 753–760, 2016.

Škrinjar, P., Eberhardt, J., Studer, G., et al. (2026). Evaluating generalization in protein–ligand cofolding methods. Nature Structural & Molecular Biology.

SOMASHEKHAR, S. P. et al. Watson for oncology and breast cancer treatment recommendations: agreement with an expert multidisciplinary tumor board. Annals of Oncology, Elsevier, v. 29, n. 2, p. 418–423, 2018.

Stang PE, Ryan PB, Racoosin JA, et al. Advancing the science for active surveillance: rationale and design for the Observational Medical Outcomes Partnership. Annals of Internal Medicine. 2010 Nov;153(9):600-606. DOI: 10.7326/0003-4819-153-9-201011020-00010. PMID: 21041580.

Stenwig, E., Salvi, G., Rossi, P. S., and Skjærvold, N.-K. (2022). Comparative analysis of explainable machine learning prediction models for hospital mortality. BMC Medical Research Methodology, 22.

Stepniewska-Dziubinska, M. M., Zielenkiewicz, P., and Siedlecki, P. (2018). Development and evaluation of a deep learning model for protein–ligand binding affinity prediction. Bioinformatics, 34(21):3666–3674.

Sterrantino, A. Observational studies: practical tips for avoiding common statistical pitfalls. The Lancet Regional Health - Southeast Asia, 2024; 25 [link]

Stiglic, G., Kocbek, P., Fijacko, N., Zitnik, M., Verbert, K., and Cilar, L. (2020). Interpretability of machine learning-based prediction models in healthcare. Wiley Interdisciplinary Reviews: Data Mining and Knowledge Discovery, 10.

Stogiannos, N., Malik, R., Kumar, A., Barnes, A., Pogose, M., Harvey, H., McEntee, M., and Malamateniou, C. (2023). Black box no more: a scoping review of ai governance frameworks to guide procurement and adoption of ai in medical imaging and radiotherapy in the uk. The British Journal of Radiology, 96.

STROBE Initiative. Strengthening the Reporting of Observational Studies in Epidemiology. [link]

Su, M., Yang, Q., Du, Y., Feng, G., Liu, Z., Li, Y., and Wang, R. (2018). Comparative assessment of scoring functions: the casf-2016 update. Journal of chemical information and modeling, 59(2):895–913.

Suamchaiyaphum, K. et al. (2024) “Triage accuracy of emergency nurses: An evidence-based review”, Journal of Emergency Nursing, v. 50, n. 1, p. 44-54. DOI: 10.1016/j.jen.2023.10.001.

SUTTON, R. T. et al. An overview of clinical decision support systems: benefits, risks, and strategies for success. NPJ digital medicine, Nature Publishing Group UK London, v. 3, n. 1, p. 17, 2020.

Svetnik, V., Liaw, A., Tong, C., Culberson, J. C., Sheridan, R. P., and Feuston, B. P. (2003). Random forest: a classification and regression tool for compound classification and qsar modeling. Journal of chemical information and computer sciences, 43(6):1947–1958.

Tamkin, A. et al. (2021) “Understanding the capabilities, limitations, and societal impact of large language models”, arXiv preprint arXiv:2102.02503.

Tan, Z., Tian, Y., and Li, J. (2023). Glime: General, stable and local lime explanation.

Teixeira, J. B. A., Rezende, M. T., Diniz, D. N., Carneiro, C. M., Luz, E. J. d. S., Souza, M. J. F., Ushizima, D. M., Medeiros, F. N. S. d., and Bianchi, A. G. C. (2023). Segmentation of cervical nuclei using convolutional neural network for conventional cytology. Computer Methods in Biomechanics and Biomedical Engineering: Imaging & Visualization, 11(5):1876–1888.

Terra, D. C., Lisboa, A. C., Rezende, M. T., Carneiro, C. M., and Bianchi, A. G. C. (2023). Shape-based features investigation for preneoplastic lesions on cervical cancer diagnosis. In Proceedings of the 18th International Joint Conference on Computer Vision, Imaging and Computer Graphics Theory and Applications (VISIGRAPP 2023) - Volume 4: VISAPP, pages 506–513. SCITEPRESS.

Tonekaboni, S., Joshi, S., McCradden, M. D., and Goldenberg, A. (2019). What clinicians want: contextualizing explainable machine learning for clinical end use. Proceedings of Machine Learning Research, 106:359–380. Machine Learning for Healthcare Conference.

Topol, E. J. (2019). High-performance medicine: the convergence of human and artificial intelligence. Nature Medicine, 25(1):44–56.

TOPOL, E. J. High-performance medicine: the convergence of human and artificial intelligence. Nature medicine, Nature Publishing Group US New York, v. 25, n. 1, p. 44–56, 2019. Citado 2 vezes nas páginas page.22 e page.2020.

Touvron, H. et al. (2023) “LLaMA: Open and Efficient Foundation Language Models”, arXiv preprint arXiv:2302.13971.

Tripos, S. (2005). Tripos mol2 file format.

Trott, O. and Olson, A. J. (2010). Autodock vina: improving the speed and accuracy of docking with a new scoring function, efficient optimization, and multithreading. Journal of Computational Chemistry, 31(2):455–461.

Tun, H., Rahman, H., Naing, L., and Malik, O. A. (2024). Trust in artificial intelligence–based clinical decision support systems among health care workers: Systematic review. Journal of Medical Internet Research, 27.

Tung, T., Hasnaeen, S. M. N., and Zhao, X. (2025). Ethical and practical challenges of generative ai in healthcare and proposed solutions: a survey. Frontiers in Digital Health, 7.

U.S. Food and Drug Administration (2021). Artificial intelligence/machine learning (AI/ML)-based software as a medical device (SaMD) action plan. Accessed: 2026.

Upadhyay, U., Gradisek, A., Iqbal, U., Dhar, E., Li, Y., and Syed-Abdul, S. (2023). Call for the responsible artificial intelligence in the healthcare. BMJ Health & Care Informatics, 30.

van Kolfschooten, H. and van Oirschot, J. (2024) “The EU Artificial Intelligence Act (2024): Implications for healthcare”, Health Policy, v. 149, p. 105152.

Varadi, M., Anyango, S., Deshpande, M., Nair, S., Natassia, C., Yordanova, G., Yuan, D., Stroe, O.,Wood, G., Laydon, A., et al. (2022). Alphafold protein structure database: Massively expanding the structural coverage of protein-sequence space with high-accuracy models. Nucleic Acids Research, 50(D1):D439–D444.

Vaswani, A. et al. (2017) “Attention is all you need”, In: Advances in Neural Information Processing Systems, 30., Long Beach: NIPS, p. 5998-6008.

VASWANI, A. et al. Attention is all you need. In: Advances in Neural Information Processing Systems. Curran Associates, Inc., 2017. v. 30, p. 5998–6008. Disponível em: [link].

Vaswani, A., Shazeer, N., Parmar, N., Uszkoreit, J., Jones, L., Gomez, A. N., Kaiser, Ł., and Polosukhin, I. (2017). Attention is all you need. Advances in neural information processing systems, 30.

Vimbi, V., Shaffi, N., and Mahmud, M. (2024). Interpreting artificial intelligence models: a systematic review on the application of lime and shap in alzheimer’s disease detection. Brain Informatics, 11.

Vincent, J.-L., Moreno, R., Takala, J., Willatts, S., De Mendonça, A., Bruining, H., Reinhart, C. K., Suter, P., and Thijs, L. G. (1996). The sofa (sepsis-related organ failure assessment) score to describe organ dysfunction/failure: On behalf of the working group on sepsis-related problems of the european society of intensive care medicine (see contributors to the project in the appendix). Intensive care medicine, 22(7):707–710.

Viswan, V., Shaffi, N., Mahmud, M., Subramanian, K., and Hajamohideen, F. (2023). Explainable artificial intelligence in alzheimer’s disease classification: A systematic review. Cognitive Computation, 16:1–44.

Wallach, I. and Heifets, A. (2018). Most ligand-based classification benchmarks reward memorization rather than generalization. Journal of chemical information and modeling, 58(5):916–932.

Wang, H., Liang, Q., Hancock, J. T., and Khoshgoftaar, T. (2024). Feature selection strategies: a comparative analysis of shap-value and importance-based methods. Journal of Big Data, 11:1–16.

Wang, R., Fang, X., Lu, Y., and Wang, S. (2004). The pdbbind database: Collection of binding affinities for protein-ligand complexes with known threedimensional structures. Journal of Medicinal Chemistry, 47(12):2977–2980.

Wang, X. et al. (2025). Enantioselective protein affinity selection mass spectrometry (e-asms). Nature Communications, 17(651).

Wei, J. et al. (2022) “Chain-of-thought prompting elicits reasoning in large language models”, Advances in Neural Information Processing Systems, v. 35, p. 24824-24837.

Weidinger, L. et al. (2021) “Ethical and social risks of harm from language models”, DeepMind Research Report, arXiv preprint arXiv:2112.04359.

Weininger, D. (1988). Smiles, a chemical language and information system. 1. introduction to methodology and encoding rules. Journal of chemical information and computer sciences, 28(1):31–36.

Werhli, A. V., Lopes, P. P., Arrua, O. E., Aderhold, A., and Machado, K. d. S. (2025). Crrf-score-cumulative ranking random forest scoring function for free energy of binding prediction in protein-ligand docking. In Brazilian Conference on Intelligent Systems, pages 199–213. Springer.

Westbrook, J. D. and Fitzgerald, P. (2003). The pdb format, mmcif, and other data formats. Methods Biochem Anal, 44:161–179.

Wood, D., Papamarkou, T., Benatan, M., and Allmendinger, R. (2023). Model-agnostic variable importance for predictive uncertainty: an entropy-based approach. Data Mining and Knowledge Discovery, 38:4184 – 4216.

WOOLDRIDGE, M. An introduction to multiagent systems. [S.l.]: John wiley & sons, 2009.

working group, S. and risk collaboration, E. C. (2021). Score2 risk prediction algorithms: new models to estimate 10-year risk of cardiovascular disease in europe. European Heart Journal, 42(25):2439–2454.

Xin, X., Hooker, G., and Huang, F. (2025). Pitfalls in machine learning interpretability: Manipulating partial dependence plots to hide discrimination. Insurance: Mathematics and Economics, page 103135.

Xu, S., Feng, Q., Qiao, L., Wu, H., Shen, T., Cheng, Y., Zheng, S., and Sun, S. (2026). Benchmarking all-atom biomolecular structure prediction with foldbench. Nature Communications, 17:442.

Xu, Z. et al. (2025). A generative ai-discovered tnik inhibitor for idiopathic pulmonary fibrosis. Nature Medicine, 31:2602–2610.

Xu,W. (2024). Current status of computational approaches for small molecule drug discovery. Journal of Medicinal Chemistry, 67(21):18633–18636.

Yang, X., Chen, A., Pournejatian, N. M., Shin, H.-C., Smith, K. E., Parisien, C., Compas, C. B., Martin, C., Costa, A. B., Flores, M. G., Zhang, Y., Magoc, T., Harle, C., Lipori, G. P., Mitchell, D. A., Hogan, W., Shenkman, E., Bian, J., and Wu, Y. (2022). A large language model for electronic health records. NPJ Digital Medicine, 5.

YUAN, K.-C. et al. The development an artificial intelligence algorithm for early sepsis diagnosis in the intensive care unit. International journal of medical informatics, Elsevier, v. 141, p. 104176, 2020.

Yuan, K., Yoon, C. H., Gu, Q., Munby, H., Walker, A., Zhu, T., and Eyre, D. W. (2025). Transformers and large language models are efficient feature extractors for electronic health record studies. Communications Medicine, 5.

Zafar, M. R. and Khan, N. (2021). Deterministic local interpretable model-agnostic explanations for stable explainability. Mach. Learn. Knowl. Extr., 3:525–541.

Zaniboni, J. V. N. (2025) “Integrando Técnicas de Geração Aumentada por Recuperação e Grandes Modelos de Linguagem para Auxílio ao Diagnóstico Médico”, Trabalho de Conclusão de Curso (Sistemas de Informação), UFSC, Florianópolis.

Zeng, X., Chen, J., Zeng, X., Tang, X., and Peng, J. (2025). Integrating multiparametric mri radiomics and clinical models to assess sensitivity to neoadjuvant chemotherapy in breast cancer: A multicenter study. Journal of Applied Clinical Medical Physics, 26.

Zheng, Y., Koh, H. Y., Ju, J., Yang, M., May, L. T., Webb, G. I., Li, L., Pan, S., and Church, G. (2025). Large language models for drug discovery and development. Patterns, 6(10).

Short Courses of the SBCAS 2026

Authors

Keywords:

Synopsis

Chapters

Downloads

References

Downloads

Publication date

Categories

License

Details about the available publication format: Full Volume

ISBN-13 (15)

Language