Minh-Hao Van
AI/ML Engineer, Google
AI/ML Engineer
Hi, my name is Minh-Hao Van, and I am currently an AI/ML Engineer at Google. Before joining Google, I was a Postdoctoral Fellow in Computer Science at the University of Arkansas. I also earned my Ph.D. in Computer Science from the University of Arkansas, advised by Prof. Xintao Wu. My research interests lie at the intersection of responsible AI and AI for science, with a focus on trustworthy machine learning, large vision-language models, and foundation models for scientific discovery. In particular, I am interested in AI robustness, safety, and privacy-preserving learning, as well as resource-efficient adaptation of large models for scientific applications in areas such as materials science and biomedical engineering. I am also actively exploring autonomous AI frameworks and agentic workflows for materials discovery and property prediction. Please see my CV here.
Prior to the University of Arkansas, I earned my B.Eng degree in Computer Science with Honors from HCMC University of Technology (Bach Khoa University), VNUHCM, in 2019. During my undergraduate study, I worked on machine learning methods for predicting cryptocurrency price movements and studying the impact of social media news on price dynamics with Dr. Khuong Nguyen-An.
Feel free to contact me via email or LinkedIn if you are interested in my work.
news
| Apr 30, 2026 | Happy to share that our paper “Property Prediction of Stacked Bilayer Materials: A Multimodal Learning Approach” was accepted to the IJCAI 2026 main conference! |
|---|---|
| Mar 27, 2026 | Excited to share that our paper “Vision Language Models for Scientific Image Analysis: An Evaluation Highlighting Opportunities and Challenges” was accepted to npj Computational Materials! |
| Mar 1, 2026 | Starting from March 2026, Minh-Hao Van began a new chapter at Google as an AI/ML engineer! |
| Dec 1, 2025 | Cheerful news from IEEE BigData 2025: three of our papers were accepted! |
| Nov 1, 2025 | Glad to share that our paper “Influence-based approaches for tumor classification in noisy brain MRI with deep learning and vision-language models” was accepted to JDSA! |
| Oct 3, 2025 | Three papers were accepted at NeurIPS workshops |
| Jun 25, 2025 | Check out our new survey of AI for materials science, covering foundation models, LLM agents, datasets and tools to develop AI foundation models for materials science tasks A Survey of AI for Materials Science: Foundation Models, LLM Agents, Datasets, and Tools |
| Jan 22, 2025 | One paper Soft Prompting for Unlearning in Large Language Models is accepted to main conference at NAACL-2025. Congratulations! |
| Jan 15, 2025 | I will server as reviewers for WWW’25, PAKDD’25 and IJCNN’25. |
| Sep 24, 2024 | Minh-Hao was awarded the Reginald R. ‘Barney’ & Jameson A. Baxter and EECS Graduate Fellowship for the academic year 2024-2025. Congratulations! |
selected publications
- npjCMVision language models for scientific image analysis: an evaluation highlighting opportunities and challengesnpj Computational Materials, 2026
- ICLR’26Property Prediction of Stacked Bilayer Materials: A Multimodal Learning ApproachIn AI for Accelerated Materials Design-ICLR 2026, 2026
- NAACL’25Soft prompting for unlearning in large language models2025
- BigData’25Fair in-context learning via latent concept variablesIn 2025 IEEE International Conference on Big Data (BigData), 2025
- arXivDetecting and mitigating hateful content in multimodal memes with vision-language modelsarXiv preprint arXiv:2505.00150, 2025
- JDSAInfluence-based approaches for tumor classification in noisy brain MRI with deep learning and vision-language modelsInternational Journal of Data Science and Analytics, 2025
- arXivA survey of AI for materials science: foundation models, LLM agents, datasets, and toolsarXiv preprint arXiv:2506.20743, 2025
- arXivCross-Modal Attention Guided Unlearning in Vision-Language ModelsarXiv preprint arXiv:2510.07567, 2025
- BigData’25A Machine Learning Framework for Automated Computational Ethology Using Markerless Pose EstimationIn 2025 IEEE International Conference on Big Data (BigData), 2025
- arXivFine-Tuning Vision-Language Models for Multimodal Polymer Property PredictionarXiv preprint arXiv:2511.05577, 2025
- SSRNSelecting In-Context Learning Demonstrations Via Influence AnalysisAvailable at SSRN 5133763, 2025
- arXivIn-Context Learning Demonstration Selection via Influence AnalysisarXiv e-prints, 2024
- IJCNN’24Evaluating the impact of local differential privacy on utility loss via influence functionsIn 2024 International Joint Conference on Neural Networks (IJCNN), 2024
- PAKDD’24Robust influence-based training methods for noisy brain mriIn Pacific-Asia Conference on Knowledge Discovery and Data Mining, 2024
- BigData’24Beyond human vision: The role of large vision language models in microscope image analysisIn 2024 IEEE International Conference on Big Data (BigData), 2024
- CHASE’24On large visual language models for medical imaging analysis: An empirical studyIn 2024 IEEE/ACM conference on connected health: applications, systems and engineering technologies (CHASE), 2024
- ICDM’23Hint: Healthy influential-noise based training to defend against data poisoning attacksIn 2023 IEEE International Conference on Data Mining (ICDM), 2023
- arXivDetecting and correcting hate speech in multimodal memes with large visual language modelarXiv preprint arXiv:2311.06737, 2023
- BigData’22Defending evasion attacks via adversarially adaptive trainingIn 2022 IEEE International Conference on Big Data (Big Data), 2022
- DASFAA’22Poisoning attacks on fair machine learningIn International Conference on Database Systems for Advanced Applications, 2022
- AIIT’20Deepvix: Explaining long short-term memory network with high dimensional time series dataIn Proceedings of the 11th international conference on advances in information technology, 2020
- ACOMP’19Predicting Cryptocurrency Price Movements Based on Social MediaIn 2019 International Conference on Advanced Computing and Applications (ACOMP), 2019
- BigData’19Hackernets: Visualizing media conversations on internet of things, big data, and cybersecurityIn 2019 IEEE International Conference on Big Data (Big Data), 2019