Azmine Toushik Wasi | Machine Learning Researcher

🧑 Career Summary

I am a recent graduate in Industrial and Production Engineering from Shahjalal University of Science and Technology, with only my thesis defense remaining.

My core research focuses on exploring AI4Science, Biomedical AI, Generative AI, AI Agents and Reasoning, with an emphasis on structural biology, therapeutic, clinical, and molecular ML domains. I am also deeply interested in interdisciplinary research, connecting these domains with Graph Neural Networks (GNNs), Digital Twins and Human-Centered AI (Multilinguality, Fairness, and Reliability) and AI applications in industrial engineering. I intend to pursue a PhD in Spring/Fall 2026 to continue my research.

I collaborate with Prof. Alshehri (KSU) on Generative AI, health informatics, GNNs, and AI agents; and Prof. Chae (HYU) on GenAI, LLM-HCI, Biomolecular ML with GNNs. I also work with Riashat Islam (Microsoft Research) on molecular ML, GenAI, agents and reasoning; and with Prof. Min Xu (CMU) on biomolecules. I actively collaborate with researchers from Cohere Labs (formerly Cohere for AI), Harvard, and more. Completed HTGAA 2025 (MIT), focusing on protein engineering. At CIOL, I collaborate with Prof. AMM Mukaddes (SUST), Prof. Ahsan (OU) and Prof. Bappy (LSU) on GNNs, Agents and digital twins for industrial and medical applications.

My research has been published in prestigious venues such as ICLR, WWW, COLING, DASFAA, CSCW, and related workshops. I regularly review for major AI-ML conferences and am a Kaggle Grandmaster. I also have 3 years of experience in AI-driven product/content automation and product and project management. I’m a quick learner and highly adaptable, able to lead teams and projects. I enjoy exploring new topics, sharing knowledge, honing skills, experimenting and pushing my limits.

View my Curriculum Vitae →
Feel free to email me (azminetoushik dot wasi at gmail dot com) if you're interested in collaboration or discussing research. I'm open to new ideas and collaborations.

🔍 Research Interests

🧬 Exploring Computational Biology and Biomedical AI Applications
- Working on Computational Molecular Biology, Bioinformatics, and Computational Drug Discovery (CADGL). Exploring problems like molecular properties prediction, protein discovery, binder design and affinity, molecular interactions, structural biology, and healthcare optimization (ICLR'24).
- Applying Graph Neural Networks (GNNs) and Geometric Machine Learning in biomedical domains, including protein modeling, molecular graphs, and structure-function alignment problems. Experienced in Flow Matching, GFlowNets, Diffusion models, and energy-based/agentic modeling pipelines. Worked with de novo protein generation models and experienced with RL-inspired/energy-guided geometric/sequential structural biology modeling tools. Currently exploring Digital Twins, clinical reasoning, and Agentic LLMs for Clinical and Biomedical AI Applications.
🧠 Understanding and Applying Generative AI Models
- Focusing on Large Language Models (LLMs), Large Reasoning Models (LRMs), multi-modal alignment, reinforcement learning (RL), reward modeling, and uncertainty-aware reasoning through self-verification and self-correction.
- Developing agentic AI agents via Retrieval-Augmented Generation (RAG), Prompt Reward Models (PRMs), Long Chain-of-Thought (Long CoT), and planning-aware strategies for multilingual, and human-in-the-loop applications (medical, clinical and legal).
- Exploring how AI reasoning works with a strong emphasis on AI safety, interpretability, alignment, and governance in real-world deployments.
🧑‍💻 Interdisciplinary Research on Humans, AI, and Language
- My recent work develops trustworthy AI reasoning by integrating RAG, reward-based RL fine-tuning, Digital Twins, and structured reasoning to enhance model reliability. I'm working on multilinguality and reasoning in LLMs, focusing on AI4Good applications focusing on Computational Social Science, Cultural Analytics and Cognitive Modeling. Additionally, I've worked on accessibility, evaluation and improvement of human factors and ergonomics (ICML'24W and EMNLP'24W), religious/cultural values (CSCW'24, CHI'24W, COLING'25 - 1, 2), into AI systems, mainly Generative AI and LLMs (ICLR'25).
- Additionally, from technical side, I'm applying AI models to industrial engineering, including supply chain optimization (GNNs in SCM) and manufacturing. Interested in extending GNNs through Knowledge Graphs for applications across healthcare, HCI, and NLP (COLING'24, ACL'24W, EMNLP'24W).

📰 News and Updates

July 5, 2025: Two papers (LLM Evaluation and Agents) are submitted to EMNLP industry Track (with Cohere Labs and CIOL)!
July 3, 2025: Two papers on Generative AI for Health are accepted to IJCAI GENAI4HEALTH workshop!
July 1, 2025: One papers on Generative AI for Health are accepted to IEEE BHI 2025 Abstracts!
June 19, 2025: Three papers on LLM Evaluation, Application and AI Agents is accepted to CSCW 2025 Posters!
June 5, 2025: I reached the milestone of 100 citations on Google Scholar (before undergrad thesis defense!)!!
June 8, 2025: Joined Pi School of AI as a Fellow! Wotking on EVE project by European Space Agency!
June 1, 2025: My work on GFLowNets for better DDI systems in accepted to ICANN'25!
May 17, 2025: I led three research works and co-authored a D&B paper with Cohere Labs, all submitted to NeurIPS'25!
May 15, 2025: Presented multiple projects at Aya Expedition 2.0, including those I led: MM Clinical Understanding and MM Legal Assistant Agent—the latter successfully passed two stages of the Bangladesh Bar Council Exam! I was also part of several other projects selected for the final presentation.
May 14, 2025: Presented my HTGAA Final Individual Project on designing de novo Lignin-Degrading Enzymes for Biomass Valorization to the global community!
May 13, 2025: One solo author position paper advocating specialized LLMs instead of general-purpose LLMs for healthcare is accepted to CVPR 2025 Multimodal Foundation Models for Biomedicine Workshop!
May 3, 2025: Awarded the third place in the IISE QCRE 2025 Data Challenge!
May, 2025: Participated in Reasoning Datasets Competition by HuggingFace with multilevel-legal-reasoning dataset!
April 15, 2025: Joined AI4CHEMIA, KSU as a Visiting Researcher, under Prof. Alshehri. I will be working on AI4Science, GNNs, AI Agents and GenAI.
April, 2025: Kaleidoscope, my third project with Cohere for AI Community is now live! Along with 40+ researchers world-wide, we built a massively multilingual vision benchmark to evaluate LLMs using in-language exams.
March 31, 2025: As part of Aya Expedition 2.0 with Cohere for AI, I am leading two projects focused on multilingual clinical and legal understanding and reasoning. We were awarded $2,000 in API credits to support the development and experimentation of these initiatives. Along with these, I am also actively involved in several other projects, including MedAya, Multilingual Long CoT, Embodied Reasoning, and more.
March 30, 2025: One of my works on LLM evaluation is accepted to CHI 2025's Human-Centric LLM Evaluation Workshop. Two papers I mentored on AI governance are accepted to CHI 2025's Socio-technical AI Governance Workshop.
March 28, 2025: Two more shared task papers I mentored on clinical reasoning and emotion detection is accepted to NAACL and ACL workshops!
March 27, 2025: I participated in the ASAP Discovery x OpenADMET Antiviral Drug Discovery Challenge via Polaris. While my overall ranking in the competition wasn't at the very top, I managed to secure strong rankings in some setups. Read my LinkedIn post and technical report for details!
March 31, 2025: Two more shared task papers I mentored on clinical reasoning and emotion detection is accepted to NAACL and ACL workshops!
March 27, 2025: My work on Multi-lingual LLM Evaluation in accepted is accepted to CHI'24 HEAL (Human-centric LLM Evaluation and Auditing) workshop!
March 05, 2025: My work on Interpretable Biomolecular Design using Genomes has been accepted at the ICLR'25 ML for Genomics Explorations workshop!
March 05, 2025: One of my works on Agentic Safety has been accepted at the ICLR'25 Foundational Models in the Wild workshop!
February 27, 2025: One of my works on AI Agents Application has been accepted at the NAACL 2025 Workshop on Language Models for Underserved Communities!
February 27, 2025: Five out of the six submissions I actively advised—five from the CIOL Winter ML Bootcamp and one from HerWILL—have been accepted at the NAACL 2025 DravidianLanTech Workshop!
February 28, 2025: My work on AI Agents for Social Impact has been accepted at the AAAI 2025 Workshop on Social Impact of AI!
February 12, 2025: Joined How to Grow Almost Anything (HTGAA) 2025 by Community Biotechnology Initiative, MIT Media Lab as a Global Committed Listener!
February 11, 2025: INCLUDE received spotlight in ICLR'25 (Top 5.1%)!
February 8, 2025: My work on AI4DG in in Marginalized Communities has been accepted at the AAAI 2025 Workshop on Social Impact of AI!
January 25, 2025: Two papers (Gaussian Graph Regularization and Bias Handling with MLMs) are accepted to DASFAA'25!
January 23, 2025: One paper (INCLUDE, Cohere for AI Community Controbution) is accepted to ICLR'25 (A*-tier)!
January 20, 2025: One first author paper on LLM Evaluation is accepted to WWW'25 (TheWebConf, A*-tier)!
January 12, 2024: One project I mentored is accepted to AAAI 2025 PDLM Workshop!
December 26, 2024: Organizing CIOL Winter ML Bootcamp! I'm the lead instructor and coordinator in this program!
December, 2024: Received 1000+ Stars on GitHub!
November 04, 2024: One shared task paper accepted to CHiPSAL workshop at COLING'25!
October 15, 2024: One work is accepted for a oral (only 3 are selected!) presentation in Harms & Risks of AI in the Military workshop at Mila!
October 1, 2024: Two works I mentored have been accepted at the DigiTwin 2024 Conference.
September 20, 2024: One paper on Geometry-Aware Facial Expression Learning is accepted to ACCV'24!
September 19, 2024: One paper is accepted to EMNLP'24 NLP for Science workshop!
September 12, 2024: One paper on GNN-based Explainable Hate Speech Detection is accepted to EMNLP'24 NLP for Positive Impact workshop!
August 10, 2024: Three workshop papers from ACL'24 are published in ACL Anthology > [HRGraph], [SMM4H Task 3], [SMM4H Task 5].
JAugust-September, 2024: I'll be reviewing in EMNLP'25, CSCW'25, TEI'25, and more top tir conferences!
August, 2024: Joined Cohere for AI - Aya Expedition Projects as open source contributor!
July 17, 2024: One paper on Human-centric NLP (LLMs) and Human-AI collaborative spaces is accepted to CSCW'24 Posters.
July, 2024: Will be joining Computational Neuroscience Summer School by Neuromatch Academy in July 2024.
June 17, 2024: One paper is accepted to ICML'24 Workshops (Large Language Models and Cognition)!
June, 2024: Will be joining Climate Change AI Summer School by in June-Aug. 2024.
May 1, 2024: Awarded 1 Year ACM Membership for my service in IDC 2024, as a reviewer.
April 1-20, 2024: Joined the program committee of TextGraphs-17 and ClimateNLP workshops co-located with ACL 2024.
April - July, 2024: Will be serving reviewer at several ACL'24 and ICML'24 workshops!
March - April, 2024: Received ICLR Travel Award (DEI) as an early student researcher! But, failed to travel due to visa issues :(
March 25, 2024: One paper on LLM Auditing and Evaluation is accepted to CHI'24 HEAL (Human-centric LLM Evaluation and Auditing) workshop!
March, 2024: Served as a reviewer in ACL ARR (Feb 2024)!
March 17, 2024: Two papers on Dark sides of LLM-assisted Writing are accepted to CHI'24 In2Writing workshop!
February 20, 2024: One paper on Bangla Knowledge Graph is accepted to COLING'24.
February 16, 2024: Two papers are accepted (Invited to Present) to ICLR Tiny Papers Track.
January, 2024: One paper is submitted to IEEE T-CBB (Journal, IF 4.5)!
December, 2023 - January, 2024: Served as a reviewer in ICLR Tiny Papers Track!
December, 2023: Served as a reviewer in Complex & Intelligent Systems Journal (IF 5.8)!
October, 2023: Two papers are accepted in NeurIPS'23 Workshops!
August, 2023: Become a Kaggle Notebook Grandmaster (3rd Kaggle GM in Bangladesh).
May, 2023: Selected as a finalist of IISE QCRE Data Challenge 2023.
October, 2022: Joined DILab, as a research student.
March, 2022: Founded CIOL, bridging IPE and AI through research.

→ View my list of Rejections and Failures!

📄 Selected Publications

INCLUDE: Evaluating Multilingual Language Understanding with Regional Knowledge
Angelika Romanou, ... , Azmine Toushik Wasi, ... , Sara Hooker, Antoine Bosselut
ICLR'25 (Spotlight, Top 5.1%) ▪ [OpenReview] ▪ ▪ ▪
Gaussian Regularization in Neural Graph Learning
Azmine Toushik Wasi, Taki Hasan Rafi, Dong-Kyu Chae
DASFAA'25 (Full Paper, Short Oral) ▪
GReFEL: Geometry-Aware Reliable Facial Expression Learning under Bias and Imbalanced Data Distribution
Azmine Toushik Wasi*, Taki Hasan Rafi*, Raima Islam, Karlo Šerbetar, Dong-Kyu Chae (*Equal Contribution)
ACCV'24 ▪ [CVF] ▪ [LNCS] ▪ [arXiv] ▪
Neural Control System for Continuous Glucose Monitoring and Maintenance
Azmine Toushik Wasi
ICLR'24 Tiny Papers ▪ [OpenReview] ▪ [arXiv] ▪ [GitHub] ▪ ▪
When SMILES have Language: Drug Classification using Text Classification Methods on Drug SMILES Strings
Azmine Toushik Wasi, Karlo Serbetar, Raima Islam, Taki Hasan Rafi, Dong-Kyu Chae
ICLR'24 Tiny Papers ▪ [OpenReview] ▪ [arXiv] ▪ [GitHub] ▪
CADGL: Context-Aware Deep Graph Learning for Predicting Drug-Drug Interactions
Azmine Toushik Wasi, Taki Hasan Rafi, Raima Islam, Serbetar Karlo, Dong-Kyu Chae
▪ [arXiv] ▪ ▪
Dialectal Bias in Bengali: An Evaluation of Multilingual Large Language Models Across Cultural Variations
Azmine Toushik Wasi, Raima Islam, Mst Rafia Islam, Farig Sadeque, Taki Hasan Rafi, Dong-Kyu Chae
WWW'25 (TheWebConf) (Short Paper) ▪ ▪ ▪
Graph Neural Networks in Supply Chain Analytics and Optimization: Concepts, Perspectives, Dataset and Benchmarks
Azmine Toushik Wasi, MD Shafikul Islam Sohan, Adipto Raihan Akib, Mahathik Muhammad Bappy
▪ [arXiv] ▪ [GitHub] ▪ ▪ ▪

View All Publications and Ongoing Works →

🧑‍🔬 Research Experiences

Visiting Researcher, AI4CHEMIA Research Group, King Saud University | Apr, 2025 - Present
Fellow (School of AI), Pi School | Jun, 2025 - Aug, 2025
Global Committed Listener, How to Grow Almost Anything (HTGAA) 2025, Community Biotechnology Initiative, MIT Media Lab | Feb, 2025 - May, 2025
Participant, Brown University Dept. of Physics presents the AI Winter School 2025 | Jan, 2025
Student Research Assistant, Mila Quebec AI Institute | May, 2024 - March, 2025
Community Researcher, Cohere Labs (formerly, Cohere for AI) | Aug, 2024 - Present
Researcher, Computational Intelligence and Operations Laboratory, Shahjalal University of Science and Technology | March, 2021 - March, 2025
Research Intern, Xu Lab, Carnegie Mellon University | Jul, 2024 - December, 2024
Visiting Researcher, Data Intelligence Lab, Hanyang University | Nov, 2023 - July, 2024
Computational Neuroscience Summer School (Research Student), Neuromatch Academy | Jul, 2024
CCAI Summer School (Research Student), Climate Change AI | Jun - Aug, 2024

📜 Selected Projects

CIOL Presents Winter ML Bootcamp
I am the lead organizer and instructor of the ML bootcamp, with more than 50 participants. I prepared materials and took live sessions on EDA, Tabular Data Modeling, Hyperparameter Tuning, ANNs - Deep Learning, NLP, Computer Vision, LLM Agents and mentored 6 research teams in the bootcamp.
[Website] ▪ [GitHub] (Check Session Notebboks) ▪ [Youtube] (Check Recordings)
AyaFestPe: A multi-lingual and multi-cultural festival exploration guide
The pipeline starts by collecting festival data from HF and translating it with AyaExpanse for multilingual access. We then gather user queries, use M-RAG to embed data and queries, rerank results, and combine relevant context with the query to generate output.
[GitHub] ▪ [Colab] (using Cohere API) ▪ [Kaggle] (Using released model weights, with quantization)
🏫 Online ML University: Free AI resource collection for everyone! Currently ~150⭐ in GitHub!
📑 Paper Organization and Collection
- ICML 2024 Graph Paper Collection : List of all graph and/or GNN papers accepted at ICML'24. Currently ~160⭐ in GitHub!
- ICLR 2024 Graph Paper Collection : List of all graph papers accepted at ICLR'24. Currently ~70⭐ in GitHub!
- ICLR 2024 LLM Paper Collection : List of all LLM papers accepted at ICLR'24. Currently ~35⭐ in GitHub!

👩‍💻 Technical Skills

Programming Languages: Python (Advanced), C, C++, MATLAB, R, SQL
DS & ML Tools (Python): NumPy, Pandas, Matplotlib, Seaborn, Scikit-learn, TensorFlow, PyTorch, PyCaret, LangChain, VLLM, Pydantic
Data Science Techniques: EDA, Experimental Design, Hypothesis Testing, Sampling, Statistical Inference, Data-driven Decision Making
Machine Learning Techniques: Classical ML, Deep Learning, NLP, Graph Neural Networks (GNNs), GFlowNets, Flow Matching, Diffusion Models, Reinforcement Learning, Reasoning in LLMs, RAG, Self-Verification, Uncertainty Estimation, Agentic Decision-Making, Reward-Based Fine-Tuning, Computer Vision
Biomedical AI & Clinical Applications: Molecular Property Prediction, Molecular Interaction Analysis, Binder and De Novo Protein Design, GNNs for Molecules, Energy-Guided Modeling, Flow-based and Diffusion-based Generative Models, Agentic LLMs, Knowledge Graphs, Drug Discovery, Genomics
Data Analysis & Visualization: MS Excel, Power BI, Tableau, SAS
Automation & Productivity: MS Word, PowerPoint, Excel Automation; Google Sheets Scripting; Python-based Automation; Adobe Photoshop and Illustrator
Human-Computer Interaction (HCI): LLM Customization, Survey Design, Data Collection & Analysis, UI/UX Frameworks
Other Tools & Skills: GitHub, VS Code, Azure, AnyScale, Replit, Colab, Kaggle, Parallel & Distributed Computing, Product & Project Management, Strategic Planning

💼 Academic Services

Conference or Journal Reviewer
- AI/ML: ICLR, NeurIPS
- NLP: ACL ARR Feb (ACL, EMNLP, NAACL, COLING) [View in official reviewer list]; COLING.
- HCI/HAI: CHI, CSCW, UbiComp/ISW, TEI, ICM, IDC.
- Journals: Complex & Intelligent Systems (Springer Nature).
- Total conference or journal paper reviewed: 80
Workshop Program Committee and Reviewer
- NeurIPS'25: AI4Science, AI4Materials, Scaling Environments for Agents, Structured Probabilistic Inference & Generative Modeling, MusiML
- ICLR'25: AI4MaterialsDiscovery Re-Align, LMRL
- NAACL'25: DravidianLanTech, CLPsych, In2Writing
- AAAI'25: Social Impact of AI,
- CHI'25: STAIG, HEAL
- ICML'25: World Models, MusIML
- NeurIPS'24: Behavioral ML, AI4MaterialsDiscovery, EvalEval, Military AI, MusIML
- EMNLP'24: NLP for Positive Impact, Multulingual Represent Learning
- ICML'24: AI4Science, LLMs & Cognition
- ACL'24: TextGraphs-17, ClimateNLP, SMM4H, Language + Molecules, GenderBiasNLP, WASSA.
- MICCAI'24: Fairness in Medical Imaging, GRaphs in biomedicAl Image anaLysis.
- Total top conference workshop paper reviewed: 84
Organizer
- NeurIPS'24, ICML'24, NeurIPS'25 Muslims in ML Affinity Workshop