I am a recent graduate in Industrial and Production Engineering from Shahjalal University of Science and Technology, with only my thesis defense remaining.
My core research focuses on exploring AI4Science, Biomedical AI, Generative AI, AI agents, AI Agents and Reasoning, with an emphasis on structural biology, therapeutic, clinical, and molecular ML domains.
I am also deeply interested in interdisciplinary research, connecting these domains with Graph Neural Networks (GNNs), Digital Twins and Human-Centered AI (Multilinguality, Fairness, and Reliability) and AI applications in industrial engineering.
I intend to pursue a PhD in Spring/Fall 2026 to continue my research.
I collaborate with Prof. Alshehri (KSU) on Generative AI, health informatics, GNNs, and AI agents; and
Prof. Chae (HYU) on GenAI, LLM-HCI, Biomolecular ML with GNNs.
I also work with Riashat Islam (Microsoft Research) on molecular ML, GenAI, agents and reasoning; and
with Prof. Min Xu (CMU) on biomolecules.
I actively collaborate with researchers from Cohere Labs (formerly Cohere for AI), Harvard, and more.
Completed HTGAA 2025 (MIT), focusing on protein engineering.
At CIOL, I collaborate with
Prof. AMM Mukaddes (SUST),
Prof. Ahsan (OU) and
Prof. Bappy (LSU) on GNNs, Agents and digital twins for industrial and medical applications.
My research has been published in prestigious venues such as ICLR, WWW, COLING, DASFAA, CSCW, and related workshops.
I regularly review for major AI-ML conferences and am a Kaggle Grandmaster. I also have 3 years of experience in AI-driven product/content automation and product and project management.
Iβm a quick learner and highly adaptable, able to lead teams and projects. I enjoy exploring new topics, sharing knowledge, honing skills, experimenting and pushing my limits.
View my Curriculum Vitae →
Feel free to email me (azminetoushik dot wasi at gmail dot com) if you're interested in collaboration or discussing research. I'm open to new ideas and collaborations.
π Research Interests
𧬠Exploring Computational Biology and Biomedical AI Applications
Working on Computational Molecular Biology, Bioinformatics, and
Computational Drug Discovery (CADGL).
Exploring problems like molecular properties prediction, protein discovery, binder design and affinity, molecular interactions, structural biology,
and healthcare optimization (ICLR'24).
Applying Graph Neural Networks (GNNs) and Geometric Machine Learning in biomedical domains, including protein modeling, molecular graphs, and structure-function alignment problems.
Experienced in Flow Matching, GFlowNets, Diffusion models, and energy-based/agentic modeling pipelines.
Worked with de novo protein generation models and experienced with RL-inspired/energy-guided geometric/sequential structural biology modeling tools.
Currently exploring Digital Twins, clinical reasoning, and Agentic LLMs for Clinical and Biomedical AI Applications.
π§ Understanding and Applying Generative AI Models
Focusing on Large Language Models (LLMs), Large Reasoning Models (LRMs), multi-modal alignment, reinforcement learning (RL), reward modeling, and uncertainty-aware reasoning through self-verification and self-correction.
Developing agentic AI agents via Retrieval-Augmented Generation (RAG), Prompt Reward Models (PRMs), Long Chain-of-Thought (Long CoT), and planning-aware strategies for multilingual, and human-in-the-loop applications (medical, clinical and legal).
Exploring how AI reasoning works with a strong emphasis on AI safety, interpretability, alignment, and governance in real-world deployments.
π§βπ» Interdisciplinary Research on Humans, AI, and Language
My recent work develops trustworthy AI reasoning by integrating RAG, reward-based RL fine-tuning, Digital Twins, and structured reasoning to enhance model reliability.
I'm working on multilinguality and reasoning in LLMs, focusing on AI4Good applications focusing on Computational Social Science, Cultural Analytics and Cognitive Modeling.
Additionally, I've worked on accessibility, evaluation and improvement of
human factors and ergonomics
(ICML'24W and EMNLP'24W),
religious/cultural values
(CSCW'24,
CHI'24W, COLING'25 -
1,
2),
into AI systems, mainly Generative AI and LLMs
(ICLR'25).
Additionally, from technical side, I'm applying AI models to industrial engineering, including supply chain optimization
(GNNs in SCM) and manufacturing.
Interested in extending GNNs through Knowledge Graphs for applications across healthcare, HCI, and NLP
(COLING'24,
ACL'24W,
EMNLP'24W).
π° News and Updates
June 19, 2025: Three papers on LLM Evaluation, Application and AI Agents is accepted to CSCW 2025 Posters!
June 5, 2025: I reached the milestone of 100 citations on Google Scholar (before undergrad thesis defense!)!!
June 1, 2025: My work on GFLowNets for better DDI systems in accepted to ICANN'25!
May 17, 2025: I led three research works and co-authored a D&B paper with Cohere Labs, all submitted to NeurIPS'25!
May 15, 2025: Presented multiple projects at Aya Expedition 2.0, including those I led: MM Clinical Understanding and MM Legal Assistant Agentβthe latter successfully passed two stages of the Bangladesh Bar Council Exam! I was also part of several other projects selected for the final presentation.
May 13, 2025: One solo author position paper advocating specialized LLMs instead of general-purpose LLMs for healthcare is accepted to CVPR 2025 Multimodal Foundation Models for Biomedicine Workshop!
May 3, 2025: Awarded the third place in the IISE QCRE 2025 Data Challenge!
April 15, 2025: Joined AI4CHEMIA, KSU as a Visiting Researcher, under Prof. Alshehri. I will be working on AI4Science, GNNs, AI Agents and GenAI.
April, 2025: Kaleidoscope, my third project with Cohere for AI Community is now live! Along with 40+ researchers world-wide, we built a massively multilingual vision benchmark to evaluate LLMs using in-language exams.
March 31, 2025: As part of Aya Expedition 2.0 with Cohere for AI, I am leading two projects focused on multilingual clinical and legal understanding and reasoning. We were awarded $2,000 in API credits to support the development and experimentation of these initiatives. Along with these, I am also actively involved in several other projects, including MedAya, Multilingual Long CoT, Embodied Reasoning, and more.
March 30, 2025: One of my works on LLM evaluation is accepted to CHI 2025's Human-Centric LLM Evaluation Workshop. Two papers I mentored on AI governance are accepted to CHI 2025's Socio-technical AI Governance Workshop.
March 28, 2025: Two more shared task papers I mentored on clinical reasoning and emotion detection is accepted to NAACL and ACL workshops!
March 31, 2025: Two more shared task papers I mentored on clinical reasoning and emotion detection is accepted to NAACL and ACL workshops!
March 27, 2025: My work on Multi-lingual LLM Evaluation in accepted is accepted to CHI'24 HEAL (Human-centric LLM Evaluation and Auditing) workshop!
March 05, 2025: My work on Interpretable Biomolecular Design using Genomes has been accepted at the ICLR'25 ML for Genomics Explorations workshop!
March 05, 2025: One of my works on Agentic Safety has been accepted at the ICLR'25 Foundational Models in the Wild workshop!
February 27, 2025: One of my works on AI Agents Application has been accepted at the NAACL 2025 Workshop on Language Models for Underserved Communities!
February 27, 2025: Five out of the six submissions I actively advisedβfive from the CIOL Winter ML Bootcamp and one from HerWILLβhave been accepted at the NAACL 2025 DravidianLanTech Workshop!
February 28, 2025: My work on AI Agents for Social Impact has been accepted at the AAAI 2025 Workshop on Social Impact of AI!
CIOL Presents Winter ML Bootcamp
I am the lead organizer and instructor of the ML bootcamp, with more than 50 participants. I prepared materials and took live sessions on EDA, Tabular Data Modeling, Hyperparameter Tuning, ANNs - Deep Learning, NLP, Computer Vision, LLM Agents and mentored 6 research teams in the bootcamp.
[Website] βͺ
[GitHub] (Check Session Notebboks) βͺ
[Youtube] (Check Recordings)
AyaFestPe: A multi-lingual and multi-cultural festival exploration guide
The pipeline starts by collecting festival data from HF and translating it with AyaExpanse for multilingual access. We then gather user queries, use M-RAG to embed data and queries, rerank results, and combine relevant context with the query to generate output.
[GitHub] βͺ
[Colab] (using Cohere API) βͺ
[Kaggle] (Using released model weights, with quantization)
Biomedical AI & Clinical Applications: Molecular Property Prediction, Molecular Interaction Analysis, Binder and De Novo Protein Design, GNNs for Molecules, Energy-Guided Modeling, Flow-based and Diffusion-based Generative Models, Agentic LLMs, Knowledge Graphs, Drug Discovery, Genomics
Data Analysis & Visualization: MS Excel, Power BI, Tableau, SAS
Automation & Productivity: MS Word, PowerPoint, Excel Automation; Google Sheets Scripting; Python-based Automation; Adobe Photoshop and Illustrator