Protein Language Model

About Me

Aaryesh Deshpande

M.S. Bioinformatics Graduate β€’ Seeking PhD & Research Roles in BioML

Georgia Institute of Technology

I am a bioinformatics researcher working at the intersection of machine learning and computational biology. My work focuses on geometric, topological, and physics-informed machine learning methods, leveraging protein language models and generative modeling for biomolecular dynamics and drug discovery.

I'm interested in designing practical computational tools and models that bridge machine learning with protein dynamics. Outside research, you'll usually find me experimenting with filmmaking, animation, and motion design, because storytelling, whether in science or cinema, is half the craft.

Based in Atlanta, GA
Actively seeking PhD & Research roles in BioML

Education & Experience

aaryesh@Education-&-Research:~/timeline
aaryesh@research:~/timeline (main) $β–‹
gt24msb HEAD β†’ main edu
M.S. Bioinformatics
Georgia Institute of Technology Β· Atlanta, GA
ML Β· ML for Graphs Β· Deep RL Β· Bioinformatics Algorithms Β· Complex Systems
Aug 2024-May 2026
gra25kbl research
Graduate Research Assistant
Kamerlin Lab (Prof. Shina Kamerlin Lynn) Β· School of Chemistry & Biochemistry, Georgia Tech
Nov 2024-Dec 2025 Β· Atlanta, GA
gta26cse teaching
Graduate Teaching Assistant
CSE 8803: Computational Methods for Complex Systems Β· College of Computing, Georgia Tech
Jan 2026-May 2026 Β· Atlanta, GA
uoh23ar intern
Research Intern
School of Life Sciences Β· University of Hyderabad
May 2023-Jul 2023 Β· Hyderabad, India
git20bt edu
B.Tech Biotechnology
GITAM University Β· Visakhapatnam, India
Bioinformatics Β· Molecular Modeling & Drug Design Β· Proteomics Β· Cell Biology
Dec 2020-Apr 2024
0000000 (root commit) Β· origin/main

Research Interests

RESEARCH PROCESSOR CLICK BOOT TO START
PWR RESEARCH PROCESSOR Β· REV 3.0 Β· OPTIMIZED LAYOUT Β· 2026 U1 ML for Comp Bio U2 Protein LMs U3 Comp Biophysics U4 Graph NNs U5 Physics-Inf ML U6 BioInfo Software
RESEARCH PROCESSOR
SIG 0000 LVL 01 ♥♥♥
Press PLAY, then click chips the moment they flash orange You miss three surges and the board fail!. Good luck!

Projects

Transformer-Based Generation of Novel Drug-like Molecules

July 2023 - Dec 2023

A fine-tuned Transformer model mapping gene expression perturbations to molecular SMILES for targeted cancer drug design; achieved 23% novel scaffold generation with high pathway alignment and QED >0.75.

Transformers GEO Drug Design Generative Model

Allosteric Dynamics in Bacterial Phosphofructokinase

Nov 2024 - May 2025

Built a CNN+Bi-directional GRU classifier on GaMD trajectories to classify protein conformational states. Analyzed inter-subunit pathways using dynamic cross-correlation and estimated transition kinetics with corrected state-transition matrices.

Molecular Dynamics CNN + GRU Allostery

Protein Language Model Typicality Landscapes

June 2025 - Dec 2025

Developed a pseudo-log-likelihood typicality framework for protein language models (ESM2, ProtBERT) combined with TwoNN intrinsic dimensionality to audit evolutionary coverage and detect distributional mismatches in embedding representations.

Protein Language Models ESM2 Intrinsic Dimensionality

State-Conditioned Fusion Hypergraph for PPARΞ³ Ligand Toxicity

Sept 2025 - Feb 2026

A dual-encoder DTI model with state-conditioned hypergraph fusion. The system encodes ligands via pharmacophore-aware hypergraphs and protein pockets via state-specific residue graphs (agonist vs antagonist), fusing representations through cross-attention for ligand toxicity and functional state classification.

Hypergraphs DTI Model Cross-Attention Toxicity Prediction Pharmacophore

Papers & Writing

Learning Biomolecular Motion: The Physics-Informed Machine Learning Paradigm

arXiv:2511.06585 β€’ November 2025

A comprehensive review of physics-informed machine learning approaches for modeling and predicting biomolecular dynamics, bridging traditional molecular simulations with modern deep learning techniques.

Review Paper Physics-Informed ML Biomolecular Dynamics

Notes

Some study & quick-reference notes I made as a Teaching Assistant at GT.

Tools & Software

Active
esm-embed banner

esm-embed

Fast, multi-layer protein language model embedding extractor for ESM-2 and ESM-C. Supports mean-pooled residue embeddings, Flash Attention 2, bfloat16 precision, and SLURM array jobs for large-scale representation learning.

Python ESM-2 ESM-C Flash Attention SLURM
View on GitHub
Experimental
ensemble-pocket-finder architecture diagram

ensemble-pocket-finder

Proof-of-concept pipeline for detecting cryptic drug-binding pockets on proteins using conformational ensembles sampled via BioEMU. Ranks pockets by druggability across transient states using a graph attention network, aiming to capture binding sites invisible to single-structure methods.

Python BioEMU Graph Attention Torch-Geometric Pocket Detection
View on GitHub
Legacy

Swiss Model Batch Processor

Python automation tool for batch submitting protein sequences to SwissModel using a single structural template. Features parallel multithreaded processing, built-in rate-limit handling, and both CLI and GUI modes.

Python PyQt SwissModel Multithreading
View on GitHub
GU Drug Pro Toolkit splash screen

GU Drug Pro Toolkit

Comprehensive drug discovery suite with ADMET prediction, molecular visualization, physicochemical property calculation, druglikeness evaluation, and integration with ChemSpider and PubChem databases.

Python RDKit ADMET Drug Discovery GUI
View on GitHub

Skills & Technologies

Programming

Python Bash SQL C MATLAB JavaScript HTML/CSS

Software & Tools

AMBER AutoDock Vina PyMOL SLURM Git Docker Nextflow SvelteKit PyQt

Libraries & Frameworks

PyTorch Torch-Geometric JAX HuggingFace Transformers ESM & ProtBERT Scikit-learn XGBoost NumPy / SciPy Biopython RDkit DeepChem MDAnalysis

Get in Touch

Let's Connect

I'm open to collaborations, research opportunities, and discussions about bioinformatics and machine learning. Whether you have a project in mind or just want to chat about computational biology, feel free to reach out!