research that ships and research that scales.

My work spans two threads: building the datasets and benchmarks that make rigorous AI evaluation possible, and developing the core AIML methods that push the frontier. Below is a curated selection across both.

Google Scholar — 3,347+ citations ↗ LinkedIn ↗ ← darvinyi.com

Part I

Datasets, Benchmarks & Evaluations

Before you can evaluate AI in the real world, you need the right data. These papers each introduce a new dataset, challenge, or evaluation framework — most have become standard references in their fields.

Part II

Foundational AIML Contributions

Core methods work: pulse sequence integration for MRI, federated learning without data sharing, GAN-based augmentation, robust training under noisy labels, and distributed deep learning for multi-institutional AI.