I’m an Assistant Professor in the Manning College of Information and Computer Sciences at the University of Massachusetts, Amherst, and a visiting researcher at the Cornell Lab of Ornithology. My research lies at the intersection of computer vision and machine learning, with an emphasis on crafting real-world machine learning systems that integrate human expertise, state-of-the-art machine learning methodologies, and large-scale datasets. Merlin Sound ID is my latest contribution in this space, following the success of Seek, the iNaturalist computer vision system, and Merlin Photo ID. I completed my PhD at Caltech in 2019, advised by Pietro Perona. My thesis work focused on efficient dataset collection through human-in-the-loop systems, and fine-grained visual categorization. I completed my BS and MS at UCSD where I was advised by Serge Belongie. Most of my research work falls under the broad research agenda of Visipedia.

News

(6-16-2025) 1 paper accepted to ICML 2025
(6-16-2025) I spoke at 3 workshops during CVPR 2025: CV4Animals, CV4Science, and VPLOW
(9-26-2024) 3 papers accepted to NeurIPS 2024
(5-8-2024) Gave a talk to the Athol Bird and Nature Club
(4-30-2024) Merlin Sound ID Spring 2024 model update: 1384 total species!
(3-12-2024) Spoke on LAist, LA’s largest NPR station
(3-6-2024) Our work on fine-tuning CLIP was accepted to CVPR.
(2-20-2024) Gave a talk to the Hampshire Bird Club
(2-6-2024) Gave a lecture in Kate Jones’ AI for the Environment class
(12-15-2023) I was a panelist at the Computational Sustainability workshop at NeurIPS
(12-1-2023) Congrats to Justin Kay for recieving an honorable mention at NECV for our domain adaption work
(11-15-2023) Gave a talk at MBARI
(10-23-2023) Gave a talk at the United Nations platform “AI for Good”
(10-23-2023) Merlin Sound ID Fall 2023 model update: 1220 total species!
(9-21-2023) Excited to announce the iNaturlalist GeoModel

Use My Research

The following apps are all free and accessible on both iPhone and Android. I completed the R&D for the machine learning components that power these apps. In the case of Merlin Sound ID I also did the engineering work for deploying the model efficiently on iOS and Android.

Merlin Sound ID

Turn on your phone’s microphone and recognize bird vocalizations in real time. This feature is part of the Merlin Bird ID app.

Merlin Sound ID demo gif

Seek

Turn on your phone’s camera, point it at wildlife, and get real time classification results.

Seek demo gif

iNaturalist

Submit observations of wildlife and get identification assistance from a computer vision system as well as a global community of wildlife enthusiasts.

Merlin Photo ID

Identify birds in photographs. This feature is part of the Merlin Bird ID app.

Industry Consulting

I am available for industry consulting. However, my current schedule may prevent me from handling all consulting requests, so I apologize in advance if I do not respond. My expertise covers all aspects of a machine learning system: data collection, data annotation, metric specification, model research and development, evaluation, and deployment. My prior project experience covers image, audio, video, and geospatial modalities.

Grant Van Horn