Research

Home / Research / Photography / Coffee

This is an overview of my research and projects. I'm interested in computer vision and robotics, with a particular focus on neural rendering and generative models for data. To learn more about my background, including relevant context for my publications, see more details here.

Research R

My research interests lie at the intersection of computer vision, computer graphics, and robotics. I'm particularly interested in neural rendering and 3D generation, especially applied to robotic systems and autonomous driving. I've also worked on computational photography.

LSD-3D: Large-Scale 3D Driving Scene Generation with Geometry Grounding

Julian Ost*, Andrea Ramazzina*, Amogh Joshi*, Maximilian Bömer, Mario Bijelic, Felix Heide
AAAI, 2026
project page / publication / arXiv

We present LSD-3D, a method for generating 3D driving scenes with coherent 3D geometry and photorealistic, high-fidelity texture.

Neural Light Spheres for Implicit Image Stitching and View Synthesis

Ilya Chugunov, Amogh Joshi, Kiran Murthy, Francois Bleibel, Felix Heide
SIGGRAPH Asia, 2024
project page / publication / arXiv

We design a spherical neural light field model for implicit panoramic image stitching and re-rendering, capable of handling depth parallax, view-dependent lighting, and scene motion.

Other Research 6

I've conducted research in a number of adjacent and applied fields while discovering where my interests lie. I've worked particularly in agricultural computer vision and agrobotics, computational social science, and computational cognitive science. For more information, see my extended bio.

	iNatAg: Multi-Class Classification Models Enabled by a Large-Scale Benchmark Dataset with 4.7M Images of 2,959 Crop and Weed Species Naitik Jain, Amogh Joshi, Mason Earles CVPR Vision for Agriculture, 2025 publication / arXiv We introduce iNatAg, a 4.7M-image dataset of 2,959 crop and weed species - one of the world's largest for agriculture - and benchmark models achieving state-of-the-art classification performance.
	Understanding the Limits of Vision Language Models Through the Lens of the Binding Problem Declan Campbell, Sunayana Rane, Tyler Giallanza, Nicolò De Sabbata, Kia Ghods, Amogh Joshi, Alexander Ku, Steven M. Frankland, Thomas L. Griffiths, Jonathan D. Cohen, Taylor W. Webb NeurIPS, 2024 publication / arXiv We identify that state-of-the-art VLMs fail at basic multi-object reasoning due to the binding problem, which limits simultaneous entity representation - similar to human brain processing.
	Examining Similar and Ideologically Correlated Imagery in Online Political Communication Amogh Joshi, Cody Buntain ICWSM, 2024 publication / arXiv We investigate how US national politicians' use of various visual media on Twitter reflects their political positions, identifying limitations in standard image characterization methods.
	An Open Source Simulation Toolbox for Annotation of Images and Point Clouds in Agricultural Scenarios Dario Guevara, Amogh Joshi, Pranav Raja, Elisabeth Forrestel, Brian Bailey, Mason Earles ISVC, 2023 publication We present an open-source simulation toolbox designed for the easy generation of synthetic labeled data for both RGB imagery and point cloud information, applicable to a wide array of cultivars.
	Standardizing and Centralizing Datasets for Efficient Training of Agricultural Deep Learning Models Amogh Joshi, Dario Guevara, Mason Earles Plant Phenomics, 2023 publication / arXiv We present methods for enhancing data efficiency in agricultural computer vision, which improves performance and reduces training time, and introduce a novel set of model benchmarks.
	Exploiting the Right: Inferring Ideological Alignment in Online Influence Campaigns Using Shared Images Amogh Joshi, Cody Buntain ICWSM, 2022 publication / arXiv / press We develop models to analyze the ideological presentation of foreign Twitter accounts based on shared images, revealing inconsistencies in ideological positions across different content types.

Projects 1 2

The following are major projects which I have been involved in or developed myself.

AgML: An Open-Source Library for Agricultural Machine Learning

AI Institute for Next Generation Food Systems
project / info

Since its inception, I have led the development of AgML. We have aggregated the world's largest collection of agricultural deep learning datasets, produced benchmarks and pretrained weights for state-of-the-art models, and developed a suite of tools for data preprocessing, model training, and deployment in an easy-to-use API.

Research

Research R

Other Research 6

Projects 1 2

Extended Biography E