Anisha Gunjal

I am a Visiting Scholar at Cognitive Learning for Vision and Robotics Lab (CLVR) at University of Southern California, advised by Prof. Joseph Lim.

Previously, I was the Lead Machine Learning Engineer of Documents AI at HyperVerge, Inc. My work involved using Computer Vision and Natural Language Processing for robust and scalable Information Extraction from real-world documents belonging to several countries.

I did my undergrad in Computer Science at Pune Institute of Computer Technology (University of Pune). Around this time, I interned at ITC InfoTech working on Computer Vision applications in Face AI.

Beyond the research world, I love traveling, salsa dancing and capturing the adventures of my dogs!

Email  /  CV  /  Scholar  /  Twitter  /  Github  /  LinkedIn  /  Blog

profile photo
Jan 2022 Task-Induced Representation Learning accepted at ICLR 2022
Dec 2021 Poster Presentation at Deep RL Workshop, NeurIPS 2021
Aug 2021 Participant at 5th Summer School on Artificial Intelligence, IIIT Hyderabad
Jul 2021 Volunteer at WiML Workshop @ ICML 2021
Jun 2021 Joined CLVR Lab at USC as a Visiting Researcher
Sep 2020 Published a blogpost The ECCV Experience
Jul 2020 Reviewer at JupyterCon2020
Jul 2020 Published blogpost Document Visual Question Answering
Jun 2020 4th Position in DocVQA Challenge at CVPR 2020

My research interest is to build machines that are able to perceive and act on multimodal information in their environments. I am interested in the intersection of the fields of Computer Vision, Natural Language Processing and Reinforcement Learning. I want to focus on learning representations from various modalities such as vision & language to enable agents to learn new tasks efficiently.

Task-Induced Representation Learning
Jun Yamada, Karl Pertsch, Anisha Gunjal, Joseph Lim
ICLR 2022
project page / paper

Investigation of using task information for learning representations for RL in visually complex scenes.

Document Visual Question Answering
Anisha Gunjal, Vipul Gupta, Moinak Bhattacharya, Digvijay Singh
CVPR 2020, Leaderboard Rank: 4
code / blog / workshop

Joint modeling of text and layout information using transformers for Visual Question Answering on unstructured documents.

Diabetic Retinopathy Grading using Deep Siamese Network
Anisha Gunjal
ICML 2018, Poster Presentation
paper / workshop

Contrastive Learning on retinal images for determining the stage of Diabetic Retinopathy disease progression on a small sized medical image dataset.

  • Document Visual Question Answering
  • The ECCV Experience
  • Listicles
  • Awesome Resource List for NeuroSymbolic Visual Reasoning

  • Clone this to overcome good website design FOMO!