Georgia Gkioxari I am an Assistant Professor of Computing + Mathematical Sciences at Caltech and a William H. Hurt scholar. I am also a visiting researcher at Meta AI in the Embodied AI team. From 2016 to 2022, I was a research scientist at Meta's FAIR team. I received my PhD from UC Berkeley, where I was advised by Jitendra Malik. I did my bachelors in ECE at NTUA in Athens, Greece, where I worked with Petros Maragos. I am the recipient of the PAMI Young Researcher Award (2021). My teammates and I received the PAMI Mark Everingham Award (2021) for the Detectron Library Suite. I was named one of 30 influential women advancing AI in 2019 by ReWork and was nominated for the Women in AI Awards in 2020 by VentureBeat. Read more about me and my work in this Q&A. |
Caltech students (undergrads and grads): If you are at Caltech and wish to work with me, please read the information in this doc.
Prospective postdocs: If you are interested in a postdoc position and want to conduct research in computer vision, 3D understanding and visual perception, please contact me directly with your CV and a short research statement.
Prospective PhD students: I am looking for Ph.D. students to join my group. If you are interested in my group, please apply directly to the CMS department and mention my name in your statement of purpose. There is no need to email me.
Pixel-Aligned Recurrent Queries
for Multi-View 3D Object
arxiv /
project page /
code /
@inproceedings{xie2023pixel, title={Pixel-Aligned Recurrent Queries for Multi-View 3D Object Detection}, author={Xie, Yiming and Jiang, Huaizu and Gkioxari, Georgia and Straub, Julian}, booktitle={Proceedings of the IEEE/CVF International Conference on Computer Vision}, pages={18370--18380}, year={2023} } |
Multiview Compressive Coding for
3D Reconstruction
arxiv /
project page /
code /
@article{wu2023multiview, author = {Wu, Chao-Yuan and Johnson, Justin and Malik, Jitendra and Feichtenhofer, Christoph and Gkioxari, Georgia}, title = {Multiview Compressive Coding for 3{D} Reconstruction}, journal = {CVPR}, year = {2023}, } |
Omni3D: A Large Benchmark and
Model for 3D Object Detection in the Wild
arxiv /
project page /
code /
@article{brazil2022omni3d, title={{Omni3D}: A Large Benchmark and Model for {3D} Object Detection in the Wild}, author={Garrick Brazil and Abhinav Kumar and Julian Straub and Nikhila Ravi and Justin Johnson and Georgia Gkioxari}, journal={CVPR}, year={2023} } |
Learning 3D Object Shape and
Layout without 3D Supervision
arxiv /
project page /
video /
@article{usl2022, title={Learning 3D Object Shape and Layout without 3D Supervision}, author={Georgia Gkioxari and Nikhila Ravi and Justin Johnson}, journal={CVPR}, year={2022} } |
Differentiable Stereopsis:
from multiple views using differentiable rendering
arxiv /
project page /
code /
@article{goel2022ds, title={Differentiable Stereopsis: Meshes from multiple views using differentiable rendering}, author={Shubham Goel and Georgia Gkioxari and Jitendra Malik}, journal={CVPR}, year={2022} } |
Recognizing Scenes from Novel
arxiv /
project page /
code /
@article{qian2021viewseg, title={Recognizing Scenes from Novel Viewpoints}, author={Shengyi Qian and Alexander Kirillov and Nikhila Ravi and Devendra Singh Chaplot and Justin Johnson and David Fouhey and Georgia Gkioxari}, journal={arXiv preprint arXiv:2112.01520}, year={2021} } |
arxiv /
code /
project page /
@article{ravi2020accelerating, title={Accelerating 3D Deep Learning with PyTorch3D}, author={Ravi, Nikhila and Reizenstein, Jeremy and Novotny, David and Gordon, Taylor and Lo, Wan-Yen and Johnson, Justin and Gkioxari, Georgia}, journal={arXiv preprint arXiv:2007.08501}, year={2020} } |
3D Shape Reconstruction from
Vision and Touch |
SynSin: End-to-end View Synthesis
a Single Image
arxiv /
code /
project page /
@inproceedings{synsin, Title={{SynSin}: {E}nd-to-end View Synthesis from a Single Image},, Author={Olivia Wiles, Georgia Gkioxari, Richard Szeliski, Justin Johnson}, Booktitle={CVPR}, Year={2020}} |
Mesh R-CNN
arxiv /
code /
project page /
@inproceedings{meshrcnn, Title={Mesh R-CNN}, Author={Georgia Gkioxari, Jitendra Malik, Justin Johnson}, Booktitle={ICCV}, Year={2019}} |
Embodied Question Answering in
Photorealistic Environments with Point Cloud Perception
arxiv /
project page /
@inproceedings{wijmans2019, Title={Embodied Question Answering in Photorealistic Environments with Point Cloud Perception}, Author={Erik Wijmans and Samyak Datta and Oleksandr Maksymets and Georgia Gkioxari and Stefan Lee and Irfan Essa and Devi Parikh and Dhruv Batra}, Booktitle={CVPR}, Year={2019}} |
Multi-Target Embodied Question
arxiv /
project page /
@inproceedings{mteqa, Title={Multi-Target Embodied Question Answering}, Author={Licheng Yu and Xinlei Chen and Georgia Gkioxari and Mohit Bansal and Tamara Berg and Dhruv Batra}, Booktitle={CVPR}, Year={2019}} |
Neural Modular Control for
Embodied Question Answering
arxiv /
project page /
@inproceedings{nmc, Title={{N}eural {M}odular {C}ontrol for {E}mbodied {Q}uestion {A}nswering}, Author={Abhishek Das and Georgia Gkioxari and Stefan Lee and Devi Parikh and Dhruv Batra}, Booktitle={CoRL}, Year={2018}} |
Building Generalizable Agents
a Realistic And Rich 3D Environment |
Detecting and Recognizing
Human-Object Interactions
arxiv /
project page /
@inproceedings{gkioxari2017interactnet, Author = {Georgia Gkioxari and Ross Girshick and Piotr Doll\'{a}r and Kaiming He}, Title = {Detecting and Recognizing Human-Object Intaractions}, Booktitle = {CVPR}, Year = {2018}} |
Embodied Question
arxiv /
project page /
code /
@inproceedings{embodiedqa, Title={{E}mbodied {Q}uestion {A}nswering}, Author={Abhishek Das and Samyak Datta and Georgia Gkioxari and Stefan Lee and Devi Parikh and Dhruv Batra}, Booktitle={CVPR}, Year={2018}} |
Detect-and-Track: Efficient Pose
Estimation in Videos |
Data Distillation: Towards
Omni-Supervised Learning |
Mask R-CNN |
Learn2Smile: Learning Non-verbal Interaction through Observation
@inproceedings{learn2smile2017, Author = {Will Feng, Anitha Kannan, Georgia Gkioxari and Larry Zitnick}, Title = {Learn2Smile: Learning Non-verbal Interaction through Observation}, Booktitle = {IROS}, Year = {2017}} |
Chained Predictions Using
Convolutional Neural Networks
arxiv /
project page /
@inproceedings{chain16, Author = {G. Gkioxari and A. Toshev and N. Jaitly}, Title = {Chained Predictions Using Convolutional Neural Networks}, Booktitle = {ECCV}, Year = {2016}} |
Contextual Action Recognition
Actions and Attributes from Wholes and Parts |
Action Tubes
project page /
arxiv /
code /
negative results /
Sports Benchmark /
@inproceedings{actiontubes, Author = {G. Gkioxari and J. Malik}, Title = {Finding Action Tubes}, Booktitle = {CVPR}, Year = {2015}} |
Pose Estimation and Action Detection
project page /
arxiv /
@article{poseactionrcnn, Author = {G. Gkioxari and B. Hariharan and R. Girshick and J. Malik}, Title = {R-CNNs for Pose Estimation and Action Detection}, ArchivePrefix = {arXiv}, Eprint = {1406.5212}, PrimaryClass = {cs.CV}, Year = {2014}} |
k-poselets for detecting people and localizing their keypoints
project page /
code /
github /
spotlight /
@inproceedings{kposelets, Author = {G. Gkioxari and B. Hariharan and R. Girshick and J. Malik}, Title = {Using k-poselets for detecting people and localizing their keypoints}, Booktitle = {CVPR}, Year = {2014}} |
Articulated Pose Estimation using Discriminative Armlet
Classifiers |
Stolen from Jon Barron