许可证:CC BY 4.0
arXiv:2404.06971v1 [cs.CV] 2024 年 4 月 10 日
引文

C. Zhou、G. AlRegib、A. Parchami 和 K. Singh,“TrajPRed:基于区域关系学习的轨迹预测”,IEEE 智能交通系统汇刊 (T-ITS),三月。 2024 年 04 月。

DOI
审查

录取日期:3月 2024年04月

发表日期:4月
2024 年 08 月

代码
围兜

@article{Zhou_2024,

title={TrajPRed:基于区域的关系学习的轨迹预测},

author={Zhou,Chen 和 AlRegib,Ghassan 和 Parchami,Armin 和 Singh,Kunjan},

journal={IEEE 智能交通系统汇刊}、

publisher={电气与电子工程师协会 (IEEE)}、

DOI={10.1109/tits.2024.3381843}、

url ={http://dx.doi.org/10.1109/tits.2024.3381843}、

ISSN={1558-0016}、

年份={2024}、

pages={1– 10}

}

关键词

关系建模、随机预测、轨迹预测、行为预测

接触

TrajPRed:基于区域的关系学习的轨迹预测

Chen Zhou, Ghassan AlRegib,
Armin Parchami, and Kunjan Singh
索引术语:
关系建模、随机预测、轨迹预测、行为预测。

致谢

作者要感谢佐治亚理工学院智能视觉工程与科学 Omni 实验室 (OLIVES) 的成员以及审稿人的反馈。 这项工作由福特-佐治亚州技术联盟项目资助。

参考

  • [1] Ghassan AlRegib and Mohit Prabhushankar, “Explanatory paradigms in neural networks: Towards relevant and contextual explanations,” IEEE Signal Processing Magazine, vol. 39, no. 4, pp. 59–72, 2022.
  • [2] Jinsol Lee and Ghassan AlRegib, “Open-set recognition with gradient-based representations,” in 2021 IEEE International Conference on Image Processing (ICIP). IEEE, 2021, pp. 469–473.
  • [3] Gukyeong Kwon and Ghassan Al Regib, “A gating model for bias calibration in generalized zero-shot learning,” IEEE Transactions on Image Processing, 2022.
  • [4] Charles Lehman, Dogancan Temel, and Ghassan AlRegib, “On the structures of representation for the robustness of semantic segmentation to input corruption,” in 2020 IEEE International Conference on Image Processing (ICIP). IEEE, 2020, pp. 3239–3243.
  • [5] Ryan Benkert, Oluwaseun Joseph Aribido, and Ghassan AlRegib, “Example forgetting: A novel approach to explain and interpret deep neural networks in seismic interpretation,” IEEE Transactions on Geoscience and Remote Sensing, 2022.
  • [6] Ahmad Mustafa, Motaz Alfarraj, and Ghassan AlRegib, “Joint learning for spatial context-based seismic inversion of multiple data sets for improved generalizability and robustness,” Geophysics, vol. 86, no. 4, pp. O37–O48, 2021.
  • [7] Kiran Kokilepersaud, Mohit Prabhushankar, and Ghassan AlRegib, “Volumetric supervised contrastive learning for seismic semantic segmentation,” in Second International Meeting for Applied Geoscience & Energy. Society of Exploration Geophysicists and American Association of Petroleum …, 2022, pp. 1699–1703.
  • [8] Yash-yee Logan, Ryan Benkert, Ahmad Mustafa, Gukyeong Kwon, and Ghassan AlRegib, “Patient aware active learning for fine-grained oct classification,” arXiv preprint arXiv:2206.11485, 2022.
  • [9] Yash-yee Logan, Kiran Kokilepersaud, Gukyeong Kwon, Ghassan AlRegib, Charles Wykoff, and Hannah Yu, “Multi-modal learning using physicians diagnostics for optical coherence tomography classification,” in 2022 IEEE 19th International Symposium on Biomedical Imaging (ISBI). IEEE, 2022, pp. 1–5.
  • [10] Junwei Liang, Lu Jiang, Juan Carlos Niebles, Alexander G Hauptmann, and Li Fei-Fei, “Peeking into the future: Predicting future person activities and locations in videos,” in Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2019, pp. 5725–5734.
  • [11] Yuexin Ma, Xinge Zhu, Xinjing Cheng, Ruigang Yang, Jiming Liu, and Dinesh Manocha, “Autotrajectory: Label-free trajectory extraction and prediction from videos using dynamic points,” in European Conference on Computer Vision. Springer, 2020, pp. 646–662.
  • [12] Panna Felsen, Pulkit Agrawal, and Jitendra Malik, “What will happen next? forecasting player moves in sports videos,” in Proceedings of the IEEE international conference on computer vision, 2017, pp. 3342–3351.
  • [13] Mengshi Qi, Jie Qin, Yu Wu, and Yi Yang, “Imitative non-autoregressive modeling for trajectory forecasting and imputation,” in Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020, pp. 12736–12745.
  • [14] Chao Cao, Peter Trautman, and Soshi Iba, “Dynamic channel: A planning framework for crowd navigation,” in 2019 International Conference on Robotics and Automation (ICRA). IEEE, 2019, pp. 5551–5557.
  • [15] Ronja Möller, Antonino Furnari, Sebastiano Battiato, Aki Härmä, and Giovanni Maria Farinella, “A survey on human-aware robot navigation,” Robotics and Autonomous Systems, vol. 145, pp. 103837, 2021.
  • [16] Lucy A Suchman, Plans and situated actions: The problem of human-machine communication, Cambridge university press, 1987.
  • [17] Alexandre Alahi, Kratarth Goel, Vignesh Ramanathan, Alexandre Robicquet, Li Fei-Fei, and Silvio Savarese, “Social lstm: Human trajectory prediction in crowded spaces,” in Proceedings of the IEEE conference on computer vision and pattern recognition, 2016, pp. 961–971.
  • [18] Tim Salzmann, Boris Ivanovic, Punarjay Chakravarty, and Marco Pavone, “Trajectron++: Dynamically-feasible trajectory forecasting with heterogeneous data,” in European Conference on Computer Vision. Springer, 2020, pp. 683–700.
  • [19] Agrim Gupta, Justin Johnson, Li Fei-Fei, Silvio Savarese, and Alexandre Alahi, “Social gan: Socially acceptable trajectories with generative adversarial networks,” in Proceedings of the IEEE conference on computer vision and pattern recognition, 2018, pp. 2255–2264.
  • [20] Karttikeya Mangalam, Harshayu Girase, Shreyas Agarwal, Kuan-Hui Lee, Ehsan Adeli, Jitendra Malik, and Adrien Gaidon, “It is not the journey but the destination: Endpoint conditioned trajectory prediction,” in European Conference on Computer Vision. Springer, 2020, pp. 759–776.
  • [21] Andrey Rudenko, Luigi Palmieri, Michael Herman, Kris M Kitani, Dariu M Gavrila, and Kai O Arras, “Human motion trajectory prediction: A survey,” The International Journal of Robotics Research, vol. 39, no. 8, pp. 895–935, 2020.
  • [22] Christoph G Keller and Dariu M Gavrila, “Will the pedestrian cross? a study on pedestrian path prediction,” IEEE Transactions on Intelligent Transportation Systems, vol. 15, no. 2, pp. 494–506, 2013.
  • [23] Michael Goldhammer, Konrad Doll, Ulrich Brunsmann, André Gensler, and Bernhard Sick, “Pedestrian’s trajectory forecast in public traffic with artificial neural networks,” in 2014 22nd international conference on pattern recognition. IEEE, 2014, pp. 4110–4115.
  • [24] Dirk Helbing and Peter Molnar, “Social force model for pedestrian dynamics,” Physical review E, vol. 51, no. 5, pp. 4282, 1995.
  • [25] Ramin Mehran, Alexis Oyama, and Mubarak Shah, “Abnormal crowd behavior detection using social force model,” in 2009 IEEE conference on computer vision and pattern recognition. IEEE, 2009, pp. 935–942.
  • [26] Kota Yamaguchi, Alexander C Berg, Luis E Ortiz, and Tamara L Berg, “Who are you with and where are you going?,” in CVPR 2011. IEEE, 2011, pp. 1345–1352.
  • [27] Alexandre Alahi, Vignesh Ramanathan, and Li Fei-Fei, “Socially-aware large-scale crowd forecasting,” in Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2014, pp. 2203–2210.
  • [28] Federico Bartoli, Giuseppe Lisanti, Lamberto Ballan, and Alberto Del Bimbo, “Context-aware trajectory prediction,” in 2018 24th International Conference on Pattern Recognition (ICPR). IEEE, 2018, pp. 1941–1946.
  • [29] Huynh Manh and Gita Alaghband, “Scene-lstm: A model for human trajectory prediction,” arXiv preprint arXiv:1808.04018, 2018.
  • [30] Anirudh Vemula, Katharina Muelling, and Jean Oh, “Social attention: Modeling attention in human crowds,” in 2018 IEEE international Conference on Robotics and Automation (ICRA). IEEE, 2018, pp. 4601–4607.
  • [31] Junwei Liang, Lu Jiang, Kevin Murphy, Ting Yu, and Alexander Hauptmann, “The garden of forking paths: Towards multi-future trajectory prediction,” in Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020, pp. 10508–10518.
  • [32] Jiachen Li, Hengbo Ma, Zhihao Zhang, Jinning Li, and Masayoshi Tomizuka, “Spatio-temporal graph dual-attention network for multi-agent prediction and tracking,” IEEE Transactions on Intelligent Transportation Systems, 2021.
  • [33] Jiachen Li, Hengbo Ma, and Masayoshi Tomizuka, “Conditional generative neural system for probabilistic trajectory prediction,” in 2019 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS). IEEE, 2019, pp. 6150–6156.
  • [34] Diederik P Kingma and Max Welling, “Auto-encoding variational bayes,” arXiv preprint arXiv:1312.6114, 2013.
  • [35] Ian Goodfellow, Jean Pouget-Abadie, Mehdi Mirza, Bing Xu, David Warde-Farley, Sherjil Ozair, Aaron Courville, and Yoshua Bengio, “Generative adversarial nets,” in Advances in Neural Information Processing Systems, 2014, vol. 27.
  • [36] Amir Sadeghian, Vineet Kosaraju, Ali Sadeghian, Noriaki Hirose, Hamid Rezatofighi, and Silvio Savarese, “Sophie: An attentive gan for predicting paths compliant to social and physical constraints,” in Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2019, pp. 1349–1358.
  • [37] Boris Ivanovic and Marco Pavone, “The trajectron: Probabilistic multi-agent trajectory modeling with dynamic spatiotemporal graphs,” in Proceedings of the IEEE/CVF International Conference on Computer Vision, 2019, pp. 2375–2384.
  • [38] Stefano Pellegrini, Andreas Ess, Konrad Schindler, and Luc Van Gool, “You’ll never walk alone: Modeling social behavior for multi-target tracking,” in 2009 IEEE 12th international conference on computer vision. IEEE, 2009, pp. 261–268.
  • [39] Alon Lerner, Yiorgos Chrysanthou, and Dani Lischinski, “Crowds by example,” in Computer graphics forum. Wiley Online Library, 2007, vol. 26, pp. 655–664.
  • [40] Alexandre Robicquet, Amir Sadeghian, Alexandre Alahi, and Silvio Savarese, “Learning social etiquette: Human trajectory understanding in crowded scenes,” in Computer Vision–ECCV 2016: 14th European Conference, Amsterdam, The Netherlands, October 11-14, 2016, Proceedings, Part VIII 14. Springer, 2016, pp. 549–565.
  • [41] Yuchao Su, Jie Du, Yuanman Li, Xia Li, Rongqin Liang, Zhongyun Hua, and Jiantao Zhou, “Trajectory forecasting based on prior-aware directed graph convolutional neural network,” IEEE Transactions on Intelligent Transportation Systems, vol. 23, no. 9, pp. 16773–16785, 2022.
  • [42] Beihao Xia, Conghao Wong, Qinmu Peng, Wei Yuan, and Xinge You, “Cscnet: Contextual semantic consistency network for trajectory prediction in crowded spaces,” Pattern Recognition, vol. 126, pp. 108552, 2022.
  • [43] Luca Anthony Thiede and Pratik Prabhanjan Brahma, “Analyzing the variety loss in the context of probabilistic trajectory prediction,” in Proceedings of the IEEE/CVF International Conference on Computer Vision, 2019, pp. 9954–9963.
  • [44] Zhiyuan Li and Sanjeev Arora, “An exponential learning rate schedule for deep learning,” arXiv preprint arXiv:1910.07454, 2019.
[Uncaptioned image] Chen Zhou (Student Member, IEEE) received the B.E. degree from the University of Science and Technology Beijing and the M.S. degree from the Georgia Institute of Technology, where he is currently pursuing the Ph.D. degree with the Omni Lab for Intelligent Visual Engineering and Science (OLIVES). He has been working in the fields of machine learning and image and video processing. His research interests include trajectory prediction and learning from label disagreement.
[Uncaptioned image] Ghassan AlRegib (Fellow, IEEE) is currently the John and Marilu McCarty Chair Professor in the School of Electrical and Computer Engineering at the Georgia Institute of Technology. His research group, the Omni Lab for Intelligent Visual Engineering and Science (OLIVES), works on robust and interpretable machine learning algorithms, uncertainty and trust, and human-in-the-loop algorithms. The group has demonstrated their work on a wide range of applications, such as autonomous systems, medical imaging, and subsurface imaging. He was a recipient of the ECE Outstanding Junior Faculty Member Award in 2008, and the 2017 Denning Faculty Award for Global Engagement. He and his students received the Beat Paper Award in ICIP 2019. He has participated in several service activities within the IEEE and served on the editorial boards of several journal publications. He served as the Technical Program Co-Chair for ICIP 2020 and ICIP 2024. He served on the editorial boards of IEEE Transactions on Image Processing from 2019 to 2022, and the Elsevier Journal Signal Processing: Image Communications from 2014 to 2022. He served as an Area Editor for Columns and Forums in IEEE Signal Processing Magazine from 2009 to 2012. He led a team that organized the inaugural IEEE VIP Cup in 2017, and the IEEE VIP Cup in 2023. He has been a witness expert in several patent infringement cases.
[Uncaptioned image] Armin Parchami received the B.E. degree in Software Engineering and the M.Sc. degree in Artificial Intelligence from Bu-Ali Sina University and the Ph.D. degree in Computer Science from UTA in 2017. His dissertation focused on single-shot face recognition using deep learning algorithms for security applications. Previously, he contributed significantly to the field of autonomous vehicles during his tenure at Ford, where he held the position of Director of Perception, leading the development of perception algorithms for L2+ autonomy. Recently, he transitioned to Snorkel AI as a Principal ML Research Scientist, focusing on programmatic labeling solutions for computer vision tasks, continuing to drive innovation in machine learning and artificial intelligence.
[Uncaptioned image] Kunjan Singh received the Bachelor’s degree in Computer Engineering from the University of Michigan and the Master’s degree in Computer Science with a specialization in Machine Learning from Georgia Tech in 2023. During his time at Ford, he has made valuable contributions to the development of computer vision solutions for autonomous vehicles, smart infrastructure technology, and ADAS.