Bio
Sanjiban works as a Software Engineer with the Data Engineering team at Voltron Data. His work primarily focuses on the development of open-source projects such as Apache Arrow, Substrait, and Velox by Meta. He co-created Substrait Fiddle, which is an online tool to prototype, visualize and share data relational queries based on the substrait specification. As a part of Voltron Data, he collaborated with the Meta open-source team for developing PyVelox, particularly implementing the support for Arrow-Velox conversion, complex data types, etc.
Sanjiban has been working in the open-source data science and engineering domain since his junior year of college in 2021. He was accepted to participate in Google Summer of Code 2021 for CERN-HSF and thus worked on developing storage functionalities for deep learning models. A year later, he was selected to participate in the CERN Summer Student Program in Geneva, Switzerland, and worked on enhancing TMVA SOFIE: which is a fast machine learning inference engine by CERN. In SOFIE, he was particularly involved in the development of the Keras and PyTorch Parser, machine learning operators based on ONNX standards, Graph Neural Networks support, etc. Moreover, he volunteered as a Mentor for the contributors of Google Summer of Code 2022, and again in 2023, and the CERN Summer Students of 2023 working on CERN’s ROOT Data Analysis Project.
Sanjiban finds hackathon and ideation events very interesting, and has participated in many of them in different levels. Previously, he has worked with various startups as well as corporations, thus gaining industrial experience. During college, he acted as the Vice Chair, and then the Chair of the ACM Student Chapter of IIIT Bhubaneswar. He also acted as the ML Head of various student technical societies.
His work on CERN’s TMVA SOFIE Machine Learning Inference Engine has been published/presented as follows:
- Moneta L., Sengupta S., Hamdan A. “New developments of TMVA/SOFIE: Code Generation and Fast Inference for Graph Neural Networks”. Oral Presentation at 26th International Conference on Computing in High Energy & Nuclear Physics; May 2023; Virginia, USA
- Sitong An, Sanjiban Sengupta et al. C++ Code Generation for Fast Inference of Deep Learning Models in ROOT/TMVA. 2023 Journal of Physics: Conference Series 2438 012013
- An S., Moneta L., Sengupta S., Hamdan A. Shah N., Shende H., Mittal S., Zapata O. “ROOT Machine Learning Ecosystem for Data Analysis”. Poster presented at 21st International Workshop on Advanced Computing and Analysis Techniques in Physics Research; October 2022; Bari, Italy.
- An S., Moneta L., Sengupta S., Hamdan A., Sossai F., Saxena A. “SOFIE: C++ Code Generation from ROOT/TMVA for Fast Deep Learning Inference”. Poster presented at 20th International Workshop on Advanced Computing and Analysis Techniques in Physics Research; November 2021; Daejeon, South Korea.
- Sengupta S. “TMVA SOFIE: Enhancing the Machine Learning Inference Engine”. A report published for the CERN Summer Student Program; December 2022; Geneva, Switzerland.