Senior Research Scientist @ Dataminr
135 Madison Avenue, Fl 10
New York, NY 10016
About Dataminr:
Dataminr Blog | AI for Good | Twitter | LinkedIn | Instagram
Forbes AI 50 | Deloitte Fast 500 | Forbes Cloud 100 | AI Breakthrough
Senior Research Scientist @ Dataminr
135 Madison Avenue, Fl 10
New York, NY 10016
About Dataminr:
Dataminr Blog | AI for Good | Twitter | LinkedIn | Instagram
Forbes AI 50 | Deloitte Fast 500 | Forbes Cloud 100 | AI Breakthrough
Dataminr
I am a Senior Research Scientist at Dataminr (Jan 2022-present). In Dataminr, I lead the AI-Entities team that contribute to building and scaling novel algorithms to perform information retrieval on 500M messages per day (both English and multilingual messages). During my time at Dataminr, I have deployed as a service ranges from small LMs (Bert, Roberta, XLMR) to LLMs (LLaMA, Mistral) with varying costs. For some of the models I have converted them to Neuron from Pytorch for scaling their inference. These are some of the projects I have worked on.
Named Entity Recognition - extracts named entities such as organizations, persons, locations from multilingual texts via smaller encoders like miniLMv2, xtremedistil, encoders are pretrained for domain adaptation
Entity Tagging/Linking - tags named entities to canonical entities in Dataminr knowledge graphs using BM25 and GBR
Entity-Alert Affinity - further verifies if tagged entities are correct or not using an expensive fine-tuned LLMs such as LLAMA 3.1 8B
Entity Alignment - search via a FAISS indexer based on a fine-tuned Sentence Transformer (paraphrase-MiniLM-L3-v2) if ingested entities exists in Dataminr KG (300K entities) or external KGs such as CapitalIQ (11M entities) and Wiki (6M entities), aligns via GBR to find the closest possible match
Company Negative Sentiment - summarizes anomalous trends by extracting relevant messages using CRP (online topic clustering) and fined tuned LLM such as Mistral 7B.
Chatter - detect third party eye withess events from Twitter multilingual messages via a finetuned twitter-XLMR model
Misinformation - detect misinformation from COVID and US election related multilingual messages via a finetuned multilingual distilbert model
IBM Research
Previously I was a Research Staff Member and a Master Inventor (2021-2024) at IBM T. J. Watson Research Labs, NY (Aug 2016-present) and work in the ai-hybrid cloud team where I was leading or co-leading the following projects: Tackle Containerization Advisory (ACA) (link) and Transforming Monoliths to Microservices (Mono2Micro)(link)
I have published papers in conferences such as ESEC/FSE, ASE, IJCAI, COLING, ECAI, CLOUD, ICWS, ICSOC, SocInfo, SCC, RE, and in journals such as JAAMAS, PLOS ONE, IEEE TCSS, IEEE Internet Computing, KER. I have received one best paper award in ICSOC, 2015 and one best demo paper award in ICSOC, 2019. I am serving or have served as a program committee member for the following conferences: EACL, WWW, ECIR, EMNLP, NAACL, ACL, AAAI, COLING, CIKM, IJCAI, AAMAS, ECML-PKDD, CLOUD, ICDCS, ICWS, ICSOC, SCC, BPM, PST, and PRIMA.
I have filed around 54 patents and out of them 50 are granted.
I graduated from North Carolina State University, Raleigh, US where I completed my MS (Fall, 2013) and Ph.D. (Summer, 2016) in Computer Science under the guidance of Prof. Munindar P. Singh. I completed my BTech in Computer Science and Engineering from the College of Engineering and Technology, Bhubaneswar, India (Summer 2008).
MS Thesis : Muon: Designing Multiagent Communication Protocols from Interaction Scenarios
Phd Thesis : Understanding Human Communication to Estimate Trust, Hierarchy, and Performance
During my PhD, I interned at
HP Labs, Palo Alto, CA (Mentors: Hamid R. Motahari Nezhad , Claudio Bartolini)
US Army Research Labs, Aberdeen, MD (Mentors: Norbou Buchler, Arwen Decostanza)[predoctoral ORISE Fellow]
I have mentored/mentoring the following Summer interns at Dataminr.
Suman Dowlagar, Fall 2022 (PhD Candidate, IIIT Hyderbad)
I have mentored/mentoring the following Summer interns at IBM.
Tishauna Wilson, Summer 2021 (MS Candidate, Virginia Tech)
Hiwot Tadasse, Summer 2021 (Senior, Benedict College)
Joymallya Chakraborty, Summer 2020 (PhD Candidate, NC State)[joined Amazon post graduation]
Haan Johng, Summer 2019 (PhD Candidate, UT Dallas)
Hoang Ho, Summer 2019 (PhD Candidate, UMASS)[joined Apple post graduation]
Daniel Gordon, Summer 2019 (Sophomore, University of the West Indies, Trinidad & Tobago)
Tarek Sakakini, Summer 2018 (PhD Candidate, UIUC) [joined Amazon post graduation]
Liana Lin, Summer 2017 (PhD Candidate, NC State)[joined IBM T.J. Watson post graduation, Now in LinkedIn]
I have mentored/co-advised following MS or PhD students
Rahul Yedida, mentoring with Tim Menzies, 2021-ongoing (PhD Candidate, NC State)
Suma Kasa, co-advisor with Munindar Singh, 2020-2021 (MS Thesis, NC State) [joined Amazon post graduation]
Parth Diwanji, mentored with Munindar Singh, 2020-2021 (MS, NC State) [joined Google post graduation]
Arvind Kumar, mentored with Munindar Singh, 2020-2021 (MS, NC State) [joined Facebook post graduation]
News**
Oct, 2025: Invited PC Member, NeurIPS 2025 Workshop NORA, 2025
Oct, 2025: Invited PC Member, EACL [Industry Track], 2025
Oct, 2025: Invited PC Member, WWW [User Modeling, Personalization and Recommendation Track], 2025
Sept, 2025: Invited PC Member ECIR, 2025
July, 2025: Invited PC Member, EMNLP [Industry Track], 2025
May, 2025, Invited PC Member, CIKM [Full and Short Paper Track], 2025
May, 2025: Invited Area Chair, ACL May, 2025
Feb, 2025: Invited PC Member, IJCAI [AI4Tech Track], 2025
Feb, 2025: Invited Emergency Area Chair, ACL Feb, 2025
Oct, 2024: Invited PC Member, NAACL [Industry Track], 2025
Jun, 2024: Invited PC Member, EMNLP [Industry Track], 2024
Dec, 2023: Invited PC Member, IJCAI, 2024
Dec, 2023: Invited PC Member, NAACL [Industry Track], 2024
June, 2023: Invited PC Member, CIKM, 2023
Mar, 2023: Invited ACL Industry Track, 2023
Dec, 2022: Invited PC Member, IJCAI, 2023
Nov, 2022: Rising Star Award from NC State Computer Science Department
Feb, 2022: Our AST paper on CrawLabel got accepted
Jan, 2022: Joined Dataminr as a senior research scientist
Dec, 2021: Open Source Recognition Program (Kubernetes Stack - Significant Contributors)
Dec, 2021: Received Outstanding Technical Achievement Award
Oct, 2021: Selected as 2021 Class of Master Inventors
Sept, 2021: Invited PC Member, AAAI, 2022
Aug, 2021: Released the open-source version of ACA (link)
Aug, 2021: Paper on the impact of hyper-parameter tuning on refactoring algorithms accepted to ASE
July, 2021: Tutorial on application modernization and refactoring accepted to ASE
July, 2021: Change Action Discovery and ACA papers got accepted to IEEE Cloud
July, 2021: Mono2Micro paper got accepted to ESEC/FSE industry track
Jun, 2021: Mono2Micro is shortlisted as a finalist in CogX awards
May, 2021: Received Outstanding Technical Achievement Award
Apr, 2021: Invited PC Member, CIKM, 2021
Mar, 2021: Invited Industry Track Co-Chair ICSOC (icsoc-2021)
Jan, 2021: Mono2Micro got GAed (link)
Dec, 2020: Received Research Accomplishment
Nov, 2020: Accepted DeveloperWeek Talk on ACA
Oct, 2020: Received Twelfth Plateau Invention Award
Oct, 2020: COLING short research paper got accepted
Sept, 2020: Invited PC Member, AAAI-21 Demonstrations
Aug, 2020: Received High Value Patent Awards
Aug, 2020: ACA got GAed
Aug, 2020: FSE demo paper got accepted
Mar, 2020: Received Eleventh Plateau Invention Award
Mar, 2020: Received Outstanding Technical Achievement Award