NLP Machine Learning Society
Hello! In May 2021, I graduated with a master's degree in Computer and Information Science at the University of Pennsylvania. During grad school, I worked as a research assistant and was advised by Dr. Ani Nenkova. In 2019, I completed my undergraduate degree in computer science from BITS Pilani, India. My research interests are in natural language processing, computational social science and ethical AI. Of late, I have grown an interest in bias reduction and interpretability of NLP models. To know more about me, please check out my resume and feel free to reach out to me on email or LinkedIn!
Below is a brief summary of my research and academic projects. They cover areas in natural language processing, machine learning, deep learning and distributed systems. For access to any of the code repositories, please reach out to me over email!
Developed a mini-search engine for a wide range of domains with a distributed crawler, indexer and PageRanker. The crawler functionality was inspired from the Mercator-style crawler and the core functionality of the indexer and PageRanker was implemented using MapReduce (Hadoop).
[Read More] [Report]
Developed a small cloud platform with user accounts, email and file storage. Frontend and backend servers employ core principles of distributed systems such as consistency, replication, fault tolerance and recovery.
[Read More] [Report]
Designed a method for the explainability of Multiple-Choice QA, where a trained SOTA model is probed with different subsets of the surrounding context sentences, in order to determine which sentences are most important in deciding the correct answer. We found that the model performs well even when a smaller subset of context sentences are provided to it.
[Read More] [Report]
Designed a deep learning architecture for visual question-answering using GRU and a pre-trained ResNet. As an enhancement, we included a module that captures Faster-rCNN features of object positions in the image. This gave us a 5% increase in accuracy for the task.
[Read More] [Report]
Extracted the flow features of network traffic using Python libraries to read packet capture files. Used the flow features and implemented two approaches in Java: (i) a machine learning method to detect SSH dictionary attacks and (ii) a method to identify a compromise by matching it to a possible action generally taken during a system compromise
Advised by: Dr. Gokul Kannan (BITS Pilani)
[Code A][Code B]
Used a variety of class-balancing methods on the software aging datasets and applied preprocessing techniques to select the significant and uncorrelated features for further analysis. We then
applied machine learning classification models on the data using Python (Scikit-learn) and compared their results using accuracy and Area Under Curve metrics
Advised by: Dr. N Bhanumurthy (BITS Pilani)
[Code]
Developed a "virtual keyboard" desktop application for the Leap Motion controller especially for stroke victims who have difficulty in typing on traditional keyboards. Used the Leap Motion SDK for Python
Advised by: Dr. Tathagata Ray (BITS Pilani)
[Code]
January 2021 - May 2021
· Course Material
· Prepared teaching materials and homework for Recurrent Neural Networks and NLP topics using PyTorch on Google Colab
· Mentored a pod of 9 students for 5 hours/week with deep learning concepts, homework and the final project
January 2020 - May 2020
· Course Website
· Read and discussed research papers throughout the semester with the goal of understanding theories of reasoning in the context of natural language understanding
· Presented two papers to the class and facilitated discussion
· The first paper was What Can Neural Networks Reason About?, and my slides can be found here
· The second paper was Graph-Based Reasoning over Heterogeneous External Knowledge for Commonsense Question Answering, and my slides can be found here
October 2019 - December 2020
· Demonstrated that the popular toxicity detection tool (Jigsaw’s Perspective API) is unable to relatively rank incivility (hostility/agitation/quarrelsomeness/rudeness) among three American news shows, in a manner similar to humans
· Deduced that erroneous Perspective scores are spuriously correlated with presence of non-offensive ‘error’ words
· Curated a dataset of video clips and transcript segments of American news shows with annotations of incivility
· Advised by: Dr. Ani Nenkova
September 2019 - May 2020
· Course Website
· Facilitated and monitored project team discussions within class, and graded technical updates and progress of 140 students
· Managed scheduling of monthly consultations of 36 student teams with relevant faculty members
January 2019 - June 2019
· Developed a novel word-embedding-based trend detection algorithm for time-stamped automobile consumer complaints
· Demonstrated proof-of-concept through quantitative comparisons with a popular topic modelling algorithm (online-LDA)
· Advised by: Dr. Rajesh N. Rao and Rishabh Gupta
August 2018 - December 2018
· Prepared tutorial material and assignment questions for six core topics (data types, regular expressions, BNF, parameter passing & scope, logic programming, functional programming)
· Invigilated and prepared solutions for two mid-semester quiz evaluation components for a class of 160 students
May 2018 - July 2018
· Studied works in the domain of text summarization and measurement of text coherence
· Built a web-based user-interface tool using HTML, CSS, JavaScript and JSP for text summarization
· Developed a neural model to measure coherence for multi-document summarization
· Advised by: Dr. Vasudeva Varma and Dr. Litton J. Kurisinkel
May 2017 - July 2017
· Built an Alexa Skill (for Amazon Echo) as an in-house voice-based assistant tool for managing the on-going projects in the company
· Used node.js and AWS Lambda for development
· Learnt basics of Natural Language Processing
· Advised by: N S Nagaraja
August 2019 - May 2021
Relevant Courses: Machine Learning, Deep Learning, Computational Linguistics, Software Systems, Internet & Web Systems, Reasoning for Natural Langugage Understanding, Analysis of Algorithms, Big Data Analytics
Master's Thesis: A Data-Driven Error Analysis of Toxicity Detection Models
August 2015 - July 2019
2021
2020
· Cornell, Maryland, Max Planck Pre-doctoral Research School
· Selected for the competitive program, aimed at fostering academic interactions between pre-doctoral students and faculty
· Attended lectures in databases and data analysis, distributed systems, security and privacy, Internet measurement and network architecture, large-scale machine learning, and theory of deep learning
2015-2019
· Awarded by BITS Pilani
· Received this merit-based scholarship consistently for all eight semesters of the undergraduate degree
2015
· Awarded by: Government of India
· Felicitated by Mrs. Smriti Irani, the Union Minister for Human Resource Development, Government of India, for the excellent performance in the 12th board exams.
2011
· Awarded by: Government of India
· National Talent Search Exam, a nation-wide scholarship aimed at identifying and recognizing school students with high intellect and academic talent
· Attended the Nurtuance Programme organized at NIT Surathkal with the goal of understanding prospects of higher studies and careers in STEM
Seattle Area