Hi, I'm Rakshitha Ailneni.

A
Self-driven, passionate and highly-motivated PhD student with love for Natural language Processing and Deep Learning.

About

I am a PhD student at the University of Texas, Dallas advised by Dr. Sanda Harabagiu. My research interests are in the fields of Natural Language Processing and Deep Learning. To be specific, I'm interested in Multimodal Prompting, Reasoning and AI for social good.I always strive to bring 100% to the work I do.

  • Languages: Python, JavaScript, C, HTML/CSS
  • Libraries: NumPy, Pandas, OpenCV
  • Frameworks:Keras, TensorFlow, PyTorch
  • Tools & Technologies: Git, Docker, AWS, JIRA

Quote I try to live by: 'I have loved the stars too fondly to be fearful of the night.'

Experience

Data Scientist
  • Worked with wide variety of customers on AI/ML projects in US, Latin America, Nigeria, and India.
  • Developed models for ESP failure prediction. For early detection, auto-regression (with LSTM) is used with inputs such as intake temperature, discharge pressure, motor temperature, and current values at previous timesteps to predict the value of current in the future. In the case of imminent failure detection, unsupervised learning approach (Isolation forest) is leveraged to detect anomaly points in a day.
  • Implemented MultiRocket and neural fuzzy logic frameworks for time series classification.
  • Tools: Python, AWS, Keras
August 2021 - July 2023 | Bangalore, India
NLP Developer
  • Working to create a knowledge graph (ClimateKB) containing causes and effects of climate change.
  • Causality detection is done using a BERT model which is fine-tuned on SemEval datasets.
  • Base entity mentions and the casual relationship between entities are extracted from cause-effect sentences.
  • Automatic knowledge base completion techniques leveraging the knowledge present in large language models (like in COMET) are explored.
  • Tools: Python, Protege, Pytorch
July 2022 - Nov 2022 | Remote Work

Projects & Publications

mami
Multimedia Misogyny Identification

Transformer-based models for Meme Classification

Accomplishments
  • Women are frequently targeted online with hate speech and misogyny using tweets, memes, and other forms of communication. Our paper describes the system for Task 5 of SemEval-2022: Multimedia Automatic Misogyny Identification (MAMI).
  • We participated in both the sub-tasks, where we used transformer-based architecture to combine features of images and text. We explore models with multi-modal pre-training (VisualBERT) and text-based pre-training (MMBT) while drawing comparative results.
  • The official evaluation ranked our system 3rd out of 83 teams on the binary classification task (Sub-task A) with an F1 score of 0.761, and 7th out of 48 teams on the multi-label classification task (Sub-task B) with an F1 score of 0.705.
Screenshot of  web app
PCL Detection

Ensemble of Tuned Transformer-based Models for PCL Detection

Accomplishments
  • Patronizing behavior is a subtle form of bullying and when directed towards vulnerable communities, it can arise inequalities. Our paper describes the system for Task 4 of SemEval-2022: Patronizing and Condescending Language Detection (PCL).
  • We participated in both the sub-tasks and conducted extensive experiments to analyze the effects of data augmentation and loss functions used, to tackle the problem of class imbalance. We explore whether large transformer-based models can capture the intricacies associated with PCL detection.
  • Our solution consists of an ensemble of the RoBERTa model which is further trained on external data and other language models such as XLNeT, Ernie-2.0, and BERT. We also present the results of several problem transformation techniques such as Classifier Chains, Label Powerset, and Binary relevance for multi-label classification.
NLP Contribution Graph
NLP Contribution Graph

Construction of knowledge graph with contributions from NLP research papers.

Accomplishments
  • Tools:Python, Pandas, Spacy, Keras
  • Classify sentences from a research paper into contribution and non-contribution categories with the help of a LSTM model.
  • Scientific entities and relational cue phrases extraction.
  • Triple formation with the help of OpenIE framework.
  • Classification of triples into various information units such as model, baseline, dataset, etc.
CGAN
Conditional image generation with GANs

Generative Adversarial Networks for the generation of Microscopic data

Accomplishments
  • Tools:Python, Keras, OpenCV, ImageJ
  • Uses image translation networks like Pix2Pix to generate label-specific microscopic images.
  • Dataset has a binary mask and the corresponding textured image for each category.
  • Generation of 3D images from 2D networks by using indiviudal z-slices of an image.
  • Modification of StarGAN framework to improve the textural range of fake images,
Screenshot of web app
DRDO SASE’s UAV Fleet Challenge

DRDO SASE’s UAV Fleet Challenge

Accomplishments
  • Tools:Python, QGround Control, Pixhawk flight controller
  • Detection of a green colored box located on a 40 x 30 field with the help of a swarm of drones in minimum time.
  • For object detection, a Yolo V3 model is deployed.
cnn model
A prototype of a self-driving car

A prototype of self-driving car using CNNs and Image processing techniques.

Accomplishments
  • In the image processing approach, the real-time image is processed and image processing techniques such as thresholding, centroid detection, etc. are applied to detect the lane and predict trajectory.
  • In Learning based approach, using open source machine learning libraries, the system is trained on a manually curated dataset, so that it is capable of identifying and extracting lanes in real-time from an unseen environment.
  • The output of the above two approaches is converted into steering angle which is given to a RC car to follow a lane in real-time.
salient detection
Frequency-tuned Salient Region Detection

A framework to find the saliency map of an image.

Accomplishments
  • The saliency map of an image is extracted by calculating the L2 norm between mean value pixel in LAB format and the Gaussian filtered image.
  • Comparison of fixed and adaptive thresholds applied on the mean shift segmented image.

Skills

Languages

Python
HTML5
CSS3
C
JavaScript
Shell Scripting

Libraries

NumPy
Pandas
OpenCV
scikit-learn
matplotlib

Frameworks

Keras
TensorFlow
PyTorch

Other

Git
AWS
MATLAB

Education

University of Texas, Dallas

Richardson, Texas

Degree: PhD in Computer Science
CGPA: 3.77/4

    Relevant Coursework:

    • Operating Systems
    • Database Design
    • Information Retrieval
    • Data Structures
    • Discrete Structures

IIT Gandhinagar

Gujarat, India

Degree: Master of Technology in Electrical & Electronics Engineering
CGPA: 9.0/10

    Relevant Coursework:

    • Natural Language Processing
    • Computer Vision
    • Design of Experiments
    • Mathematical Methods in Engineering
    • Advanced Numerical Methods in Engineering

Visvesvaraya National Institute Of Technology

Maharastra, India

Degree: Bachelor of Technology in Electrical & Electronics Engineering
CGPA: 7.79/10

    Relevant Coursework:

    • Data Structures and Algorithms
    • Calculus
    • Linear Algebra
    • Industrial Automation

Contact