Hi, I'm Phaneesha

MS DS @ University at Buffalo,SUNY

Paddling through the ocean that is Machine Learning.

Contact Me

About Me

My Introduction

Data Scientist Aspirant with a Master's Degree Seeking Full-Time Position to Utilize Skills in Machine Learning, Python, and Data Analysis to Drive Business Decisions.

>10 Data Science Projects
Completed
5 Articles
Written
2 Published
Papers

Skills

My Technical Level

Development

All About the Core

Python

90%

Java

80%

PySpark

75%

R

70%

C++

40%

JavaScript

70%

Android

85%

MS Excel

70%

Photoshop

70%

Indesign

90%

Frameworks

Everyone Needs Support

NumPy

80%

pandas

90%

matplotlib

70%

scikit-learn

85%

Spark MLlib

70%

Pytorch

85%

Deep Graph Library

55%

OpenCV

65%

Pillow

65%

NLTK

60%

streamlit

80%

seaborn

70%

Flask

40%

Machine Learning

Theory, theory!

Linear and Logistic Regression

95%

Decision Trees

95%

Ensemble Models

90%

Clustering

65%

Convolutional Neural Networks

80%

Graph Neural Networks

60%

Recommender Systems

75%

Natural Language Processing

65%

Exploratory Data Analysis

90%

Multi-modal Learning

70%

Time Series

55%

Cloud and Engineering

Fly Fast & High!

AWS Sagemaker

65%

AWS EMR

75%

AWS Lambda

70%

Big Query

40%

Docker

60%

Apache Airflow

40%

Kafka

40%

Databases and Viz

Wow! Factor

MySQL

85%

AWS Redshift

75%

Amazon RDS

70%

Tableau

50%

Power BI

50%

Looker

60%

Qualification

My Personal Journey
Education
Work

Master's of Professional Studies in Data Sciences and Applications

SUNY, University at Buffalo, NY, USA
2021-2023

Extension - M.Tech in Advanced Manufacturing Systems

Jawaharlal Nehru Technological University, Hyderabad, India
2019-2021

B.Tech in Mechanical Engineering

Jawaharlal Nehru Technological University,Hyderabad, India
2015-2019

Higher Secondary in Science

Sri Chaitanya Junior College, Hyderabad, India
2013-2015

Secondary

St.Arnold's High School, Hyderabad, India
2008-2013

Software Engineer

Illinois Secretary of State
October 2023 - Present
What I did here

  • • Designed and implemented interactive dashboards using Power BI, SSRS, and related technologies, translating complex data into actionable insights for informed decision-making.

  • • Contributed significantly to the creation and optimization of web applications for online driver services, ensuring seamless functionality and user satisfaction, employing technologies like HTML, CSS, and JavaScript to create user-friendly and responsive interfaces.

  • • Collaborated closely with teams using technologies like Java, Python, and SQL Server Reporting Services (SSRS) to align software projects with strategic goals, emphasizing effective cross-functional communication and collaboration.

Data Scientist - Team Lead

Community Dreams Foundation
April 2023 - October 2023
What I did here

  • • Led a team of data research analysts in conducting energy-related data assessments and analysis..

  • • Gathered, analyzed, and interpreted large datasets on energy consumption, renewable energy sources, and environmental impact.

  • • Utilized statistical models and data visualization techniques to identify trends, patterns, and insights.

Data Scientist Intern

Marvel Technology Solutions
June 2022 - August 2022
What I did here

  • • Developed a robust recommendation system for an e-commerce company using Python and its associated data science libraries, including Pandas, NumPy, and Scikit-learn.

  • • Utilized advanced recommendation algorithms such as collaborative filtering, content-based filtering, and hybrid methods to deliver personalized product recommendations to millions of customers and used Tableau for data visualization and reporting to make insights easily understandable for stakeholders.

  • • Leveraged cutting-edge techniques such as dimensionality reduction and hyper parameter tuning for data preprocessing, feature engineering, and model selection, and utilized Apache Spark and Tensor Flow for distributed computing and deep learning to handle massive amounts of data.

Data Scientist

Sarag Systems
May 2019 - July 2021
What I did here

  • Developed a content-based recommendation engine for the Whitecoats application, a widely used tool among healthcare professionals, aimed at suggesting relevant articles.

  • Conducted in-depth data analysis using nltk and spacy packages, exploring millions of data records.

  • Generated weighted embeddings using TFIDF scores and PubMed embeddings.

Research Intern

Indian Institute of Technology,Hyderabad.
Feb 2017 - November 2019
What I did here

  • Collaborated with a PhD Scholar to design and fabricate a Passive Dynamic Walker using Solid Works and Laser beam machine, respectively, culminating in a successful analysis and optimization of the design, resulting in a 20% increase in efficiency.

  • Conducted simulation of the walker's behavior on a slope with an inclination of 3 degrees using MATLAB, leading to successful analysis and optimization of the design.

  • Presented the project at the Connaisance Conference, where the improved efficiency and innovative design were showcased to a diverse audience of over 50 professionals.

Mathematics Tutor and Academic Mentor

Self-Employeed
June 2013 - July 2021
What I did here

  • Provided mathematics tutoring to high school and college students, focusing on strengthening foundational knowledge and problem-solving skills.

  • Guided students in preparing for the competitive EAMCET and JEE exams, offering targeted instruction and exam strategies.

  • Facilitated home schooling for primary to high school level students during the pandemic, creating customized curriculum and utilizing innovative teaching methods.

Subject Matter Expert

Chegg Inc
June 2020 - Apr 2021
What I did here

  • Worked as Calculus Subject Matter Expert (Mathematics)

Portfolio

My Projects

OptiRoute: Dynamic TSP Solver

Efficient Algorithmic Solutions and Automated Coordinate Retrieval

  • Leveraged Nearest Neighbors algorithm for efficient TSP computation.

  • Integrated Nominatim API for automated city coordinate retrieval.

  • Employed GIF-based animated tour generation for interactive route visualization.

  • Tech Stack

    Folium Folium Folium

    Research Papers Referred

    View Code View Report

    Grid-based Game

    Exploring Hyperparameters and Performance Analysis

  • Implemented SARSA algorithm for training an agent in a gaming environment.

  • Conducted experiments with various hyperparameters to analyze their impact on agent performance.

  • Evaluated the effectiveness of different hyperparameter configurations to optimize agent learning in the gaming environment

  • Tech Stack

    Folium Folium

    Research Papers Referred

    View Code
  • Object Localization with PyTorch

    Building a Model for Precise Object Localization in Images

  • Developed a deep learning model using PyTorch for object localization, enabling precise identification and localization of objects within images.

  • Implemented advanced techniques such as data augmentation, transfer learning with pre-trained models, and custom dataset creation to enhance model performance and accuracy.

  • Successfully trained and evaluated the model, achieving significant improvements in object localization accuracy compared to traditional computer vision techniques.

  • Tech Stack

    Folium

    Research Papers Referred

    View Code

    RoadNet

    Efficient Road Segmentation using UNet and EfficientNet-B0

  • Developed a deep learning-based road segmentation model using UNet architecture and EfficientNet-B0 encoder.

  • Applied extensive data augmentation techniques to improve model performance and generalization.

  • Achieved high accuracy in road segmentation on a large-scale dataset, demonstrating the potential for real-world road detection applications.

  • Tech Stack

    Folium

    Research Papers Referred

    View Code

    NYC Collision Analysis Web Application

    Extracting, Analyzing, and Visualizing Data for Safer Roads

  • Built a cutting-edge web application on NYC Collision Analysis using Python, Streamlit, PyDeck, GCP, BigQuery, SQL, and Looker Studio.

  • Extracted and processed extensive collision data from NYC Open Data, storing it in GCP BigQuery for advanced analysis.

  • Created interactive visualizations and a user-friendly interface to explore collision data, with filtering and mapping capabilities.<

  • Tech Stack

    Folium Folium Folium Folium
    View Report

    Optimizing SVMs for Purchase Prediction

    Evaluating Linear, Radial, and Polynomial Kernels with Tuned Parameters

  • Trained Support Vector Machines (SVMs) with linear, radial, and polynomial kernels on the OJ dataset.

  • Evaluated training and test error rates for each SVM model and identified optimal cost values.

  • Tech Stack

    Folium Folium
    View Code

    Cleveland Heart Disease Diagnosis

    Evaluating Neural Network, CART, and Random Forest

  • Utilized machine learning algorithms, including Neural Network, CART, and Random Forest, to analyze the Cleveland heart disease dataset.

  • Compared the performance of the models based on metrics such as mean squared error, accuracy, and error rate.

  • Found that Random Forest achieved the highest accuracy (83.14607%), followed by the Neural Network (77.72006%), and CART model.

  • Tech Stack

    Folium Folium Folium
    View Code

    Predicting Car Mileage

    Exploring LDA, QDA, Logistic Regression, and KNN

  • Created a binary variable, mpg01, based on car mileage to facilitate prediction using the Auto dataset.

  • Explored the association between mpg01 and other features through scatterplots, boxplots, and correlation analysis.

  • Applied different classification models such as LDA, QDA, Logistic Regression, and KNN to predict mpg01 and calculated their respective test error rates.

  • Tech Stack

    Folium Folium Folium Folium Folium
    View Code

    Boston Data Analysis

    Using AIC, BIC, and Cross-Validation Techniques

  • Loaded the required libraries for the assignment, including ISLR2, ISLR, bootstrap, boot, leaps, klaR, class, GGally, corrplot, caret, and rpart.

  • Performed best subset regression analysis on the Boston data from the ISLR2 package, using AIC, BIC, five-fold cross-validation, and ten-fold cross-validation for model selection.

  • Observed different results for the selected models using AIC, BIC, five-fold cross-validation, and ten-fold cross-validation.

  • Tech Stack

    Folium Folium Folium Folium
    View Code

    Chicago: Crime Data Analysis

    Google BigQuery

  • Utilized Google Cloud Platform's BigQuery to analyze Chicago crime data.

  • Explored crime patterns and trends in Chicago using various queries and data analysis techniques.

  • Gained insights into the types of crimes, their frequency, geographical distribution, and temporal variations to inform decision-making and resource allocation for crime prevention efforts.

  • Tech Stack

    Folium Folium Folium
    View Code

    Autism Detection

    Random Forest, and Boosting Algorithms

  • Performed preprocessing and feature engineering on genome data by removing irrelevant features, normalizing the data, and selecting relevant genetic markers.

  • Trained a random forest model to detect autism, leveraging its ability to handle complex datasets like genome data.

  • Explored and compared the performance of boosting algorithms (GBM, Adaboost, XGBoost) to improve the random forest model's baseline performance.

  • Tech Stack

    Folium Folium Folium Folium
    View Code

    Blog

    My Technical Articles

    Unleashing the Power of the Sum of Spiral Diagonal Formula

    "From Space Science to Finance and Genetics" explores the versatile applications of the sum of spiral diagonal formula in astronomy, finance, and genetics. Researchers aim to harness its potential for innovation and advancements in these fields.

    Read it!

    The Art of Hiding Messages: Exploring the Fascinating World of Steganography

    This project explores various techniques and tools used in steganography, highlighting its applications in data security, covert communication, and digital forensics. By uncovering the secrets of steganography, researchers aim to raise awareness about its potential risks and promote discussions on encryption and privacy in the digital age.

    Read it!

    Traversing the City of Dreams: Mapping the Perfect Tour using Python and Folium

    "Using the nearest neighbors algorithm and folium mapping, we solved the Traveling Salesman Problem and visualized the optimal tour on a geographical map."

    Read it!

    Did you know how ancient people sent secret messages

    "Caesar Cipher: Ancient Encryption Unveiled" explores the secrets of a historic encryption technique used by Julius Caesar. Discover the workings of this cipher and its relevance in modern cryptography.

    Read it!

    Chicken Biryani

    From My Kitchen to Yours: Making Mouthwatering Chicken Biryani at Home

    Read it!

    A Story of Hidden Messages

    Explore the fascinating world of encoding and decoding and discover how it is used to send hidden messages.

    Read it!

    Contact Me

    Get in Touch

    Call Me

    +1 (716) 299 8640

    Location

    Buffalo, NY, USA