Poornachandraprasad Kongara

Boston, MA, USA | 6173723096 | ckongarac@gmail.com

Results-driven professional with a Master's degree in Management Information Systems from Northeastern University and a Bachelor's degree in Computer Science from Mumbai University. Adept in leveraging a comprehensive skill set encompassing programming languages (Python, C, R), databases (Oracle SQL, MySQL, NoSQL), and data analysis and visualization tools (Power BI, Tableau, TensorFlow). Proven track record in software development, data engineering, and implementing machine learning models. Skilled in ETL processes, data warehousing, and cloud technologies (AWS, Azure, GCP). Recognized for optimizing workflows, enhancing website performance, and collaborating with cross-functional teams to scale operations. Extensive experience in diverse projects, including predictive modeling, system enhancements, and data-driven decision-making. Committed to delivering innovative solutions in data science and analytics.

Experience

Data Analyst

CGI | Houston, USA

• Engineered automated expense reporting and reconciliation pipelines using Python and SQL, streamlining financial workflows across 6+ consular departments and reducing manual processing time by 40%.
• Built interactive Tableau dashboards analyzing $2.3M+ in consular expenditures by integrating SQL and Excel data sources, enabling leadership to make fiscal decisions 3x faster.
• Performed end-to-end data cleaning, validation, and preprocessing on 50,000+ financial records using Python (Pandas) and Excel, reducing inconsistencies by 35% and improving reporting accuracy to 98%+.

May 2025 – Present

Data Analyst

VERTEX PHARMACEUTICALS | Boston, USA

• Streamlined cellular assay data analysis using Databricks, implementing k-means clustering and logistic regression, reducing processing time by 20% and delivering actionable insights faster.
• Led A/B testing projects to optimize customer engagement strategies, resulting in a 5% increase in key metrics such as click-through rate, conversion rate, and user engagement.
• Analyzed Flow, Nucleic acid, and T1D assay data using Python and SQL, conducting exploratory data analysis (EDA) to improve scientist comprehension by 40%, enabling more effective decision-making.
• Developed dynamic Power BI dashboards and integrated 4 data sources with SQLAlchemy, enabling real-time tracking of 15+ KPIs and improving decision-making efficiency for 10+ stakeholders.
• Designed and automated ETL pipelines using Python and AWS Redshift, processing 8M+ records and saving 100+ hours of manual work monthly, boosting operational efficiency by 50%.
• Optimized SQL workflows for analytics dashboards, reducing cloud storage costs by $10K annually and enhancing reporting efficiency by 30%.

January – June 2024

Operations Analyst

Northeastern University | United States

• Conducted exploratory data analysis (EDA) on search filter usage across two distinct search experiences using SQL and Excel, optimizing filter functionality, reducing site latency by 10%, and improving overall user experience.
• Optimized SQL queries to enhance the efficiency of Looker Studio dashboards, processing 25+ reports more efficiently, improving performance, and reducing server resource usage and cloud storage costs by $10,000 annually.
• Streamlined dashboard workflows in Looker Studio by automating data retrieval and integration processes, improving data accessibility and reducing manual efforts by 30%.
• Leveraged Excel tools such as Pivot Tables and VLOOKUP to analyze large datasets, enabling actionable insights and improving reporting accuracy across key business metrics.

January 2023 – December 2023

Graduate Teaching Assistant

Northeastern University | Boston, Massachusetts, United States

• Delivered lectures on "INFO 7374 - Applied Machine Learning using Python in Finance", simplifying complex concepts in machine learning, quantitative finance, and Python programming for enhanced student comprehension.
• Guided hands-on projects in topics such as statistical analysis, machine learning algorithms, and data analytics, equipping students with essential skills for practical applications in finance and data analysis.
• Facilitated in-depth discussions on advanced topics, including predictive modeling, financial forecasting, and Python-based data processing, fostering critical thinking and problem-solving among students.
• Mentored students in applying machine learning techniques to real-world finance scenarios, enhancing their technical proficiency and career readiness in the data-driven finance industry.

Skills: Python (Programming Language) · Machine Learning · Data Analytics · Financial Analysis · Predictive Modeling · Data Processing · Problem Solving in Finance · Applied Mathematics · Financial Forecasting · Statistical Data Analysis

September 2023 – December 2023

Data Engineer

Iksula Services Pvt Ltd | Mumbai, India

• Analyzed eBay Global Shipping data using Python, SQL, and PySpark-driven ETL solutions, delivering data-driven insights that improved operational processes and contributed $100K in additional monthly revenue.
• Performed A/B test analysis on 10+ KPIs, including add-to-cart rate and conversion rate, using Google Analytics, uncovering insights that drove a 4% increase in sales conversions and enhanced customer engagement.
• Enhanced data retrieval processes with advanced SQL techniques such as joins, CTEs, and window functions, streamlining data preparation for 50+ reports and saving 10+ hours per week in manual processing.
• Visualized insights using Tableau, creating dashboards that effectively communicated key trends and operational metrics to stakeholders for data-driven decision-making.
• Optimized data workflows with Airflow, automating report generation and reducing processing time by 30 minutes per report, significantly improving operational efficiency.
• Implemented scalable ETL pipelines using PySpark, ensuring efficient data processing and improving data quality across eBay’s global shipping datasets.
• Collaborated with cross-functional teams to analyze data using SQL views, presenting findings that informed strategic decisions and enhanced operational performance.

November 2020 – April 2022

Data Engineer Intern

Iksula Services Pvt Ltd | Mumbai, India

• Analyzed 10,000+ shipments by major carriers like UPS etc. using SQL queries to provide insights into sales performance.
• Utilized Power BI and Tableau to analyze data on active users, purchases resulting in 5% improvement in the system model.
• Collaborated with a team of 10 shipping experts in a fast-paced environment to implement changes based on the insights gained using the analyzed data, resulting in a 20% increase in shipping efficiency and an impressive 15% decrease in shipping costs.

May 2020 – October 2020

Education

Northeastern University, Boston, MA

Master of Science in Information Systems
  • Data Science

Relevant Coursework: Application Engineering & Development, Database Management & Data Design, Data Science Engineering Methods and Tools, Applied Machine Learning and Python in Finance

Expected May 2024

University of Mumbai

Bachelor of Engineering
  • Computer Engineering

Relevant Coursework: Sequence of Applied Mathematics, Discrete Mathematics, Object Oriented Programming, Data Structures, RDBMS, Machine Learning, Natural Language Processing, Python, Theory of Computer Science and Communications Skills.

CGPI: 8.05/10

August 2016 - August 2020

Projects

Hair Loss Prediction and Data Analysis

(Python, SQL, Web Scraping, Pandas, Tweepy, Excel, Matplotlib)

• Developed and optimized a machine learning model, achieving an impressive 89.9% accuracy for hair loss prediction.
• Streamlined data science lifecycle by reducing data processing time by 10% through effective data preparation techniques.
• Emphasized hyperparameter tuning, yielding a 15% query efficiency boost and model refinement for superior performance.
• Demonstrated expertise in model interpretation techniques like SHAP, LIME, and PDP, enhancing transparency, trust, and actionable insights, driving a 10% improvement in query efficiency.

January 2023 - May 2023

Twitter ETL Pipeline with Apache Airflow

Project GitHub Repository
(Python, Apache Airflow, Tweepy, Pandas, AWS EC2, AWS S3, S3FS)

● Automated ETL Process: Developed an automated ETL pipeline to extract tweets from a specific Twitter account, transform the data, and load it into AWS S3 using Airflow for scheduling.
● Data Extraction and Transformation: Utilized Tweepy to interact with the Twitter API, fetching the latest 200 tweets. Cleaned and transformed the raw data using Pandas to create a structured DataFrame that includes tweet text, retweets, favorites, and timestamps.
● Data Loading and Cloud Integration: Implemented S3FS to store transformed data in an AWS S3 bucket for long-term storage and analysis. Deployed the solution on an AWS EC2 instance to manage the workflow execution efficiently.
● Scheduling and Automation: Scheduled the ETL process using Airflow DAGs to run daily, ensuring continuous and up-to-date data retrieval from Twitter.

October 2024 - Present

Northeastern University Events Management System

(Python, SQL, Web Scraping, Pandas, Tweepy, Excel, Matplotlib)

● Database Design and Implementation: Created a database structure with an ERD including tables for events, clubs, organizers, and tweets. Developed use cases and relational algebra queries to extract and manipulate the data.
● Data Gathering and Quality Assurance: Gathered data about events by web scraping data from Twitter and Northeastern websites. The web scraped data was then inserted into an Excel sheet, and data cleaning and munging was performed using popular data analysis libraries like Pandas, tweepy, and pymysql. Ensured data quality by auditing the data for accuracy, validity, completeness, and consistency before inserting it into the SQL database using SQL files.
● Data Visualization and Optimization: Implemented use cases that represent different ways that the database might be queried and manipulated to extract useful information. Created data visualizations using Pandas and Matplotlib to help users easily understand and interpret the data. Normalized data to 3NF and created SQL views for optimized data access.

September 2022 - December 2022

Automatic Lip Reading: Classification of Words and Phrases using Convolutional Neural Network

(Python, OpenCV, Convolutional Neural Network, Matplotlib, Classification, Pandas, NumPy)

● Data Pre-processing: In this stage, the haar cascade model and the Python package dlib were used, together with Open CV, to acquire all the points of the facial structure. These points were then used to crop the speaker's face. Concatenated the cropped frames into a single image to get all the faces of speakers expressing words and phrases in a single frame, then resized to boost scalability by 10%
● Model Implementation: Designed a CNN model with four convolutional layers and two fully connected layers. The two completely linked layers employed a soft max activation function layer to generate the probabilities of each word and phrase, with the highest probability being chosen
● Result And Analysis: Understanding the errors by using accuracy as the primary criteria and the confusion matrix. Highest training accuracy was 99.26%, while maximum testing accuracy was 80.44%. The model was trained for 50 epochs at 0.0001 learning rate, increasing accuracy from 10% to 80.44%

August 2019 - May 2020

Wine Quality Detection using Machine Learning

(Python, Matplotlib, Pandas, NumPy)

● Modeling: • Created a multi-class classification model using Artificial Neural Networks in Python to rate the quality of red and white wines on a scale of 1 to 10, with categorical cross-entropy as the loss function.
● Testing and Training: • Developed a neural network to rate wine quality from 1-10 using 1600 samples, achieved a 36% validation loss reduction, and employed Matplotlib, Pandas, and NumPy for data handling and visualization.

June 2019 - Dec 2019

Translator using Natural Language Processing

((Python, Tkinter, NLTK)

● GUI Implementation: Tkinter was used to create the GUI for translating words and phrases from English to Russian
● Modeling: Generalized tokenization for parsing the data into small chunks in which meaning can be assigned easily, stemming to remove the affixes and removal of stop words along with POS Tagging using natural language processing

Jan 2019 - May 2019

Skills

Programming & Technologies
  • Python, C, R, ETL, PySpark, Airflow, Databricks, HTML, CSS

Databases
  • PostgreSQL, MS SQL Server, MySQL

Data Analysis and Visualization
  • Power BI, Tableau, Exploratory Data Analysis, NumPy, Matplotlib, Pandas, Seaborn

Machine Learning
  • CNN, Hypothesis Testing, GenAI, Scikit-Learn, Keras, Regression, Classification, Statistical Analysis

Cloud and Big Data
  • AWS, Azure, Shell Scripting, Bitbucket, Confluence, CI/CD, Restful APIs, Docker

Tools and Productivity
  • Excel, PowerPoint, Agile, Pivot Tables, Linux, Jira, Git, Postman, Analytical, Detail Oriented

Publications

  • Automatic Lip Reading: Classification of Words and Phrases using Convolutional Neural Network Language based upon the classification of 10 words and 10 phrases with the help of deep learning and image processing in association with International Research Journal of Engineering and Technology(IRJET).

    Published Paper Link

Leadership Experience

  • ● Lead Developer – eBay Global Shipping Project, Podar International
    ● Member – IEEE RGIT
    ● Asst. Secretary – Sports Committee RGIT