Sentiment Classification Using Embeddings

This project implements an embedding-based sentiment classification system that classifies Twitter tweets into Positive, Negative, or Neutral sentiments using Gemini text embeddings and machine learning.

📌 Problem Statement

Social media platforms generate millions of posts daily, making manual sentiment analysis impractical. Understanding public sentiment helps brands, governments, and organizations make informed decisions.

🎯 Objective

The goal of this project is to build a sentiment classifier using:

Text preprocessing and cleaning
Semantic embeddings generated using Gemini
A machine learning classification model

📊 Dataset

Dataset: Twitter Tweets Sentiment Dataset
Size: ~27,000 tweets
Columns: textID, text, selected_text, sentiment
Sentiment Labels: Positive, Negative, Neutral

@Dataset Link: https://www.kaggle.com/datasets/yasserh/twitter-tweets-sentiment-dataset

🛠️ Technologies Used

Python
Pandas, NumPy
NLTK (text preprocessing)
Google Gemini Embeddings (text-embedding-004)
Scikit-learn (Logistic Regression)
Matplotlib, Seaborn
WordCloud
VS Code (Jupyter Notebook)

🔄 Project Workflow

Exploratory Data Analysis (EDA)
Text preprocessing and cleaning
Word cloud visualization
Embedding generation using Gemini
Model training using Logistic Regression
Model evaluation using classification metrics
Custom tweet sentiment prediction

📈 Results

The model successfully classifies tweets into positive, negative, and neutral categories.
Semantic embeddings capture contextual meaning effectively.
Custom user-defined tweets were accurately classified.

🧪 Sample Predictions

-"I absolutely love this new phone!" → Positive -"This service is horrible and frustrating" → Negative -"The event happened yesterday" → Neutral

🚀 How to Run the Project

Clone the repository:

git clone https://github.com/coderShreyIn/Sentiment_Classification_Using_Embeddings.git

Install required dependencies:

pip install -r requirements.txt

Add your Gemini API key in the code:

api_key = "WRITE_YOUR_API_KEY_HERE"

Open and run the notebook in VS Code or Jupyter Notebook.

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
project_Sentiment_Classification_Using_Embeddings.ipynb		project_Sentiment_Classification_Using_Embeddings.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Sentiment Classification Using Embeddings

📌 Problem Statement

🎯 Objective

📊 Dataset

🛠️ Technologies Used

🔄 Project Workflow

📈 Results

🧪 Sample Predictions

🚀 How to Run the Project

About

Uh oh!

Releases

Packages

Languages

License

coderShreyIn/Sentiment_Classification_Using_Embeddings

Folders and files

Latest commit

History

Repository files navigation

Sentiment Classification Using Embeddings

📌 Problem Statement

🎯 Objective

📊 Dataset

🛠️ Technologies Used

🔄 Project Workflow

📈 Results

🧪 Sample Predictions

🚀 How to Run the Project

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages