Skip to content
View ritwik4m's full-sized avatar
๐ŸŽฏ
Focusing
๐ŸŽฏ
Focusing
  • Guildford
  • 05:25 (UTC)

Block or report ritwik4m

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this userโ€™s behavior. Learn more about reporting abuse.

Report abuse
ritwik4m/README.md

Hi there, I'm Ritwik ๐Ÿ‘‹


๐Ÿ‘จโ€๐Ÿ’ป About Me

Iโ€™m a Data Science professional with an MSc in Data Science (Distinction) from the University of Surrey. I bring over 5 years of QA Engineering and data validation experience, combined with modern data science expertise in:

  • ๐Ÿง  Machine Learning
  • ๐Ÿ Python
  • ๐Ÿงฎ SQL / NoSQL
  • ๐Ÿ“Š Data Visualization
  • ๐Ÿ“š NLP & Deep Learning
  • โ˜๏ธ Cloud Computing

Iโ€™m driven to build reliable, scalable, and dataโ€‘driven solutions that solve realโ€‘world problems and deliver measurable business value.


๐Ÿš€ Projects

Biomedical abbreviation & long-form detection using CRF, BiLSTM, and RoBERTa
Achieved F1 โ‰ˆ 0.86 with RoBERTa + LION. Tackled real-time challenges like distillation and pruning.

๐Ÿ“ˆ HR Data Salary Prediction

Modeled expected salary (CTC) based on candidate profile data
Achieved Rยฒ โ‰ˆ 0.98 (with prior salary), Rยฒ โ‰ˆ 0.90 (without).
Applied regression, feature engineering, and fairness evaluation.


โš™๏ธ Tools & Technologies

Python R SQL scikit-learn NLP Git Docker AWS Google Cloud Microservices Tableau KNIME Power BI OpenAI Transformers NLP Git Colab Jupyter GitHub


๐Ÿ“Š GitHub Stats

GitHub Stats Top Langs


๐Ÿ“ซ Connect with Me

LinkedIn Email


๐Ÿง  โ€œData is the new oil, but insight is the spark that makes it valuable.โ€

Popular repositories Loading

  1. NLP-lab-2025 NLP-lab-2025 Public

    Jupyter Notebook

  2. NER-Token-Classification NER-Token-Classification Public

    Token classification for biomedical abbreviation and long-form detection (PLOD-CW-25 dataset)

    Jupyter Notebook 1

  3. salary-prediction-delta salary-prediction-delta Public

    ML model to predict fair salaries for Delta Ltd applicants

    Jupyter Notebook

  4. ritwik4m ritwik4m Public

  5. ml-algorithms-portfolio ml-algorithms-portfolio Public

    Comprehensive ML portfolio covering regression, classification, clustering, ILP, and reinforcement learning โ€” built during MSc Data Science at University of Surrey.

  6. youtube-video-summarizer-qa youtube-video-summarizer-qa Public

    A Python pipeline to process YouTube video transcripts โ€” restoring punctuation, summarizing in chunks, and answering questions via OpenAI GPT-3.5.

    Jupyter Notebook