Skip to content
View dineshpalli's full-sized avatar
πŸ¦‰
πŸ¦‰

Block or report dineshpalli

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
dineshpalli/README.md

Hello, I am Dinesh Palli. πŸ‘‹

I am a Data Engineer | Python Developer | Published Computational Biologist.

Glad to see you here!

πŸ“œ About

I am a data science professional with 5+ years of experience in machine learning, business intelligence, and automation. I thrive on turning complex data into actionable insights and compelling stories that drive business growth. My work often involves Python, SQL, and PyTorch, and I'm a passionate open-source contributor with tools recognized with over 300 stars on GitHub.

  • Proven track record in business environments securing 4 partnerships and contributing to a 7.4% increase in market reach.

  • Experienced in Biomedical AI, computational biology, developing visualization tools for datasets with >250k rows.

  • Skilled in building end-to-end data pipelines, optimizing processes, and creating impactful KPI dashboards.


πŸ“‘ Publications

SelfAdapt: Unsupervised Domain Adaptation of Cell Segmentation Models (IEEE, 2025)

Co-authored a publication on a novel, label-free method for adapting deep learning models to new data domains without supervision. The method achieved a relative performance improvement of up to 29.6% in AP0.5 over the baseline Cellpose model on the LiveCell dataset.


πŸš€ Recent Projects & Highlights

Here are some of the projects I've been working on recently:

  • ☁️ End-to-End Azure Data Pipeline: Designed and built a complete data engineering project, migrating data from an on-premises SQL Server to an Azure-based medallion architecture using Azure Data Factory, Databricks (PySpark), and Synapse Analytics, with insights delivered via Power BI. Check out the repo!

  • πŸ“ Cloud Resume Challenge: Deployed a full-stack, serverless web application on Microsoft Azure featuring a CI/CD pipeline with GitHub Actions, Infrastructure as Code with Terraform, a Python-based API, and a NoSQL database. Visit the website or see the code.

  • πŸ’» Advanced SQL Challenges: Completed both the LeetCode Top SQL 50 and StrataScratch Advanced SQL 25 challenges, sharpening my skills in window functions, complex joins, and query optimization.


πŸ“ Work Experience

Data Engineer | Neue Pressegesellschaft (Oct 2025 β€” Present)

  • I'm currently a Data Engineer at NPG Digital, where I build and manage our data ecosystem. My role spans the entire data lifecycle - from designing, building robust ETL data pipelines to conducting deep-dive analytics and applying data science methodologies.

Sales Manager | seedalive GmbH (Aug 2025 β€” Oct 2025)

  • Led international business development initiatives in India, establishing strategic partnerships and expanding market presence through targeted client acquisition campaigns

  • Conducted comprehensive market analysis of India's seed industry, identifying high-potential strategic partners and compiling critical market intelligence on sector trends, competitive landscape, and growth opportunities

  • Developed and deployed interactive Power BI dashboards to visualize market research findings, enabling data-driven decision making and strategic planning for Indian market penetration

  • Architected and deployed server infrastructure solutions, ensuring seamless technical implementation and optimal system performance for client operations

  • Delivered comprehensive training programs to client personnel, including students and staff members, enabling successful execution of experimental protocols and enhancing operational capabilities

  • Led end-to-end project management of international business trips and corporate events, coordinating logistics, stakeholder engagement, and strategic planning to maximize ROI and client satisfaction

Barista & Store Operations | Copenhagen Coffee Lab (May 2025 β€” Jul 2025)

  • Handled all-round store duties including expert wide variety of coffee preparation (Yes! Ofc with latte art! β˜•πŸŽ¨), customer service, billing, point-of-sale (POS) system, and inventory management

  • Ensured smooth daily operations, from store opening and closing to maintaining cleanliness, coordinated and organized food storage, supply logistics across multiple branches

Business Intelligence Working Student | Olympus Europa EMEA (Feb 2024 β€” Feb 2025)

  • Reduced processing time by 37.48% by digitizing operational processes with Microsoft Power Apps

  • Engineered an end-to-end data pipeline in Python for ETL, automated regulatory reporting, and database management

  • Automated KPI dashboards using Power BI, data modeling, and statistical analysis

Machine Learning Working Student | Charite – University Medicine (Dec 2022 β€” Feb 2024)

  • Engineered and published a machine learning model for cell segmentation

  • Developed a visualization tool for large-scale single-cell datasets, improving data comprehension

  • Automated code quality checks and enforced coding standards to enhance codebase stability

Biomedical AI – Master Thesis | Helmholtz AI (Feb 2023 β€” Aug 2023)

  • Increased clustering purity of high-dimensional cytometry data by 26.31% by implementing Neighborhood Component Analysis (NCA)

  • Developed a comprehensive analysis pipeline for high-dimensional biological image data, enabling visualization across datasets with over 22,000 labeled images

Open-Source Developer | Helmholtz AI (2021 β€” 22)

  • Built ETL functions processing 250k+ rows, adopted by 1000+ biologists

I am a part of the open-source community, where I contributed to the development of SquidPy and SquidPy notebooks, tools for the analysis and visualization of spatial molecular data click_here. I contributed to the development of PolarityJam, a tool designed for extracting, analyzing, and visualizing cellular data from images. I have experience using PyTorch, Numpy, Pandas, Linux and HPC.

I tutored the courses "Machine Learning in Image Analysis" course at Hasso Plattner Institute for 72 students and "Introduction to Single-Cell RNA Sequencing Data Analysis with ScanPy” at the Helmholtz AI for 28 students.


⚽ Hobbies & Interests

When I'm not busy being a full-time nerd, I’m probably off on a different kind of adventure. You can find me backpacking through the wilderness, trying out new gourmet recipes in the kitchen, or devouring the pages of a good book. I also enjoy staying active through working out, badminton, skating, and swimming. I love learning new things and believe in being a jack-of-all-tradesβ€”always curious and ready for the next challenge.

I may not have any superpowers, but I pride myself on being a kind and honest person. It's my own personal version of a cape and tights (and definitely easier to wash πŸ˜…). I believe nothing is as good if you don't share it. Keep smiling!


πŸ’Ό Technical Skills

Python R JavaScript MySQL Microsoft SQL Server NumPy Pandas

scikit-learn PyTorch TensorFlow Hugging Face

Matplotlib Power BI Tableau

AWS Microsoft Azure Azure DevOps Azure Functions Databricks Terraform Docker

GitHub Actions Bitbucket Pipelines Codecov ETL

Jupyter Notebook Google Colab PyCharm Visual Studio Code Sublime Text

Git GitHub Bitbucket

Notion Sphinx ReadMe Read the Docs Markdown JSON YAML

Adobe Photoshop Affinity Designer Affinity Photo Affinity Publisher Canva

macOS Debian Ubuntu Windows Apple Silicon Microsoft Microsoft Office Microsoft Excel Microsoft Word

ChatGPT GitHub Copilot

LinkedIn LeetCode Quora Reddit


πŸ“ˆ My GitHub Stats

Dinesh's GitHub stats


🀝 Connect With Me

DineshPalli | LinkedIn Dinesh_Palli | LeetCode Dinesh_Palli | Xing Dinesh_Palli | X Dinesh Palli | Instagram Dinesh Palli | Goodreads Dinesh Palli | Email

Popular repositories Loading

  1. terraform-zero-to-hero terraform-zero-to-hero Public

    Forked from iam-veeramalla/terraform-zero-to-hero

    Master Terraform in 7 days using this Zero to Hero course.

    HCL 5 11

  2. rg-data-engineering-project rg-data-engineering-project Public

    Forked from lukejbyrne/rg-data-engineering-project

    Jupyter Notebook 1

  3. GameteBinning GameteBinning Public

    Forked from schneebergerlab/GameteBinning

    C++

  4. scanpy scanpy Public

    Forked from scverse/scanpy

    Single-cell analysis in Python. Scales to >1M cells.

    Python

  5. squidpy_notebooks squidpy_notebooks Public

    Forked from scverse/squidpy_notebooks

    Tutorials for Squidpy

    Python

  6. squidpy_reproducibility squidpy_reproducibility Public

    Forked from theislab/squidpy_reproducibility

    Jupyter Notebook