Skip to content
View Gabya06's full-sized avatar

Highlights

  • Pro

Block or report Gabya06

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Gabya06/README.md

πŸ‘‹ Hi there, I'm Gaby.

I’m a Data Scientist with expertise in machine learning, NLP, multimodal models, and data analytics. I specialize in building scalable ML models, optimizing data pipelines, and deploying models in production environments.

πŸ” What I Do

  • Machine Learning & NLP – Developing and fine-tuning models like RoBERTa, CLIP, and T5 for classification, sentiment analysis, and text generation.
  • Data Analysis & Processing – Querying and analyzing data using SQL (Athena, Redshift), Pandas, and PySpark.
  • Model Deployment & MLOps – Working with AWS SageMaker, CI/CD pipelines, and monitoring ML models for performance optimization.
  • Forecasting – Using N-BEATS for multi-series forecasting to predict future trends across multiple time series.
  • Visualization & Insights – Building Streamlit applications, dashboards, and reports to communicate findings effectively.

πŸ“š Projects

Explore my portfolio to see projects covering:

  • T5 & BART text summarization models for automated summarization tasks.
  • Sentiment analysis on Reddit and Glassdoor reviews using NLP models.
  • Data pull via PRAW API for social media data collection.
  • Streamlit app tutorial for building interactive data visualizations.
  • Classification models for genre predictions based on text features.
  • Ensemble methods for price predictions using multiple machine learning models.

πŸ› οΈ Tools & Technologies

  • Languages: Python, SQL
  • ML Frameworks: PyTorch, TensorFlow, Hugging Face
  • Databases: Amazon Redshift, AWS Athena, PostgreSQL
  • Cloud & MLOps: AWS SageMaker, Docker, CI/CD
  • Visualization: Streamlit, Tableau, Looker Studio

✨ Fun Facts

  • πŸ‘©β€πŸ’» When I'm not diving into data science, I teach seminars part-time, sharing knowledge and helping others grow in their careers.
  • πŸ‹οΈβ€β™€οΈ I’m a CrossFit enthusiast - it's challenging but definitely fun! I enjoy pushing myself and seeing progress along the way!
  • 🐾 I’m also a proud pet parent to two cats who keep me on my toes (and provide plenty of cuddles 🐱).

πŸ“« Connect with Me

Let’s connect! Check out my blogposts and projects. Feel free to reach out if you want to talk fitness, cats, or anything tech-related!

Pinned Loading

  1. gabya06.github.io Public

    This repo serves as a hub for my current and past data science projects.

    Jupyter Notebook

  2. sentiment_analysis_glassdoor Public

    Sentiment Analysis on glassdoor.com data science jobs

    Jupyter Notebook 1

  3. RentWatchAI Public

    Jupyter Notebook

  4. defective_products Public

    Streamlit analysis for defective products

    Python

  5. nlp_genres Public

    Jupyter Notebook