Skip to content
View CarlitosCarreras's full-sized avatar

Block or report CarlitosCarreras

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
CarlitosCarreras/README.md

Carlos F. Carreras De León

📍 Spain
💼 LinkedIn
📧 carlitoscdl@gmail.com
🐙 GitHub

Data Scientist | Data Analyst | Computational Social Scientist | Economist

I am a Data Scientist and Economist with a Master's degree in Social Data Science from the University of Granada (Spain). I specialize in transforming complex social, economic, and behavioral data into actionable insights through machine learning, statistical modeling, natural language processing (NLP), social network analysis, and data visualization.

My work combines quantitative rigor with real-world problem solving. I enjoy building data-driven solutions that help organizations understand people, markets, technologies, and public policies.

I am currently interested in opportunities related to:

  • Data Science
  • Data Analytics
  • Business Intelligence
  • Machine Learning
  • Computational Social Science
  • Social Research & Analytics
  • Marketing Analytics
  • E-commerce Analytics
  • Tourism Analytics
  • Sustainability & ESG Analytics
  • Urban and Public Services Analytics
  • Public Policy Evaluation
  • Responsible AI & Data Ethics

What I Do

I use data to answer questions such as:

  • What drives customer satisfaction?
  • How do people interact with technology?
  • Which factors explain social and economic inequalities?
  • How can organizations extract insights from large volumes of text data?
  • Can machine learning improve decision-making in business and public policy?
  • How do online communities form, interact, and influence each other?

My approach combines:

  • Statistical inference
  • Predictive modeling
  • Machine learning
  • Natural language processing
  • Social network analysis
  • Survey data analysis
  • Data storytelling

Featured Projects

Digital Capital Gap in Andalusia

Analysis of the determinants of digital capital using official microdata from the Spanish National Statistics Institute (INE).

Skills: Logistic Regression, Marginal Effects, Odds Ratios, Survey Data Analysis, Statistical Modeling.


Global Happiness Prediction

Machine learning models predicting subjective well-being using data from the World Values Survey across multiple countries.

Skills: Random Forest, XGBoost, Cross-Validation, Feature Engineering, Predictive Analytics.


Brexit Immigration Discourse and Toxicity Analysis

Large-scale analysis of YouTube comments related to immigration during the Brexit referendum campaign.

Skills: NLP, Semantic Embeddings, Toxicity Detection, Political Communication Analysis, Computational Social Science.


Social Media Sentiment and Network Analysis

Analysis of social media conversations through sentiment analysis and network analytics to identify influential actors, community structures, and patterns of interaction within online discussions.

Skills: Social Network Analysis, Graph Analytics, Community Detection, Sentiment Analysis, Social Media Analytics, Data Visualization.


Tourist Satisfaction Analysis in Seville Using NLP

Analysis of online reviews from TripAdvisor and Civitatis to estimate tourist satisfaction and identify key drivers of visitor experiences.

Skills: NLP, Sentiment Analysis, Web Scraping, Transformer Models, Tourism Analytics.


Technical Skills

Programming

  • Python
  • R
  • SQL

Data Science & Machine Learning

  • Scikit-learn
  • XGBoost
  • Random Forest
  • Logistic Regression
  • Statistical Modeling
  • Predictive Analytics

Natural Language Processing

  • Transformers
  • Hugging Face
  • BERTopic
  • Sentiment Analysis
  • Topic Modeling
  • Text Mining
  • Semantic Embeddings

Network Analysis

  • Social Network Analysis (SNA)
  • Graph Analytics
  • Community Detection
  • Network Visualization
  • Gephi

Data Analysis & Visualization

  • Pandas
  • NumPy
  • Matplotlib
  • Seaborn
  • Plotly
  • ggplot2
  • Power BI

Research & Analytics

  • Survey Data Analysis
  • Quantitative Social Research
  • Policy Evaluation
  • Computational Social Science
  • Applied Statistics

Professional Interests

I am particularly motivated by projects involving:

  • Technology and digital transformation
  • Consumer behavior and marketing analytics
  • E-commerce and customer intelligence
  • Social and economic research
  • Tourism intelligence
  • Sustainability and ESG metrics
  • Smart cities and urban services
  • Public policy and social innovation
  • Responsible AI and data ethics

Education

MSc in Social Data Science
University of Granada, Spain

BSc in Economics Instituto Tecnológico de Santo Domingo, Dominican Republic


Let's Connect

I am always interested in collaborating on data-driven projects and exploring opportunities where data can generate measurable impact for organizations, communities, and society.

Feel free to explore my repositories and connect with me.

Pinned Loading

  1. brexit-youtube-immigration-toxicity-analysis brexit-youtube-immigration-toxicity-analysis Public

    Natural language processing project combining semantic embeddings and toxicity detection to analyze Brexit-related immigration discourse.

    Jupyter Notebook 1

  2. digital-capital-gap-andalusia-logistic-regression digital-capital-gap-andalusia-logistic-regression Public

    Logistic regression analysis of the digital capital gap in Andalusia using official microdata from Spain's ICT Household Survey (INE 2025).

    TeX 1

  3. global-happiness-prediction-random-forest global-happiness-prediction-random-forest Public

    Predicting happiness across 57 countries using World Values Survey data and Random Forest.

    Jupyter Notebook 1

  4. social-media-sentiment-network-analysis social-media-sentiment-network-analysis Public

    Sentiment analysis and semantic co-occurrence network analysis of LGBT-related tweets using VADER and Gephi.

    Jupyter Notebook 1

  5. tourist-satisfaction-seville-nlp tourist-satisfaction-seville-nlp Public

    Tourist satisfaction analysis in Seville using NLP, sentiment analysis, web scraping and transformer-based language models on TripAdvisor and Civitatis reviews.

    Jupyter Notebook 1