Skip to content
View Icar0S's full-sized avatar
👨‍💻
Working from home
👨‍💻
Working from home
  • Fortaleza - CE

Block or report Icar0S

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Icar0S/README.md

Ícaro Santos

Master Software Quality Engineer

Building reliable Big Data systems through better testing and data quality:

  • 🎓 M.Sc. candidate in Computer Science at State University of Ceará (UECE), researching software testing for Big Data with a focus on data quality, LLMs, and RAG (since 2024)
  • 🧪 Research lines:
    • Systematic Literature Review + extensive snowballing (screened 4,700+ titles/abstracts) on Big Data testing tools, methods, and frameworks
    • Mining online repositories (Medium, LinkedIn, DEV, Stack Overflow) to uncover emerging practices using topic modeling (LDA), keyword extraction, K-means, and DBSCAN
  • 🛠️ Building an LLM- and RAG-assisted tool to help testers improve coverage and ensure data integrity in Big Data systems (natural-language test specs, data-quality checks, and more)
  • 📄 Publications and activities: SLR paper published; experience report + expert survey with a Big Data testing checklist (short paper + extended LNBIP); poster and abstract accepted at the 5th LATAM School
  • 👨‍💻 B.Sc. in Computer Engineering from IFCE (2017–2023)
  • 🔎 Interests: Big Data, Software Testing, Data Quality, Mining Software Repositories, ML for SE, LLMs, RAG.

Always open to collaboration on data-quality testing, empirical studies, and tooling for large-scale systems.


Connect with me:

https://www.linkedin.com/in/icaro-santos-oliveira/ https://instagram.com/icaro.olivra

💻 Skill Tecnologies

🚀⚙️ Frameworks:

🤖🧪 QA Testing:

👩‍💻 Languages:

🗃️ Database:

📊 Software Metrics and Analytics:

☁️⚡Cloud & CI:
GitLab CI

🪄📚 Styles & Library:

🔬 Linters:

👨‍💻 Office:

🖌️ Design:

📚 Education:

📈 Statistics

GitHub Stats GitHub Stats

Pinned Loading

  1. VozaoTV VozaoTV Public

    Java 1

  2. Testing-big-data-Ten-years-of-Mining-Repositories Testing-big-data-Ten-years-of-Mining-Repositories Public

    All scripts and data related to this research will be made publicly available.

    Jupyter Notebook 1

  3. migration-on-premise-cloud-mapping migration-on-premise-cloud-mapping Public

    Forked from great-ufc/migration-on-premise-cloud-mapping

    In this work, we conducted a systematic literature mapping to summarize the knowledge regarding the migration of legacy systems to the cloud.

    HTML

  4. UECE-Computer-Science/maven-testscope-cve-analysis-mining-categorization-paper UECE-Computer-Science/maven-testscope-cve-analysis-mining-categorization-paper Public

    Cloud Infrastructure for Mining Challenge 25 Analytics

    Jupyter Notebook 1

  5. SmartDataTest SmartDataTest Public

    SmartDataTest is an innovative tool designed to automate data quality testing for Big Data systems. By combining Large Language Models (LLMs) with Retrieval-Augmented Generation (RAG).

    Python 2