Skip to content
@Data-Centric-AI-Community

Data-Centric AI Community

The Data-Centric AI Community is the home of all things data. Help us achieve high-quality data for data science!

   

Welcome to the Data-Centric AI Community

We're a group of data science enthusiasts committed to developing AI and ML applications with a focus on data quality! We get together to learn, discuss, and collaborate on topics such as Data Quality, Data Profiling, and Synthetic Data, closing the gap between data understanding and improvement.

Start learning data science with beginner-friendly projects! 💻

  • 🐍 awesome-python-for-data-science: The roadmap to learn data science in 2023!
  • 📦 awesome-data-centric-ai: Stay up to data with the latest resources on Data-Centric AI
  • 👾 Join us on our Discord Server to meet and learn from other data enthusiasts!

You can follow our updates on the website blog or on Medium and subscribe to our newsletter to stay up to date with our events, data initiatives, and to receive monthly tricks and tips on how to achieve actionable data to feed your machine learning models.

📧 Feel free to get in touch! You can reach out to us at the Discord server. And good news, if you have questions about projects such as ydata-profiling or ydata-synthetic, you can find the right persons to help you in our community! 🥳

Popular repositories Loading

  1. ydata-profiling ydata-profiling Public

    1 Line of code data quality profiling & exploratory data analysis for Pandas and Spark DataFrames.

    Python 13.4k 1.8k

  2. ydata-synthetic ydata-synthetic Public

    Synthetic data generators for tabular and time-series data

    Jupyter Notebook 1.6k 257

  3. ydata-quality ydata-quality Public

    Data Quality assessment with one line of code

    Jupyter Notebook 454 56

  4. awesome-data-centric-ai awesome-data-centric-ai Public

    Open-Source Software, Tutorials, and Research on Data-Centric AI 🤖

    Jupyter Notebook 345 47

  5. awesome-python-for-data-science awesome-python-for-data-science Public

    A curated list of awesome resources such as books, tutorials, courses, open-source libraries, exercises, and other materials that support Pythonistas in the making, and Pythonistas migrating into D…

    Jupyter Notebook 91 20

  6. nist-crc-2023 nist-crc-2023 Public

    NIST Collaborative Research Cycle on Synthetic Data. Learn about Synthetic Data week by week!

    Jupyter Notebook 27 2

Repositories

Showing 10 of 11 repositories
  • ydata-profiling Public

    1 Line of code data quality profiling & exploratory data analysis for Pandas and Spark DataFrames.

    Data-Centric-AI-Community/ydata-profiling’s past year of commit activity
    Python 13,409 MIT 1,763 260 (39 issues need help) 36 Updated Mar 3, 2026
  • ydata-quality Public

    Data Quality assessment with one line of code

    Data-Centric-AI-Community/ydata-quality’s past year of commit activity
    Jupyter Notebook 454 MIT 56 19 (6 issues need help) 16 Updated Mar 2, 2026
  • ydata-synthetic Public

    Synthetic data generators for tabular and time-series data

    Data-Centric-AI-Community/ydata-synthetic’s past year of commit activity
    Jupyter Notebook 1,613 MIT 257 45 19 Updated Mar 2, 2026
  • syntheticdata.community Public

    Synthetic Data Community Page

    Data-Centric-AI-Community/syntheticdata.community’s past year of commit activity
    SCSS 0 0 0 3 Updated Feb 19, 2026
  • awesome-data-centric-ai Public

    Open-Source Software, Tutorials, and Research on Data-Centric AI 🤖

    Data-Centric-AI-Community/awesome-data-centric-ai’s past year of commit activity
    Jupyter Notebook 345 MIT 47 2 0 Updated Feb 10, 2026
  • .github Public

    The Data-Centric AI Community is the home of all things data. Join us!

    Data-Centric-AI-Community/.github’s past year of commit activity
    2 0 0 0 Updated Aug 11, 2025
  • Data-Centric-AI-Community/datacentricai.community’s past year of commit activity
    HTML 0 0 0 3 Updated Aug 11, 2025
  • Data-Centric-AI-Community/ghost-pages-deploy’s past year of commit activity
    TypeScript 0 0 0 0 Updated Aug 11, 2025
  • awesome-python-for-data-science Public

    A curated list of awesome resources such as books, tutorials, courses, open-source libraries, exercises, and other materials that support Pythonistas in the making, and Pythonistas migrating into Data Science! 📊

    Data-Centric-AI-Community/awesome-python-for-data-science’s past year of commit activity
    Jupyter Notebook 91 20 22 (22 issues need help) 4 Updated Jun 4, 2024
  • code-with-me Public

    Our epic Code With Me sessions: from introductory to advanced topics 🚀

    Data-Centric-AI-Community/code-with-me’s past year of commit activity
    Jupyter Notebook 6 0 0 0 Updated Dec 13, 2023

Top languages

Loading…