Rootify (WIP)

Explainable Artist Influence Graph

Rootify is an evidence-first music discovery system that maps artist influence relationships using real textual sources rather than similarity metrics or black-box recommendations.

Given an artist, Rootify returns:

a ranked list of influencing artists
verbatim evidence snippets supporting each claim
citations pointing to the original source

If an influence cannot be supported by text, Rootify does not assert it.

Motivation

Most music discovery systems focus on answering:

“Which artists sound similar?”

Rootify instead addresses:

“Which artists explicitly influenced this artist, and where is that stated?”

This framing prioritizes:

interpretability
auditability
usefulness for understanding music history rather than surface-level similarity

System Overview

Rootify operates as a multi-stage, evidence-preserving pipeline:

Wikipedia / Wikidata / YouTube  
↓  
Normalized documents  
↓  
Candidate artist extraction  
↓  
ML validation and direction resolution  
↓  
Evidence-backed influence claims  
↓  
Ranked, explainable output

Each stage is explicit and independently inspectable.

Evidence Sources

Wikipedia — encyclopedic, third-person influence statements
Wikidata — structured “influenced by” relations with high precision
YouTube — interviews and first-person influence statements

All sources are normalized into a common representation before downstream processing.

Evidence-First Design

Rootify does not construct abstract graph edges directly.

Instead, it stores sentence-level evidence claims, each annotated with:

source reference (page, section, or timestamp)
verbatim text snippet
influence strength category
ML-derived confidence score

Influence strength is computed by aggregating evidence, not by counting mentions.

Machine Learning (Explainable by Design)

Machine learning is used to support influence reasoning, not replace it.

Binary classifier filters non-influence sentences and assigns confidence
Probabilistic outputs directly inform scoring and ranking decisions
No generative steps are used, and no influence is created without evidence.

Infrastructure & Deployment (Production-Ready)

Runtime Architecture

FastAPI API service
Redis-backed cache sidecar
PostgreSQL database
AWS Lambda → S3 artifact writer
Azure VM (runtime-only)

Key Infra Decisions

API never writes directly to S3
- API → Lambda → S3 (auditability, isolation)
VM never builds images
- CI builds, VM runs
No Docker registry required
Everything is versioned and reproducible

Tech Stack (WIP)

Backend: FastAPI, async SQLAlchemy, Alembic
Database: PostgreSQL
ML / NLP: spaCy, sentence-transformers, scikit-learn
Caching: Redis
Infra: Docker, Docker Compose, GitHub Actions, Azure VM
Artifacts: AWS Lambda → S3

Why Rootify Is Distinct

Evidence-first graph construction
Explicit, explainable ML components
Multi-source validation
Clear separation between extraction, validation, and ranking
Designed to be defensible in technical interviews

Current Status

Core pipelines: Wikipedia: ✅
Wikidata: ✅
YouTube: ⏭️ planned
Evidence + claim schema: ✅
Two-stage ML validation: ✅
Caching + regeneration logic: ✅
CI/CD + production infra: ✅
Frontend: ⏭️ planned

Name		Name	Last commit message	Last commit date
Latest commit History 53 Commits
.github/workflows		.github/workflows
Prep Work		Prep Work
rootify		rootify
.gitignore		.gitignore
README.MD		README.MD

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Rootify (WIP)

Motivation

System Overview

Evidence Sources

Evidence-First Design

Machine Learning (Explainable by Design)

Infrastructure & Deployment (Production-Ready)

Runtime Architecture

Key Infra Decisions

Tech Stack (WIP)

Why Rootify Is Distinct

Current Status

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Rootify (WIP)

Motivation

System Overview

Evidence Sources

Evidence-First Design

Machine Learning (Explainable by Design)

Infrastructure & Deployment (Production-Ready)

Runtime Architecture

Key Infra Decisions

Tech Stack (WIP)

Why Rootify Is Distinct

Current Status

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages