-
π Hi, Iβm @jinyunliao
-
π What I Specialize In
- Data integration & ELT β dbt, Databricks, AWS Glue, Airflow; automated testing and lineage.
- Realβtime analytics β Kafka, Flink, Spark Structured Streaming, Delta Lake for subβsecond scoring.
- Cost & performance optimization β S3 layout, partitioning, EMR Serverless, query tuning,AWS Cloud Solutions Architecture.
- Governance & observability β Great Expectations, Unity Catalog, monitoring, alerting, runbooks.
- BI & selfβservice β Tableau, Superset, Looker, QuickSight; training and handover.
-
π« How to reach me
-
Freelance
- Remote
Popular repositories Loading
-
-
SeabornVisualizations
SeabornVisualizations PublicAbout Seaborn relplot,scatterplot,lineplot,catplot,countplot,distplot,kdeplot,striplot,swarmplot,boxplot,violinplot,boxenplot,pointplot,barplot,joinplot,pairplot,lmplot,regplot
Jupyter Notebook
-
-
VeloxFlow
VeloxFlow PublicHigh-performance ETL pipeline for streaming large CSV files into PostgreSQL. Built with Python 3.9 and Pandas, it utilizes the native COPY FROM STDIN command for maximum throughput. Optimized for lβ¦
Jupyter Notebook
-
olist_ebiz
olist_ebiz Publicolist_ebiz is a modern data stack template for eβcommerce analytics. It delivers an endβtoβend pipeline: raw data extraction, warehouse loading , dbt modeling with star schemas, automated testing, β¦
Python
If the problem persists, check the GitHub status page or contact support.