Skip to content
View yasumorishima's full-sized avatar

Block or report yasumorishima

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
yasumorishima/README.md

Yasunori Morishima (盛島康徳)

Manufacturing Engineer & Data Analyst with 18 years of experience, specializing in data analysis, open source contribution, and business automation. (製造業にて18年の経験を持つエンジニア。データ分析・OSS貢献・業務自動化を専門としています)

In Production: Minami Baseball OB — Full-stack alumni site with 10 active users (38 pages · 22 DB tables · 130 e2e tests · ¥0/mo running cost) | Docs

🌍 Realtime Open Data

Japan Geohazard Monitor Persian Gulf Ship Tracker
31 geophysical data sources → ML earthquake prediction (AUC 0.754, CSEP Molchan 0.981) + real-time monitoring dashboard AIS vessel tracking across the Persian Gulf & Gulf of Oman with land mask filtering

Real-time API / WebSocket → SQLite → FastAPI + Leaflet.js (dark theme)All projects

Japan Geohazard Monitor — Earthquake prediction research: 76 features from 25+ sources (USGS, Earthdata, INTERMAGNET, NMDB, NOAA, IOC), walk-forward HistGBT + stacking, BigQuery data platform (216K rows). Weekly automated CI pipeline on GitHub Actions.


⚾ Baseball Analytics

Prediction Systems

Project Description Demo
NPB Season Prediction Bayesian ensemble (Marcel 35% + Stan/Ridge 40% + ML 25%) + Monte Carlo team simulation + 24 foreign player individual projections Live
NPB 2021 Backtest Could Bayesian model predict Yakult & Orix last→champion? 25 foreign players with FanGraphs data Analysis
MLB Win Probability Engine 3-engine ensemble WP (Normal + Empirical + LightGBM) + Gemini AI commentary Live
Baseball MLOps Pipeline Statcast × GCP MLOps: 5-model ensemble + BQML, weekly auto-retrained on BigQuery + Cloud Run Live
MLB Data Pipeline Shared BigQuery data platform — FanGraphs + Savant + Statcast for all baseball analytics projects
Prediction accuracy & details
System Key Metric Articles
NPB 2026 8-yr backtest wOBA MAE .0498, 97% prob. of beating Marcel. 10K Monte Carlo sims JP / EN
NPB 2021 Backtest MAE 10.7W — Yakult & Orix last→champion driven by JP player breakouts, not foreign players Repo
MLB WP Engine 3-engine ensemble, 367K+ play states (2015–2024), inverse-Brier weighted + Isotonic calibration Live
Baseball MLOps Batter wOBA MAE .0287 (Marcel: .0326) / Pitcher xFIP MAE 0.483 (Marcel: 0.558) Live

Biomechanics

Baseball Skeleton Analysis — 3D skeleton visualization from Driveline OpenBiomechanics C3D data

Pitching Skeleton (3D C3D) Hitting Skeleton (3D C3D)

Trunk rotation range vs pitch speed: r=0.425 (strongest). Contributed bug fix PR #384 to ezc3d. Article (JP) / Article (EN)

Statcast Analysis

6 analyses covering Japanese MLB pitchers and Ohtani batting data.
All analyses (6)
Analysis Key Finding Article
Kikuchi Slider Revolution (2019-2025) SL 17%→37% after Astros trade Zenn / DEV.to / Kaggle
Senga Ghost Fork (2023-2025) FO whiff rate 58%→39%, decline pre-injury Zenn / DEV.to / Kaggle
Imanaga 2nd Year (2024-2025) 3-pitch concentration (97%), 1st TTO xwOBA .505 Zenn / DEV.to / Kaggle
Darvish Evolution (2021-2025) SL/ST halved, CU became putaway pitch Zenn / DEV.to / Kaggle
Ohtani Spray Chart spraychart() one-liner vs matplotlib manual Zenn
Ohtani Heatmap Stadium drawing + hit density heatmap Zenn

🌐 Open Source Contributions

(61 PRs / 31 Merged) across 26 repositories. See [oss-contributions](https://github.com/yasumorishima/oss-contributions) for full details.
PR highlights (click to expand)
Repository PR Description
dfinity/icp-js-core #1270 Improve Candid decode error messages
dfinity/icp-js-core #1277 Deduplicate parallel fetchSubnetKeys
dfinity/pic-js #235 Add fetchCanisterLogs() method
line/line-bot-mcp-server #369 Add get_follower_ids tool
pyomeca/ezc3d #384 Fix __eq__ early return bug
optuna/optuna Hyperparameter optimization framework
pandas-dev/pandas Data analysis library
jldbc/pybaseball #498-504 Bug fixes & documentation
team-mirai — Civic Tech OSS (21 PRs (11 Merged / 2 Open / 8 Closed))

Civic tech projects for political transparency & citizen participation in Japan. Next.js / TypeScript / Supabase / Vitest.

Repository Highlights
action-board 48 unit tests, RPC tests, breadcrumb nav, cache fix + 5 more
marumie Category filter total
post-checker Timezone fix
fact-checker X API investigation + 5 PRs

📊 Data & Competitions

Kaggle

Notebooks Expert | 🥉 14 Bronze Notebook Medals

Active: Deep Past (Akkadian→English) / BirdCLEF+ 2026 (Bird Sound Recognition)

Bronze Medal Notebooks (14)
Notebook Topic
savant-extras Defense & Pitching Quality Defense metrics & pitching quality (savant-extras)
MLB Statcast Spray Charts for WBC 2026 WBC 2026 spray + pitch zone charts (baseball-field-viz)
March Machine Learning Mania 2026 NCAA tournament prediction (LightGBM)
NFL Geometric Rules Baseline Physics-based rules, No ML, RMSE 2.921
CAFA 6 Baseline Protein function prediction (PyTorch MLP)

All 14 notebooks →

Kaggle Datasets

7 published MLB datasets
Dataset Description
🥈 MLB Bat Tracking Leaderboard (2024-2025) 452 batters, 19 swing metrics
🥈 WBC 2026 Scouting 306 players, 20 countries
Other datasets (4)
Dataset Description
Baseball Savant Leaderboards (2024-2025) 15 leaderboards, 2 seasons combined
Japanese MLB Players Statcast (2015-2025) 34 Japanese MLB players, 174k pitches+hits
MLB Pitcher Arsenal Evolution (2020-2025) 4,253 pitcher-seasons, 111 metrics
MLB Statcast + Bat Tracking (2024-2025) Combined Statcast + bat tracking data

DrivenData

DrivenData Competitions — Automated pipeline: GitHub Actions + GPU training + GPU→CPU fallback. Currently competing in On Top of Pasketti (Children's ASR, $120K prize, Wav2Vec2 CTC).


🌐 Web & Apps

Minami Baseball OB — Alumni Association Site

Full-stack web app for a high school baseball alumni association (126 files, ~14600 LOC). Technical Documentation

PC (Light) Game Results

5-tier RBAC (Middleware + RLS), automated member pipeline (Form → GAS → Actions → Supabase), 681 match records (1955–present)

Architecture & features
  • 5-tier RBAC (guest → admin): Next.js Middleware + Supabase RLS — authorization at route, row, and component level
  • Automated member pipeline: Google Form → Apps Script → GitHub Actions auto-PR → Supabase role sync. Personal names never touch Git
  • Custom CMS: 8 editor pages + 5 inline edit, soft delete (7-day trash + auto-purge), change history, audit logs
  • 681 match records (1955–present): cross-source verification, generation-based grouping, per-game photo management
  • UX: Unsaved warning, Web Share + LINE fallback, Calendar registration, ripple feedback, Suspense skeleton UI, weather forecast (Open-Meteo, 10 venues), automated game detection (2 sources → auto-PR)
  • Security: RLS on all 19 tables, server-only admin, CODEOWNERS, branch protection, secret scanning, cookie consent, 60-min session timeout

Next.js 15 / TypeScript 5.8 / Tailwind CSS 4 / Supabase (PostgreSQL + Auth + Storage) / Vercel / GitHub Actions / Google Apps Script / GA4

Dashboards & Mobile

App Description Link
MLB Bat Tracking Dashboard Leaderboard, Player Comparison, Team Lineup Builder. Powered by savant-extras Live
WBC 2026 Scouting Dashboard 30 Statcast apps across 19 countries. Zone heatmaps, spray charts, pitch movement Live
Daily Diary Flutter mobile app, 5 languages, offline-first, AdMob Google Play
WBC 2026 Scouting Dashboard details (30 apps)

30 Statcast scouting apps across 19 countries (batters + pitchers). Zone heatmaps, spray charts, pitch movement, LHP/RHP splits. Auto-fetched via GitHub Actions. → USA Batters / Japan Pitchers / All 30 apps


📦 PyPI Packages

6 packages (click to expand)
Package Description
savant-extras 17 Baseball Savant leaderboards + date range support. Complements pybaseball
baseball-field-viz Statcast coordinate transform + field drawing + spray charts + pitch zone charts
kaggle-notebook-deploy Deploy Kaggle Notebooks via git push + GitHub Actions
kaggle-wandb-sync Sync W&B offline runs from Kaggle to W&B cloud
signate-deploy SIGNATE competition workflow via GitHub Actions
signate-wandb-sync Record SIGNATE scores to W&B runs

🔬 Learning Projects

Project Description
ICP Learning Project Persistent counter dApp on Internet Computer (Motoko, dfx CLI)
OpenClaw Twitter Bot Raspberry Pi 5 + OpenClaw + Gemini API auto-tweet bot (stopped) — Article (JP)
Past Projects
Project Description
GAS Calendar Tool Batch calendar event registration with senior-friendly mobile UI
Dune Analytics On-chain data analysis — JPYC Stablecoin Dashboard
Archived Projects Selenium automation, business workflow tools, etc.

🛠️ Tech Stack

Category Technologies
Data Analysis & ML Python, pandas, scikit-learn, LightGBM, XGBoost, CatBoost, PyTorch, matplotlib, seaborn, DuckDB, W&B
Data Platform Google BigQuery (8 datasets, 96+ tables — baseball, geohazard, ship tracking), BigQuery ML, Cloud Run, Grafana
Data Sources Baseball Savant (Statcast), pybaseball, USGS, NASA Earthdata, AIS
Web & Dashboards Streamlit, Next.js, TypeScript, Supabase, Vercel, shadcn/ui
Mobile App Flutter, Dart, Hive, Google AdMob
Automation & DevOps GitHub Actions, Google Apps Script, VBA, Power Query
Tools Claude Code, Kaggle, Google Colab, Excel, Looker Studio
Manufacturing Statistical Quality Control, Process Engineering

📈 Career

  • 2024 - Present: Quality Management @ Marubun Corporation (丸文株式会社)
  • 2008 - 2024: Semiconductor Manufacturing Process Engineer (半導体製造プロセスエンジニア)

🏆 Patents

Stencil mask and manufacturing method thereof (ステンシルマスク及びその製造方法)

  • Patent No: 6307851 (特許第6307851号)
  • Role: Inventor (発明者)
  • Assignee: Toppan Printing Co., Ltd. (凸版印刷株式会社)
  • Link: Google Patents (JP6307851B2)

📫 Contact & Blog

Pinned Loading

  1. kaggle-datasets kaggle-datasets Public

    Baseball-themed Kaggle datasets generated with pybaseball

    Python

  2. mlb-statcast-visualization mlb-statcast-visualization Public

    MLB Statcast data visualization with pybaseball - 3 methods to draw baseball fields (spraychart, matplotlib, sportypy)

    Python

  3. oss-contributions oss-contributions Public

    My open source contributions tracker

    Python

  4. wbc-scouting wbc-scouting Public

    Python