Skip to content
View YutoTerashima's full-sized avatar

Block or report YutoTerashima

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
YutoTerashima/README.md

Yuto Terashima kendo banner

寺島悠斗 Yuto Terashima

AI Researcher · LLMs · Agents · Trustworthy AI · Japanese Kenshi

UC Berkeley Bachelor of Computer Science, 2021-2025 · Google AI Intern, 2025 · University of Melbourne MCS, 2026-present

一心不乱 / One mind, undisturbed.
Focused execution, rigorous evaluation, and trustworthy AI systems.
AI研究と剣道に通じる集中と修練。
以剑道的专注,打磨 AI 系统与评测。

About

I am an AI researcher interested in how language models behave when they become systems: agents that use tools, retrieve context, evaluate themselves, and operate inside real workflows. My work focuses on practical evaluation, agent reliability, multilingual safety, and transformer-based NLP systems.

  • Education: UC Berkeley, Computer Science undergraduate, 2021-2025; University of Melbourne, Master of Computer Science, 2026-present
  • Experience: Google AI, AI Intern, 2025
  • Research taste: careful benchmarks, readable systems, reproducible experiments
  • Practice: Japanese kenshi, guided by 一心不乱: disciplined attention under pressure

Research Focus

Area What I build
LLM evaluation Reproducible eval pipelines, rubric graders, model comparison reports
AI agents Trace analysis, tool-use evaluation, reliability and failure-mode studies
AI safety Multilingual safety tests, refusal/over-refusal analysis, risk taxonomies
Transformers Small model experiments, attention visualization, NLP training notes

Selected Work

Project Signal
hms-harmful-brain-activity-classification Kaggle HMS EEG classification Silver Medal solution archive, 123rd of 2,767 teams
agent-safety-eval-lab Flagship lab for agent traces, tool-call grading, and safety evaluation
mcp-tool-security-playground Tool-use security, permission policies, and prompt-injection threat modeling
rag-eval-observatory RAG observability with retrieval, faithfulness, and failure-case analysis
multilingual-llm-safety-bench English/Japanese/Chinese safety mini-benchmark and model behavior report
ISC-Bench-Reproduction Reproduction-oriented work around LLM safety and agent evaluation
Transformers-Projects Transformer/NLP project lab and experiments

Awards

Competition Result
HMS - Harmful Brain Activity Classification Kaggle Silver Medal, 123rd / 2,767 teams

Total GitHub stars

Portfolio Matrix

Repository Purpose
transformer-from-scratch-notes Attention, tiny tokenizer, and mini training-loop notes
llm-eval-cookbook Exact match, rubric, preference, pairwise, and JSON-schema eval recipes
agent-trace-viewer Lightweight HTML trace viewer for agent messages, tools, and failures
prompt-robustness-suite Prompt versioning, A/B testing, and failure clustering
open-model-benchmark-cards Structured benchmark-card generator for open models

Languages

English · 日本語 · 中文

Contact

Pinned Loading

  1. agent-safety-eval-lab agent-safety-eval-lab Public

    Agent trace and tool-use safety evaluation lab.

    Python 63 4

  2. rag-eval-observatory rag-eval-observatory Public

    RAG evaluation and observability lab.

    Python 11

  3. Transformers-Projects Transformers-Projects Public

    Chinese NLP Transformer project suite: BERT, T5, GLM, QA, NER, retrieval, summarization, and training artifact reports.

    Jupyter Notebook 19

  4. mcp-tool-security-playground mcp-tool-security-playground Public

    MCP-style tool-use security playground with permission policies.

    Python 11

  5. hms-harmful-brain-activity-classification hms-harmful-brain-activity-classification Public

    Kaggle Silver Medal solution archive for HMS harmful brain activity EEG classification.

    Jupyter Notebook 28

  6. multilingual-llm-safety-bench multilingual-llm-safety-bench Public

    English, Japanese, and Chinese LLM safety mini-benchmark.

    Python 21