Yuto Terashima YutoTerashima

寺島悠斗 Yuto Terashima

AI Researcher · LLMs · Agents · Trustworthy AI · Japanese Kenshi

UC Berkeley Bachelor of Computer Science, 2021-2025 · Google AI Intern, 2025 · University of Melbourne MCS, 2026-present

一心不乱 / One mind, undisturbed.
Focused execution, rigorous evaluation, and trustworthy AI systems.
AI研究と剣道に通じる集中と修練。
以剑道的专注，打磨 AI 系统与评测。

About

I am an AI researcher interested in how language models behave when they become systems: agents that use tools, retrieve context, evaluate themselves, and operate inside real workflows. My work focuses on practical evaluation, agent reliability, multilingual safety, and transformer-based NLP systems.

Education: UC Berkeley, Computer Science undergraduate, 2021-2025; University of Melbourne, Master of Computer Science, 2026-present
Experience: Google AI, AI Intern, 2025
Research taste: careful benchmarks, readable systems, reproducible experiments
Practice: Japanese kenshi, guided by 一心不乱: disciplined attention under pressure

Research Focus

Area	What I build
LLM evaluation	Reproducible eval pipelines, rubric graders, model comparison reports
AI agents	Trace analysis, tool-use evaluation, reliability and failure-mode studies
AI safety	Multilingual safety tests, refusal/over-refusal analysis, risk taxonomies
Transformers	Small model experiments, attention visualization, NLP training notes

Selected Work

Project	Signal
hms-harmful-brain-activity-classification	Kaggle HMS EEG classification Silver Medal solution archive, 123rd of 2,767 teams
agent-safety-eval-lab	Flagship lab for agent traces, tool-call grading, and safety evaluation
mcp-tool-security-playground	Tool-use security, permission policies, and prompt-injection threat modeling
rag-eval-observatory	RAG observability with retrieval, faithfulness, and failure-case analysis
multilingual-llm-safety-bench	English/Japanese/Chinese safety mini-benchmark and model behavior report
ISC-Bench-Reproduction	Reproduction-oriented work around LLM safety and agent evaluation
Transformers-Projects	Transformer/NLP project lab and experiments

Awards

Competition	Result
HMS - Harmful Brain Activity Classification	Kaggle Silver Medal, 123rd / 2,767 teams

Portfolio Matrix

Repository	Purpose
transformer-from-scratch-notes	Attention, tiny tokenizer, and mini training-loop notes
llm-eval-cookbook	Exact match, rubric, preference, pairwise, and JSON-schema eval recipes
agent-trace-viewer	Lightweight HTML trace viewer for agent messages, tools, and failures
prompt-robustness-suite	Prompt versioning, A/B testing, and failure clustering
open-model-benchmark-cards	Structured benchmark-card generator for open models

Languages

English · 日本語 · 中文

Contact

Email: yutoterashima4@gmail.com

Provide feedback

Saved searches

Use saved searches to filter your results more quickly