Furhat Study Assistant with Emotion Detection

An intelligent study assistant system using the Furhat robot with real-time emotion detection and multimodal AI interaction powered by Google Gemini.

System Overview

This project consists of three main components:

Furhat Robot - Physical/virtual robot interface
Vision System - Real-time emotion detection via webcam
Main Interview Script - Orchestrates the interaction using Gemini AI

Prerequisites

Python 3.8+
Furhat SDK (Download here)
Google Gemini API Key (Get one here)
Webcam (for emotion detection)
Model File: vit_rafdb_4class.pth (should be in visionSystem/ directory)

Installation

1. Install Furhat SDK

Download and install the Furhat SDK from furhat.io
Launch Furhat SDK application
Start a virtual Furhat robot (or connect to a physical one)
Note the IP address (default: localhost for virtual robot)

2. Add the Furhat Remote API Skill

Open Furhat SDK
Go to the Skills section
Import the skill file: skill/furhat-remote-api.skill
Enable the Remote API skill on your Furhat robot

3. Set Up Python Environment

# Create a virtual environment
python3 -m venv venv

# Activate the virtual environment
# On macOS/Linux:
source venv/bin/activate
# On Windows:
# venv\Scripts\activate

# Install dependencies
pip install -r requirements.txt

4. Configure Environment Variables

Create a .env file in the project root:

API_KEY="your_google_gemini_api_key_here"

Replace your_google_gemini_api_key_here with your actual Google Gemini API key.

5. Verify Model File

Ensure the emotion detection model is present:

ls visionSystem/vit_rafdb_4class.pth

If missing, you'll need to obtain or train the ViT model for 4-class emotion detection (angry, happy, neutral, sad).

Running the System

Step 1: Start Furhat SDK

Launch the Furhat SDK application
Start your virtual or physical Furhat robot
Ensure the Remote API skill is active

Step 2: Start the Vision System (Emotion Detection)

Open a terminal and run:

# Activate virtual environment if not already active
source venv/bin/activate

# Navigate to visionSystem directory
cd visionSystem

# Start the emotion detection API
python emotion_api.py

The vision system will:

Start on http://127.0.0.1:8000
Access your webcam
Continuously process emotions in the background
Expose a /mood endpoint

You can test it by visiting: http://127.0.0.1:8000/mood

Step 3: Run the Main Interview Script

Open a new terminal (keep the vision system running):

# Activate virtual environment
source venv/bin/activate

# Run the main script
python mainInterview.py

Configuration Options

mainInterview.py Settings

Edit these variables in mainInterview.py:

FURHAT_IP = "localhost"              # Change if using physical robot
QUESTIONS_TO_ASK = 5                 # Number of interaction rounds
USE_KEYBOARD = False                 # True for keyboard input, False for voice
MOOD_CLASSIFIER_ENABLED = True       # Enable/disable emotion detection

Vision System Settings

Edit these in visionSystem/emotion_api.py:

WEBCAM_ID = 0                        # Change if using different camera
PROCESSING_INTERVAL = 0.5            # Seconds between emotion checks

Usage

Once everything is running:

The Furhat robot will greet you and ask for your name
You'll be asked what subject you're working on
The system will engage in 5 rounds of conversation
Your facial emotions are continuously detected and influence the robot's responses
At the end, you'll receive a summary

Input Modes

Voice Mode (default): Speak to the robot
Keyboard Mode: Set USE_KEYBOARD = True and type responses

Troubleshooting

Furhat Connection Issues

Verify Furhat SDK is running
Check FURHAT_IP matches your robot's address
Ensure Remote API skill is active

Vision System Issues

Check webcam is connected and not in use by other applications
Verify model file exists: visionSystem/vit_rafdb_4class.pth
Check API is running: curl http://127.0.0.1:8000/mood

API Key Issues

Verify .env file exists in project root
Check API key is valid at Google AI Studio
Ensure proper formatting: API_KEY="your_key_here"

Dependencies Issues

# Reinstall dependencies
pip install --upgrade -r requirements.txt

Project Structure

.
├── mainInterview.py          # Main orchestration script
├── requirements.txt          # Python dependencies
├── .env                      # Environment variables (API keys)
├── skill/
│   └── furhat-remote-api.skill  # Furhat Remote API skill (72MB)
├── visionSystem/
│   ├── emotion_api.py        # FastAPI emotion detection server
│   └── vit_rafdb_4class.pth  # Trained emotion detection model
└── output/                   # Output directory (if used)

Dependencies

Key libraries:

furhat-remote-api - Furhat robot control
google-generativeai - Gemini AI integration
fastapi + uvicorn - Emotion API server
opencv-python - Webcam/face detection
torch + torchvision - Deep learning inference
timm - Vision Transformer model

See requirements.txt for complete list.

Credits

Built for Uppsala University - Intelligent Robot Interaction course.

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
output		output
skill		skill
visionSystem		visionSystem
.gitignore		.gitignore
README.md		README.md
mainInterview.py		mainInterview.py
requirements.txt		requirements.txt
try_gestures.py		try_gestures.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Furhat Study Assistant with Emotion Detection

System Overview

Prerequisites

Installation

1. Install Furhat SDK

2. Add the Furhat Remote API Skill

3. Set Up Python Environment

4. Configure Environment Variables

5. Verify Model File

Running the System

Step 1: Start Furhat SDK

Step 2: Start the Vision System (Emotion Detection)

Step 3: Run the Main Interview Script

Configuration Options

mainInterview.py Settings

Vision System Settings

Usage

Input Modes

Troubleshooting

Furhat Connection Issues

Vision System Issues

API Key Issues

Dependencies Issues

Project Structure

Dependencies

Credits

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Furhat Study Assistant with Emotion Detection

System Overview

Prerequisites

Installation

1. Install Furhat SDK

2. Add the Furhat Remote API Skill

3. Set Up Python Environment

4. Configure Environment Variables

5. Verify Model File

Running the System

Step 1: Start Furhat SDK

Step 2: Start the Vision System (Emotion Detection)

Step 3: Run the Main Interview Script

Configuration Options

mainInterview.py Settings

Vision System Settings

Usage

Input Modes

Troubleshooting

Furhat Connection Issues

Vision System Issues

API Key Issues

Dependencies Issues

Project Structure

Dependencies

Credits

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages