Streaming Agent API Documentation

Base URL

http://localhost:5000

Authentication

No authentication required for this implementation. In production, consider adding API keys or JWT tokens.

Endpoints

1. Health Check

Check if the server is running and healthy.

Endpoint: GET /health

Response:

{
  "status": "OK",
  "message": "Streaming Agent is running"
}

Example:

curl http://localhost:5000/health

2. Streaming Chat

Send a message and receive a real-time streaming response.

Endpoint: POST /api/chat/stream

Request Body:

{
  "message": "Your message here"
}

Response: Server-Sent Events (SSE) stream

Event Types:

chunk: Partial response content
complete: Full response completed
error: Error occurred

Example Events:

data: {"type": "chunk", "content": "Hello"}

data: {"type": "chunk", "content": " there!"}

data: {"type": "complete", "content": "Hello there!"}

Example Request:

curl -X POST http://localhost:5000/api/chat/stream \
  -H "Content-Type: application/json" \
  -d '{"message": "Tell me a joke"}'

JavaScript Example:

const eventSource = new EventSource('/api/chat/stream', {
  method: 'POST',
  headers: {
    'Content-Type': 'application/json',
  },
  body: JSON.stringify({ message: 'Hello!' })
});

eventSource.onmessage = function(event) {
  const data = JSON.parse(event.data);
  
  switch(data.type) {
    case 'chunk':
      console.log('Chunk:', data.content);
      break;
    case 'complete':
      console.log('Complete:', data.content);
      eventSource.close();
      break;
    case 'error':
      console.error('Error:', data.error);
      eventSource.close();
      break;
  }
};

3. Non-Streaming Chat

Send a message and receive a complete response.

Endpoint: POST /api/chat

Request Body:

{
  "message": "Your message here"
}

Response:

{
  "response": "Complete response from the agent"
}

Example:

curl -X POST http://localhost:5000/api/chat \
  -H "Content-Type: application/json" \
  -d '{"message": "What is 2+2?"}'

Response:

{
  "response": "2 + 2 equals 4."
}

4. Get Conversation History

Retrieve the current conversation history.

Endpoint: GET /api/history

Response:

{
  "history": [
    {
      "role": "user",
      "content": "Hello, how are you?"
    },
    {
      "role": "assistant", 
      "content": "Hello! I'm doing well, thank you for asking."
    }
  ]
}

Example:

curl http://localhost:5000/api/history

5. Clear Conversation History

Clear the current conversation history.

Endpoint: POST /api/clear

Response:

{
  "message": "Conversation history cleared"
}

Example:

curl -X POST http://localhost:5000/api/clear

Error Responses

All endpoints may return error responses in the following format:

HTTP Status: 400 Bad Request or 500 Internal Server Error

Response:

{
  "error": "Error message description"
}

Common Error Scenarios:

Missing message field in request body
Invalid JSON in request body
OpenAI API errors (rate limits, invalid API key, etc.)
Server internal errors

Rate Limits

Currently no rate limiting is implemented. Consider adding rate limiting in production to prevent abuse.

CORS

CORS is enabled for all origins (*). In production, configure specific allowed origins.

WebSocket Alternative

For real-time bidirectional communication, consider implementing WebSocket support:

// Future WebSocket implementation example
const ws = new WebSocket('ws://localhost:5000/ws');
ws.onmessage = function(event) {
  const data = JSON.parse(event.data);
  // Handle streaming response
};

Testing

Use the included test script to verify functionality:

node test-agent.js

Environment Variables

Required environment variables:

OPENAI_API_KEY=your_openai_api_key_here
OPENAI_MODEL=gpt-3.5-turbo
PORT=5000

Response Times

Streaming responses: Start streaming immediately, complete in 2-10 seconds depending on response length
Non-streaming responses: 1-5 seconds for complete response
History/clear endpoints: < 100ms

Best Practices

Use streaming for long responses to improve user experience
Handle errors gracefully in your client implementation
Close EventSource connections when done to prevent memory leaks
Implement retry logic for network failures
Monitor conversation history to avoid context length limits

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Streaming Agent API Documentation

Base URL

Authentication

Endpoints

1. Health Check

2. Streaming Chat

3. Non-Streaming Chat

4. Get Conversation History

5. Clear Conversation History

Error Responses

Rate Limits

CORS

WebSocket Alternative

Testing

Environment Variables

Response Times

Best Practices

FilesExpand file tree

API.md

Latest commit

History

API.md

File metadata and controls

Streaming Agent API Documentation

Base URL

Authentication

Endpoints

1. Health Check

2. Streaming Chat

3. Non-Streaming Chat

4. Get Conversation History

5. Clear Conversation History

Error Responses

Rate Limits

CORS

WebSocket Alternative

Testing

Environment Variables

Response Times

Best Practices