This README file contains instructions to set up and run the Doccano project on your local computer.
Before you begin, make sure you have the following installed on your local computer:
- Python 3.10 or higher
- pip (Python package installer)
Follow these steps to set up the project:
-
Clone the Repository:
git clone <repository-url> cd <repository-directory>
-
Create a Virtual Environment:
python -m venv doccano_venv source doccano_venv/bin/activate # On Windows, use `doccano_venv\Scripts\activate`
-
Install Dependencies:
pip install -r requirements.txt
-
Initialize Doccano:
doccano init doccano createuser --username admin --password password --email admin@example.com
-
Run the Doccano Server:
doccano runserver 0.0.0.0:8000
-
Access Doccano:
- Open your web browser and navigate to
http://localhost:8000. - Log in with the admin credentials you created.
- Open your web browser and navigate to
-
Create a Project:
- Once logged in, click on "Projects" and then "Create Project".
- Choose "Sequence Labeling" as the project type for word selection tasks.
- Fill in the project details and create the project.
-
Upload Dataset:
- Go to your project and click on "Upload Data".
- Upload the dataset containing the quotes (in CSV, JSON, or text format).
-
Annotate Quotes:
- Students can log in to Doccano, access the project, and start selecting words in the quotes.
- Use the annotation interface to highlight and label the words.
- Export Annotations:
- Once the annotations are complete, you can export the annotated data.
- Go to the project and click "Download" to get the annotated dataset in a desired format (JSONL, CSV).
Feel free to reach out if you have any questions or run into any issues. Happy annotating! 😊
Let me know if there are any specific details you'd like to add or modify!