Doccano is an open-source text annotation tool for machine learning professionals. It sets annotation features for sequence labeling, text classification and sequence to sequence tasks. It has multiple applications like creating labeled data for sentiment analysis, named entity recognition, text summarization and so on.
Unlike other free open source annotation tools such as Brat and Anafora, Doccano has better modern UX experience. Other modern text annotations tools exist like Prodigy and LightTag, but they cost a lot.
- Collaborative annotation
- Multi-language support
- Mobile support
- Emoji 😄 support
- Dark theme
- RESTful API
Two options to run Doccano
- Docker Compose
$ git clone https://github.com/chakki-works/doccano.git $ cd doccano $ docker-compose -f docker-compose.prod.yml up
docker pull chakkiworks/doccano docker container create --name doccano \ -e "ADMIN_USERNAME=admin" \ -e "ADMIN_EMAILemail@example.com" \ -e "ADMIN_PASSWORD=password" \ -p 8000:8000 chakkiworks/doccano