Dataset
|
Domain
|
License
|
Reference
|
Availablility
|
CONLL 2003
|
News
|
DUA
|
Sang and Meulder, 2003
| |
NIST-IEER
|
News
|
None
|
NIST 1999 IE-ER
| |
MUC-6
|
News
|
LDC
|
Grishman and Sundheim, 1996
| |
OntoNotes 5
|
Various
|
LDC
|
Weischedel et al., 2013
| |
BBN
|
Various
|
LDC
|
Weischedel and Brunstein, 2005
| |
GMB-1.0.0
|
Various
|
None
|
Bos et al., 2017
| |
GUM-3.1.0
|
Wiki
|
Several (*2)
|
Zeldes, 2016
|
✔ Included here
|
wikigold
|
Wikipedia
|
CC-BY 4.0
|
Balasuriya et al., 2009
|
✔ Included here
|
Ritter
|
Twitter
|
None
|
Ritter et al., 2011
| |
BTC
|
Twitter
|
CC-BY 4.0
|
Derczynski et al., 2016
|
✔ Included here
|
WNUT17
|
Social media
|
CC-BY 4.0
|
Derczynski et al., 2017
|
✔ Included here
|
i2b2-2006
|
Medical
|
DUA
|
Uzuner et al., 2007
| |
i2b2-2014
|
Medical
|
DUA
|
Stubbs et al., 2015
| |
CADEC
|
Medical
|
CSIRO
|
Karimi et al., 2015
| |
AnEM
|
Anatomical
|
CC-BY-SA 3.0
|
Ohta et al., 2012
|
✔ Included here
|
MITRestaurant
|
Queries
|
None
|
Liu et al., 2013a
| |
MITMovie
|
Queries
|
None
|
Liu et al., 2013b
| |
MalwareTextDB
|
Malware
|
None
|
Lim et al., 2017
| |
re3d
|
Defense
|
Several (*1)
|
DSTL, 2017
|
✔ Included here
|
SEC-filings
|
Finance
|
CC-BY 3.0
|
Alvarado et al., 2015
|
✔ Included here
|
Assembly
|
Robotics
|
X
|
Costa et al., 2017
|
X
|
Thursday, 16 April 2020
English NER dataset
Subscribe to:
Post Comments (Atom)
Popular Posts
-
Deep Fashion3D: A Dataset and Benchmark for 3D Garment Reconstruction from Single Images We present Deep Fashion3D, a large-scale...
-
👉🏻 http://www.crowd-counting.com/#download A comprehensive dataset with 4,372 images and 1.51 million annotations. In comparison to...
-
**Paper:** https://arxiv.org/abs/1908.08345 **Dataset:** 1) the CNN/DailyMail news highlights dataset: somewhat Extractive - News Articles...
-
Recent Additions The UZH-FPV Drone Racing Dataset: High-speed, Aggressive 6DoF Trajectories for State Estimation and Drone Racing Hotels...
-
Data size is 100GB. Torrent files Link : https://bit.ly/2z8Rryd
-
https://metatext.io/datasets-list/finnish-language FI News Corpus Dataset is a collection of news headlines and short summaries of text, o...
-
CT images with clinical findings of COVID-19 The COVID-CT-Dataset has 275 CT images containing clinical findings of COVID-19. The images ...
-
github: https://github.com/layumi/University1652-Baseline
-
The goal: Given a sequence of click events performed by some user during a typical session in an e-commerce website, the goal is to predict...
-
This data set was created to understand the potential for machine learning, computer vision, and HPC to improve the energy efficiency aspec...
No comments:
Post a Comment