Dataset
|
Domain
|
License
|
Reference
|
Availablility
|
CONLL 2003
|
News
|
DUA
|
Sang and Meulder, 2003
| |
NIST-IEER
|
News
|
None
|
NIST 1999 IE-ER
| |
MUC-6
|
News
|
LDC
|
Grishman and Sundheim, 1996
| |
OntoNotes 5
|
Various
|
LDC
|
Weischedel et al., 2013
| |
BBN
|
Various
|
LDC
|
Weischedel and Brunstein, 2005
| |
GMB-1.0.0
|
Various
|
None
|
Bos et al., 2017
| |
GUM-3.1.0
|
Wiki
|
Several (*2)
|
Zeldes, 2016
|
✔ Included here
|
wikigold
|
Wikipedia
|
CC-BY 4.0
|
Balasuriya et al., 2009
|
✔ Included here
|
Ritter
|
Twitter
|
None
|
Ritter et al., 2011
| |
BTC
|
Twitter
|
CC-BY 4.0
|
Derczynski et al., 2016
|
✔ Included here
|
WNUT17
|
Social media
|
CC-BY 4.0
|
Derczynski et al., 2017
|
✔ Included here
|
i2b2-2006
|
Medical
|
DUA
|
Uzuner et al., 2007
| |
i2b2-2014
|
Medical
|
DUA
|
Stubbs et al., 2015
| |
CADEC
|
Medical
|
CSIRO
|
Karimi et al., 2015
| |
AnEM
|
Anatomical
|
CC-BY-SA 3.0
|
Ohta et al., 2012
|
✔ Included here
|
MITRestaurant
|
Queries
|
None
|
Liu et al., 2013a
| |
MITMovie
|
Queries
|
None
|
Liu et al., 2013b
| |
MalwareTextDB
|
Malware
|
None
|
Lim et al., 2017
| |
re3d
|
Defense
|
Several (*1)
|
DSTL, 2017
|
✔ Included here
|
SEC-filings
|
Finance
|
CC-BY 3.0
|
Alvarado et al., 2015
|
✔ Included here
|
Assembly
|
Robotics
|
X
|
Costa et al., 2017
|
X
|
Thursday 16 April 2020
English NER dataset
Subscribe to:
Post Comments (Atom)
Popular Posts
-
github: https://github.com/layumi/University1652-Baseline
-
image segmentation dataset github : https://github.com/divamgupta/image-segmentation-keras google drive : https://drive.google.com/uc...
-
**Paper:** https://arxiv.org/abs/1908.08345 **Dataset:** 1) the CNN/DailyMail news highlights dataset: somewhat Extractive - News Articles...
-
Dataset Domain License Reference Availablility CONLL 2003 News DUA Sang and Meulder, 2003 Easy to find NIST-IEER...
-
Best interesting data is football network refer to this page: http://www-personal.umich.edu/~mejn/netdata/
-
https://www.biomotionlab.ca/movi/
-
👉🏻 http://www.crowd-counting.com/#download A comprehensive dataset with 4,372 images and 1.51 million annotations. In comparison to...
-
Recent Additions The UZH-FPV Drone Racing Dataset: High-speed, Aggressive 6DoF Trajectories for State Estimation and Drone Racing Hotels...
-
Data size is 100GB. Torrent files Link : https://bit.ly/2z8Rryd
-
The goal: Given a sequence of click events performed by some user during a typical session in an e-commerce website, the goal is to predict...
No comments:
Post a Comment