Logo Questions Linux Laravel Mysql Ubuntu Git Menu
 

How many training data(sentences) are required for custom NER using spacy python?[Just rought idea]

I want to know let's say I have 10 custom entities to recognize how much annotated training sentences should I give (Any rough idea) ??

Thank You, in Advance!! :)

I am new to this, please help

like image 880
Lalit Vyas Avatar asked Sep 19 '25 13:09

Lalit Vyas


1 Answers

For developing custom ner model at least 50-100 occurrences of each entity will be required along with their proper context. Otherwise if you have less data than your custom model will overfit on that. So, depending upon your data you will require atleast 200 to 300 sentences.

like image 181
Hitesh Laddha Avatar answered Sep 21 '25 03:09

Hitesh Laddha