Viewing a single comment thread. View all comments

pythoslabs t1_j0pqrsp wrote

Custom NER is the way to go. I believe you will have to run a custom annotation pipeline defining your custom NERs . In your case - do a fine tune on a model to train on the defined spans on a few documents on 'Goals' . ( If you have more than one NER, add spancategorizer into your pipeline ) https://spacy.io/api/spancategorizer

Check out "training custom NER in spacy" on youtube - you should get plenty of detailed videos.

And if you want to go an extra step and extract a cause and effect relationship (this is out of scope for your project though ), but for the benefit of any future reader coming here - in case you have a relation like "Goal" - "Action" , you can use the following two methods -

  1. Spacy has a model for this. ( you can create your entity relation extractor ) on this. Check out this video . https://www.youtube.com/watch?v=8HL-Ap5_Axo
  2. Kindred is a project which is specially for Biomedical text. eg : in case there is a cause - effect relationship ( check it out here - https://spacy.io/universe/project/kindred )

DM me in case you need any further points.

2