I was looking Stanford NER and thinking of using JAVA Apis it to extract postal address from a text document. The document may be any document where there is an postal address section e.g. Utility Bills, electricity bills.
So what I am thinking as the approach is,
I am trying to find a example pipeline for the same (what are the steps in details required), anyone has done this before? Suggestions welcome.
To be clear: all credit goes to Raj Vardhan (and John Bauer) who had an interaction on the [java-nlp-user] mailing list.
Raj Vardhan wrote about the plan to work on "finding street address in a sentence":
Here is an approach I have thought of:
- Find the event-anchor in a sentence
- Select outgoing-edges in the SemanticGraph from that event-node with relations such as *"prep-in" *or "prep-at".
- IF the dependent value in the relation has POS tag as NNP
a) Find outgoing-edges from dependent value's node with relations such as "nn"
b) Connect all such nodes in increasing order of occurrence in the sentence.
c) PRINT resulting value as Location where the event occurred
This is obviously with certain assumptions such as direct dependency between the event-anchor and location in a sentence.
Not sure whether this could help you, but I wanted to mention it just in case. Again, any credit should go to Raj Vardhan (and John Bauer).
If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!
Donate Us With