Skip to content
This repository has been archived by the owner on Jan 29, 2024. It is now read-only.

Feature: Named entity recognition #1

Open
parhamr opened this issue Feb 14, 2015 · 0 comments
Open

Feature: Named entity recognition #1

parhamr opened this issue Feb 14, 2015 · 0 comments

Comments

@parhamr
Copy link
Owner

parhamr commented Feb 14, 2015

From Wikipedia:

Given a stream of text, determine which items in the text map to proper names, such as people or places, and what the type of each such name is (e.g. person, location, organization). Note that, although capitalization can aid in recognizing named entities in languages such as English, this information cannot aid in determining the type of named entity, and in any case is often inaccurate or insufficient. For example, the first word of a sentence is also capitalized, and named entities often span several words, only some of which are capitalized. Furthermore, many other languages in non-Western scripts (e.g. Chinese or Arabic) do not have any capitalization at all, and even languages with capitalization may not consistently use it to distinguish names. For example, German capitalizes all nouns, regardless of whether they refer to names, and French and Spanish do not capitalize names that serve as adjectives.

Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.
Projects
None yet
Development

No branches or pull requests

1 participant