Guide for NER Augmentation #19

DecentMakeover · 2019-08-08T10:08:47Z

Thanks for sharing your work, i could not find Any NLP Augmentation library other than this.

Will this Library help in augmenting NER data?

My data looks like this

Ryan B-PER
Dsouza B-PER
/DOB O
11/11/1997 B-DOB
/MALE O
22 B-NUM
56565 B-NUM

Thanks in advance

The text was updated successfully, but these errors were encountered:

makcedward · 2019-08-09T16:48:39Z

This library does not support generate augmented data for NER problem yet.

I can enhance it if there are any research paper related this problem

DecentMakeover · 2019-08-09T16:50:43Z

May be I can help , I have a custom data set for which I need to augmentations, may be you can include that in your library? On 09-Aug-2019, at 10:18 PM, Edward Ma <notifications@github.com<mailto:notifications@github.com>> wrote: This library does not support generate augmented data for NER problem yet. I can enhance it if there are any research paper related this problem — You are receiving this because you authored the thread. Reply to this email directly, view it on GitHub<#19?email_source=notifications&email_token=AGD5QFYXJNSPIFNFQM3IJZ3QDWNWRA5CNFSM4IKIUBBKYY3PNVWWK3TUL52HS4DFVREXG43VMVBW63LNMVXHJKTDN5WW2ZLOORPWSZGOD37GFOQ#issuecomment-519987898>, or mute the thread<https://github.com/notifications/unsubscribe-auth/AGD5QF22EFZUNFBKJVIMYXDQDWNWRANCNFSM4IKIUBBA>.

makcedward · 2019-08-10T16:47:30Z

Thanks for your contribution.

Please share corresponding papers to me. So, I can check out whether it can be supported or not.

Zylatis · 2019-11-10T03:54:51Z

I'm really interested in this as well as I am trying to do NER with a limited data set. I'm not aware of any papers looking at this specifically, but I think it might be interesting to combine it with a data generating DSL like Chattete (I actually asked about the problems nlpaug tackles in this issue!
SimGus/Chatette#25)

I think a useful first step might be to just make the substitutions tag-aware, so that you aren't going to do a substitution that changes the tag or something. Potentially you might also want a flag which just prevents substitutions on tagged (i.e. not 'O') words altogether.

This of course presumes the existence of a labelled, if small, dataset, which I think is totally reasonable. I think combining context-aware vector substitutions with a DSL language, and maybe some gazetter pipelines to streamline external inputs, could be really powerful, and a cool project to work on if anyone is interested!

makcedward · 2019-11-10T07:20:10Z

@Zylatis
Thank you for your input. DSL can be one of the solution for that. Will further design how can nlpaug support DSL.

Before that, you may consider to leverage "stopwords" attribute to simulate tag-aware behavior. You can change list of stopwords per augmentation.

import nlpaug.augmenter.word as naw
text = "Peter likes dogs"
aug = naw.ContextualWordEmbsAug()
aug.stopwords = ['Peter']
aug.augment(text)

manishiitg · 2020-01-29T05:20:13Z

Hi,

even i was looking for this. the above code snippet is helpful for sure.

but there is another use case in which we might want to substitute NER tag with another word.

is there any example for this?

manishiitg · 2020-01-29T07:28:17Z

This is a simple custom NER augmenter which might help

https://gist.github.com/manishiitg/8fd4209fcb3c6cb08ed34705c1f32c86

pratikchhapolika · 2023-03-03T05:58:56Z

Hi @makcedward @manishiitg , any recent improvements to create NER synthetic data.

Original_text=`My name is Pratik. I live in India'

Augmented can be:

`My name is Jon. I live in U.S.A'
'My name is Manish. I live in China`

makcedward added the enhancement New feature or request label Aug 9, 2019

makcedward added the help wanted Extra attention is needed label Aug 26, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Guide for NER Augmentation #19

Guide for NER Augmentation #19

DecentMakeover commented Aug 8, 2019

makcedward commented Aug 9, 2019

DecentMakeover commented Aug 9, 2019 via email

makcedward commented Aug 10, 2019

Zylatis commented Nov 10, 2019 •

edited

Loading

makcedward commented Nov 10, 2019

manishiitg commented Jan 29, 2020

manishiitg commented Jan 29, 2020 •

edited

Loading

pratikchhapolika commented Mar 3, 2023

Guide for NER Augmentation #19

Guide for NER Augmentation #19

Comments

DecentMakeover commented Aug 8, 2019

makcedward commented Aug 9, 2019

DecentMakeover commented Aug 9, 2019 via email

makcedward commented Aug 10, 2019

Zylatis commented Nov 10, 2019 • edited Loading

makcedward commented Nov 10, 2019

manishiitg commented Jan 29, 2020

manishiitg commented Jan 29, 2020 • edited Loading

pratikchhapolika commented Mar 3, 2023

Zylatis commented Nov 10, 2019 •

edited

Loading

manishiitg commented Jan 29, 2020 •

edited

Loading