Suggestion for doing core for longer sequences?

Hi,

First, thank you for providing this valuable resource.
According to Table 4 of the Bert paper, for long sequences with length 1152+ the performance declined.
I wonder if I want to do the coref for my dataset in which average sequence length is 1500+, do you suggest using 'spanbert' on my data as it is. Or it is better to segment the data into pieces of length 512?
Of course both has it's drawbacks in negatively effecting the performance of pertained model but which approach do you suggest?

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Suggestion for doing core for longer sequences? #33

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Suggestion for doing core for longer sequences? #33

Description

Metadata

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Issue actions