Is there a way to make wordninja aware of capital letters and be able to cut at caps boundaries if not informed by the dictionary? Or is there a way to build a caps aware dictionary?
for example: rCBVmeanSD should be cut as rCBV mean SD. But of course my domain specific language model must both define mean and means and the resulting cut is rCBV', 'meanS', 'D'.
Thoughts?
Is there a way to make wordninja aware of capital letters and be able to cut at caps boundaries if not informed by the dictionary? Or is there a way to build a caps aware dictionary?
for example:
rCBVmeanSDshould be cut asrCBVmeanSD. But of course my domain specific language model must both definemeanandmeansand the resulting cut isrCBV', 'meanS', 'D'.Thoughts?