Tags · linkedin/detext

v2.0.8

Use matrix for multiple py versions: python-app-py3.yml (#1)

Aug 22, 2020
180433f
zip
tar.gz
Notes

v2.0.6

fix text preprocessing for inference model (#40)

* fix text preprocessing for inference model

* add filter_window_sizes override for non-cnn models

Aug 5, 2020
7c1fac6
zip
tar.gz
Notes

v2.0.5-alpha

Update python-publish.yml

update username and secret name

Jul 21, 2020
64478f7
zip
tar.gz
Notes

2.0.4

Create python-publish.yml

Jul 18, 2020
cc59ff7
zip
tar.gz

v1.2.0

Add embedding and MLP support for sparse wide features (#24)

# Description

Currently DeText's design for sparse feature has simple modeling power for sparse features.
1. only linear model is applied on sparse features
2. there's no interaction between sparse features and dense features (model_score = dense_score + sparse_score)

This PR resolves the above limitation on sparse feature by
1. computing dense representation of sparse features
2. allowing interactions between sparse features and wide features

More specifically, the model architecture changes from
```
dense_score = dense_ftrs -> MLP
sparse_score = sparse_ftrs -> Linear
final_score = dense_score + sparse_score
```
to
```
sparse_emb_ftrs = sparse_ftrs -> Dense(sp_emb_size)
all_ftrs = (dense_ftrs, sparse_emb_ftrs) -> Concatenate
final_score= all_ftrs -> MLP
```
## Type of change

- [ ] New feature (non-breaking change which adds functionality)

## List all changes 
Please list all changes in the commit.
* Change sp_linear_model to sp_emb_model and add an option sp_emb_size to allow the sparse matrix to have output dimension > 1
* Change structure of dense & sparse feature interaction as mentioned in the PR description
* Add and restructure unit test for sparse embedding model
* Add new data for testing
* Add a sample tfrecord generation helper function in misc_utils.py
* Add instructions in TRAINING.md

# Testing
- Successfully run run_detext.sh for data including wide_sp_val and sp_emb_size=10
- Successfully run run_detext_multitask.sh for data
- Unit test for sparse_emb_model when sp_emb_size is 1 and > 1
# Checklist

- [ ] My code follows the style guidelines of this project
- [ ] I have performed a self-review of my own code
- [ ] I have commented my code, particularly in hard-to-understand areas
- [ ] I have made corresponding changes to the documentation
- [ ] My changes generate no new warnings
- [ ] I have added tests that prove my fix is effective or that my feature works
- [ ] New and existing unit tests pass locally with my changes
- [ ] Any dependent changes have been merged and published in downstream modules

May 29, 2020
f0ee982
zip
tar.gz
Notes

v1.1.0

update optimization test

May 13, 2020
3628982
zip
tar.gz
Notes

v1.0.12

expose tfrecord dataset transformation function for LinkedIn usage (#10)

Co-authored-by: Leon Gao <legao@linkedin.com>

May 1, 2020
97f51f0
zip
tar.gz
Notes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

v2.0.8

v2.0.6

v2.0.5-alpha

2.0.4

v1.2.0

v1.1.0

v1.0.12

Tags: linkedin/detext

v2.0.8

v2.0.6

v2.0.5-alpha

2.0.4

v1.2.0

v1.1.0

v1.0.12