-
Notifications
You must be signed in to change notification settings - Fork 355
/
Copy pathTODO.txt
28 lines (23 loc) · 1.04 KB
/
TODO.txt
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
Design of ConceptNet 5.5:
- [x] link the API responses (fix Pygments order)
- [x] Don't stem things on the way in
- [x] Figure out how to keep more Verbosity
- [x] Coarsen Wiktionary using etymologies
- [x] Handle 'sw-maps' correctly, no proliferation of N-triples, no double-escaping
- [x] Store the index in a reasonable constant database
- [x] Use the blacklist when making assertions
- [ ] Use JSON-LD when possible
- [x] Build ConceptNet vectors in build process
- [x] include word2vec and GloVe inputs in raw/
- [x] Automatically test ConceptNet vectors
- [x] Build vectors using less than 16 GB RAM
- [x] fix 'FormOf' links to self
- [ ] Try new lemmatizer as part of Numberbatch
- [ ] Use standardized language codes in Wiktionary forms
ConceptNet 5.6 perhaps:
- [ ] Nodes should have labels that are stored separately from their edges
- [ ] Break down the and-or trees into more machine-readable structures
- [ ] Use symmetrical relations
Cleaning up the data:
- [ ] Relabel vague nodes like "someone" and "something"
- [ ] Infer things from verb phrases