Skip to content

Commit 56a810c

Browse files
author
Patrick Wang
committed
rem print in get datastore data
1 parent 4ac4d2d commit 56a810c

File tree

2 files changed

+1
-1
lines changed

2 files changed

+1
-1
lines changed

datastore/get_datastore_chat.py

Lines changed: 0 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -43,7 +43,6 @@
4343
dataset = load_dataset('Aeala/ShareGPT_Vicuna_unfiltered', split='train')
4444
total_length = len(dataset)
4545
for conversations in tqdm(dataset, total=total_length):
46-
print('conversations', conversations, type(conversations))
4746
for sample in conversations['conversations']:
4847
token_list = tokenizer.encode(sample['value'])
4948
writer.add_entry(token_list)

requirements.txt

Lines changed: 1 addition & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -9,3 +9,4 @@ openai
99
anthropic
1010
sentencepiece
1111
protobuf
12+
shortuuid

0 commit comments

Comments
 (0)