Test replit-code-v1-3b model #1299

abetlen · 2023-05-03T15:47:01Z

Replit recently trained a 3B parameter Llama-style code model with some very promising results. Weights have been released here

execveat · 2023-05-03T16:08:32Z

The demo looks extremely underwhelming. Nowhere close to Codex/Copilot: https://huggingface.co/spaces/replit/replit-code-v1-3b-demo

ggerganov · 2023-05-03T16:09:18Z

Would be nice to get a breakdown of the differences with the LLaMA architecture to get a feeling of how big of a task this would be

Heath123 · 2023-05-03T16:13:46Z

The demo looks extremely underwhelming. Nowhere close to Codex/Copilot: huggingface.co/spaces/replit/replit-code-v1-3b-demo

Do you expect it to be at 3B parameters? It might not be as good as Copilot but it's still very good

abetlen · 2023-05-03T16:35:59Z

Would be nice to get a breakdown of the differences with the LLaMA architecture to get a feeling of how big of a task this would be

Ah looks like I misread their press release, I took Llama-style to mean the exact architecture not that it was trained past Chinchilla optimallity. I'll try to look into this.

The demo looks extremely underwhelming. Nowhere close to Codex/Copilot: https://huggingface.co/spaces/replit/replit-code-v1-3b-demo

I think the better comparison is against the salesforce codegen models as they're the best option for self-hosted code completion, would be cool to build something like turbopilot but this may require a seperate ggml implementation for this model.

Green-Sky · 2023-05-03T17:00:33Z

replit-code-v1-3b is powered by state-of-the-art LLM techniques, such as: Flash Attention for fast training and inference, AliBi positional embeddings to support variable context length at inference time, LionW optimizer, etc.

alibi got merged recently

v3ss0n · 2023-05-04T04:55:04Z

The demo looks extremely underwhelming. Nowhere close to Codex/Copilot: https://huggingface.co/spaces/replit/replit-code-v1-3b-demo

Can you show your results? Mine looks quite good and already usable, this with chat tuned lora will be quite amazing

""" SQLAlchemy model for relationship betweek tasks and projects """ # < thats the only input i gave
class Project(Base):
    __tablename__ = 'project'

    id = Column(Integer, primary_key=True)
    name = Column(String, nullable=False)
    description = Column(String, nullable=False)
    tasks = relationship('Task', back_populates='project')

    def __init__(self, name, description):
        self.name = name
        self.description = description

    def __repr__(self):
        return f"<Project(name='{self.name}', description='{self.description}')>"

class Task(Base):
    __tablename__ = 'task'

    id = Column(Integer, primary_key=True)
    name = Column(String, nullable=False)
    description = Column(String, nullable=False)
    project_id = Column(Integer, ForeignKey('project.id'))
    project = relationship('Project', back_populates='tasks')

    def __init__(self, name, description, project_id):
        self.name = name
        self.description = description
        self.project_id = project_id

    def __repr__(self):
        return f"<Task(name='{self.name}', description='{self.description}', project_id='{self.project_id}')>"

ElYaiko · 2023-05-04T17:20:21Z

The demo looks extremely underwhelming. Nowhere close to Codex/Copilot: https://huggingface.co/spaces/replit/replit-code-v1-3b-demo

Can you show your results? Mine looks quite good and already usable, this with chat tuned lora will be quite amazing

""" SQLAlchemy model for relationship betweek tasks and projects """ # < thats the only input i gave
class Project(Base):
    __tablename__ = 'project'

    id = Column(Integer, primary_key=True)
    name = Column(String, nullable=False)
    description = Column(String, nullable=False)
    tasks = relationship('Task', back_populates='project')

    def __init__(self, name, description):
        self.name = name
        self.description = description

    def __repr__(self):
        return f"<Project(name='{self.name}', description='{self.description}')>"

class Task(Base):
    __tablename__ = 'task'

    id = Column(Integer, primary_key=True)
    name = Column(String, nullable=False)
    description = Column(String, nullable=False)
    project_id = Column(Integer, ForeignKey('project.id'))
    project = relationship('Project', back_populates='tasks')

    def __init__(self, name, description, project_id):
        self.name = name
        self.description = description
        self.project_id = project_id

    def __repr__(self):
        return f"<Task(name='{self.name}', description='{self.description}', project_id='{self.project_id}')>"

Yes, it gives ok results
Is there any chance of using this model with ggml?

Green-Sky mentioned this issue May 4, 2023

replit-code-v1-3b ggerganov/ggml#131

Closed

ggerganov added help wanted Extra attention is needed model Model specific labels May 5, 2023

gardner mentioned this issue Sep 17, 2023

Requesting Support for phi-1_5 by Microsoft #3146

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Test replit-code-v1-3b model #1299

Test replit-code-v1-3b model #1299

abetlen commented May 3, 2023

execveat commented May 3, 2023

ggerganov commented May 3, 2023

Heath123 commented May 3, 2023

abetlen commented May 3, 2023

Green-Sky commented May 3, 2023

v3ss0n commented May 4, 2023 •

edited

Loading

ElYaiko commented May 4, 2023

Test replit-code-v1-3b model #1299

Test replit-code-v1-3b model #1299

Comments

abetlen commented May 3, 2023

execveat commented May 3, 2023

ggerganov commented May 3, 2023

Heath123 commented May 3, 2023

abetlen commented May 3, 2023

Green-Sky commented May 3, 2023

v3ss0n commented May 4, 2023 • edited Loading

ElYaiko commented May 4, 2023

v3ss0n commented May 4, 2023 •

edited

Loading