You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
From the paper "CODEGEN: AN OPEN LARGE LANGUAGE MODEL FOR CODE WITH MULTI-TURN PROGRAM SYNTHESIS", the architecture of CodeGen follows a standard transformer decoder with left-to-right causal masking. How do you use CodeGen-mono 350M (decoder) to initialize the encoder? As far as I know, there is a slight difference between encoder and decoder.
The text was updated successfully, but these errors were encountered:
From the paper "CODEGEN: AN OPEN LARGE LANGUAGE MODEL FOR CODE WITH MULTI-TURN PROGRAM SYNTHESIS", the architecture of CodeGen follows a standard transformer decoder with left-to-right causal masking. How do you use CodeGen-mono 350M (decoder) to initialize the encoder? As far as I know, there is a slight difference between encoder and decoder.
The text was updated successfully, but these errors were encountered: